{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef date_range(start_date, end_date=None, num=None, delta=None):\n    if not delta:\n        return []\n    if end_date and start_date > end_date:\n        raise Exception(\"Wait. start_date needs to be before end_date\")\n    if end_date and num:\n        raise Exception(\"Wait. Either specify end_date OR num\")\n    if not end_date and not num:\n        end_date = timezone.utcnow()\n\n    delta_iscron = False\n    tz = start_date.tzinfo\n    if isinstance(delta, six.string_types):\n        delta_iscron = True\n        start_date = timezone.make_naive(start_date, tz)\n        cron = croniter(delta, start_date)\n    elif isinstance(delta, timedelta):\n        delta = abs(delta)\n    dates = []\n    if end_date:\n        if timezone.is_naive(start_date):\n            end_date = timezone.make_naive(end_date, tz)\n        while start_date <= end_date:\n            if timezone.is_naive(start_date):\n                dates.append(timezone.make_aware(start_date, tz))\n            else:\n                dates.append(start_date)\n\n            if delta_iscron:\n                start_date = cron.get_next(datetime)\n            else:\n                start_date += delta\n    else:\n        for _ in range(abs(num)):\n            if timezone.is_naive(start_date):\n                dates.append(timezone.make_aware(start_date, tz))\n            else:\n                dates.append(start_date)\n\n            if delta_iscron:\n                if num > 0:\n                    start_date = cron.get_next(datetime)\n                else:\n                    start_date = cron.get_prev(datetime)\n            else:\n                if num > 0:\n                    start_date += delta\n                else:\n                    start_date -= delta\n    return sorted(dates)", "response": "Return a list of dates based on a start date and end date."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef round_time(dt, delta, start_date=timezone.make_aware(datetime.min)):\n\n    if isinstance(delta, six.string_types):\n        # It's cron based, so it's easy\n        tz = start_date.tzinfo\n        start_date = timezone.make_naive(start_date, tz)\n        cron = croniter(delta, start_date)\n        prev = cron.get_prev(datetime)\n        if prev == start_date:\n            return timezone.make_aware(start_date, tz)\n        else:\n            return timezone.make_aware(prev, tz)\n\n    # Ignore the microseconds of dt\n    dt -= timedelta(microseconds=dt.microsecond)\n\n    # We are looking for a datetime in the form start_date + i * delta\n    # which is as close as possible to dt. Since delta could be a relative\n    # delta we don't know its exact length in seconds so we cannot rely on\n    # division to find i. Instead we employ a binary search algorithm, first\n    # finding an upper and lower limit and then disecting the interval until\n    # we have found the closest match.\n\n    # We first search an upper limit for i for which start_date + upper * delta\n    # exceeds dt.\n    upper = 1\n    while start_date + upper * delta < dt:\n        # To speed up finding an upper limit we grow this exponentially by a\n        # factor of 2\n        upper *= 2\n\n    # Since upper is the first value for which start_date + upper * delta\n    # exceeds dt, upper // 2 is below dt and therefore forms a lower limited\n    # for the i we are looking for\n    lower = upper // 2\n\n    # We now continue to intersect the interval between\n    # start_date + lower * delta and start_date + upper * delta\n    # until we find the closest value\n    while True:\n        # Invariant: start + lower * delta < dt <= start + upper * delta\n        # If start_date + (lower + 1)*delta exceeds dt, then either lower or\n        # lower+1 has to be the solution we are searching for\n        if start_date + (lower + 1) * delta >= dt:\n            # Check if start_date + (lower + 1)*delta or\n            # start_date + lower*delta is closer to dt and return the solution\n            if (start_date + (lower + 1) * delta) - dt <= dt - (start_date + lower * delta):\n                return start_date + (lower + 1) * delta\n            else:\n                return start_date + lower * delta\n\n        # We intersect the interval and either replace the lower or upper\n        # limit with the candidate\n        candidate = lower + (upper - lower) // 2\n        if start_date + candidate * delta >= dt:\n            upper = candidate\n        else:\n            lower = candidate", "response": "Returns the datetime of the form start_date + delta which is closest to dt."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef infer_time_unit(time_seconds_arr):\n    if len(time_seconds_arr) == 0:\n        return 'hours'\n    max_time_seconds = max(time_seconds_arr)\n    if max_time_seconds <= 60 * 2:\n        return 'seconds'\n    elif max_time_seconds <= 60 * 60 * 2:\n        return 'minutes'\n    elif max_time_seconds <= 24 * 60 * 60 * 2:\n        return 'hours'\n    else:\n        return 'days'", "response": "Infer the most appropriate time unit for an array of time durations."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nconverting an array of time durations in seconds to the specified time unit.", "response": "def scale_time_units(time_seconds_arr, unit):\n    \"\"\"\n    Convert an array of time durations in seconds to the specified time unit.\n    \"\"\"\n    if unit == 'minutes':\n        return list(map(lambda x: x * 1.0 / 60, time_seconds_arr))\n    elif unit == 'hours':\n        return list(map(lambda x: x * 1.0 / (60 * 60), time_seconds_arr))\n    elif unit == 'days':\n        return list(map(lambda x: x * 1.0 / (24 * 60 * 60), time_seconds_arr))\n    return time_seconds_arr"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef days_ago(n, hour=0, minute=0, second=0, microsecond=0):\n    today = timezone.utcnow().replace(\n        hour=hour,\n        minute=minute,\n        second=second,\n        microsecond=microsecond)\n    return today - timedelta(days=n)", "response": "Returns a datetime object representing n days ago. By default the time is midnight."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nreturns a list of Dag Runs for a specific DAG ID.", "response": "def get_dag_runs(dag_id, state=None):\n    \"\"\"\n    Returns a list of Dag Runs for a specific DAG ID.\n    :param dag_id: String identifier of a DAG\n    :param state: queued|running|success...\n    :return: List of DAG runs of a DAG with requested state,\n    or all runs if the state is not specified\n    \"\"\"\n    dagbag = DagBag()\n\n    # Check DAG exists.\n    if dag_id not in dagbag.dags:\n        error_message = \"Dag id {} not found\".format(dag_id)\n        raise AirflowException(error_message)\n\n    dag_runs = list()\n    state = state.lower() if state else None\n    for run in DagRun.find(dag_id=dag_id, state=state):\n        dag_runs.append({\n            'id': run.id,\n            'run_id': run.run_id,\n            'state': run.state,\n            'dag_id': run.dag_id,\n            'execution_date': run.execution_date.isoformat(),\n            'start_date': ((run.start_date or '') and\n                           run.start_date.isoformat()),\n            'dag_run_url': url_for('Airflow.graph', dag_id=run.dag_id,\n                                   execution_date=run.execution_date)\n        })\n\n    return dag_runs"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\ninitializing the role with the permissions and related view - menus.", "response": "def init_role(self, role_name, role_vms, role_perms):\n        \"\"\"\n        Initialize the role with the permissions and related view-menus.\n\n        :param role_name:\n        :param role_vms:\n        :param role_perms:\n        :return:\n        \"\"\"\n        pvms = self.get_session.query(sqla_models.PermissionView).all()\n        pvms = [p for p in pvms if p.permission and p.view_menu]\n\n        role = self.find_role(role_name)\n        if not role:\n            role = self.add_role(role_name)\n\n        if len(role.permissions) == 0:\n            self.log.info('Initializing permissions for role:%s in the database.', role_name)\n            role_pvms = set()\n            for pvm in pvms:\n                if pvm.view_menu.name in role_vms and pvm.permission.name in role_perms:\n                    role_pvms.add(pvm)\n            role.permissions = list(role_pvms)\n            self.get_session.merge(role)\n            self.get_session.commit()\n        else:\n            self.log.debug('Existing permissions for the role:%s '\n                           'within the database will persist.', role_name)"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef delete_role(self, role_name):\n        session = self.get_session\n        role = session.query(sqla_models.Role)\\\n                      .filter(sqla_models.Role.name == role_name)\\\n                      .first()\n        if role:\n            self.log.info(\"Deleting role '%s'\", role_name)\n            session.delete(role)\n            session.commit()\n        else:\n            raise AirflowException(\"Role named '{}' does not exist\".format(\n                role_name))", "response": "Delete the given role in the ab_role table."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nget all the roles associated with the user.", "response": "def get_user_roles(self, user=None):\n        \"\"\"\n        Get all the roles associated with the user.\n\n        :param user: the ab_user in FAB model.\n        :return: a list of roles associated with the user.\n        \"\"\"\n        if user is None:\n            user = g.user\n        if user.is_anonymous:\n            public_role = appbuilder.config.get('AUTH_ROLE_PUBLIC')\n            return [appbuilder.security_manager.find_role(public_role)] \\\n                if public_role else []\n        return user.roles"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nreturn a set of tuples with the perm name and view menu name", "response": "def get_all_permissions_views(self):\n        \"\"\"\n        Returns a set of tuples with the perm name and view menu name\n        \"\"\"\n        perms_views = set()\n        for role in self.get_user_roles():\n            perms_views.update({(perm_view.permission.name, perm_view.view_menu.name)\n                                for perm_view in role.permissions})\n        return perms_views"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nreturning a set of all dags that the user can access to.", "response": "def get_accessible_dag_ids(self, username=None):\n        \"\"\"\n        Return a set of dags that user has access to(either read or write).\n\n        :param username: Name of the user.\n        :return: A set of dag ids that the user could access.\n        \"\"\"\n        if not username:\n            username = g.user\n\n        if username.is_anonymous or 'Public' in username.roles:\n            # return an empty set if the role is public\n            return set()\n\n        roles = {role.name for role in username.roles}\n        if {'Admin', 'Viewer', 'User', 'Op'} & roles:\n            return self.DAG_VMS\n\n        user_perms_views = self.get_all_permissions_views()\n        # return a set of all dags that the user could access\n        return set([view for perm, view in user_perms_views if perm in self.DAG_PERMS])"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef has_access(self, permission, view_name, user=None):\n        if not user:\n            user = g.user\n        if user.is_anonymous:\n            return self.is_item_public(permission, view_name)\n        return self._has_view_access(user, permission, view_name)", "response": "Verify whether a given user can perform certain permission on the dag_id."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _has_role(self, role_name_or_list):\n        if not isinstance(role_name_or_list, list):\n            role_name_or_list = [role_name_or_list]\n        return any(\n            [r.name in role_name_or_list for r in self.get_user_roles()])", "response": "Returns True if the user has the role name_or_list."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef _has_perm(self, permission_name, view_menu_name):\n        if hasattr(self, 'perms'):\n            if (permission_name, view_menu_name) in self.perms:\n                return True\n        # rebuild the permissions set\n        self._get_and_cache_perms()\n        return (permission_name, view_menu_name) in self.perms", "response": "Returns True if the user has permission with the given name."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef clean_perms(self):\n        self.log.debug('Cleaning faulty perms')\n        sesh = self.get_session\n        pvms = (\n            sesh.query(sqla_models.PermissionView)\n            .filter(or_(\n                sqla_models.PermissionView.permission == None,  # NOQA\n                sqla_models.PermissionView.view_menu == None,  # NOQA\n            ))\n        )\n        deleted_count = pvms.delete()\n        sesh.commit()\n        if deleted_count:\n            self.log.info('Deleted %s faulty permissions', deleted_count)", "response": "FAB leaves faulty permissions that need to be cleaned up"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nmerging the permission and view_menu into the permission_view_role.", "response": "def _merge_perm(self, permission_name, view_menu_name):\n        \"\"\"\n        Add the new permission , view_menu to ab_permission_view_role if not exists.\n        It will add the related entry to ab_permission\n        and ab_view_menu two meta tables as well.\n\n        :param permission_name: Name of the permission.\n        :type permission_name: str\n        :param view_menu_name: Name of the view-menu\n        :type view_menu_name: str\n        :return:\n        \"\"\"\n        permission = self.find_permission(permission_name)\n        view_menu = self.find_view_menu(view_menu_name)\n        pv = None\n        if permission and view_menu:\n            pv = self.get_session.query(self.permissionview_model).filter_by(\n                permission=permission, view_menu=view_menu).first()\n        if not pv and permission_name and view_menu_name:\n            self.add_permission_view_menu(permission_name, view_menu_name)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef create_custom_dag_permission_view(self, session=None):\n        self.log.debug('Fetching a set of all permission, view_menu from FAB meta-table')\n\n        def merge_pv(perm, view_menu):\n            \"\"\"Create permission view menu only if it doesn't exist\"\"\"\n            if view_menu and perm and (view_menu, perm) not in all_pvs:\n                self._merge_perm(perm, view_menu)\n\n        all_pvs = set()\n        for pv in self.get_session.query(self.permissionview_model).all():\n            if pv.permission and pv.view_menu:\n                all_pvs.add((pv.permission.name, pv.view_menu.name))\n\n        # Get all the active / paused dags and insert them into a set\n        all_dags_models = session.query(models.DagModel)\\\n            .filter(or_(models.DagModel.is_active, models.DagModel.is_paused))\\\n            .filter(~models.DagModel.is_subdag).all()\n\n        # create can_dag_edit and can_dag_read permissions for every dag(vm)\n        for dag in all_dags_models:\n            for perm in self.DAG_PERMS:\n                merge_pv(perm, dag.dag_id)\n\n        # for all the dag-level role, add the permission of viewer\n        # with the dag view to ab_permission_view\n        all_roles = self.get_all_roles()\n        user_role = self.find_role('User')\n\n        dag_role = [role for role in all_roles if role.name not in EXISTING_ROLES]\n        update_perm_views = []\n\n        # need to remove all_dag vm from all the existing view-menus\n        dag_vm = self.find_view_menu('all_dags')\n        ab_perm_view_role = sqla_models.assoc_permissionview_role\n        perm_view = self.permissionview_model\n        view_menu = self.viewmenu_model\n\n        all_perm_view_by_user = session.query(ab_perm_view_role)\\\n            .join(perm_view, perm_view.id == ab_perm_view_role\n                  .columns.permission_view_id)\\\n            .filter(ab_perm_view_role.columns.role_id == user_role.id)\\\n            .join(view_menu)\\\n            .filter(perm_view.view_menu_id != dag_vm.id)\n        all_perm_views = set([role.permission_view_id for role in all_perm_view_by_user])\n\n        for role in dag_role:\n            # Get all the perm-view of the role\n            existing_perm_view_by_user = self.get_session.query(ab_perm_view_role)\\\n                .filter(ab_perm_view_role.columns.role_id == role.id)\n\n            existing_perms_views = set([pv.permission_view_id\n                                        for pv in existing_perm_view_by_user])\n            missing_perm_views = all_perm_views - existing_perms_views\n\n            for perm_view_id in missing_perm_views:\n                update_perm_views.append({'permission_view_id': perm_view_id,\n                                          'role_id': role.id})\n\n        if update_perm_views:\n            self.get_session.execute(ab_perm_view_role.insert(), update_perm_views)\n        self.get_session.commit()", "response": "Create a custom permission view for all the dags."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef update_admin_perm_view(self):\n        pvms = self.get_session.query(sqla_models.PermissionView).all()\n        pvms = [p for p in pvms if p.permission and p.view_menu]\n\n        admin = self.find_role('Admin')\n        admin.permissions = list(set(admin.permissions) | set(pvms))\n\n        self.get_session.commit()", "response": "Update the admin permissions table for the admin role."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nsynchronizing user roles with the default and custom roles.", "response": "def sync_roles(self):\n        \"\"\"\n        1. Init the default role(Admin, Viewer, User, Op, public)\n           with related permissions.\n        2. Init the custom role(dag-user) with related permissions.\n\n        :return: None.\n        \"\"\"\n        self.log.debug('Start syncing user roles.')\n        # Create global all-dag VM\n        self.create_perm_vm_for_all_dag()\n\n        # Create default user role.\n        for config in self.ROLE_CONFIGS:\n            role = config['role']\n            vms = config['vms']\n            perms = config['perms']\n            self.init_role(role, vms, perms)\n        self.create_custom_dag_permission_view()\n\n        # init existing roles, the rest role could be created through UI.\n        self.update_admin_perm_view()\n        self.clean_perms()"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef sync_perm_for_dag(self, dag_id, access_control=None):\n        for dag_perm in self.DAG_PERMS:\n            perm_on_dag = self.find_permission_view_menu(dag_perm, dag_id)\n            if perm_on_dag is None:\n                self.add_permission_view_menu(dag_perm, dag_id)\n\n        if access_control:\n            self._sync_dag_view_permissions(dag_id, access_control)", "response": "Sync permissions for a dag."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef _sync_dag_view_permissions(self, dag_id, access_control):\n        def _get_or_create_dag_permission(perm_name):\n            dag_perm = self.find_permission_view_menu(perm_name, dag_id)\n            if not dag_perm:\n                self.log.info(\n                    \"Creating new permission '%s' on view '%s'\",\n                    perm_name, dag_id\n                )\n                dag_perm = self.add_permission_view_menu(perm_name, dag_id)\n\n            return dag_perm\n\n        def _revoke_stale_permissions(dag_view):\n            existing_dag_perms = self.find_permissions_view_menu(dag_view)\n            for perm in existing_dag_perms:\n                non_admin_roles = [role for role in perm.role\n                                   if role.name != 'Admin']\n                for role in non_admin_roles:\n                    target_perms_for_role = access_control.get(role.name, {})\n                    if perm.permission.name not in target_perms_for_role:\n                        self.log.info(\n                            \"Revoking '%s' on DAG '%s' for role '%s'\",\n                            perm.permission, dag_id, role.name\n                        )\n                        self.del_permission_role(role, perm)\n\n        dag_view = self.find_view_menu(dag_id)\n        if dag_view:\n            _revoke_stale_permissions(dag_view)\n\n        for rolename, perms in access_control.items():\n            role = self.find_role(rolename)\n            if not role:\n                raise AirflowException(\n                    \"The access_control mapping for DAG '{}' includes a role \"\n                    \"named '{}', but that role does not exist\".format(\n                        dag_id,\n                        rolename))\n\n            perms = set(perms)\n            invalid_perms = perms - self.DAG_PERMS\n            if invalid_perms:\n                raise AirflowException(\n                    \"The access_control map for DAG '{}' includes the following \"\n                    \"invalid permissions: {}; The set of valid permissions \"\n                    \"is: {}\".format(dag_id,\n                                    (perms - self.DAG_PERMS),\n                                    self.DAG_PERMS))\n\n            for perm_name in perms:\n                dag_perm = _get_or_create_dag_permission(perm_name)\n                self.add_permission_role(role, dag_perm)", "response": "Synchronizes the permissions of a DAG to the view model."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\ncreates perm - vm for all - dags.", "response": "def create_perm_vm_for_all_dag(self):\n        \"\"\"\n        Create perm-vm if not exist and insert into FAB security model for all-dags.\n        \"\"\"\n        # create perm for global logical dag\n        for dag_vm in self.DAG_VMS:\n            for perm in self.DAG_PERMS:\n                self._merge_perm(permission_name=perm,\n                                 view_menu_name=dag_vm)"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef get_fernet():\n    global _fernet\n    log = LoggingMixin().log\n\n    if _fernet:\n        return _fernet\n    try:\n        from cryptography.fernet import Fernet, MultiFernet, InvalidToken\n        global InvalidFernetToken\n        InvalidFernetToken = InvalidToken\n\n    except BuiltinImportError:\n        log.warning(\n            \"cryptography not found - values will not be stored encrypted.\"\n        )\n        _fernet = NullFernet()\n        return _fernet\n\n    try:\n        fernet_key = configuration.conf.get('core', 'FERNET_KEY')\n        if not fernet_key:\n            log.warning(\n                \"empty cryptography key - values will not be stored encrypted.\"\n            )\n            _fernet = NullFernet()\n        else:\n            _fernet = MultiFernet([\n                Fernet(fernet_part.encode('utf-8'))\n                for fernet_part in fernet_key.split(',')\n            ])\n            _fernet.is_encrypted = True\n    except (ValueError, TypeError) as ve:\n        raise AirflowException(\"Could not create Fernet object: {}\".format(ve))\n\n    return _fernet", "response": "Returns a Fernet object for the given cryptography key."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef poke(self, context):\n        if '.' in self.table_name:\n            self.database_name, self.table_name = self.table_name.split('.')\n        self.log.info(\n            'Poking for table %s. %s, expression %s', self.database_name, self.table_name, self.expression\n        )\n\n        return self.get_hook().check_for_partition(\n            self.database_name, self.table_name, self.expression)", "response": "Checks if the partition exists in the AWS Glue Catalog table"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef get_hook(self):\n        if not hasattr(self, 'hook'):\n            from airflow.contrib.hooks.aws_glue_catalog_hook import AwsGlueCatalogHook\n            self.hook = AwsGlueCatalogHook(\n                aws_conn_id=self.aws_conn_id,\n                region_name=self.region_name)\n\n        return self.hook", "response": "Gets the AwsGlueCatalogHook object"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\ncheck for message on subscribed queue and write to xcom", "response": "def poke(self, context):\n        \"\"\"\n        Check for message on subscribed queue and write to xcom the message with key ``messages``\n\n        :param context: the context object\n        :type context: dict\n        :return: ``True`` if message is available or ``False``\n        \"\"\"\n\n        sqs_hook = SQSHook(aws_conn_id=self.aws_conn_id)\n        sqs_conn = sqs_hook.get_conn()\n\n        self.log.info('SQSSensor checking for message on queue: %s', self.sqs_queue)\n\n        messages = sqs_conn.receive_message(QueueUrl=self.sqs_queue,\n                                            MaxNumberOfMessages=self.max_messages,\n                                            WaitTimeSeconds=self.wait_time_seconds)\n\n        self.log.info(\"reveived message %s\", str(messages))\n\n        if 'Messages' in messages and len(messages['Messages']) > 0:\n\n            entries = [{'Id': message['MessageId'], 'ReceiptHandle': message['ReceiptHandle']}\n                       for message in messages['Messages']]\n\n            result = sqs_conn.delete_message_batch(QueueUrl=self.sqs_queue,\n                                                   Entries=entries)\n\n            if 'Successful' in result:\n                context['ti'].xcom_push(key='messages', value=messages)\n                return True\n            else:\n                raise AirflowException(\n                    'Delete SQS Messages failed ' + str(result) + ' for messages ' + str(messages))\n\n        return False"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef tmp_configuration_copy(chmod=0o600):\n    cfg_dict = conf.as_dict(display_sensitive=True, raw=True)\n    temp_fd, cfg_path = mkstemp()\n\n    with os.fdopen(temp_fd, 'w') as temp_file:\n        if chmod is not None:\n            os.fchmod(temp_fd, chmod)\n        json.dump(cfg_dict, temp_file)\n\n    return cfg_path", "response": "Returns a path to a temporary file including a full copy of the configuration\n    settings."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef get_conn(self):\n        # When using HAClient, proxy_user must be the same, so is ok to always\n        # take the first.\n        effective_user = self.proxy_user\n        autoconfig = self.autoconfig\n        use_sasl = configuration.conf.get('core', 'security') == 'kerberos'\n\n        try:\n            connections = self.get_connections(self.hdfs_conn_id)\n\n            if not effective_user:\n                effective_user = connections[0].login\n            if not autoconfig:\n                autoconfig = connections[0].extra_dejson.get('autoconfig',\n                                                             False)\n            hdfs_namenode_principal = connections[0].extra_dejson.get(\n                'hdfs_namenode_principal')\n        except AirflowException:\n            if not autoconfig:\n                raise\n\n        if autoconfig:\n            # will read config info from $HADOOP_HOME conf files\n            client = AutoConfigClient(effective_user=effective_user,\n                                      use_sasl=use_sasl)\n        elif len(connections) == 1:\n            client = Client(connections[0].host, connections[0].port,\n                            effective_user=effective_user, use_sasl=use_sasl,\n                            hdfs_namenode_principal=hdfs_namenode_principal)\n        elif len(connections) > 1:\n            nn = [Namenode(conn.host, conn.port) for conn in connections]\n            client = HAClient(nn, effective_user=effective_user,\n                              use_sasl=use_sasl,\n                              hdfs_namenode_principal=hdfs_namenode_principal)\n        else:\n            raise HDFSHookException(\"conn_id doesn't exist in the repository \"\n                                    \"and autoconfig is not specified\")\n\n        return client", "response": "Returns a snakebite HDFSClient object."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nestablish a connection depending on the security mode set via config or environment variable.", "response": "def get_conn(self):\n        \"\"\"\n        Establishes a connection depending on the security mode set via config or environment variable.\n\n        :return: a hdfscli InsecureClient or KerberosClient object.\n        :rtype: hdfs.InsecureClient or hdfs.ext.kerberos.KerberosClient\n        \"\"\"\n        connections = self.get_connections(self.webhdfs_conn_id)\n\n        for connection in connections:\n            try:\n                self.log.debug('Trying namenode %s', connection.host)\n                client = self._get_client(connection)\n                client.status('/')\n                self.log.debug('Using namenode %s for hook', connection.host)\n                return client\n            except HdfsError as hdfs_error:\n                self.log.debug('Read operation on namenode %s failed with error: %s',\n                               connection.host, hdfs_error)\n\n        hosts = [connection.host for connection in connections]\n        error_message = 'Read operations failed on the namenodes below:\\n{hosts}'.format(\n            hosts='\\n'.join(hosts))\n        raise AirflowWebHDFSHookException(error_message)"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef check_for_path(self, hdfs_path):\n        conn = self.get_conn()\n\n        status = conn.status(hdfs_path, strict=False)\n        return bool(status)", "response": "Checks if a path exists in HDFS by querying FileStatus. ncache_directory"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nestablishing a connection to pinot broker through pinot dbqpi.", "response": "def get_conn(self):\n        \"\"\"\n        Establish a connection to pinot broker through pinot dbqpi.\n        \"\"\"\n        conn = self.get_connection(self.pinot_broker_conn_id)\n        pinot_broker_conn = connect(\n            host=conn.host,\n            port=conn.port,\n            path=conn.extra_dejson.get('endpoint', '/pql'),\n            scheme=conn.extra_dejson.get('schema', 'http')\n        )\n        self.log.info('Get the connection to pinot '\n                      'broker on {host}'.format(host=conn.host))\n        return pinot_broker_conn"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nget the connection uri for the current object.", "response": "def get_uri(self):\n        \"\"\"\n        Get the connection uri for pinot broker.\n\n        e.g: http://localhost:9000/pql\n        \"\"\"\n        conn = self.get_connection(getattr(self, self.conn_name_attr))\n        host = conn.host\n        if conn.port is not None:\n            host += ':{port}'.format(port=conn.port)\n        conn_type = 'http' if not conn.conn_type else conn.conn_type\n        endpoint = conn.extra_dejson.get('endpoint', 'pql')\n        return '{conn_type}://{host}/{endpoint}'.format(\n            conn_type=conn_type, host=host, endpoint=endpoint)"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef get_records(self, sql):\n        with self.get_conn() as cur:\n            cur.execute(sql)\n            return cur.fetchall()", "response": "Executes the sql and returns a set of records."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nexecuting the sql and returns the first result.", "response": "def get_first(self, sql):\n        \"\"\"\n        Executes the sql and returns the first resulting row.\n\n        :param sql: the sql statement to be executed (str) or a list of\n            sql statements to execute\n        :type sql: str or list\n        \"\"\"\n        with self.get_conn() as cur:\n            cur.execute(sql)\n            return cur.fetchone()"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef smart_truncate(string, max_length=0, word_boundary=False, separator=' ', save_order=False):\n\n    string = string.strip(separator)\n\n    if not max_length:\n        return string\n\n    if len(string) < max_length:\n        return string\n\n    if not word_boundary:\n        return string[:max_length].strip(separator)\n\n    if separator not in string:\n        return string[:max_length]\n\n    truncated = ''\n    for word in string.split(separator):\n        if word:\n            next_len = len(truncated) + len(word)\n            if next_len < max_length:\n                truncated += '{0}{1}'.format(word, separator)\n            elif next_len == max_length:\n                truncated += '{0}'.format(word)\n                break\n            else:\n                if save_order:\n                    break\n    if not truncated: # pragma: no cover\n        truncated = string[:max_length]\n    return truncated.strip(separator)", "response": "Truncates a string to a maximum length."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef slugify(text, entities=True, decimal=True, hexadecimal=True, max_length=0, word_boundary=False,\n            separator=DEFAULT_SEPARATOR, save_order=False, stopwords=(), regex_pattern=None, lowercase=True,\n            replacements=()):\n    \"\"\"\n    Make a slug from the given text.\n    :param text (str): initial text\n    :param entities (bool):\n    :param decimal (bool):\n    :param hexadecimal (bool):\n    :param max_length (int): output string length\n    :param word_boundary (bool):\n    :param save_order (bool): if parameter is True and max_length > 0 return whole words in the initial order\n    :param separator (str): separator between words\n    :param stopwords (iterable): words to discount\n    :param regex_pattern (str): regex pattern for allowed characters\n    :param lowercase (bool): activate case sensitivity by setting it to False\n    :param replacements (iterable): list of replacement rules e.g. [['|', 'or'], ['%', 'percent']]\n    :return (str):\n    \"\"\"\n\n    # user-specific replacements\n    if replacements:\n        for old, new in replacements:\n            text = text.replace(old, new)\n\n    # ensure text is unicode\n    if not isinstance(text, _unicode_type):\n        text = _unicode(text, 'utf-8', 'ignore')\n\n    # replace quotes with dashes - pre-process\n    text = QUOTE_PATTERN.sub(DEFAULT_SEPARATOR, text)\n\n    # decode unicode\n    text = unidecode.unidecode(text)\n\n    # ensure text is still in unicode\n    if not isinstance(text, _unicode_type):\n        text = _unicode(text, 'utf-8', 'ignore')\n\n    # character entity reference\n    if entities:\n        text = CHAR_ENTITY_PATTERN.sub(lambda m: unichr(name2codepoint[m.group(1)]), text)\n\n    # decimal character reference\n    if decimal:\n        try:\n            text = DECIMAL_PATTERN.sub(lambda m: unichr(int(m.group(1))), text)\n        except Exception:\n            pass\n\n    # hexadecimal character reference\n    if hexadecimal:\n        try:\n            text = HEX_PATTERN.sub(lambda m: unichr(int(m.group(1), 16)), text)\n        except Exception:\n            pass\n\n    # translate\n    text = unicodedata.normalize('NFKD', text)\n\n    # make the text lowercase (optional)\n    if lowercase:\n        text = text.lower()\n\n    # remove generated quotes -- post-process\n    text = QUOTE_PATTERN.sub('', text)\n\n    # cleanup numbers\n    text = NUMBERS_PATTERN.sub('', text)\n\n    # replace all other unwanted characters\n    if lowercase:\n        pattern = regex_pattern or ALLOWED_CHARS_PATTERN\n    else:\n        pattern = regex_pattern or ALLOWED_CHARS_PATTERN_WITH_UPPERCASE\n    text = re.sub(pattern, DEFAULT_SEPARATOR, text)\n\n    # remove redundant\n    text = DUPLICATE_DASH_PATTERN.sub(DEFAULT_SEPARATOR, text).strip(DEFAULT_SEPARATOR)\n\n    # remove stopwords\n    if stopwords:\n        if lowercase:\n            stopwords_lower = [s.lower() for s in stopwords]\n            words = [w for w in text.split(DEFAULT_SEPARATOR) if w not in stopwords_lower]\n        else:\n            words = [w for w in text.split(DEFAULT_SEPARATOR) if w not in stopwords]\n        text = DEFAULT_SEPARATOR.join(words)\n\n    # finalize user-specific replacements\n    if replacements:\n        for old, new in replacements:\n            text = text.replace(old, new)\n\n    # smart truncate if requested\n    if max_length > 0:\n        text = smart_truncate(text, max_length, word_boundary, DEFAULT_SEPARATOR, save_order)\n\n    if separator != DEFAULT_SEPARATOR:\n        text = text.replace(DEFAULT_SEPARATOR, separator)\n\n    return text", "response": "Make a slug from the given text."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nstoring an XCom value in the database.", "response": "def set(\n            cls,\n            key,\n            value,\n            execution_date,\n            task_id,\n            dag_id,\n            session=None):\n        \"\"\"\n        Store an XCom value.\n        TODO: \"pickling\" has been deprecated and JSON is preferred.\n        \"pickling\" will be removed in Airflow 2.0.\n\n        :return: None\n        \"\"\"\n        session.expunge_all()\n\n        enable_pickling = configuration.getboolean('core', 'enable_xcom_pickling')\n        if enable_pickling:\n            value = pickle.dumps(value)\n        else:\n            try:\n                value = json.dumps(value).encode('UTF-8')\n            except ValueError:\n                log = LoggingMixin().log\n                log.error(\"Could not serialize the XCOM value into JSON. \"\n                          \"If you are using pickles instead of JSON \"\n                          \"for XCOM, then you need to enable pickle \"\n                          \"support for XCOM in your airflow config.\")\n                raise\n\n        # remove any duplicate XComs\n        session.query(cls).filter(\n            cls.key == key,\n            cls.execution_date == execution_date,\n            cls.task_id == task_id,\n            cls.dag_id == dag_id).delete()\n\n        session.commit()\n\n        # insert new XCom\n        session.add(XCom(\n            key=key,\n            value=value,\n            execution_date=execution_date,\n            task_id=task_id,\n            dag_id=dag_id))\n\n        session.commit()"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nretrieve an XCom value from the database optionally meeting certain criteria.", "response": "def get_one(cls,\n                execution_date,\n                key=None,\n                task_id=None,\n                dag_id=None,\n                include_prior_dates=False,\n                session=None):\n        \"\"\"\n        Retrieve an XCom value, optionally meeting certain criteria.\n        TODO: \"pickling\" has been deprecated and JSON is preferred.\n        \"pickling\" will be removed in Airflow 2.0.\n\n        :return: XCom value\n        \"\"\"\n        filters = []\n        if key:\n            filters.append(cls.key == key)\n        if task_id:\n            filters.append(cls.task_id == task_id)\n        if dag_id:\n            filters.append(cls.dag_id == dag_id)\n        if include_prior_dates:\n            filters.append(cls.execution_date <= execution_date)\n        else:\n            filters.append(cls.execution_date == execution_date)\n\n        query = (\n            session.query(cls.value).filter(and_(*filters))\n                   .order_by(cls.execution_date.desc(), cls.timestamp.desc()))\n\n        result = query.first()\n        if result:\n            enable_pickling = configuration.getboolean('core', 'enable_xcom_pickling')\n            if enable_pickling:\n                return pickle.loads(result.value)\n            else:\n                try:\n                    return json.loads(result.value.decode('UTF-8'))\n                except ValueError:\n                    log = LoggingMixin().log\n                    log.error(\"Could not deserialize the XCOM value from JSON. \"\n                              \"If you are using pickles instead of JSON \"\n                              \"for XCOM, then you need to enable pickle \"\n                              \"support for XCOM in your airflow config.\")\n                    raise"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nretrieving an XCom value optionally meeting certain criteria.", "response": "def get_many(cls,\n                 execution_date,\n                 key=None,\n                 task_ids=None,\n                 dag_ids=None,\n                 include_prior_dates=False,\n                 limit=100,\n                 session=None):\n        \"\"\"\n        Retrieve an XCom value, optionally meeting certain criteria\n        TODO: \"pickling\" has been deprecated and JSON is preferred.\n        \"pickling\" will be removed in Airflow 2.0.\n        \"\"\"\n        filters = []\n        if key:\n            filters.append(cls.key == key)\n        if task_ids:\n            filters.append(cls.task_id.in_(as_tuple(task_ids)))\n        if dag_ids:\n            filters.append(cls.dag_id.in_(as_tuple(dag_ids)))\n        if include_prior_dates:\n            filters.append(cls.execution_date <= execution_date)\n        else:\n            filters.append(cls.execution_date == execution_date)\n\n        query = (\n            session.query(cls).filter(and_(*filters))\n                              .order_by(cls.execution_date.desc(), cls.timestamp.desc())\n                              .limit(limit))\n        results = query.all()\n        return results"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef _convert_date_to_dict(field_date):\n        return {DAY: field_date.day, MONTH: field_date.month, YEAR: field_date.year}", "response": "Convert native python datetime. date object to a dict"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef _convert_time_to_dict(time):\n        return {HOURS: time.hour, MINUTES: time.minute, SECONDS: time.second}", "response": "Convert native python datetime. time object to a dict"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nreturn a Redis connection.", "response": "def get_conn(self):\n        \"\"\"\n        Returns a Redis connection.\n        \"\"\"\n        conn = self.get_connection(self.redis_conn_id)\n        self.host = conn.host\n        self.port = conn.port\n        self.password = None if str(conn.password).lower() in ['none', 'false', ''] else conn.password\n        self.db = conn.extra_dejson.get('db', None)\n\n        if not self.redis:\n            self.log.debug(\n                'Initializing redis object for conn_id \"%s\" on %s:%s:%s',\n                self.redis_conn_id, self.host, self.port, self.db\n            )\n            self.redis = Redis(\n                host=self.host,\n                port=self.port,\n                password=self.password,\n                db=self.db)\n\n        return self.redis"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nreturn a connection object for the current database entry and its class.", "response": "def get_conn(self):\n        \"\"\"\n        Returns a oracle connection object\n        Optional parameters for using a custom DSN connection\n        (instead of using a server alias from tnsnames.ora)\n        The dsn (data source name) is the TNS entry\n        (from the Oracle names server or tnsnames.ora file)\n        or is a string like the one returned from makedsn().\n\n        :param dsn: the host address for the Oracle server\n        :param service_name: the db_unique_name of the database\n              that you are connecting to (CONNECT_DATA part of TNS)\n\n        You can set these parameters in the extra fields of your connection\n        as in ``{ \"dsn\":\"some.host.address\" , \"service_name\":\"some.service.name\" }``\n        see more param detail in\n        `cx_Oracle.connect <https://cx-oracle.readthedocs.io/en/latest/module.html#cx_Oracle.connect>`_\n        \"\"\"\n        conn = self.get_connection(self.oracle_conn_id)\n        conn_config = {\n            'user': conn.login,\n            'password': conn.password\n        }\n        dsn = conn.extra_dejson.get('dsn', None)\n        sid = conn.extra_dejson.get('sid', None)\n        mod = conn.extra_dejson.get('module', None)\n\n        service_name = conn.extra_dejson.get('service_name', None)\n        port = conn.port if conn.port else 1521\n        if dsn and sid and not service_name:\n            conn_config['dsn'] = cx_Oracle.makedsn(dsn, port, sid)\n        elif dsn and service_name and not sid:\n            conn_config['dsn'] = cx_Oracle.makedsn(dsn, port, service_name=service_name)\n        else:\n            conn_config['dsn'] = conn.host\n\n        if 'encoding' in conn.extra_dejson:\n            conn_config['encoding'] = conn.extra_dejson.get('encoding')\n            # if `encoding` is specific but `nencoding` is not\n            # `nencoding` should use same values as `encoding` to set encoding, inspired by\n            # https://github.com/oracle/python-cx_Oracle/issues/157#issuecomment-371877993\n            if 'nencoding' not in conn.extra_dejson:\n                conn_config['nencoding'] = conn.extra_dejson.get('encoding')\n        if 'nencoding' in conn.extra_dejson:\n            conn_config['nencoding'] = conn.extra_dejson.get('nencoding')\n        if 'threaded' in conn.extra_dejson:\n            conn_config['threaded'] = conn.extra_dejson.get('threaded')\n        if 'events' in conn.extra_dejson:\n            conn_config['events'] = conn.extra_dejson.get('events')\n\n        mode = conn.extra_dejson.get('mode', '').lower()\n        if mode == 'sysdba':\n            conn_config['mode'] = cx_Oracle.SYSDBA\n        elif mode == 'sysasm':\n            conn_config['mode'] = cx_Oracle.SYSASM\n        elif mode == 'sysoper':\n            conn_config['mode'] = cx_Oracle.SYSOPER\n        elif mode == 'sysbkp':\n            conn_config['mode'] = cx_Oracle.SYSBKP\n        elif mode == 'sysdgd':\n            conn_config['mode'] = cx_Oracle.SYSDGD\n        elif mode == 'syskmt':\n            conn_config['mode'] = cx_Oracle.SYSKMT\n        elif mode == 'sysrac':\n            conn_config['mode'] = cx_Oracle.SYSRAC\n\n        purity = conn.extra_dejson.get('purity', '').lower()\n        if purity == 'new':\n            conn_config['purity'] = cx_Oracle.ATTR_PURITY_NEW\n        elif purity == 'self':\n            conn_config['purity'] = cx_Oracle.ATTR_PURITY_SELF\n        elif purity == 'default':\n            conn_config['purity'] = cx_Oracle.ATTR_PURITY_DEFAULT\n\n        conn = cx_Oracle.connect(**conn_config)\n        if mod is not None:\n            conn.module = mod\n\n        return conn"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\ninsert a set of rows into a table in Oracle.", "response": "def insert_rows(self, table, rows, target_fields=None, commit_every=1000):\n        \"\"\"\n        A generic way to insert a set of tuples into a table,\n        the whole set of inserts is treated as one transaction\n        Changes from standard DbApiHook implementation:\n\n        - Oracle SQL queries in cx_Oracle can not be terminated with a semicolon (`;`)\n        - Replace NaN values with NULL using `numpy.nan_to_num` (not using\n          `is_nan()` because of input types error for strings)\n        - Coerce datetime cells to Oracle DATETIME format during insert\n\n        :param table: target Oracle table, use dot notation to target a\n            specific database\n        :type table: str\n        :param rows: the rows to insert into the table\n        :type rows: iterable of tuples\n        :param target_fields: the names of the columns to fill in the table\n        :type target_fields: iterable of str\n        :param commit_every: the maximum number of rows to insert in one transaction\n            Default 1000, Set greater than 0.\n            Set 1 to insert each row in each single transaction\n        :type commit_every: int\n        \"\"\"\n        if target_fields:\n            target_fields = ', '.join(target_fields)\n            target_fields = '({})'.format(target_fields)\n        else:\n            target_fields = ''\n        conn = self.get_conn()\n        cur = conn.cursor()\n        if self.supports_autocommit:\n            cur.execute('SET autocommit = 0')\n        conn.commit()\n        i = 0\n        for row in rows:\n            i += 1\n            lst = []\n            for cell in row:\n                if isinstance(cell, basestring):\n                    lst.append(\"'\" + str(cell).replace(\"'\", \"''\") + \"'\")\n                elif cell is None:\n                    lst.append('NULL')\n                elif type(cell) == float and \\\n                        numpy.isnan(cell):  # coerce numpy NaN to NULL\n                    lst.append('NULL')\n                elif isinstance(cell, numpy.datetime64):\n                    lst.append(\"'\" + str(cell) + \"'\")\n                elif isinstance(cell, datetime):\n                    lst.append(\"to_date('\" +\n                               cell.strftime('%Y-%m-%d %H:%M:%S') +\n                               \"','YYYY-MM-DD HH24:MI:SS')\")\n                else:\n                    lst.append(str(cell))\n            values = tuple(lst)\n            sql = 'INSERT /*+ APPEND */ ' \\\n                  'INTO {0} {1} VALUES ({2})'.format(table,\n                                                     target_fields,\n                                                     ','.join(values))\n            cur.execute(sql)\n            if i % commit_every == 0:\n                conn.commit()\n                self.log.info('Loaded %s into %s rows so far', i, table)\n        conn.commit()\n        cur.close()\n        conn.close()\n        self.log.info('Done loading. Loaded a total of %s rows', i)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nreturn a connection object for the current object.", "response": "def get_conn(self):\n        \"\"\"Returns a connection object\n        \"\"\"\n        db = self.get_connection(getattr(self, self.conn_name_attr))\n        return self.connector.connect(\n            host=db.host,\n            port=db.port,\n            username=db.login,\n            schema=db.schema)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nexecuting the sql and returns a pandas dataframe", "response": "def get_pandas_df(self, sql, parameters=None):\n        \"\"\"\n        Executes the sql and returns a pandas dataframe\n\n        :param sql: the sql statement to be executed (str) or a list of\n            sql statements to execute\n        :type sql: str or list\n        :param parameters: The parameters to render the SQL query with.\n        :type parameters: mapping or iterable\n        \"\"\"\n        import pandas.io.sql as psql\n\n        with closing(self.get_conn()) as conn:\n            return psql.read_sql(sql, con=conn, params=parameters)"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nexecute the sql and returns a set of records.", "response": "def get_records(self, sql, parameters=None):\n        \"\"\"\n        Executes the sql and returns a set of records.\n\n        :param sql: the sql statement to be executed (str) or a list of\n            sql statements to execute\n        :type sql: str or list\n        :param parameters: The parameters to render the SQL query with.\n        :type parameters: mapping or iterable\n        \"\"\"\n        with closing(self.get_conn()) as conn:\n            with closing(conn.cursor()) as cur:\n                if parameters is not None:\n                    cur.execute(sql, parameters)\n                else:\n                    cur.execute(sql)\n                return cur.fetchall()"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef get_first(self, sql, parameters=None):\n        with closing(self.get_conn()) as conn:\n            with closing(conn.cursor()) as cur:\n                if parameters is not None:\n                    cur.execute(sql, parameters)\n                else:\n                    cur.execute(sql)\n                return cur.fetchone()", "response": "Executes the sql and returns the first result."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nexecute a SQL query or a list of SQL statements.", "response": "def run(self, sql, autocommit=False, parameters=None):\n        \"\"\"\n        Runs a command or a list of commands. Pass a list of sql\n        statements to the sql parameter to get them to execute\n        sequentially\n\n        :param sql: the sql statement to be executed (str) or a list of\n            sql statements to execute\n        :type sql: str or list\n        :param autocommit: What to set the connection's autocommit setting to\n            before executing the query.\n        :type autocommit: bool\n        :param parameters: The parameters to render the SQL query with.\n        :type parameters: mapping or iterable\n        \"\"\"\n        if isinstance(sql, basestring):\n            sql = [sql]\n\n        with closing(self.get_conn()) as conn:\n            if self.supports_autocommit:\n                self.set_autocommit(conn, autocommit)\n\n            with closing(conn.cursor()) as cur:\n                for s in sql:\n                    if parameters is not None:\n                        self.log.info(\"{} with parameters {}\".format(s, parameters))\n                        cur.execute(s, parameters)\n                    else:\n                        self.log.info(s)\n                        cur.execute(s)\n\n            # If autocommit was set to False for db that supports autocommit,\n            # or if db does not supports autocommit, we do a manual commit.\n            if not self.get_autocommit(conn):\n                conn.commit()"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nsets the autocommit flag on the connection conn.", "response": "def set_autocommit(self, conn, autocommit):\n        \"\"\"\n        Sets the autocommit flag on the connection\n        \"\"\"\n        if not self.supports_autocommit and autocommit:\n            self.log.warn(\n                (\"%s connection doesn't support \"\n                 \"autocommit but autocommit activated.\"),\n                getattr(self, self.conn_name_attr))\n        conn.autocommit = autocommit"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef insert_rows(self, table, rows, target_fields=None, commit_every=1000,\n                    replace=False):\n        \"\"\"\n        A generic way to insert a set of tuples into a table,\n        a new transaction is created every commit_every rows\n\n        :param table: Name of the target table\n        :type table: str\n        :param rows: The rows to insert into the table\n        :type rows: iterable of tuples\n        :param target_fields: The names of the columns to fill in the table\n        :type target_fields: iterable of strings\n        :param commit_every: The maximum number of rows to insert in one\n            transaction. Set to 0 to insert all rows in one transaction.\n        :type commit_every: int\n        :param replace: Whether to replace instead of insert\n        :type replace: bool\n        \"\"\"\n        if target_fields:\n            target_fields = \", \".join(target_fields)\n            target_fields = \"({})\".format(target_fields)\n        else:\n            target_fields = ''\n        i = 0\n        with closing(self.get_conn()) as conn:\n            if self.supports_autocommit:\n                self.set_autocommit(conn, False)\n\n            conn.commit()\n\n            with closing(conn.cursor()) as cur:\n                for i, row in enumerate(rows, 1):\n                    lst = []\n                    for cell in row:\n                        lst.append(self._serialize_cell(cell, conn))\n                    values = tuple(lst)\n                    placeholders = [\"%s\", ] * len(values)\n                    if not replace:\n                        sql = \"INSERT INTO \"\n                    else:\n                        sql = \"REPLACE INTO \"\n                    sql += \"{0} {1} VALUES ({2})\".format(\n                        table,\n                        target_fields,\n                        \",\".join(placeholders))\n                    cur.execute(sql, values)\n                    if commit_every and i % commit_every == 0:\n                        conn.commit()\n                        self.log.info(\n                            \"Loaded %s into %s rows so far\", i, table\n                        )\n\n            conn.commit()\n        self.log.info(\"Done loading. Loaded a total of %s rows\", i)", "response": "A generic way to insert a set of tuples into a table."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nreturns the SQL literal of the cell as a string.", "response": "def _serialize_cell(cell, conn=None):\n        \"\"\"\n        Returns the SQL literal of the cell as a string.\n\n        :param cell: The cell to insert into the table\n        :type cell: object\n        :param conn: The database connection\n        :type conn: connection object\n        :return: The serialized cell\n        :rtype: str\n        \"\"\"\n\n        if cell is None:\n            return None\n        if isinstance(cell, datetime):\n            return cell.isoformat()\n        return str(cell)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef health(self, session=None):\n\n        BJ = jobs.BaseJob\n        payload = {}\n        scheduler_health_check_threshold = timedelta(seconds=conf.getint('scheduler',\n                                                                         'scheduler_health_check_threshold'\n                                                                         ))\n\n        latest_scheduler_heartbeat = None\n        payload['metadatabase'] = {'status': 'healthy'}\n        try:\n            latest_scheduler_heartbeat = session.query(func.max(BJ.latest_heartbeat)).\\\n                filter(BJ.state == 'running', BJ.job_type == 'SchedulerJob').\\\n                scalar()\n        except Exception:\n            payload['metadatabase']['status'] = 'unhealthy'\n\n        if not latest_scheduler_heartbeat:\n            scheduler_status = 'unhealthy'\n        else:\n            if timezone.utcnow() - latest_scheduler_heartbeat <= scheduler_health_check_threshold:\n                scheduler_status = 'healthy'\n            else:\n                scheduler_status = 'unhealthy'\n\n        payload['scheduler'] = {'status': scheduler_status,\n                                'latest_scheduler_heartbeat': str(latest_scheduler_heartbeat)}\n\n        return wwwutils.json_response(payload)", "response": "This endpoint returns the health status of the Airflow instance."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nreturning a QuerySet of all the related objects in the database.", "response": "def get_query(self):\n        \"\"\"\n        Default filters for model\n        \"\"\"\n        return (\n            super().get_query()\n            .filter(or_(models.DagModel.is_active,\n                        models.DagModel.is_paused))\n            .filter(~models.DagModel.is_subdag)\n        )"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef get_count_query(self):\n        return (\n            super().get_count_query()\n            .filter(models.DagModel.is_active)\n            .filter(~models.DagModel.is_subdag)\n        )", "response": "Returns a QuerySet that returns the count of the active Dags."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nreturns a connection to the cloudant service and opens it automatically if used as context manager.", "response": "def get_conn(self):\n        \"\"\"\n        Opens a connection to the cloudant service and closes it automatically if used as context manager.\n\n        .. note::\n            In the connection form:\n            - 'host' equals the 'Account' (optional)\n            - 'login' equals the 'Username (or API Key)' (required)\n            - 'password' equals the 'Password' (required)\n\n        :return: an authorized cloudant session context manager object.\n        :rtype: cloudant\n        \"\"\"\n        conn = self.get_connection(self.cloudant_conn_id)\n\n        self._validate_connection(conn)\n\n        cloudant_session = cloudant(user=conn.login, passwd=conn.password, account=conn.host)\n\n        return cloudant_session"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef execute(self, context):\n        self.hook = SlackWebhookHook(\n            self.http_conn_id,\n            self.webhook_token,\n            self.message,\n            self.attachments,\n            self.channel,\n            self.username,\n            self.icon_emoji,\n            self.link_names,\n            self.proxy\n        )\n        self.hook.execute()", "response": "Call the SlackWebhookHook to post the provided Slack message"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef _get_credentials(self):\n        key_path = self._get_field('key_path', False)\n        keyfile_dict = self._get_field('keyfile_dict', False)\n        scope = self._get_field('scope', None)\n        if scope:\n            scopes = [s.strip() for s in scope.split(',')]\n        else:\n            scopes = _DEFAULT_SCOPES\n\n        if not key_path and not keyfile_dict:\n            self.log.info('Getting connection using `google.auth.default()` '\n                          'since no key file is defined for hook.')\n            credentials, _ = google.auth.default(scopes=scopes)\n        elif key_path:\n            # Get credentials from a JSON file.\n            if key_path.endswith('.json'):\n                self.log.debug('Getting connection using JSON key file %s' % key_path)\n                credentials = (\n                    google.oauth2.service_account.Credentials.from_service_account_file(\n                        key_path, scopes=scopes)\n                )\n            elif key_path.endswith('.p12'):\n                raise AirflowException('Legacy P12 key file are not supported, '\n                                       'use a JSON key file.')\n            else:\n                raise AirflowException('Unrecognised extension for key file.')\n        else:\n            # Get credentials from JSON data provided in the UI.\n            try:\n                keyfile_dict = json.loads(keyfile_dict)\n\n                # Depending on how the JSON was formatted, it may contain\n                # escaped newlines. Convert those to actual newlines.\n                keyfile_dict['private_key'] = keyfile_dict['private_key'].replace(\n                    '\\\\n', '\\n')\n\n                credentials = (\n                    google.oauth2.service_account.Credentials.from_service_account_info(\n                        keyfile_dict, scopes=scopes)\n                )\n            except json.decoder.JSONDecodeError:\n                raise AirflowException('Invalid key JSON.')\n\n        return credentials.with_subject(self.delegate_to) \\\n            if self.delegate_to else credentials", "response": "Returns the Credentials object for Google API."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nreturns an authorized HTTP object to be used to build a Google Cloud service hook connection.", "response": "def _authorize(self):\n        \"\"\"\n        Returns an authorized HTTP object to be used to build a Google cloud\n        service hook connection.\n        \"\"\"\n        credentials = self._get_credentials()\n        http = httplib2.Http()\n        authed_http = google_auth_httplib2.AuthorizedHttp(\n            credentials, http=http)\n        return authed_http"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef _get_field(self, f, default=None):\n        long_f = 'extra__google_cloud_platform__{}'.format(f)\n        if hasattr(self, 'extras') and long_f in self.extras:\n            return self.extras[long_f]\n        else:\n            return default", "response": "Gets a field from extras and returns it."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nfunction decorator that intercepts HTTP Errors and raises AirflowException with more informative message.", "response": "def catch_http_exception(func):\n        \"\"\"\n        Function decorator that intercepts HTTP Errors and raises AirflowException\n        with more informative message.\n        \"\"\"\n\n        @functools.wraps(func)\n        def wrapper_decorator(self, *args, **kwargs):\n            try:\n                return func(self, *args, **kwargs)\n            except GoogleAPICallError as e:\n                if isinstance(e, AlreadyExists):\n                    raise e\n                else:\n                    self.log.error('The request failed:\\n%s', str(e))\n                    raise AirflowException(e)\n            except RetryError as e:\n                self.log.error('The request failed due to a retryable error and retry attempts failed.')\n                raise AirflowException(e)\n            except ValueError as e:\n                self.log.error('The request failed, the parameters are invalid.')\n                raise AirflowException(e)\n            except HttpError as e:\n                self.log.error('The request failed:\\n%s', str(e))\n                raise AirflowException(e)\n\n        return wrapper_decorator"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef unfinished(cls):\n        return [\n            cls.NONE,\n            cls.SCHEDULED,\n            cls.QUEUED,\n            cls.RUNNING,\n            cls.SHUTDOWN,\n            cls.UP_FOR_RETRY,\n            cls.UP_FOR_RESCHEDULE\n        ]", "response": "A list of states indicating that a task has not yet been completed."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef delete_dag(dag_id, keep_records_in_log=True, session=None):\n    DM = models.DagModel\n    dag = session.query(DM).filter(DM.dag_id == dag_id).first()\n    if dag is None:\n        raise DagNotFound(\"Dag id {} not found\".format(dag_id))\n\n    if dag.fileloc and os.path.exists(dag.fileloc):\n        raise DagFileExists(\"Dag id {} is still in DagBag. \"\n                            \"Remove the DAG file first: {}\".format(dag_id, dag.fileloc))\n\n    count = 0\n\n    # noinspection PyUnresolvedReferences,PyProtectedMember\n    for m in models.base.Base._decl_class_registry.values():\n        if hasattr(m, \"dag_id\"):\n            if keep_records_in_log and m.__name__ == 'Log':\n                continue\n            cond = or_(m.dag_id == dag_id, m.dag_id.like(dag_id + \".%\"))\n            count += session.query(m).filter(cond).delete(synchronize_session='fetch')\n\n    if dag.is_subdag:\n        p, c = dag_id.rsplit(\".\", 1)\n        for m in models.DagRun, TaskFail, models.TaskInstance:\n            count += session.query(m).filter(m.dag_id == p, m.task_id == c).delete()\n\n    return count", "response": "Delete a DAG from the database."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nconstructs the spark - sql command to execute.", "response": "def _prepare_command(self, cmd):\n        \"\"\"\n        Construct the spark-sql command to execute. Verbose output is enabled\n        as default.\n\n        :param cmd: command to append to the spark-sql command\n        :type cmd: str\n        :return: full command to be executed\n        \"\"\"\n        connection_cmd = [\"spark-sql\"]\n        if self._conf:\n            for conf_el in self._conf.split(\",\"):\n                connection_cmd += [\"--conf\", conf_el]\n        if self._total_executor_cores:\n            connection_cmd += [\"--total-executor-cores\", str(self._total_executor_cores)]\n        if self._executor_cores:\n            connection_cmd += [\"--executor-cores\", str(self._executor_cores)]\n        if self._executor_memory:\n            connection_cmd += [\"--executor-memory\", self._executor_memory]\n        if self._keytab:\n            connection_cmd += [\"--keytab\", self._keytab]\n        if self._principal:\n            connection_cmd += [\"--principal\", self._principal]\n        if self._num_executors:\n            connection_cmd += [\"--num-executors\", str(self._num_executors)]\n        if self._sql:\n            sql = self._sql.strip()\n            if sql.endswith(\".sql\") or sql.endswith(\".hql\"):\n                connection_cmd += [\"-f\", sql]\n            else:\n                connection_cmd += [\"-e\", sql]\n        if self._master:\n            connection_cmd += [\"--master\", self._master]\n        if self._name:\n            connection_cmd += [\"--name\", self._name]\n        if self._verbose:\n            connection_cmd += [\"--verbose\"]\n        if self._yarn_queue:\n            connection_cmd += [\"--queue\", self._yarn_queue]\n\n        connection_cmd += cmd\n        self.log.debug(\"Spark-Sql cmd: %s\", connection_cmd)\n\n        return connection_cmd"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef run_query(self, cmd=\"\", **kwargs):\n        spark_sql_cmd = self._prepare_command(cmd)\n        self._sp = subprocess.Popen(spark_sql_cmd,\n                                    stdout=subprocess.PIPE,\n                                    stderr=subprocess.STDOUT,\n                                    **kwargs)\n\n        for line in iter(self._sp.stdout.readline, ''):\n            self.log.info(line)\n\n        returncode = self._sp.wait()\n\n        if returncode:\n            raise AirflowException(\n                \"Cannot execute {} on {}. Process exit code: {}.\".format(\n                    cmd, self._conn.host, returncode\n                )\n            )", "response": "Execute Spark - SQL query and return the result."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef densenet121(pretrained=False, **kwargs):\n    model = DenseNet(num_init_features=64, growth_rate=32, block_config=(6, 12, 24, 16),\n                     **kwargs)\n    if pretrained:\n        _load_state_dict(model, model_urls['densenet121'])\n    return model", "response": "r Densenet -121 model from an image net\n   "}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef to_tensor(pic):\n    if not(_is_pil_image(pic) or _is_numpy_image(pic)):\n        raise TypeError('pic should be PIL Image or ndarray. Got {}'.format(type(pic)))\n\n    if isinstance(pic, np.ndarray):\n        # handle numpy array\n        if pic.ndim == 2:\n            pic = pic[:, :, None]\n\n        img = torch.from_numpy(pic.transpose((2, 0, 1)))\n        # backward compatibility\n        if isinstance(img, torch.ByteTensor):\n            return img.float().div(255)\n        else:\n            return img\n\n    if accimage is not None and isinstance(pic, accimage.Image):\n        nppic = np.zeros([pic.channels, pic.height, pic.width], dtype=np.float32)\n        pic.copyto(nppic)\n        return torch.from_numpy(nppic)\n\n    # handle PIL Image\n    if pic.mode == 'I':\n        img = torch.from_numpy(np.array(pic, np.int32, copy=False))\n    elif pic.mode == 'I;16':\n        img = torch.from_numpy(np.array(pic, np.int16, copy=False))\n    elif pic.mode == 'F':\n        img = torch.from_numpy(np.array(pic, np.float32, copy=False))\n    elif pic.mode == '1':\n        img = 255 * torch.from_numpy(np.array(pic, np.uint8, copy=False))\n    else:\n        img = torch.ByteTensor(torch.ByteStorage.from_buffer(pic.tobytes()))\n    # PIL image mode: L, LA, P, I, F, RGB, YCbCr, RGBA, CMYK\n    if pic.mode == 'YCbCr':\n        nchannel = 3\n    elif pic.mode == 'I;16':\n        nchannel = 1\n    else:\n        nchannel = len(pic.mode)\n    img = img.view(pic.size[1], pic.size[0], nchannel)\n    # put it from HWC to CHW format\n    # yikes, this transpose takes 80% of the loading time/CPU\n    img = img.transpose(0, 1).transpose(0, 2).contiguous()\n    if isinstance(img, torch.ByteTensor):\n        return img.float().div(255)\n    else:\n        return img", "response": "Convert a PIL Image or numpy. ndarray to a tensor."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nconverting a tensor or numpy array to PIL Image.", "response": "def to_pil_image(pic, mode=None):\n    \"\"\"Convert a tensor or an ndarray to PIL Image.\n\n    See :class:`~torchvision.transforms.ToPILImage` for more details.\n\n    Args:\n        pic (Tensor or numpy.ndarray): Image to be converted to PIL Image.\n        mode (`PIL.Image mode`_): color space and pixel depth of input data (optional).\n\n    .. _PIL.Image mode: https://pillow.readthedocs.io/en/latest/handbook/concepts.html#concept-modes\n\n    Returns:\n        PIL Image: Image converted to PIL Image.\n    \"\"\"\n    if not(isinstance(pic, torch.Tensor) or isinstance(pic, np.ndarray)):\n        raise TypeError('pic should be Tensor or ndarray. Got {}.'.format(type(pic)))\n\n    elif isinstance(pic, torch.Tensor):\n        if pic.ndimension() not in {2, 3}:\n            raise ValueError('pic should be 2/3 dimensional. Got {} dimensions.'.format(pic.ndimension()))\n\n        elif pic.ndimension() == 2:\n            # if 2D image, add channel dimension (CHW)\n            pic = pic.unsqueeze(0)\n\n    elif isinstance(pic, np.ndarray):\n        if pic.ndim not in {2, 3}:\n            raise ValueError('pic should be 2/3 dimensional. Got {} dimensions.'.format(pic.ndim))\n\n        elif pic.ndim == 2:\n            # if 2D image, add channel dimension (HWC)\n            pic = np.expand_dims(pic, 2)\n\n    npimg = pic\n    if isinstance(pic, torch.FloatTensor):\n        pic = pic.mul(255).byte()\n    if isinstance(pic, torch.Tensor):\n        npimg = np.transpose(pic.numpy(), (1, 2, 0))\n\n    if not isinstance(npimg, np.ndarray):\n        raise TypeError('Input pic must be a torch.Tensor or NumPy ndarray, ' +\n                        'not {}'.format(type(npimg)))\n\n    if npimg.shape[2] == 1:\n        expected_mode = None\n        npimg = npimg[:, :, 0]\n        if npimg.dtype == np.uint8:\n            expected_mode = 'L'\n        elif npimg.dtype == np.int16:\n            expected_mode = 'I;16'\n        elif npimg.dtype == np.int32:\n            expected_mode = 'I'\n        elif npimg.dtype == np.float32:\n            expected_mode = 'F'\n        if mode is not None and mode != expected_mode:\n            raise ValueError(\"Incorrect mode ({}) supplied for input type {}. Should be {}\"\n                             .format(mode, np.dtype, expected_mode))\n        mode = expected_mode\n\n    elif npimg.shape[2] == 2:\n        permitted_2_channel_modes = ['LA']\n        if mode is not None and mode not in permitted_2_channel_modes:\n            raise ValueError(\"Only modes {} are supported for 2D inputs\".format(permitted_2_channel_modes))\n\n        if mode is None and npimg.dtype == np.uint8:\n            mode = 'LA'\n\n    elif npimg.shape[2] == 4:\n        permitted_4_channel_modes = ['RGBA', 'CMYK', 'RGBX']\n        if mode is not None and mode not in permitted_4_channel_modes:\n            raise ValueError(\"Only modes {} are supported for 4D inputs\".format(permitted_4_channel_modes))\n\n        if mode is None and npimg.dtype == np.uint8:\n            mode = 'RGBA'\n    else:\n        permitted_3_channel_modes = ['RGB', 'YCbCr', 'HSV']\n        if mode is not None and mode not in permitted_3_channel_modes:\n            raise ValueError(\"Only modes {} are supported for 3D inputs\".format(permitted_3_channel_modes))\n        if mode is None and npimg.dtype == np.uint8:\n            mode = 'RGB'\n\n    if mode is None:\n        raise TypeError('Input type {} is not supported'.format(npimg.dtype))\n\n    return Image.fromarray(npimg, mode=mode)"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nnormalizes a tensor image with mean and standard deviation.", "response": "def normalize(tensor, mean, std, inplace=False):\n    \"\"\"Normalize a tensor image with mean and standard deviation.\n\n    .. note::\n        This transform acts out of place by default, i.e., it does not mutates the input tensor.\n\n    See :class:`~torchvision.transforms.Normalize` for more details.\n\n    Args:\n        tensor (Tensor): Tensor image of size (C, H, W) to be normalized.\n        mean (sequence): Sequence of means for each channel.\n        std (sequence): Sequence of standard deviations for each channel.\n\n    Returns:\n        Tensor: Normalized Tensor image.\n    \"\"\"\n    if not _is_tensor_image(tensor):\n        raise TypeError('tensor is not a torch image.')\n\n    if not inplace:\n        tensor = tensor.clone()\n\n    mean = torch.as_tensor(mean, dtype=torch.float32, device=tensor.device)\n    std = torch.as_tensor(std, dtype=torch.float32, device=tensor.device)\n    tensor.sub_(mean[:, None, None]).div_(std[:, None, None])\n    return tensor"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef pad(img, padding, fill=0, padding_mode='constant'):\n    if not _is_pil_image(img):\n        raise TypeError('img should be PIL Image. Got {}'.format(type(img)))\n\n    if not isinstance(padding, (numbers.Number, tuple)):\n        raise TypeError('Got inappropriate padding arg')\n    if not isinstance(fill, (numbers.Number, str, tuple)):\n        raise TypeError('Got inappropriate fill arg')\n    if not isinstance(padding_mode, str):\n        raise TypeError('Got inappropriate padding_mode arg')\n\n    if isinstance(padding, Sequence) and len(padding) not in [2, 4]:\n        raise ValueError(\"Padding must be an int or a 2, or 4 element tuple, not a \" +\n                         \"{} element tuple\".format(len(padding)))\n\n    assert padding_mode in ['constant', 'edge', 'reflect', 'symmetric'], \\\n        'Padding mode should be either constant, edge, reflect or symmetric'\n\n    if padding_mode == 'constant':\n        if img.mode == 'P':\n            palette = img.getpalette()\n            image = ImageOps.expand(img, border=padding, fill=fill)\n            image.putpalette(palette)\n            return image\n\n        return ImageOps.expand(img, border=padding, fill=fill)\n    else:\n        if isinstance(padding, int):\n            pad_left = pad_right = pad_top = pad_bottom = padding\n        if isinstance(padding, Sequence) and len(padding) == 2:\n            pad_left = pad_right = padding[0]\n            pad_top = pad_bottom = padding[1]\n        if isinstance(padding, Sequence) and len(padding) == 4:\n            pad_left = padding[0]\n            pad_top = padding[1]\n            pad_right = padding[2]\n            pad_bottom = padding[3]\n\n        if img.mode == 'P':\n            palette = img.getpalette()\n            img = np.asarray(img)\n            img = np.pad(img, ((pad_top, pad_bottom), (pad_left, pad_right)), padding_mode)\n            img = Image.fromarray(img)\n            img.putpalette(palette)\n            return img\n\n        img = np.asarray(img)\n        # RGB image\n        if len(img.shape) == 3:\n            img = np.pad(img, ((pad_top, pad_bottom), (pad_left, pad_right), (0, 0)), padding_mode)\n        # Grayscale image\n        if len(img.shape) == 2:\n            img = np.pad(img, ((pad_top, pad_bottom), (pad_left, pad_right)), padding_mode)\n\n        return Image.fromarray(img)", "response": "r Pads the given PIL Image on all sides with specified padding mode and fill value."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef crop(img, i, j, h, w):\n    if not _is_pil_image(img):\n        raise TypeError('img should be PIL Image. Got {}'.format(type(img)))\n\n    return img.crop((j, i, j + w, i + h))", "response": "Crop the given PIL Image."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\ncrop the given PIL Image and resize it to desired size.", "response": "def resized_crop(img, i, j, h, w, size, interpolation=Image.BILINEAR):\n    \"\"\"Crop the given PIL Image and resize it to desired size.\n\n    Notably used in :class:`~torchvision.transforms.RandomResizedCrop`.\n\n    Args:\n        img (PIL Image): Image to be cropped.\n        i (int): i in (i,j) i.e coordinates of the upper left corner\n        j (int): j in (i,j) i.e coordinates of the upper left corner\n        h (int): Height of the cropped image.\n        w (int): Width of the cropped image.\n        size (sequence or int): Desired output size. Same semantics as ``resize``.\n        interpolation (int, optional): Desired interpolation. Default is\n            ``PIL.Image.BILINEAR``.\n    Returns:\n        PIL Image: Cropped image.\n    \"\"\"\n    assert _is_pil_image(img), 'img should be PIL Image'\n    img = crop(img, i, j, h, w)\n    img = resize(img, size, interpolation)\n    return img"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef _get_perspective_coeffs(startpoints, endpoints):\n    matrix = []\n\n    for p1, p2 in zip(endpoints, startpoints):\n        matrix.append([p1[0], p1[1], 1, 0, 0, 0, -p2[0] * p1[0], -p2[0] * p1[1]])\n        matrix.append([0, 0, 0, p1[0], p1[1], 1, -p2[1] * p1[0], -p2[1] * p1[1]])\n\n    A = torch.tensor(matrix, dtype=torch.float)\n    B = torch.tensor(startpoints, dtype=torch.float).view(8)\n    res = torch.gels(B, A)[0]\n    return res.squeeze_(1).tolist()", "response": "Helper function to get the coefficients for the perspective transform."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nperform a perspective transform of the given PIL Image.", "response": "def perspective(img, startpoints, endpoints, interpolation=Image.BICUBIC):\n    \"\"\"Perform perspective transform of the given PIL Image.\n\n    Args:\n        img (PIL Image): Image to be transformed.\n        coeffs (tuple) : 8-tuple (a, b, c, d, e, f, g, h) which contains the coefficients.\n                            for a perspective transform.\n        interpolation: Default- Image.BICUBIC\n    Returns:\n        PIL Image:  Perspectively transformed Image.\n    \"\"\"\n    if not _is_pil_image(img):\n        raise TypeError('img should be PIL Image. Got {}'.format(type(img)))\n\n    coeffs = _get_perspective_coeffs(startpoints, endpoints)\n    return img.transform(img.size, Image.PERSPECTIVE, coeffs, interpolation)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef five_crop(img, size):\n    if isinstance(size, numbers.Number):\n        size = (int(size), int(size))\n    else:\n        assert len(size) == 2, \"Please provide only two dimensions (h, w) for size.\"\n\n    w, h = img.size\n    crop_h, crop_w = size\n    if crop_w > w or crop_h > h:\n        raise ValueError(\"Requested crop size {} is bigger than input size {}\".format(size,\n                                                                                      (h, w)))\n    tl = img.crop((0, 0, crop_w, crop_h))\n    tr = img.crop((w - crop_w, 0, w, crop_h))\n    bl = img.crop((0, h - crop_h, crop_w, h))\n    br = img.crop((w - crop_w, h - crop_h, w, h))\n    center = center_crop(img, (crop_h, crop_w))\n    return (tl, tr, bl, br, center)", "response": "Crop the given PIL Image into four corners and the central crop."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef ten_crop(img, size, vertical_flip=False):\n    if isinstance(size, numbers.Number):\n        size = (int(size), int(size))\n    else:\n        assert len(size) == 2, \"Please provide only two dimensions (h, w) for size.\"\n\n    first_five = five_crop(img, size)\n\n    if vertical_flip:\n        img = vflip(img)\n    else:\n        img = hflip(img)\n\n    second_five = five_crop(img, size)\n    return first_five + second_five", "response": "r Returns a tuple of images that are ten corners and the central crop plus the a\n            flipped version of these."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nadjust the brightness of an Image.", "response": "def adjust_brightness(img, brightness_factor):\n    \"\"\"Adjust brightness of an Image.\n\n    Args:\n        img (PIL Image): PIL Image to be adjusted.\n        brightness_factor (float):  How much to adjust the brightness. Can be\n            any non negative number. 0 gives a black image, 1 gives the\n            original image while 2 increases the brightness by a factor of 2.\n\n    Returns:\n        PIL Image: Brightness adjusted image.\n    \"\"\"\n    if not _is_pil_image(img):\n        raise TypeError('img should be PIL Image. Got {}'.format(type(img)))\n\n    enhancer = ImageEnhance.Brightness(img)\n    img = enhancer.enhance(brightness_factor)\n    return img"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef adjust_contrast(img, contrast_factor):\n    if not _is_pil_image(img):\n        raise TypeError('img should be PIL Image. Got {}'.format(type(img)))\n\n    enhancer = ImageEnhance.Contrast(img)\n    img = enhancer.enhance(contrast_factor)\n    return img", "response": "Adjusts the contrast of an Image."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nadjust the saturation of an image.", "response": "def adjust_saturation(img, saturation_factor):\n    \"\"\"Adjust color saturation of an image.\n\n    Args:\n        img (PIL Image): PIL Image to be adjusted.\n        saturation_factor (float):  How much to adjust the saturation. 0 will\n            give a black and white image, 1 will give the original image while\n            2 will enhance the saturation by a factor of 2.\n\n    Returns:\n        PIL Image: Saturation adjusted image.\n    \"\"\"\n    if not _is_pil_image(img):\n        raise TypeError('img should be PIL Image. Got {}'.format(type(img)))\n\n    enhancer = ImageEnhance.Color(img)\n    img = enhancer.enhance(saturation_factor)\n    return img"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef adjust_hue(img, hue_factor):\n    if not(-0.5 <= hue_factor <= 0.5):\n        raise ValueError('hue_factor is not in [-0.5, 0.5].'.format(hue_factor))\n\n    if not _is_pil_image(img):\n        raise TypeError('img should be PIL Image. Got {}'.format(type(img)))\n\n    input_mode = img.mode\n    if input_mode in {'L', '1', 'I', 'F'}:\n        return img\n\n    h, s, v = img.convert('HSV').split()\n\n    np_h = np.array(h, dtype=np.uint8)\n    # uint8 addition take cares of rotation across boundaries\n    with np.errstate(over='ignore'):\n        np_h += np.uint8(hue_factor * 255)\n    h = Image.fromarray(np_h, 'L')\n\n    img = Image.merge('HSV', (h, s, v)).convert(input_mode)\n    return img", "response": "Adjusts the hue of an image."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef rotate(img, angle, resample=False, expand=False, center=None):\n\n    if not _is_pil_image(img):\n        raise TypeError('img should be PIL Image. Got {}'.format(type(img)))\n\n    return img.rotate(angle, resample, expand, center)", "response": "Rotate the image by angle."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\napplying an affine transformation on the image keeping image center invariant.", "response": "def affine(img, angle, translate, scale, shear, resample=0, fillcolor=None):\n    \"\"\"Apply affine transformation on the image keeping image center invariant\n\n    Args:\n        img (PIL Image): PIL Image to be rotated.\n        angle (float or int): rotation angle in degrees between -180 and 180, clockwise direction.\n        translate (list or tuple of integers): horizontal and vertical translations (post-rotation translation)\n        scale (float): overall scale\n        shear (float): shear angle value in degrees between -180 to 180, clockwise direction.\n        resample (``PIL.Image.NEAREST`` or ``PIL.Image.BILINEAR`` or ``PIL.Image.BICUBIC``, optional):\n            An optional resampling filter.\n            See `filters`_ for more information.\n            If omitted, or if the image has mode \"1\" or \"P\", it is set to ``PIL.Image.NEAREST``.\n        fillcolor (int): Optional fill color for the area outside the transform in the output image. (Pillow>=5.0.0)\n    \"\"\"\n    if not _is_pil_image(img):\n        raise TypeError('img should be PIL Image. Got {}'.format(type(img)))\n\n    assert isinstance(translate, (tuple, list)) and len(translate) == 2, \\\n        \"Argument translate should be a list or tuple of length 2\"\n\n    assert scale > 0.0, \"Argument scale should be positive\"\n\n    output_size = img.size\n    center = (img.size[0] * 0.5 + 0.5, img.size[1] * 0.5 + 0.5)\n    matrix = _get_inverse_affine_matrix(center, angle, translate, scale, shear)\n    kwargs = {\"fillcolor\": fillcolor} if PILLOW_VERSION[0] == '5' else {}\n    return img.transform(output_size, Image.AFFINE, matrix, resample, **kwargs)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nconverting image to grayscale version of image.", "response": "def to_grayscale(img, num_output_channels=1):\n    \"\"\"Convert image to grayscale version of image.\n\n    Args:\n        img (PIL Image): Image to be converted to grayscale.\n\n    Returns:\n        PIL Image: Grayscale version of the image.\n            if num_output_channels = 1 : returned image is single channel\n\n            if num_output_channels = 3 : returned image is 3 channel with r = g = b\n    \"\"\"\n    if not _is_pil_image(img):\n        raise TypeError('img should be PIL Image. Got {}'.format(type(img)))\n\n    if num_output_channels == 1:\n        img = img.convert('L')\n    elif num_output_channels == 3:\n        img = img.convert('L')\n        np_img = np.array(img, dtype=np.uint8)\n        np_img = np.dstack([np_img, np_img, np_img])\n        img = Image.fromarray(np_img, 'RGB')\n    else:\n        raise ValueError('num_output_channels should be either 1 or 3')\n\n    return img"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nmake a grid of images.", "response": "def make_grid(tensor, nrow=8, padding=2,\n              normalize=False, range=None, scale_each=False, pad_value=0):\n    \"\"\"Make a grid of images.\n\n    Args:\n        tensor (Tensor or list): 4D mini-batch Tensor of shape (B x C x H x W)\n            or a list of images all of the same size.\n        nrow (int, optional): Number of images displayed in each row of the grid.\n            The Final grid size is (B / nrow, nrow). Default is 8.\n        padding (int, optional): amount of padding. Default is 2.\n        normalize (bool, optional): If True, shift the image to the range (0, 1),\n            by subtracting the minimum and dividing by the maximum pixel value.\n        range (tuple, optional): tuple (min, max) where min and max are numbers,\n            then these numbers are used to normalize the image. By default, min and max\n            are computed from the tensor.\n        scale_each (bool, optional): If True, scale each image in the batch of\n            images separately rather than the (min, max) over all images.\n        pad_value (float, optional): Value for the padded pixels.\n\n    Example:\n        See this notebook `here <https://gist.github.com/anonymous/bf16430f7750c023141c562f3e9f2a91>`_\n\n    \"\"\"\n    if not (torch.is_tensor(tensor) or\n            (isinstance(tensor, list) and all(torch.is_tensor(t) for t in tensor))):\n        raise TypeError('tensor or list of tensors expected, got {}'.format(type(tensor)))\n\n    # if list of tensors, convert to a 4D mini-batch Tensor\n    if isinstance(tensor, list):\n        tensor = torch.stack(tensor, dim=0)\n\n    if tensor.dim() == 2:  # single image H x W\n        tensor = tensor.unsqueeze(0)\n    if tensor.dim() == 3:  # single image\n        if tensor.size(0) == 1:  # if single-channel, convert to 3-channel\n            tensor = torch.cat((tensor, tensor, tensor), 0)\n        tensor = tensor.unsqueeze(0)\n\n    if tensor.dim() == 4 and tensor.size(1) == 1:  # single-channel images\n        tensor = torch.cat((tensor, tensor, tensor), 1)\n\n    if normalize is True:\n        tensor = tensor.clone()  # avoid modifying tensor in-place\n        if range is not None:\n            assert isinstance(range, tuple), \\\n                \"range has to be a tuple (min, max) if specified. min and max are numbers\"\n\n        def norm_ip(img, min, max):\n            img.clamp_(min=min, max=max)\n            img.add_(-min).div_(max - min + 1e-5)\n\n        def norm_range(t, range):\n            if range is not None:\n                norm_ip(t, range[0], range[1])\n            else:\n                norm_ip(t, float(t.min()), float(t.max()))\n\n        if scale_each is True:\n            for t in tensor:  # loop over mini-batch dimension\n                norm_range(t, range)\n        else:\n            norm_range(tensor, range)\n\n    if tensor.size(0) == 1:\n        return tensor.squeeze()\n\n    # make the mini-batch of images into a grid\n    nmaps = tensor.size(0)\n    xmaps = min(nrow, nmaps)\n    ymaps = int(math.ceil(float(nmaps) / xmaps))\n    height, width = int(tensor.size(2) + padding), int(tensor.size(3) + padding)\n    grid = tensor.new_full((3, height * ymaps + padding, width * xmaps + padding), pad_value)\n    k = 0\n    for y in irange(ymaps):\n        for x in irange(xmaps):\n            if k >= nmaps:\n                break\n            grid.narrow(1, y * height + padding, height - padding)\\\n                .narrow(2, x * width + padding, width - padding)\\\n                .copy_(tensor[k])\n            k = k + 1\n    return grid"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nsave a given Tensor into an image file.", "response": "def save_image(tensor, filename, nrow=8, padding=2,\n               normalize=False, range=None, scale_each=False, pad_value=0):\n    \"\"\"Save a given Tensor into an image file.\n\n    Args:\n        tensor (Tensor or list): Image to be saved. If given a mini-batch tensor,\n            saves the tensor as a grid of images by calling ``make_grid``.\n        **kwargs: Other arguments are documented in ``make_grid``.\n    \"\"\"\n    from PIL import Image\n    grid = make_grid(tensor, nrow=nrow, padding=padding, pad_value=pad_value,\n                     normalize=normalize, range=range, scale_each=scale_each)\n    # Add 0.5 after unnormalizing to [0, 255] to round to nearest integer\n    ndarr = grid.mul_(255).add_(0.5).clamp_(0, 255).permute(1, 2, 0).to('cpu', torch.uint8).numpy()\n    im = Image.fromarray(ndarr)\n    im.save(filename)"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nfinds the class folders in a dataset.", "response": "def _find_classes(self, dir):\n        \"\"\"\n        Finds the class folders in a dataset.\n\n        Args:\n            dir (string): Root directory path.\n\n        Returns:\n            tuple: (classes, class_to_idx) where classes are relative to (dir), and class_to_idx is a dictionary.\n\n        Ensures:\n            No class is a subdirectory of another.\n        \"\"\"\n        if sys.version_info >= (3, 5):\n            # Faster and available in Python 3.5 and above\n            classes = [d.name for d in os.scandir(dir) if d.is_dir()]\n        else:\n            classes = [d for d in os.listdir(dir) if os.path.isdir(os.path.join(dir, d))]\n        classes.sort()\n        class_to_idx = {classes[i]: i for i in range(len(classes))}\n        return classes, class_to_idx"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef read_image_file(data_dir, image_ext, n):\n\n    def PIL2array(_img):\n        \"\"\"Convert PIL image type to numpy 2D array\n        \"\"\"\n        return np.array(_img.getdata(), dtype=np.uint8).reshape(64, 64)\n\n    def find_files(_data_dir, _image_ext):\n        \"\"\"Return a list with the file names of the images containing the patches\n        \"\"\"\n        files = []\n        # find those files with the specified extension\n        for file_dir in os.listdir(_data_dir):\n            if file_dir.endswith(_image_ext):\n                files.append(os.path.join(_data_dir, file_dir))\n        return sorted(files)  # sort files in ascend order to keep relations\n\n    patches = []\n    list_files = find_files(data_dir, image_ext)\n\n    for fpath in list_files:\n        img = Image.open(fpath)\n        for y in range(0, 1024, 64):\n            for x in range(0, 1024, 64):\n                patch = img.crop((x, y, x + 64, y + 64))\n                patches.append(PIL2array(patch))\n    return torch.ByteTensor(np.array(patches[:n]))", "response": "Read the image file and return a Tensor containing the patches"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef read_info_file(data_dir, info_file):\n    labels = []\n    with open(os.path.join(data_dir, info_file), 'r') as f:\n        labels = [int(line.split()[0]) for line in f]\n    return torch.LongTensor(labels)", "response": "Read the info file and keep only the ID of the 3D point."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef read_matches_files(data_dir, matches_file):\n    matches = []\n    with open(os.path.join(data_dir, matches_file), 'r') as f:\n        for line in f:\n            line_split = line.split()\n            matches.append([int(line_split[0]), int(line_split[3]),\n                            int(line_split[1] == line_split[4])])\n    return torch.LongTensor(matches)", "response": "Read the matches file and keep only 3D point ID."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef accuracy(output, target, topk=(1,)):\n    with torch.no_grad():\n        maxk = max(topk)\n        batch_size = target.size(0)\n\n        _, pred = output.topk(maxk, 1, True, True)\n        pred = pred.t()\n        correct = pred.eq(target[None])\n\n        res = []\n        for k in topk:\n            correct_k = correct[:k].flatten().sum(dtype=torch.float32)\n            res.append(correct_k * (100.0 / batch_size))\n        return res", "response": "Computes the accuracy over the k top predictions for the specified values of k"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef setup_for_distributed(is_master):\n    import builtins as __builtin__\n    builtin_print = __builtin__.print\n\n    def print(*args, **kwargs):\n        force = kwargs.pop('force', False)\n        if is_master or force:\n            builtin_print(*args, **kwargs)\n\n    __builtin__.print = print", "response": "This function disables printing when not in master process\n   "}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef synchronize_between_processes(self):\n        if not is_dist_avail_and_initialized():\n            return\n        t = torch.tensor([self.count, self.total], dtype=torch.float64, device='cuda')\n        dist.barrier()\n        dist.all_reduce(t)\n        t = t.tolist()\n        self.count = int(t[0])\n        self.total = t[1]", "response": "Synchronize the deque with the distribution processes."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\ncreates directories if they do not exist.", "response": "def makedir_exist_ok(dirpath):\n    \"\"\"\n    Python2 support for os.makedirs(.., exist_ok=True)\n    \"\"\"\n    try:\n        os.makedirs(dirpath)\n    except OSError as e:\n        if e.errno == errno.EEXIST:\n            pass\n        else:\n            raise"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\ndownloading a file from a URL and place it in a directory.", "response": "def download_url(url, root, filename=None, md5=None):\n    \"\"\"Download a file from a url and place it in root.\n\n    Args:\n        url (str): URL to download file from\n        root (str): Directory to place downloaded file in\n        filename (str, optional): Name to save the file under. If None, use the basename of the URL\n        md5 (str, optional): MD5 checksum of the download. If None, do not check\n    \"\"\"\n    from six.moves import urllib\n\n    root = os.path.expanduser(root)\n    if not filename:\n        filename = os.path.basename(url)\n    fpath = os.path.join(root, filename)\n\n    makedir_exist_ok(root)\n\n    # downloads file\n    if os.path.isfile(fpath) and check_integrity(fpath, md5):\n        print('Using downloaded and verified file: ' + fpath)\n    else:\n        try:\n            print('Downloading ' + url + ' to ' + fpath)\n            urllib.request.urlretrieve(\n                url, fpath,\n                reporthook=gen_bar_updater()\n            )\n        except OSError:\n            if url[:5] == 'https':\n                url = url.replace('https:', 'http:')\n                print('Failed download. Trying https -> http instead.'\n                      ' Downloading ' + url + ' to ' + fpath)\n                urllib.request.urlretrieve(\n                    url, fpath,\n                    reporthook=gen_bar_updater()\n                )"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef list_dir(root, prefix=False):\n    root = os.path.expanduser(root)\n    directories = list(\n        filter(\n            lambda p: os.path.isdir(os.path.join(root, p)),\n            os.listdir(root)\n        )\n    )\n\n    if prefix is True:\n        directories = [os.path.join(root, d) for d in directories]\n\n    return directories", "response": "List all directories at a given root"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nlisting all files ending with a suffix at a given root", "response": "def list_files(root, suffix, prefix=False):\n    \"\"\"List all files ending with a suffix at a given root\n\n    Args:\n        root (str): Path to directory whose folders need to be listed\n        suffix (str or tuple): Suffix of the files to match, e.g. '.png' or ('.jpg', '.png').\n            It uses the Python \"str.endswith\" method and is passed directly\n        prefix (bool, optional): If true, prepends the path to each result, otherwise\n            only returns the name of the files found\n    \"\"\"\n    root = os.path.expanduser(root)\n    files = list(\n        filter(\n            lambda p: os.path.isfile(os.path.join(root, p)) and p.endswith(suffix),\n            os.listdir(root)\n        )\n    )\n\n    if prefix is True:\n        files = [os.path.join(root, d) for d in files]\n\n    return files"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef download_file_from_google_drive(file_id, root, filename=None, md5=None):\n    # Based on https://stackoverflow.com/questions/38511444/python-download-files-from-google-drive-using-url\n    import requests\n    url = \"https://docs.google.com/uc?export=download\"\n\n    root = os.path.expanduser(root)\n    if not filename:\n        filename = file_id\n    fpath = os.path.join(root, filename)\n\n    makedir_exist_ok(root)\n\n    if os.path.isfile(fpath) and check_integrity(fpath, md5):\n        print('Using downloaded and verified file: ' + fpath)\n    else:\n        session = requests.Session()\n\n        response = session.get(url, params={'id': file_id}, stream=True)\n        token = _get_confirm_token(response)\n\n        if token:\n            params = {'id': file_id, 'confirm': token}\n            response = session.get(url, params=params, stream=True)\n\n        _save_response_content(response, fpath)", "response": "Download a Google Drive file from Google Drive and place it in root."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\ngets parameters for a random crop.", "response": "def get_params(img, output_size):\n        \"\"\"Get parameters for ``crop`` for a random crop.\n\n        Args:\n            img (PIL Image): Image to be cropped.\n            output_size (tuple): Expected output size of the crop.\n\n        Returns:\n            tuple: params (i, j, h, w) to be passed to ``crop`` for random crop.\n        \"\"\"\n        w, h = img.size\n        th, tw = output_size\n        if w == tw and h == th:\n            return 0, 0, h, w\n\n        i = random.randint(0, h - th)\n        j = random.randint(0, w - tw)\n        return i, j, th, tw"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef get_params(width, height, distortion_scale):\n        half_height = int(height / 2)\n        half_width = int(width / 2)\n        topleft = (random.randint(0, int(distortion_scale * half_width)),\n                   random.randint(0, int(distortion_scale * half_height)))\n        topright = (random.randint(width - int(distortion_scale * half_width) - 1, width - 1),\n                    random.randint(0, int(distortion_scale * half_height)))\n        botright = (random.randint(width - int(distortion_scale * half_width) - 1, width - 1),\n                    random.randint(height - int(distortion_scale * half_height) - 1, height - 1))\n        botleft = (random.randint(0, int(distortion_scale * half_width)),\n                   random.randint(height - int(distortion_scale * half_height) - 1, height - 1))\n        startpoints = [(0, 0), (width - 1, 0), (width - 1, height - 1), (0, height - 1)]\n        endpoints = [topleft, topright, botright, botleft]\n        return startpoints, endpoints", "response": "Get parameters for a random perspective transform."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef get_params(img, scale, ratio):\n        area = img.size[0] * img.size[1]\n\n        for attempt in range(10):\n            target_area = random.uniform(*scale) * area\n            log_ratio = (math.log(ratio[0]), math.log(ratio[1]))\n            aspect_ratio = math.exp(random.uniform(*log_ratio))\n\n            w = int(round(math.sqrt(target_area * aspect_ratio)))\n            h = int(round(math.sqrt(target_area / aspect_ratio)))\n\n            if w <= img.size[0] and h <= img.size[1]:\n                i = random.randint(0, img.size[1] - h)\n                j = random.randint(0, img.size[0] - w)\n                return i, j, h, w\n\n        # Fallback to central crop\n        in_ratio = img.size[0] / img.size[1]\n        if (in_ratio < min(ratio)):\n            w = img.size[0]\n            h = w / min(ratio)\n        elif (in_ratio > max(ratio)):\n            h = img.size[1]\n            w = h * max(ratio)\n        else:  # whole image\n            w = img.size[0]\n            h = img.size[1]\n        i = (img.size[1] - h) // 2\n        j = (img.size[0] - w) // 2\n        return i, j, h, w", "response": "Get parameters for a random sized crop."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef get_params(brightness, contrast, saturation, hue):\n        transforms = []\n\n        if brightness is not None:\n            brightness_factor = random.uniform(brightness[0], brightness[1])\n            transforms.append(Lambda(lambda img: F.adjust_brightness(img, brightness_factor)))\n\n        if contrast is not None:\n            contrast_factor = random.uniform(contrast[0], contrast[1])\n            transforms.append(Lambda(lambda img: F.adjust_contrast(img, contrast_factor)))\n\n        if saturation is not None:\n            saturation_factor = random.uniform(saturation[0], saturation[1])\n            transforms.append(Lambda(lambda img: F.adjust_saturation(img, saturation_factor)))\n\n        if hue is not None:\n            hue_factor = random.uniform(hue[0], hue[1])\n            transforms.append(Lambda(lambda img: F.adjust_hue(img, hue_factor)))\n\n        random.shuffle(transforms)\n        transform = Compose(transforms)\n\n        return transform", "response": "Get a randomized transform to be applied on image."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nget parameters for affine transformation", "response": "def get_params(degrees, translate, scale_ranges, shears, img_size):\n        \"\"\"Get parameters for affine transformation\n\n        Returns:\n            sequence: params to be passed to the affine transformation\n        \"\"\"\n        angle = random.uniform(degrees[0], degrees[1])\n        if translate is not None:\n            max_dx = translate[0] * img_size[0]\n            max_dy = translate[1] * img_size[1]\n            translations = (np.round(random.uniform(-max_dx, max_dx)),\n                            np.round(random.uniform(-max_dy, max_dy)))\n        else:\n            translations = (0, 0)\n\n        if scale_ranges is not None:\n            scale = random.uniform(scale_ranges[0], scale_ranges[1])\n        else:\n            scale = 1.0\n\n        if shears is not None:\n            shear = random.uniform(shears[0], shears[1])\n        else:\n            shear = 0.0\n\n        return angle, translations, scale, shear"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef inception_v3(pretrained=False, **kwargs):\n    if pretrained:\n        if 'transform_input' not in kwargs:\n            kwargs['transform_input'] = True\n        if 'aux_logits' in kwargs:\n            original_aux_logits = kwargs['aux_logits']\n            kwargs['aux_logits'] = True\n        else:\n            original_aux_logits = True\n        model = Inception3(**kwargs)\n        model.load_state_dict(model_zoo.load_url(model_urls['inception_v3_google']))\n        if not original_aux_logits:\n            model.aux_logits = False\n            del model.AuxLogits\n        return model\n\n    return Inception3(**kwargs)", "response": "r A function to create an inception v3 model."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef download(self):\n        import tarfile\n\n        if self._check_integrity():\n            print('Files already downloaded and verified')\n            return\n\n        download_url(self.url, self.root, self.filename, self.md5_checksum)\n\n        # Extract file\n        with tarfile.open(os.path.join(self.root, self.filename), 'r:gz') as tar:\n            tar.extractall(path=self.root)\n\n        # Download individual photos\n        with open(os.path.join(self.root, 'dataset', 'SBU_captioned_photo_dataset_urls.txt')) as fh:\n            for line in fh:\n                url = line.rstrip()\n                try:\n                    download_url(url, os.path.join(self.root, 'dataset'))\n                except OSError:\n                    # The images point to public images on Flickr.\n                    # Note: Images might be removed by users at anytime.\n                    pass", "response": "Download and extract the tarball and download each individual photo."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef googlenet(pretrained=False, **kwargs):\n    if pretrained:\n        if 'transform_input' not in kwargs:\n            kwargs['transform_input'] = True\n        if 'aux_logits' not in kwargs:\n            kwargs['aux_logits'] = False\n        if kwargs['aux_logits']:\n            warnings.warn('auxiliary heads in the pretrained googlenet model are NOT pretrained, '\n                          'so make sure to train them')\n        original_aux_logits = kwargs['aux_logits']\n        kwargs['aux_logits'] = True\n        kwargs['init_weights'] = False\n        model = GoogLeNet(**kwargs)\n        model.load_state_dict(model_zoo.load_url(model_urls['googlenet']))\n        if not original_aux_logits:\n            model.aux_logits = False\n            del model.aux1, model.aux2\n        return model\n\n    return GoogLeNet(**kwargs)", "response": "r A function to create a new image net architectural version of the googlenet model."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef download(self):\n\n        if self._check_exists():\n            return\n\n        makedir_exist_ok(self.raw_folder)\n        makedir_exist_ok(self.processed_folder)\n\n        # download files\n        for url in self.urls:\n            filename = url.rpartition('/')[2]\n            file_path = os.path.join(self.raw_folder, filename)\n            download_url(url, root=self.raw_folder, filename=filename, md5=None)\n            self.extract_gzip(gzip_path=file_path, remove_finished=True)\n\n        # process and save as torch files\n        print('Processing...')\n\n        training_set = (\n            read_image_file(os.path.join(self.raw_folder, 'train-images-idx3-ubyte')),\n            read_label_file(os.path.join(self.raw_folder, 'train-labels-idx1-ubyte'))\n        )\n        test_set = (\n            read_image_file(os.path.join(self.raw_folder, 't10k-images-idx3-ubyte')),\n            read_label_file(os.path.join(self.raw_folder, 't10k-labels-idx1-ubyte'))\n        )\n        with open(os.path.join(self.processed_folder, self.training_file), 'wb') as f:\n            torch.save(training_set, f)\n        with open(os.path.join(self.processed_folder, self.test_file), 'wb') as f:\n            torch.save(test_set, f)\n\n        print('Done!')", "response": "Download the MNIST data if it doesn t exist in processed_folder already."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\ndownloading the EMNIST data from the given URL and save it as torch files.", "response": "def download(self):\n        \"\"\"Download the EMNIST data if it doesn't exist in processed_folder already.\"\"\"\n        import shutil\n        import zipfile\n\n        if self._check_exists():\n            return\n\n        makedir_exist_ok(self.raw_folder)\n        makedir_exist_ok(self.processed_folder)\n\n        # download files\n        filename = self.url.rpartition('/')[2]\n        file_path = os.path.join(self.raw_folder, filename)\n        download_url(self.url, root=self.raw_folder, filename=filename, md5=None)\n\n        print('Extracting zip archive')\n        with zipfile.ZipFile(file_path) as zip_f:\n            zip_f.extractall(self.raw_folder)\n        os.unlink(file_path)\n        gzip_folder = os.path.join(self.raw_folder, 'gzip')\n        for gzip_file in os.listdir(gzip_folder):\n            if gzip_file.endswith('.gz'):\n                self.extract_gzip(gzip_path=os.path.join(gzip_folder, gzip_file))\n\n        # process and save as torch files\n        for split in self.splits:\n            print('Processing ' + split)\n            training_set = (\n                read_image_file(os.path.join(gzip_folder, 'emnist-{}-train-images-idx3-ubyte'.format(split))),\n                read_label_file(os.path.join(gzip_folder, 'emnist-{}-train-labels-idx1-ubyte'.format(split)))\n            )\n            test_set = (\n                read_image_file(os.path.join(gzip_folder, 'emnist-{}-test-images-idx3-ubyte'.format(split))),\n                read_label_file(os.path.join(gzip_folder, 'emnist-{}-test-labels-idx1-ubyte'.format(split)))\n            )\n            with open(os.path.join(self.processed_folder, self._training_file(split)), 'wb') as f:\n                torch.save(training_set, f)\n            with open(os.path.join(self.processed_folder, self._test_file(split)), 'wb') as f:\n                torch.save(test_set, f)\n        shutil.rmtree(gzip_folder)\n\n        print('Done!')"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef get_current_theme_name(override=None):\n\n    if override and (override in themes or override == '__common__'):\n        return override\n    theme_name = request.args.get('theme', request.preferences.get_value('theme'))\n    if theme_name not in themes:\n        theme_name = default_theme\n    return theme_name", "response": "Returns theme name.\n\n    Checks in this order:\n    1. override\n    2. cookies\n    3. settings"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef index():\n\n    # output_format\n    output_format = request.form.get('format', 'html')\n    if output_format not in ['html', 'csv', 'json', 'rss']:\n        output_format = 'html'\n\n    # check if there is query\n    if request.form.get('q') is None:\n        if output_format == 'html':\n            return render(\n                'index.html',\n            )\n        else:\n            return index_error(output_format, 'No query'), 400\n\n    # search\n    search_query = None\n    result_container = None\n    try:\n        search_query = get_search_query_from_webapp(request.preferences, request.form)\n        # search = Search(search_query) #  without plugins\n        search = SearchWithPlugins(search_query, request.user_plugins, request)\n        result_container = search.search()\n    except Exception as e:\n        # log exception\n        logger.exception('search error')\n\n        # is it an invalid input parameter or something else ?\n        if (issubclass(e.__class__, SearxParameterException)):\n            return index_error(output_format, e.message), 400\n        else:\n            return index_error(output_format, gettext('search error')), 500\n\n    # results\n    results = result_container.get_ordered_results()\n    number_of_results = result_container.results_number()\n    if number_of_results < result_container.results_length():\n        number_of_results = 0\n\n    # UI\n    advanced_search = request.form.get('advanced_search', None)\n\n    # output\n    for result in results:\n        if output_format == 'html':\n            if 'content' in result and result['content']:\n                result['content'] = highlight_content(escape(result['content'][:1024]), search_query.query)\n            result['title'] = highlight_content(escape(result['title'] or u''), search_query.query)\n        else:\n            if result.get('content'):\n                result['content'] = html_to_text(result['content']).strip()\n            # removing html content and whitespace duplications\n            result['title'] = ' '.join(html_to_text(result['title']).strip().split())\n\n        result['pretty_url'] = prettify_url(result['url'])\n\n        # TODO, check if timezone is calculated right\n        if 'publishedDate' in result:\n            try:  # test if publishedDate >= 1900 (datetime module bug)\n                result['pubdate'] = result['publishedDate'].strftime('%Y-%m-%d %H:%M:%S%z')\n            except ValueError:\n                result['publishedDate'] = None\n            else:\n                if result['publishedDate'].replace(tzinfo=None) >= datetime.now() - timedelta(days=1):\n                    timedifference = datetime.now() - result['publishedDate'].replace(tzinfo=None)\n                    minutes = int((timedifference.seconds / 60) % 60)\n                    hours = int(timedifference.seconds / 60 / 60)\n                    if hours == 0:\n                        result['publishedDate'] = gettext(u'{minutes} minute(s) ago').format(minutes=minutes)\n                    else:\n                        result['publishedDate'] = gettext(u'{hours} hour(s), {minutes} minute(s) ago').format(hours=hours, minutes=minutes)  # noqa\n                else:\n                    result['publishedDate'] = format_date(result['publishedDate'])\n\n    if output_format == 'json':\n        return Response(json.dumps({'query': search_query.query.decode('utf-8'),\n                                    'number_of_results': number_of_results,\n                                    'results': results,\n                                    'answers': list(result_container.answers),\n                                    'corrections': list(result_container.corrections),\n                                    'infoboxes': result_container.infoboxes,\n                                    'suggestions': list(result_container.suggestions),\n                                    'unresponsive_engines': list(result_container.unresponsive_engines)},\n                                   default=lambda item: list(item) if isinstance(item, set) else item),\n                        mimetype='application/json')\n    elif output_format == 'csv':\n        csv = UnicodeWriter(StringIO())\n        keys = ('title', 'url', 'content', 'host', 'engine', 'score')\n        csv.writerow(keys)\n        for row in results:\n            row['host'] = row['parsed_url'].netloc\n            csv.writerow([row.get(key, '') for key in keys])\n        csv.stream.seek(0)\n        response = Response(csv.stream.read(), mimetype='application/csv')\n        cont_disp = 'attachment;Filename=searx_-_{0}.csv'.format(search_query.query)\n        response.headers.add('Content-Disposition', cont_disp)\n        return response\n    elif output_format == 'rss':\n        response_rss = render(\n            'opensearch_response_rss.xml',\n            results=results,\n            q=request.form['q'],\n            number_of_results=number_of_results,\n            base_url=get_base_url(),\n            override_theme='__common__',\n        )\n        return Response(response_rss, mimetype='text/xml')\n\n    return render(\n        'results.html',\n        results=results,\n        q=request.form['q'],\n        selected_categories=search_query.categories,\n        pageno=search_query.pageno,\n        time_range=search_query.time_range,\n        number_of_results=format_decimal(number_of_results),\n        advanced_search=advanced_search,\n        suggestions=result_container.suggestions,\n        answers=result_container.answers,\n        corrections=result_container.corrections,\n        infoboxes=result_container.infoboxes,\n        paging=result_container.paging,\n        unresponsive_engines=result_container.unresponsive_engines,\n        current_language=match_language(search_query.lang,\n                                        LANGUAGE_CODES,\n                                        fallback=settings['search']['language']),\n        base_url=get_base_url(),\n        theme=get_current_theme_name(),\n        favicons=global_favicons[themes.index(get_current_theme_name())]\n    )", "response": "Render index page.\n\n    Supported outputs: html, json, csv, rss."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef preferences():\n\n    # save preferences\n    if request.method == 'POST':\n        resp = make_response(redirect(urljoin(settings['server']['base_url'], url_for('index'))))\n        try:\n            request.preferences.parse_form(request.form)\n        except ValidationException:\n            request.errors.append(gettext('Invalid settings, please edit your preferences'))\n            return resp\n        return request.preferences.save(resp)\n\n    # render preferences\n    image_proxy = request.preferences.get_value('image_proxy')\n    lang = request.preferences.get_value('language')\n    disabled_engines = request.preferences.engines.get_disabled()\n    allowed_plugins = request.preferences.plugins.get_enabled()\n\n    # stats for preferences page\n    stats = {}\n\n    for c in categories:\n        for e in categories[c]:\n            stats[e.name] = {'time': None,\n                             'warn_timeout': False,\n                             'warn_time': False}\n            if e.timeout > settings['outgoing']['request_timeout']:\n                stats[e.name]['warn_timeout'] = True\n            stats[e.name]['supports_selected_language'] = _is_selected_language_supported(e, request.preferences)\n\n    # get first element [0], the engine time,\n    # and then the second element [1] : the time (the first one is the label)\n    for engine_stat in get_engines_stats()[0][1]:\n        stats[engine_stat.get('name')]['time'] = round(engine_stat.get('avg'), 3)\n        if engine_stat.get('avg') > settings['outgoing']['request_timeout']:\n            stats[engine_stat.get('name')]['warn_time'] = True\n    # end of stats\n\n    return render('preferences.html',\n                  locales=settings['locales'],\n                  current_locale=get_locale(),\n                  image_proxy=image_proxy,\n                  engines_by_category=categories,\n                  stats=stats,\n                  answerers=[{'info': a.self_info(), 'keywords': a.keywords} for a in answerers],\n                  disabled_engines=disabled_engines,\n                  autocomplete_backends=autocomplete_backends,\n                  shortcuts={y: x for x, y in engine_shortcuts.items()},\n                  themes=themes,\n                  plugins=plugins,\n                  doi_resolvers=settings['doi_resolvers'],\n                  current_doi_resolver=get_doi_resolver(request.args, request.preferences.get_value('doi_resolver')),\n                  allowed_plugins=allowed_plugins,\n                  theme=get_current_theme_name(),\n                  preferences_url_params=request.preferences.get_as_url_params(),\n                  base_url=get_base_url(),\n                  preferences=True)", "response": "Render preferences page and save user preferences"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef request(query, params):\n    '''pre-request callback\n    params<dict>:\n      method  : POST/GET\n      headers : {}\n      data    : {} # if method == POST\n      url     : ''\n      category: 'search category'\n      pageno  : 1 # number of the requested page\n    '''\n\n    offset = (params['pageno'] - 1)\n    params['url'] = search_url.format(offset=offset, query=quote(query))\n    return params", "response": "pre-request callback\n    params<dict>:\n      method  : POST/GET\n      headers : {}\n      data    : {} # if method == POST\n      url     : ''\n      category: 'search category'\n      pageno  : 1 # number of the requested page"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nposts - response callback resp : requests response object", "response": "def response(resp):\n    '''post-response callback\n    resp: requests response object\n    '''\n    results = []\n\n    dom = html.fromstring(resp.text)\n\n    try:\n        number_of_results_string = re.sub('[^0-9]', '', dom.xpath(\n            '//a[@class=\"active\" and contains(@href,\"/suchen/dudenonline\")]/span/text()')[0]\n        )\n\n        results.append({'number_of_results': int(number_of_results_string)})\n\n    except:\n        logger.debug(\"Couldn't read number of results.\")\n        pass\n\n    for result in dom.xpath('//section[@class=\"wide\" and not(contains(@style,\"overflow:hidden\"))]'):\n        try:\n            logger.debug(\"running for %s\" % str(result))\n            link = result.xpath('.//h2/a')[0]\n            url = link.attrib.get('href')\n            title = result.xpath('string(.//h2/a)')\n            content = extract_text(result.xpath('.//p'))\n            # append result\n            results.append({'url': url,\n                            'title': title,\n                            'content': content})\n        except:\n            logger.debug('result parse error in:\\n%s', etree.tostring(result, pretty_print=True))\n            continue\n\n    return results"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef get_themes(templates_path):\n    themes = os.listdir(templates_path)\n    if '__common__' in themes:\n        themes.remove('__common__')\n    return themes", "response": "Returns available themes list."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef searx_bang(full_query):\n    '''check if the searchQuery contain a bang, and create fitting autocompleter results'''\n    # check if there is a query which can be parsed\n    if len(full_query.getSearchQuery()) == 0:\n        return []\n\n    results = []\n\n    # check if current query stats with !bang\n    first_char = full_query.getSearchQuery()[0]\n    if first_char == '!' or first_char == '?':\n        if len(full_query.getSearchQuery()) == 1:\n            # show some example queries\n            # TODO, check if engine is not avaliable\n            results.append(first_char + \"images\")\n            results.append(first_char + \"wikipedia\")\n            results.append(first_char + \"osm\")\n        else:\n            engine_query = full_query.getSearchQuery()[1:]\n\n            # check if query starts with categorie name\n            for categorie in categories:\n                if categorie.startswith(engine_query):\n                    results.append(first_char + '{categorie}'.format(categorie=categorie))\n\n            # check if query starts with engine name\n            for engine in engines:\n                if engine.startswith(engine_query.replace('_', ' ')):\n                    results.append(first_char + '{engine}'.format(engine=engine.replace(' ', '_')))\n\n            # check if query starts with engine shortcut\n            for engine_shortcut in engine_shortcuts:\n                if engine_shortcut.startswith(engine_query):\n                    results.append(first_char + '{engine_shortcut}'.format(engine_shortcut=engine_shortcut))\n\n    # check if current query stats with :bang\n    elif first_char == ':':\n        if len(full_query.getSearchQuery()) == 1:\n            # show some example queries\n            results.append(\":en\")\n            results.append(\":en_us\")\n            results.append(\":english\")\n            results.append(\":united_kingdom\")\n        else:\n            engine_query = full_query.getSearchQuery()[1:]\n\n            for lc in language_codes:\n                lang_id, lang_name, country, english_name = map(unicode.lower, lc)\n\n                # check if query starts with language-id\n                if lang_id.startswith(engine_query):\n                    if len(engine_query) <= 2:\n                        results.append(u':{lang_id}'.format(lang_id=lang_id.split('-')[0]))\n                    else:\n                        results.append(u':{lang_id}'.format(lang_id=lang_id))\n\n                # check if query starts with language name\n                if lang_name.startswith(engine_query) or english_name.startswith(engine_query):\n                    results.append(u':{lang_name}'.format(lang_name=lang_name))\n\n                # check if query starts with country\n                if country.startswith(engine_query.replace('_', ' ')):\n                    results.append(u':{country}'.format(country=country.replace(' ', '_')))\n\n    # remove duplicates\n    result_set = set(results)\n\n    # remove results which are already contained in the query\n    for query_part in full_query.query_parts:\n        if query_part in result_set:\n            result_set.remove(query_part)\n\n    # convert result_set back to list\n    return list(result_set)", "response": "check if the searchQuery contains a bang and create fitting autocompleter results"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nparsing response and return list of dictionaries", "response": "def response(resp):\n    \"\"\"remove first and last lines to get only json\"\"\"\n    json_resp = resp.text[resp.text.find('\\n') + 1:resp.text.rfind('\\n') - 2]\n    results = []\n    try:\n        conversion_rate = float(json.loads(json_resp)['conversion']['converted-amount'])\n    except:\n        return results\n    answer = '{0} {1} = {2} {3}, 1 {1} ({5}) = {4} {3} ({6})'.format(\n        resp.search_params['amount'],\n        resp.search_params['from'],\n        resp.search_params['amount'] * conversion_rate,\n        resp.search_params['to'],\n        conversion_rate,\n        resp.search_params['from_name'],\n        resp.search_params['to_name'],\n    )\n\n    url = 'https://duckduckgo.com/js/spice/currency/1/{0}/{1}'.format(\n        resp.search_params['from'].upper(), resp.search_params['to'])\n\n    results.append({'answer': answer, 'url': url})\n\n    return results"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef custom_gradient(fx, gx, x, fx_gx_manually_stopped=False, name=None):\n  def maybe_stop(x):\n    if fx_gx_manually_stopped:\n      return x\n    return tf.stop_gradient(x)\n\n  with tf.compat.v1.name_scope(name, 'custom_gradient', [fx, gx, x]):\n    fx = tf.convert_to_tensor(value=fx, name='fx')\n    # We don't want to bother eagerly computing `gx` since we may not even need\n    # it.\n    with tf.control_dependencies([fx]):\n      if is_list_like(x):\n        x = [identity(x_, name='x') for x_ in x]\n      else:\n        x = [identity(x, name='x')]\n\n      if is_list_like(gx):\n        gx = [identity(gx_, dtype=fx.dtype, name='gx')\n              for gx_ in gx]\n      else:\n        gx = [identity(gx, dtype=fx.dtype, name='gx')]\n\n      override_grad = []\n      for x_, gx_ in zip(x, gx):\n        # Observe: tf.gradients(f(x), x)[i].shape == x[i].shape\n        # thus we check that the user is supplying correct shapes.\n        equal_shape = tf.compat.v1.assert_equal(\n            tf.shape(input=x_),\n            tf.shape(input=gx_),\n            message='Each `x` must have the same shape as each `gx`.')\n        with tf.control_dependencies([equal_shape]):\n          # IEEE754 ensures `(x-x)==0.` and that `0.*x==0.` so we make sure to\n          # write the code this way, rather than, e.g.,\n          # `sum_x * stop(gx) + stop(fx - sum_x * gx)`.\n          # For more discussion regarding the relevant portions of the IEEE754\n          # standard, see the StackOverflow question,\n          # \"Is there a floating point value of x, for which x-x == 0 is false?\"\n          # http://stackoverflow.com/q/2686644\n          zeros_like_x_ = x_ - tf.stop_gradient(x_)\n          override_grad.append(\n              tf.reduce_sum(input_tensor=maybe_stop(gx_) * zeros_like_x_))\n      override_grad = sum(override_grad)\n      override_grad /= tf.cast(tf.size(input=fx), dtype=fx.dtype.base_dtype)\n\n      # Proof of correctness:\n      #\n      #  f(x) = x * stop[gx] + stop[fx - x * gx]\n      #       = stop[fx]\n      #\n      #  g(x) = grad[fx]\n      #       = stop[gx] + grad[stop[fx - x * gx]]\n      #       = stop[gx] + 0\n      #\n      # Notice that when x is zero it still works:\n      # grad[x * stop(gx) + stop(fx - x * gx)] = 1 * stop[gx] + 0 = stop[gx]\n      #\n      # The proof is similar for the tensor-domain case, except that we\n      # `reduce_sum` the `stop[gx] * (x - stop[x])` then rescale by\n      # `tf.size(fx)` since this reduced version is broadcast to `fx`.\n      return maybe_stop(fx) + override_grad", "response": "Embeds a custom gradient into a tensor."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\ncompute the value of f wrt each of xs.", "response": "def value_and_gradient(f, xs, use_gradient_tape=False, name=None):\n  \"\"\"Computes `f(*xs)` and its gradients wrt to `*xs`.\n\n  Args:\n    f: Python `callable` to be differentiated. If `f` returns a scalar, this\n      scalar will be differentiated. If `f` returns a tensor or list of tensors,\n      by default a scalar will be computed by adding all their values to produce\n      a single scalar. If desired, the tensors can be elementwise multiplied by\n      the tensors passed as the `dy` keyword argument to the returned gradient\n      function.\n    xs: Python list of parameters of f for which to differentiate. (Can also\n      be single `Tensor`.)\n    use_gradient_tape: Python `bool` indicating that `tf.GradientTape`\n      should be used regardless of `tf.executing_eagerly()` status.\n      Default value: `False`.\n    name: Python `str` name prefixed to ops created by this function.\n      Default value: `None` (i.e., `'value_and_gradient'`).\n\n  Returns:\n    y: `y = f(*xs)`.\n    dydx: Gradient of `y` wrt each of `xs`.\n  \"\"\"\n  with tf.compat.v1.name_scope(name, 'value_and_gradient', [xs]):\n    is_xs_list_like = isinstance(xs, (tuple, list))\n    if not is_xs_list_like:\n      xs = [xs]\n    xs = [tf.convert_to_tensor(value=x, name='x{}'.format(i))\n          for i, x in enumerate(xs)]\n    if tf.executing_eagerly() or use_gradient_tape:\n      with tf.GradientTape(watch_accessed_variables=False) as tape:\n        for x in xs:\n          tape.watch(x)\n        y = f(*xs)\n      dydx = tape.gradient(y, xs)\n    else:\n      y = f(*xs)\n      dydx = tf.gradients(ys=y, xs=xs)\n    if not is_xs_list_like:\n      dydx = dydx[0]\n    return y, dydx"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef eight_schools_joint_log_prob(\n    treatment_effects, treatment_stddevs,\n    avg_effect, avg_stddev, school_effects_standard):\n  \"\"\"Eight-schools joint log-prob.\"\"\"\n  rv_avg_effect = tfd.Normal(loc=0., scale=10.)\n  rv_avg_stddev = tfd.Normal(loc=5., scale=1.)\n  rv_school_effects_standard = mvn(\n      loc=tf.zeros_like(school_effects_standard),\n      scale=tf.ones_like(school_effects_standard))\n  rv_treatment_effects = mvn(\n      loc=(avg_effect + tf.exp(avg_stddev) * school_effects_standard),\n      scale=treatment_stddevs)\n  return (\n      rv_avg_effect.log_prob(avg_effect) +\n      rv_avg_stddev.log_prob(avg_stddev) +\n      rv_school_effects_standard.log_prob(school_effects_standard) +\n      rv_treatment_effects.log_prob(treatment_effects))", "response": "Eight - schools joint log - prob."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef benchmark_eight_schools_hmc(\n    num_results=int(5e3),\n    num_burnin_steps=int(3e3),\n    num_leapfrog_steps=3,\n    step_size=0.4):\n  \"\"\"Runs HMC on the eight-schools unnormalized posterior.\"\"\"\n\n  num_schools = 8\n  treatment_effects = tf.constant(\n      [28, 8, -3, 7, -1, 1, 18, 12],\n      dtype=np.float32,\n      name='treatment_effects')\n  treatment_stddevs = tf.constant(\n      [15, 10, 16, 11, 9, 11, 10, 18],\n      dtype=np.float32,\n      name='treatment_stddevs')\n\n  def unnormalized_posterior_log_prob(\n      avg_effect, avg_stddev, school_effects_standard):\n    \"\"\"Eight-schools unnormalized log posterior.\"\"\"\n    return eight_schools_joint_log_prob(\n        treatment_effects, treatment_stddevs,\n        avg_effect, avg_stddev, school_effects_standard)\n\n  if tf.executing_eagerly():\n    sample_chain = tf.function(tfp.mcmc.sample_chain)\n  else:\n    sample_chain = tfp.mcmc.sample_chain\n\n  def computation():\n    \"\"\"The benchmark computation.\"\"\"\n    _, kernel_results = sample_chain(\n        num_results=num_results,\n        num_burnin_steps=num_burnin_steps,\n        current_state=(\n            tf.zeros([], name='init_avg_effect'),\n            tf.zeros([], name='init_avg_stddev'),\n            tf.ones([num_schools], name='init_school_effects_standard'),\n        ),\n        kernel=tfp.mcmc.HamiltonianMonteCarlo(\n            target_log_prob_fn=unnormalized_posterior_log_prob,\n            step_size=step_size,\n            num_leapfrog_steps=num_leapfrog_steps))\n\n    return kernel_results.is_accepted\n\n  # Let's force evaluation of graph to ensure build time is not part of our time\n  # trial.\n  is_accepted_tensor = computation()\n  if not tf.executing_eagerly():\n    session = tf.compat.v1.Session()\n    session.run(is_accepted_tensor)\n\n  start_time = time.time()\n  if tf.executing_eagerly():\n    is_accepted = computation()\n  else:\n    is_accepted = session.run(is_accepted_tensor)\n  wall_time = time.time() - start_time\n\n  num_accepted = np.sum(is_accepted)\n  acceptance_rate = np.float32(num_accepted) / np.float32(num_results)\n\n  return dict(\n      iters=(num_results + num_burnin_steps) * num_leapfrog_steps,\n      extras={'acceptance_rate': acceptance_rate},\n      wall_time=wall_time)", "response": "Benchmarks the eight - schools unnormalized posterior."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef expand_docstring(**kwargs):\n  def _fn_wrapped(fn):\n    \"\"\"Original function with modified `__doc__` attribute.\"\"\"\n    doc = inspect.cleandoc(fn.__doc__)\n    for k, v in six.iteritems(kwargs):\n      # Capture each ${k} reference to replace with v.\n      # We wrap the replacement in a function so no backslash escapes\n      # are processed.\n      pattern = r'\\$\\{' + str(k) + r'\\}'\n      doc = re.sub(pattern, lambda match: v, doc)  # pylint: disable=cell-var-from-loop\n    fn.__doc__ = doc\n    return fn\n  return _fn_wrapped", "response": "Decorator to programmatically expand the docstring."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\ninferring the original name passed into a distribution constructor.", "response": "def _simple_name(distribution):\n  \"\"\"Infer the original name passed into a distribution constructor.\n\n  Distributions typically follow the pattern of\n  with.name_scope(name) as name:\n    super(name=name)\n  so we attempt to reverse the name-scope transformation to allow\n  addressing of RVs by the distribution's original, user-visible\n  name kwarg.\n\n  Args:\n    distribution: a tfd.Distribution instance.\n  Returns:\n    simple_name: the original name passed into the Distribution.\n\n  #### Example\n\n  ```\n  d1 = tfd.Normal(0., 1., name='x') # d1.name = 'x/'\n  d2 = tfd.Normal(0., 1., name='x') # d2.name = 'x_2/'\n  _simple_name(d2) # returns 'x'\n\n  ```\n\n  \"\"\"\n  simple_name = distribution.name\n\n  # turn 'scope/x/' into 'x'\n  if simple_name.endswith('/'):\n    simple_name = simple_name.split('/')[-2]\n\n  # turn 'x_3' into 'x'\n  parts = simple_name.split('_')\n  if parts[-1].isdigit():\n    simple_name = '_'.join(parts[:-1])\n\n  return simple_name"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef _build_custom_rv(distribution, sample_shape, value, name):\n  # Program transformations (e.g., `make_log_joint_fn`) assume that\n  # the traced constructor has `name` and `value` kwargs, enabling\n  # them to override the value of an RV according to its name.\n  # User-defined RVs inherit their name from the provided\n  # distribution; this helper method exposes the name as a dummy kwarg\n  # so that it's visible to program transformations.\n  del name  # unused\n  return RandomVariable(distribution=distribution,\n                        sample_shape=sample_shape,\n                        value=value)", "response": "Helper method to build a custom RandomVariable instance."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nwrap an existing distribution as a traceable random variable.", "response": "def as_random_variable(distribution,\n                       sample_shape=(),\n                       value=None):\n  \"\"\"Wrap an existing distribution as a traceable random variable.\n\n  This enables the use of custom or user-provided distributions in\n  Edward models. Unlike a bare `RandomVariable` object, this method\n  wraps the constructor so it is included in the Edward trace and its\n  values can be properly intercepted and overridden.\n\n  Where possible, you should prefer the built-in constructors\n  (`ed.Normal`, etc); these simultaneously construct a Distribution\n  and a RandomVariable object so that the distribution parameters\n  themselves may be intercepted and overridden. RVs constructed via\n  `as_random_variable()` have a fixed distribution and may not support\n  program transformations (e.g, conjugate marginalization) that rely\n  on overriding distribution parameters.\n\n  Args:\n    distribution: tfd.Distribution governing the distribution of the random\n      variable, such as sampling and log-probabilities.\n    sample_shape: tf.TensorShape of samples to draw from the random variable.\n      Default is `()` corresponding to a single sample.\n    value: Fixed tf.Tensor to associate with random variable. Must have shape\n      `sample_shape + distribution.batch_shape + distribution.event_shape`.\n      Default is to sample from random variable according to `sample_shape`.\n\n  Returns:\n    rv: a `RandomVariable` wrapping the provided distribution.\n\n  #### Example\n\n  ```python\n  from tensorflow_probability import distributions as tfd\n  from tensorflow_probability import edward2 as ed\n\n  def model():\n    # equivalent to ed.Normal(0., 1., name='x')\n    return ed.as_random_variable(tfd.Normal(0., 1., name='x'))\n\n  log_joint = ed.make_log_joint_fn(model)\n  output = log_joint(x=2.)\n  ```\n  \"\"\"\n\n  return _build_custom_rv(distribution=distribution,\n                          sample_shape=sample_shape,\n                          value=value,\n                          name=_simple_name(distribution))"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef _make_random_variable(distribution_cls):\n\n  @interceptable\n  @functools.wraps(distribution_cls, assigned=('__module__', '__name__'))\n  @docstring_util.expand_docstring(\n      cls=distribution_cls.__name__,\n      doc=inspect.cleandoc(distribution_cls.__init__.__doc__ or ''))\n  def func(*args, **kwargs):\n    # pylint: disable=g-doc-args\n    \"\"\"Create a random variable for ${cls}.\n\n    See ${cls} for more details.\n\n    Returns:\n      RandomVariable.\n\n    #### Original Docstring for Distribution\n\n    ${doc}\n    \"\"\"\n    # pylint: enable=g-doc-args\n    sample_shape = kwargs.pop('sample_shape', ())\n    value = kwargs.pop('value', None)\n    return RandomVariable(distribution=distribution_cls(*args, **kwargs),\n                          sample_shape=sample_shape,\n                          value=value)\n  return func", "response": "Factory function to make random variable given distribution class."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nshapes for the mode and mean Tensors.", "response": "def _mode_mean_shape(self):\n    \"\"\"Shape for the mode/mean Tensors.\"\"\"\n    shape = tensorshape_util.concatenate(self.batch_shape, self.event_shape)\n    has_static_shape = tensorshape_util.is_fully_defined(shape)\n    if not has_static_shape:\n      shape = tf.concat([\n          self.batch_shape_tensor(),\n          self.event_shape_tensor(),\n      ], 0)\n    return shape"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef one_step_predictive(model, observed_time_series, parameter_samples):\n\n  with tf.compat.v1.name_scope(\n      'one_step_predictive', values=[observed_time_series, parameter_samples]):\n\n    [\n        observed_time_series,\n        is_missing\n    ] = sts_util.canonicalize_observed_time_series_with_mask(\n        observed_time_series)\n\n    # Run filtering over the training timesteps to extract the\n    # predictive means and variances.\n    num_timesteps = dist_util.prefer_static_value(\n        tf.shape(input=observed_time_series))[-2]\n    lgssm = model.make_state_space_model(\n        num_timesteps=num_timesteps, param_vals=parameter_samples)\n    (_, _, _, _, _, observation_means, observation_covs\n    ) = lgssm.forward_filter(observed_time_series, mask=is_missing)\n\n    # Squeeze dims to convert from LGSSM's event shape `[num_timesteps, 1]`\n    # to a scalar time series.\n    return sts_util.mix_over_posterior_draws(\n        means=observation_means[..., 0],\n        variances=observation_covs[..., 0, 0])", "response": "Compute one - step predictive distributions for all timesteps."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nconstruct a predictive distribution over future observations.", "response": "def forecast(model,\n             observed_time_series,\n             parameter_samples,\n             num_steps_forecast):\n  \"\"\"Construct predictive distribution over future observations.\n\n  Given samples from the posterior over parameters, return the predictive\n  distribution over future observations for num_steps_forecast timesteps.\n\n  Args:\n    model: An instance of `StructuralTimeSeries` representing a\n      time-series model. This represents a joint distribution over\n      time-series and their parameters with batch shape `[b1, ..., bN]`.\n    observed_time_series: `float` `Tensor` of shape\n      `concat([sample_shape, model.batch_shape, [num_timesteps, 1]])` where\n      `sample_shape` corresponds to i.i.d. observations, and the trailing `[1]`\n      dimension may (optionally) be omitted if `num_timesteps > 1`. May\n      optionally be an instance of `tfp.sts.MaskedTimeSeries` including a\n      mask `Tensor` to encode the locations of missing observations.\n    parameter_samples: Python `list` of `Tensors` representing posterior samples\n      of model parameters, with shapes `[concat([[num_posterior_draws],\n      param.prior.batch_shape, param.prior.event_shape]) for param in\n      model.parameters]`. This may optionally also be a map (Python `dict`) of\n      parameter names to `Tensor` values.\n    num_steps_forecast: scalar `int` `Tensor` number of steps to forecast.\n\n  Returns:\n    forecast_dist: a `tfd.MixtureSameFamily` instance with event shape\n      [num_steps_forecast, 1] and batch shape\n      `concat([sample_shape, model.batch_shape])`, with `num_posterior_draws`\n      mixture components.\n\n  #### Examples\n\n  Suppose we've built a model and fit it to data using HMC:\n\n  ```python\n    day_of_week = tfp.sts.Seasonal(\n        num_seasons=7,\n        observed_time_series=observed_time_series,\n        name='day_of_week')\n    local_linear_trend = tfp.sts.LocalLinearTrend(\n        observed_time_series=observed_time_series,\n        name='local_linear_trend')\n    model = tfp.sts.Sum(components=[day_of_week, local_linear_trend],\n                        observed_time_series=observed_time_series)\n\n    samples, kernel_results = tfp.sts.fit_with_hmc(model, observed_time_series)\n  ```\n\n  Passing the posterior samples into `forecast`, we construct a forecast\n  distribution:\n\n  ```python\n    forecast_dist = tfp.sts.forecast(model, observed_time_series,\n                                     parameter_samples=samples,\n                                     num_steps_forecast=50)\n\n    forecast_mean = forecast_dist.mean()[..., 0]  # shape: [50]\n    forecast_scale = forecast_dist.stddev()[..., 0]  # shape: [50]\n    forecast_samples = forecast_dist.sample(10)[..., 0]  # shape: [10, 50]\n  ```\n\n  If using variational inference instead of HMC, we'd construct a forecast using\n  samples from the variational posterior:\n\n  ```python\n    (variational_loss,\n     variational_distributions) = tfp.sts.build_factored_variational_loss(\n       model=model, observed_time_series=observed_time_series)\n\n    # OMITTED: take steps to optimize variational loss\n\n    samples = {k: q.sample(30) for (k, q) in variational_distributions.items()}\n    forecast_dist = tfp.sts.forecast(model, observed_time_series,\n                                         parameter_samples=samples,\n                                         num_steps_forecast=50)\n  ```\n\n  We can visualize the forecast by plotting:\n\n  ```python\n    from matplotlib import pylab as plt\n    def plot_forecast(observed_time_series,\n                      forecast_mean,\n                      forecast_scale,\n                      forecast_samples):\n      plt.figure(figsize=(12, 6))\n\n      num_steps = observed_time_series.shape[-1]\n      num_steps_forecast = forecast_mean.shape[-1]\n      num_steps_train = num_steps - num_steps_forecast\n\n      c1, c2 = (0.12, 0.47, 0.71), (1.0, 0.5, 0.05)\n      plt.plot(np.arange(num_steps), observed_time_series,\n               lw=2, color=c1, label='ground truth')\n\n      forecast_steps = np.arange(num_steps_train,\n                       num_steps_train+num_steps_forecast)\n      plt.plot(forecast_steps, forecast_samples.T, lw=1, color=c2, alpha=0.1)\n      plt.plot(forecast_steps, forecast_mean, lw=2, ls='--', color=c2,\n               label='forecast')\n      plt.fill_between(forecast_steps,\n                       forecast_mean - 2 * forecast_scale,\n                       forecast_mean + 2 * forecast_scale, color=c2, alpha=0.2)\n\n      plt.xlim([0, num_steps])\n      plt.legend()\n\n    plot_forecast(observed_time_series,\n                  forecast_mean=forecast_mean,\n                  forecast_scale=forecast_scale,\n                  forecast_samples=forecast_samples)\n  ```\n\n  \"\"\"\n\n  with tf.compat.v1.name_scope(\n      'forecast',\n      values=[observed_time_series, parameter_samples, num_steps_forecast]):\n    [\n        observed_time_series,\n        mask\n    ] = sts_util.canonicalize_observed_time_series_with_mask(\n        observed_time_series)\n\n    # Run filtering over the observed timesteps to extract the\n    # latent state posterior at timestep T+1 (i.e., the final\n    # filtering distribution, pushed through the transition model).\n    # This is the prior for the forecast model (\"today's prior\n    # is yesterday's posterior\").\n    num_observed_steps = dist_util.prefer_static_value(\n        tf.shape(input=observed_time_series))[-2]\n    observed_data_ssm = model.make_state_space_model(\n        num_timesteps=num_observed_steps, param_vals=parameter_samples)\n    (_, _, _, predictive_means, predictive_covs, _, _\n    ) = observed_data_ssm.forward_filter(observed_time_series, mask=mask)\n\n    # Build a batch of state-space models over the forecast period. Because\n    # we'll use MixtureSameFamily to mix over the posterior draws, we need to\n    # do some shenanigans to move the `[num_posterior_draws]` batch dimension\n    # from the leftmost to the rightmost side of the model's batch shape.\n    # TODO(b/120245392): enhance `MixtureSameFamily` to reduce along an\n    # arbitrary axis, and eliminate `move_dimension` calls here.\n    parameter_samples = model._canonicalize_param_vals_as_map(parameter_samples)  # pylint: disable=protected-access\n    parameter_samples_with_reordered_batch_dimension = {\n        param.name: dist_util.move_dimension(\n            parameter_samples[param.name],\n            0, -(1 + _prefer_static_event_ndims(param.prior)))\n        for param in model.parameters}\n    forecast_prior = tfd.MultivariateNormalFullCovariance(\n        loc=dist_util.move_dimension(predictive_means[..., -1, :], 0, -2),\n        covariance_matrix=dist_util.move_dimension(\n            predictive_covs[..., -1, :, :], 0, -3))\n\n    # Ugly hack: because we moved `num_posterior_draws` to the trailing (rather\n    # than leading) dimension of parameters, the parameter batch shapes no\n    # longer broadcast against the `constant_offset` attribute used in `sts.Sum`\n    # models. We fix this by manually adding an extra broadcasting dim to\n    # `constant_offset` if present.\n    # The root cause of this hack is that we mucked with param dimensions above\n    # and are now passing params that are 'invalid' in the sense that they don't\n    # match the shapes of the model's param priors. The fix (as above) will be\n    # to update MixtureSameFamily so we can avoid changing param dimensions\n    # altogether.\n    # TODO(b/120245392): enhance `MixtureSameFamily` to reduce along an\n    # arbitrary axis, and eliminate this hack.\n    kwargs = {}\n    if hasattr(model, 'constant_offset'):\n      kwargs['constant_offset'] = tf.convert_to_tensor(\n          value=model.constant_offset,\n          dtype=forecast_prior.dtype)[..., tf.newaxis]\n\n    # We assume that any STS model that has a `constant_offset` attribute\n    # will allow it to be overridden as a kwarg. This is currently just\n    # `sts.Sum`.\n    # TODO(b/120245392): when kwargs hack is removed, switch back to calling\n    # the public version of `_make_state_space_model`.\n    forecast_ssm = model._make_state_space_model(  # pylint: disable=protected-access\n        num_timesteps=num_steps_forecast,\n        param_map=parameter_samples_with_reordered_batch_dimension,\n        initial_state_prior=forecast_prior,\n        initial_step=num_observed_steps,\n        **kwargs)\n\n    num_posterior_draws = dist_util.prefer_static_value(\n        forecast_ssm.batch_shape_tensor())[-1]\n    return tfd.MixtureSameFamily(\n        mixture_distribution=tfd.Categorical(\n            logits=tf.zeros([num_posterior_draws], dtype=forecast_ssm.dtype)),\n        components_distribution=forecast_ssm)"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nreturn max or mask if max is not finite.", "response": "def _max_mask_non_finite(x, axis=-1, keepdims=False, mask=0):\n  \"\"\"Returns `max` or `mask` if `max` is not finite.\"\"\"\n  m = np.max(x, axis=_astuple(axis), keepdims=keepdims)\n  needs_masking = ~np.isfinite(m)\n  if needs_masking.ndim > 0:\n    m[needs_masking] = mask\n  elif needs_masking:\n    m = mask\n  return m"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\ncomputes log ( sum ( input_tensor ) along the specified axis.", "response": "def _reduce_logsumexp(input_tensor, axis=None, keepdims=False, name=None):  # pylint: disable=unused-argument\n  \"\"\"Computes `log(sum(exp(input_tensor))) along the specified axis.\"\"\"\n  try:\n    return scipy_special.logsumexp(\n        input_tensor, axis=_astuple(axis), keepdims=keepdims)\n  except NotImplementedError:\n    # We offer a non SP version just in case SP isn't installed and this\n    # because logsumexp is often used.\n    m = _max_mask_non_finite(input_tensor, axis=axis, keepdims=True)\n    y = input_tensor - m\n    y = np.exp(y, out=y)\n    return m + np.log(np.sum(y, axis=_astuple(axis), keepdims=keepdims))"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nassert all elements of `x` are finite. Args: x: Numeric `Tensor`. data: The tensors to print out if the condition is False. Defaults to error message and first few entries of `x`. summarize: Print this many entries of each tensor. message: A string to prefix to the default message. name: A name for this operation (optional). Defaults to \"assert_finite\". Returns: Op raising `InvalidArgumentError` unless `x` has specified rank or lower. If static checks determine `x` has correct rank, a `no_op` is returned. Raises: ValueError: If static checks determine `x` has wrong rank.", "response": "def assert_finite(x, data=None, summarize=None, message=None, name=None):\n  \"\"\"Assert all elements of `x` are finite.\n\n  Args:\n    x:  Numeric `Tensor`.\n    data:  The tensors to print out if the condition is False.  Defaults to\n      error message and first few entries of `x`.\n    summarize: Print this many entries of each tensor.\n    message: A string to prefix to the default message.\n    name: A name for this operation (optional).\n      Defaults to \"assert_finite\".\n\n  Returns:\n    Op raising `InvalidArgumentError` unless `x` has specified rank or lower.\n    If static checks determine `x` has correct rank, a `no_op` is returned.\n\n  Raises:\n    ValueError:  If static checks determine `x` has wrong rank.\n  \"\"\"\n  with tf.compat.v2.name_scope(name or 'assert_finite'):\n    x_ = tf.get_static_value(x)\n    if x_ is not None:\n      if ~np.all(np.isfinite(x_)):\n        raise ValueError(message)\n      return x\n    assertion = tf.compat.v1.assert_equal(\n        tf.math.is_finite(x), tf.ones_like(x, tf.bool),\n        data=data, summarize=summarize, message=message)\n    with tf.control_dependencies([assertion]):\n      return tf.identity(x)"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef assert_rank_at_most(x, rank, data=None, summarize=None, message=None,\n                        name=None):\n  \"\"\"Assert `x` has rank equal to `rank` or smaller.\n\n  Example of adding a dependency to an operation:\n\n  ```python\n  with tf.control_dependencies([tf.assert_rank_at_most(x, 2)]):\n    output = tf.reduce_sum(x)\n  ```\n\n  Args:\n    x:  Numeric `Tensor`.\n    rank:  Scalar `Tensor`.\n    data:  The tensors to print out if the condition is False.  Defaults to\n      error message and first few entries of `x`.\n    summarize: Print this many entries of each tensor.\n    message: A string to prefix to the default message.\n    name: A name for this operation (optional).\n      Defaults to \"assert_rank_at_most\".\n\n  Returns:\n    Op raising `InvalidArgumentError` unless `x` has specified rank or lower.\n    If static checks determine `x` has correct rank, a `no_op` is returned.\n\n  Raises:\n    ValueError:  If static checks determine `x` has wrong rank.\n  \"\"\"\n  with tf.compat.v2.name_scope(name or 'assert_rank_at_most'):\n    return tf.compat.v1.assert_less_equal(\n        tf.rank(x), rank, data=data, summarize=summarize, message=message)", "response": "A helper function to check that a tensor x has rank equal to rank or smaller."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\ncomputing the number of elements in a tensor with shape event_shape.", "response": "def _event_size(event_shape, name=None):\n  \"\"\"Computes the number of elements in a tensor with shape `event_shape`.\n\n  Args:\n    event_shape: A tensor shape.\n    name: The name to use for the tensor op to compute the number of elements\n      (if such an op needs to be created).\n\n  Returns:\n    event_size: The number of elements in `tensor_shape`.  Returns a numpy int\n    when the number of elements can be computed immediately.  Otherwise, returns\n    a scalar tensor.\n  \"\"\"\n  with tf.compat.v1.name_scope(name, 'event_size', [event_shape]):\n    event_shape = tf.convert_to_tensor(\n        value=event_shape, dtype=tf.int32, name='event_shape')\n\n    event_shape_const = tf.get_static_value(event_shape)\n    if event_shape_const is not None:\n      return np.prod(event_shape_const)\n    else:\n      return tf.reduce_prod(input_tensor=event_shape)"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _eval_all_one_hot(fn, dist, name=None):\n  with tf.compat.v1.name_scope(name, 'eval_all_one_hot'):\n    event_size = dist.event_shape_tensor()[-1]\n    batch_ndims = tf.size(input=dist.batch_shape_tensor())\n    # Reshape `eye(d)` to: `[d] + [1]*batch_ndims + [d]`.\n    x = tf.reshape(\n        tf.eye(event_size, dtype=dist.dtype),\n        shape=tf.pad(\n            tensor=tf.ones(batch_ndims, tf.int32),\n            paddings=[[1, 1]],\n            constant_values=event_size))\n    # Compute `fn(x)` then cyclically left-transpose one dim.\n    perm = tf.pad(tensor=tf.range(1, batch_ndims + 1), paddings=[[0, 1]])\n    return tf.transpose(a=fn(dist, x), perm=perm)", "response": "OneHotCategorical helper computing probs cdf etc over its support."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef _make_kl_divergence_fn(\n    distribution_b,\n    use_exact_kl=False,\n    test_points_reduce_axis=(),  # `None` == \"all\"; () == \"none\".\n    test_points_fn=tf.convert_to_tensor,\n    weight=None):\n  \"\"\"Creates a callable computing `KL[a,b]` from `a`, a `tfd.Distribution`.\"\"\"\n\n  if use_exact_kl is None:\n    kl_divergence_fn = tfd.kl_divergence\n  else:\n    # Closure over: test_points_fn, test_points_reduce_axis.\n    def kl_divergence_fn(distribution_a, distribution_b):\n      z = test_points_fn(distribution_a)\n      return tf.reduce_mean(\n          input_tensor=distribution_a.log_prob(z) - distribution_b.log_prob(z),\n          axis=test_points_reduce_axis)\n\n  # Closure over: distribution_b, kl_divergence_fn, weight.\n  def _fn(distribution_a):\n    \"\"\"Closure that computes KLDiv as a function of `a` as in `KL[a, b]`.\"\"\"\n    with tf.compat.v1.name_scope('kldivergence_loss'):\n      # TODO(b/119756336): Due to eager/graph Jacobian graph caching bug\n      # we add here the capability for deferred construction of the prior.\n      # This capability can probably be removed once b/119756336 is resolved.\n      distribution_b_ = (distribution_b() if callable(distribution_b)\n                         else distribution_b)\n      kl = kl_divergence_fn(distribution_a, distribution_b_)\n      if weight is not None:\n        kl = tf.cast(weight, dtype=kl.dtype) * kl\n      # Losses appended with the model.add_loss and are expected to be a single\n      # scalar, unlike model.loss, which is expected to be the loss per sample.\n      # Therefore, we reduce over all dimensions, regardless of the shape.\n      # We take the sum because (apparently) Keras will add this to the *post*\n      # `reduce_sum` (total) loss.\n      # TODO(b/126259176): Add end-to-end Keras/TFP test to ensure the API's\n      # align, particularly wrt how losses are aggregated (across batch\n      # members).\n      return tf.reduce_sum(input_tensor=kl, name='batch_total_kl_divergence')\n\n  return _fn", "response": "Creates a callable that computes KLDiv from a distribution and b."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nreturn a convert - to - tensor func given a name config callable etc.", "response": "def _get_convert_to_tensor_fn(identifier):\n  \"\"\"Return a convert-to-tensor func, given a name, config, callable, etc.\"\"\"\n  if identifier is None:\n    return None\n\n  if isinstance(identifier, six.string_types):\n    identifier = str(identifier)\n    return _deserialize(identifier)\n\n  if isinstance(identifier, dict):\n    return _deserialize(identifier)\n\n  if isinstance(identifier, property):\n    identifier = identifier.fget\n  if callable(identifier):\n    return identifier\n\n  raise ValueError('Could not interpret '\n                   'convert-to-tensor function identifier:', identifier)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef get_config(self):\n    config = {\n        'make_distribution_fn': _serialize_function(self._make_distribution_fn),\n        'convert_to_tensor_fn': _serialize(self._convert_to_tensor_fn),\n    }\n    base_config = super(DistributionLambda, self).get_config()\n    return dict(list(base_config.items()) + list(config.items()))", "response": "Returns the config of this layer."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\ncreates a multivariate normal triangular distribution from a params vector.", "response": "def new(params, event_size, validate_args=False, name=None):\n    \"\"\"Create the distribution instance from a `params` vector.\"\"\"\n    with tf.compat.v1.name_scope(name, 'MultivariateNormalTriL',\n                                 [params, event_size]):\n      params = tf.convert_to_tensor(value=params, name='params')\n      scale_tril = tfb.ScaleTriL(\n          diag_shift=np.array(1e-5, params.dtype.as_numpy_dtype()),\n          validate_args=validate_args)\n      return tfd.MultivariateNormalTriL(\n          loc=params[..., :event_size],\n          scale_tril=scale_tril(params[..., event_size:]),\n          validate_args=validate_args)"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef params_size(event_size, name=None):\n    with tf.compat.v1.name_scope(name, 'MultivariateNormalTriL_params_size',\n                                 [event_size]):\n      return event_size + event_size * (event_size + 1) // 2", "response": "The number of params needed to create a single distribution."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\ncreating a OneHotCategorical distribution instance from a params vector.", "response": "def new(params, event_size, dtype=None, validate_args=False, name=None):\n    \"\"\"Create the distribution instance from a `params` vector.\"\"\"\n    with tf.compat.v1.name_scope(name, 'OneHotCategorical',\n                                 [params, event_size]):\n      return tfd.OneHotCategorical(\n          logits=params,\n          dtype=dtype or params.dtype.base_dtype,\n          validate_args=validate_args)"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\ncreate a distribution instance from a params vector.", "response": "def new(params, event_size, num_components,\n          dtype=None, validate_args=False, name=None):\n    \"\"\"Create the distribution instance from a `params` vector.\"\"\"\n    with tf.compat.v1.name_scope(name, 'CategoricalMixtureOfOneHotCategorical',\n                                 [params, event_size, num_components]):\n      dist = MixtureSameFamily.new(\n          params,\n          num_components,\n          OneHotCategorical(\n              event_size,\n              validate_args=False,  # So we can eval on simplex interior.\n              name=name),\n          validate_args=validate_args,\n          name=name)\n      # pylint: disable=protected-access\n      dist._mean = functools.partial(\n          _eval_all_one_hot, tfd.Distribution.prob, dist)\n      dist.log_mean = functools.partial(\n          _eval_all_one_hot, tfd.Distribution.log_prob, dist)\n      # pylint: enable=protected-access\n      return dist"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\ncreating a new IndependentBernoulli distribution instance from a params vector.", "response": "def new(params, event_shape=(), dtype=None, validate_args=False, name=None):\n    \"\"\"Create the distribution instance from a `params` vector.\"\"\"\n    with tf.compat.v1.name_scope(name, 'IndependentBernoulli',\n                                 [params, event_shape]):\n      params = tf.convert_to_tensor(value=params, name='params')\n      event_shape = dist_util.expand_to_vector(\n          tf.convert_to_tensor(\n              value=event_shape, name='event_shape', dtype_hint=tf.int32),\n          tensor_name='event_shape')\n      new_shape = tf.concat([\n          tf.shape(input=params)[:-1],\n          event_shape,\n      ], axis=0)\n      dist = tfd.Independent(\n          tfd.Bernoulli(\n              logits=tf.reshape(params, new_shape),\n              dtype=dtype or params.dtype.base_dtype,\n              validate_args=validate_args),\n          reinterpreted_batch_ndims=tf.size(input=event_shape),\n          validate_args=validate_args)\n      dist._logits = dist.distribution._logits  # pylint: disable=protected-access\n      dist._probs = dist.distribution._probs  # pylint: disable=protected-access\n      dist.logits = tfd.Bernoulli.logits\n      dist.probs = tfd.Bernoulli.probs\n      return dist"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nreturns the config of this Keras object.", "response": "def get_config(self):\n    \"\"\"Returns the config of this layer.\n\n    NOTE: At the moment, this configuration can only be serialized if the\n    Layer's `convert_to_tensor_fn` is a serializable Keras object (i.e.,\n    implements `get_config`) or one of the standard values:\n     - `Distribution.sample` (or `\"sample\"`)\n     - `Distribution.mean` (or `\"mean\"`)\n     - `Distribution.mode` (or `\"mode\"`)\n     - `Distribution.stddev` (or `\"stddev\"`)\n     - `Distribution.variance` (or `\"variance\"`)\n    \"\"\"\n    config = {\n        'event_shape': self._event_shape,\n        'convert_to_tensor_fn': _serialize(self._convert_to_tensor_fn),\n        'sample_dtype': self._sample_dtype,\n        'validate_args': self._validate_args\n    }\n    base_config = super(IndependentBernoulli, self).get_config()\n    return dict(list(base_config.items()) + list(config.items()))"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\ncreate a new IndependentLogistic distribution from a params vector.", "response": "def new(params, event_shape=(), validate_args=False, name=None):\n    \"\"\"Create the distribution instance from a `params` vector.\"\"\"\n    with tf.compat.v1.name_scope(name, 'IndependentLogistic',\n                                 [params, event_shape]):\n      params = tf.convert_to_tensor(value=params, name='params')\n      event_shape = dist_util.expand_to_vector(\n          tf.convert_to_tensor(\n              value=event_shape, name='event_shape', dtype_hint=tf.int32),\n          tensor_name='event_shape')\n      output_shape = tf.concat([\n          tf.shape(input=params)[:-1],\n          event_shape,\n      ],\n                               axis=0)\n      loc_params, scale_params = tf.split(params, 2, axis=-1)\n      return tfd.Independent(\n          tfd.Logistic(\n              loc=tf.reshape(loc_params, output_shape),\n              scale=tf.math.softplus(tf.reshape(scale_params, output_shape)),\n              validate_args=validate_args),\n          reinterpreted_batch_ndims=tf.size(input=event_shape),\n          validate_args=validate_args)"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\ncreates a new IndependentPoisson distribution instance from a params vector.", "response": "def new(params, event_shape=(), validate_args=False, name=None):\n    \"\"\"Create the distribution instance from a `params` vector.\"\"\"\n    with tf.compat.v1.name_scope(name, 'IndependentPoisson',\n                                 [params, event_shape]):\n      params = tf.convert_to_tensor(value=params, name='params')\n      event_shape = dist_util.expand_to_vector(\n          tf.convert_to_tensor(\n              value=event_shape, name='event_shape', dtype_hint=tf.int32),\n          tensor_name='event_shape')\n      output_shape = tf.concat([\n          tf.shape(input=params)[:-1],\n          event_shape,\n      ],\n                               axis=0)\n      return tfd.Independent(\n          tfd.Poisson(\n              log_rate=tf.reshape(params, output_shape),\n              validate_args=validate_args),\n          reinterpreted_batch_ndims=tf.size(input=event_shape),\n          validate_args=validate_args)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef new(params, num_components, component_layer,\n          validate_args=False, name=None):\n    \"\"\"Create the distribution instance from a `params` vector.\"\"\"\n    with tf.compat.v1.name_scope(name, 'MixtureSameFamily',\n                                 [params, num_components, component_layer]):\n      params = tf.convert_to_tensor(value=params, name='params')\n      num_components = tf.convert_to_tensor(\n          value=num_components, name='num_components', dtype_hint=tf.int32)\n\n      components_dist = component_layer(\n          tf.reshape(\n              params[..., num_components:],\n              tf.concat([tf.shape(input=params)[:-1], [num_components, -1]],\n                        axis=0)))\n      mixture_dist = tfd.Categorical(logits=params[..., :num_components])\n      return tfd.MixtureSameFamily(\n          mixture_dist,\n          components_dist,\n          # TODO(b/120154797): Change following to `validate_args=True` after\n          # fixing: \"ValueError: `mixture_distribution` must have scalar\n          # `event_dim`s.\" assertion in MixtureSameFamily.\n          validate_args=False)", "response": "Create a mixture - same - family distribution from a params vector."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nreturning the number of parameters needed to create a mixture of the same family.", "response": "def params_size(num_components, component_params_size, name=None):\n    \"\"\"Number of `params` needed to create a `MixtureSameFamily` distribution.\n\n    Arguments:\n      num_components: Number of component distributions in the mixture\n        distribution.\n      component_params_size: Number of parameters needed to create a single\n        component distribution.\n      name: The name to use for the op to compute the number of parameters\n        (if such an op needs to be created).\n\n    Returns:\n     params_size: The number of parameters needed to create the mixture\n       distribution.\n    \"\"\"\n    with tf.compat.v1.name_scope(name, 'MixtureSameFamily_params_size',\n                                 [num_components, component_params_size]):\n      num_components = tf.convert_to_tensor(\n          value=num_components, name='num_components', dtype_hint=tf.int32)\n      component_params_size = tf.convert_to_tensor(\n          value=component_params_size, name='component_params_size')\n\n      num_components = dist_util.prefer_static_value(num_components)\n      component_params_size = dist_util.prefer_static_value(\n          component_params_size)\n\n      return num_components + num_components * component_params_size"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef params_size(num_components, event_shape=(), name=None):\n    return MixtureSameFamily.params_size(\n        num_components,\n        IndependentNormal.params_size(event_shape, name=name),\n        name=name)", "response": "The number of params needed to create a single distribution."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\ncreating the distribution instance from a params vector.", "response": "def new(params, num_components, event_shape=(),\n          validate_args=False, name=None):\n    \"\"\"Create the distribution instance from a `params` vector.\"\"\"\n    return MixtureSameFamily.new(\n        params,\n        num_components,\n        IndependentLogistic(\n            event_shape, validate_args=validate_args, name=name),\n        validate_args=validate_args,\n        name=name)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef get_next_interceptor():\n  try:\n    interceptor = _interceptor_stack.stack.pop()\n    yield interceptor\n  finally:\n    _interceptor_stack.stack.append(interceptor)", "response": "Yields the next interceptor on the thread - local interceptor stack."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\ngenerate synthetic data for binary classification.", "response": "def toy_logistic_data(num_examples, input_size=2, weights_prior_stddev=5.0):\n  \"\"\"Generates synthetic data for binary classification.\n\n  Args:\n    num_examples: The number of samples to generate (scalar Python `int`).\n    input_size: The input space dimension (scalar Python `int`).\n    weights_prior_stddev: The prior standard deviation of the weight\n      vector. (scalar Python `float`).\n\n  Returns:\n    random_weights: Sampled weights as a Numpy `array` of shape\n      `[input_size]`.\n    random_bias: Sampled bias as a scalar Python `float`.\n    design_matrix: Points sampled uniformly from the cube `[-1,\n       1]^{input_size}`, as a Numpy `array` of shape `(num_examples,\n       input_size)`.\n    labels: Labels sampled from the logistic model `p(label=1) =\n      logistic(dot(features, random_weights) + random_bias)`, as a Numpy\n      `int32` `array` of shape `(num_examples, 1)`.\n  \"\"\"\n  random_weights = weights_prior_stddev * np.random.randn(input_size)\n  random_bias = np.random.randn()\n  design_matrix = np.random.rand(num_examples, input_size) * 2 - 1\n  logits = np.reshape(\n      np.dot(design_matrix, random_weights) + random_bias,\n      (-1, 1))\n  p_labels = 1. / (1 + np.exp(-logits))\n  labels = np.int32(p_labels > np.random.rand(num_examples, 1))\n  return random_weights, random_bias, np.float32(design_matrix), labels"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef build_input_pipeline(x, y, batch_size):\n  training_dataset = tf.data.Dataset.from_tensor_slices((x, y))\n  training_batches = training_dataset.repeat().batch(batch_size)\n  training_iterator = tf.compat.v1.data.make_one_shot_iterator(training_batches)\n  batch_features, batch_labels = training_iterator.get_next()\n  return batch_features, batch_labels", "response": "Build a Dataset iterator for supervised classification."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nvalidate map_values if validate_args == True.", "response": "def _maybe_check_valid_map_values(map_values, validate_args):\n  \"\"\"Validate `map_values` if `validate_args`==True.\"\"\"\n  assertions = []\n\n  message = 'Rank of map_values must be 1.'\n  if tensorshape_util.rank(map_values.shape) is not None:\n    if tensorshape_util.rank(map_values.shape) != 1:\n      raise ValueError(message)\n  elif validate_args:\n    assertions.append(assert_util.assert_rank(map_values, 1, message=message))\n\n  message = 'Size of map_values must be greater than 0.'\n  if tensorshape_util.num_elements(map_values.shape) is not None:\n    if tensorshape_util.num_elements(map_values.shape) == 0:\n      raise ValueError(message)\n  elif validate_args:\n    assertions.append(\n        assert_util.assert_greater(\n            tf.size(input=map_values), 0, message=message))\n\n  if validate_args:\n    assertions.append(\n        assert_util.assert_equal(\n            tf.math.is_strictly_increasing(map_values),\n            True,\n            message='map_values is not strictly increasing.'))\n\n  return assertions"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\ncalls a transition operator with args unpacking args if its a sequence.", "response": "def call_fn(fn: TransitionOperator, args: Union[Tuple[Any], Any]) -> Any:\n  \"\"\"Calls a transition operator with args, unpacking args if its a sequence.\n\n  Args:\n    fn: A `TransitionOperator`.\n    args: Arguments to `fn`\n\n  Returns:\n    ret: Return value of `fn`.\n  \"\"\"\n\n  if isinstance(args, (list, tuple)) and not mcmc_util.is_namedtuple_like(args):\n    args = args  # type: Tuple[Any]\n    return fn(*args)\n  else:\n    return fn(args)"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\ncalls fn and returns the gradients with respect to args.", "response": "def call_and_grads(fn: TransitionOperator, args: Union[Tuple[Any], Any]\n                  ) -> Tuple[tf.Tensor, TensorNest, TensorNest]:\n  \"\"\"Calls `fn` and returns the gradients with respect to `fn`'s first output.\n\n  Args:\n    fn: A `TransitionOperator`.\n    args: Arguments to `fn`\n\n  Returns:\n    ret: First output of `fn`.\n    extra: Second output of `fn`.\n    grads: Gradients of `ret` with respect to `args`.\n  \"\"\"\n  with tf.GradientTape() as tape:\n    tape.watch(args)\n    ret, extra = call_fn(fn, args)\n  grads = tape.gradient(ret, args)\n  return ret, extra, grads"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\ntransforms a log - prob function using a bijector.", "response": "def transform_log_prob_fn(log_prob_fn: PotentialFn,\n                          bijector: BijectorNest,\n                          init_state: State = None\n                         ) -> Union[PotentialFn, Tuple[PotentialFn, State]]:\n  \"\"\"Transforms a log-prob function using a bijector.\n\n  This takes a log-prob function and creates a new log-prob function that now\n  takes takes state in the domain of the bijector, forward transforms that state\n  and calls the original log-prob function. It then returns the log-probability\n  that correctly accounts for this transformation.\n\n  The forward-transformed state is pre-pended to the original log-prob\n  function's extra returns and returned as the new extra return.\n\n  For convenience you can also pass the initial state (in the original space),\n  and this function will return the inverse transformed as the 2nd return value.\n  You'd use this to initialize MCMC operators that operate in the transformed\n  space.\n\n  Args:\n    log_prob_fn: Log prob fn.\n    bijector: Bijector(s), must be of the same structure as the `log_prob_fn`\n      inputs.\n    init_state: Initial state, in the original space.\n\n  Returns:\n    transformed_log_prob_fn: Transformed log prob fn.\n    transformed_init_state: If `init_state` is provided. Initial state in the\n      transformed space.\n  \"\"\"\n\n  def wrapper(*args):\n    \"\"\"Transformed wrapper.\"\"\"\n    bijector_ = bijector\n\n    args = tf.nest.map_structure(lambda x: 0. + x, args)\n    if len(args) == 1:\n      args = args[0]\n    elif isinstance(bijector_, list):\n      bijector_ = tuple(bijector_)\n\n    original_space_args = tf.nest.map_structure(lambda b, x: b.forward(x),\n                                                bijector_, args)\n    original_space_args = original_space_args  # type: Tuple[Any]\n    original_space_log_prob, extra = call_fn(log_prob_fn, original_space_args)\n    event_ndims = tf.nest.map_structure(\n        lambda x: tf.rank(x) - tf.rank(original_space_log_prob), args)\n\n    return original_space_log_prob + sum(\n        tf.nest.flatten(\n            tf.nest.map_structure(\n                lambda b, x, e: b.forward_log_det_jacobian(x, event_ndims=e),\n                bijector_, args, event_ndims))), [original_space_args, extra]\n\n  if init_state is None:\n    return wrapper\n  else:\n    return wrapper, tf.nest.map_structure(lambda b, s: b.inverse(s), bijector,\n                                          init_state)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nleapfrog `TransitionOperator`. Args: leapfrog_step_state: LeapFrogStepState. step_size: Step size, structure broadcastable to the `target_log_prob_fn` state. target_log_prob_fn: Target log prob fn. kinetic_energy_fn: Kinetic energy fn. Returns: leapfrog_step_state: LeapFrogStepState. leapfrog_step_extras: LeapFrogStepExtras.", "response": "def leapfrog_step(leapfrog_step_state: LeapFrogStepState,\n                  step_size: FloatTensor, target_log_prob_fn: PotentialFn,\n                  kinetic_energy_fn: PotentialFn\n                 ) -> Tuple[LeapFrogStepState, LeapFrogStepExtras]:\n  \"\"\"Leapfrog `TransitionOperator`.\n\n  Args:\n    leapfrog_step_state: LeapFrogStepState.\n    step_size: Step size, structure broadcastable to the `target_log_prob_fn`\n      state.\n    target_log_prob_fn: Target log prob fn.\n    kinetic_energy_fn: Kinetic energy fn.\n\n  Returns:\n    leapfrog_step_state: LeapFrogStepState.\n    leapfrog_step_extras: LeapFrogStepExtras.\n  \"\"\"\n  state = leapfrog_step_state.state\n  state_grads = leapfrog_step_state.state_grads\n  momentum = leapfrog_step_state.momentum\n  step_size = maybe_broadcast_structure(step_size, state)\n\n  state = tf.nest.map_structure(tf.convert_to_tensor, state)\n  momentum = tf.nest.map_structure(tf.convert_to_tensor, momentum)\n  state = tf.nest.map_structure(tf.convert_to_tensor, state)\n\n  if state_grads is None:\n    _, _, state_grads = call_and_grads(target_log_prob_fn, state)\n  else:\n    state_grads = tf.nest.map_structure(tf.convert_to_tensor, state_grads)\n\n  momentum = tf.nest.map_structure(lambda m, sg, s: m + 0.5 * sg * s, momentum,\n                                   state_grads, step_size)\n\n  kinetic_energy, kinetic_energy_extra, momentum_grads = call_and_grads(\n      kinetic_energy_fn, momentum)\n\n  state = tf.nest.map_structure(lambda x, mg, s: x + mg * s, state,\n                                momentum_grads, step_size)\n\n  target_log_prob, state_extra, state_grads = call_and_grads(\n      target_log_prob_fn, state)\n\n  momentum = tf.nest.map_structure(lambda m, sg, s: m + 0.5 * sg * s, momentum,\n                                   state_grads, step_size)\n\n  return LeapFrogStepState(state, state_grads, momentum), LeapFrogStepExtras(\n      target_log_prob, state_extra, kinetic_energy, kinetic_energy_extra)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef metropolis_hastings_step(current_state: State,\n                             proposed_state: State,\n                             energy_change: FloatTensor,\n                             seed=None) -> Tuple[State, tf.Tensor, tf.Tensor]:\n  \"\"\"Metropolis-Hastings step.\n\n  This probabilistically chooses between `current_state` and `proposed_state`\n  based on the `energy_change` so as to preserve detailed balance.\n\n  Energy change is the negative of `log_accept_ratio`.\n\n  Args:\n    current_state: Current state.\n    proposed_state: Proposed state.\n    energy_change: E(proposed_state) - E(previous_state).\n    seed: For reproducibility.\n\n  Returns:\n    new_state: The chosen state.\n    is_accepted: Whether the proposed state was accepted.\n    log_uniform: The random number that was used to select between the two\n      states.\n  \"\"\"\n  flat_current = tf.nest.flatten(current_state)\n  flat_proposed = nest.flatten_up_to(current_state, proposed_state)\n  # Impute the None's in the current state.\n  flat_current = [\n      p if c is None else c for p, c in zip(flat_proposed, flat_current)\n  ]\n  current_state = tf.nest.pack_sequence_as(current_state, flat_current)\n\n  current_state = tf.nest.map_structure(tf.convert_to_tensor, current_state)\n  proposed_state = tf.nest.map_structure(tf.convert_to_tensor, proposed_state)\n  energy_change = tf.convert_to_tensor(value=energy_change)\n\n  log_accept_ratio = -energy_change\n\n  log_uniform = tf.math.log(\n      tf.random.uniform(\n          shape=tf.shape(input=log_accept_ratio),\n          dtype=log_accept_ratio.dtype.base_dtype,\n          seed=seed))\n  is_accepted = log_uniform < log_accept_ratio\n\n  next_state = mcmc_util.choose(\n      is_accepted, proposed_state, current_state, name='choose_next_state')\n  return next_state, is_accepted, log_uniform", "response": "Metropolis - Hastings step."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef sign_adaptation(control: FloatNest,\n                    output: FloatTensor,\n                    set_point: FloatTensor,\n                    adaptation_rate: FloatTensor = 0.01) -> FloatNest:\n  \"\"\"A function to do simple sign-based control of a variable.\n\n  ```\n  control = control * (1. + adaptation_rate) ** sign(output - set_point)\n  ```\n\n  Args:\n    control: The control variable.\n    output: The output variable.\n    set_point: The set point for `output`. This function will adjust `control`\n      so that `output` matches `set_point`.\n    adaptation_rate: Adaptation rate.\n\n  Returns:\n    control: New control.\n  \"\"\"\n\n  def _get_new_control(control, output, set_point):\n    new_control = mcmc_util.choose(output > set_point,\n                                   control * (1. + adaptation_rate),\n                                   control / (1. + adaptation_rate))\n    return new_control\n\n  output = maybe_broadcast_structure(output, control)\n  set_point = maybe_broadcast_structure(set_point, control)\n\n  return tf.nest.map_structure(_get_new_control, control, output, set_point)", "response": "A function to do simple sign - based adaptation of a variable."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef compute_output_shape(self, input_shape):\n    input_shape = tf.TensorShape(input_shape).as_list()\n    if self.data_format == 'channels_last':\n      space = input_shape[1:-1]\n      new_space = []\n      for i in range(len(space)):\n        new_dim = tf_layers_util.conv_output_length(\n            space[i],\n            self.kernel_size[i],\n            padding=self.padding,\n            stride=self.strides[i],\n            dilation=self.dilation_rate[i])\n        new_space.append(new_dim)\n      return tf.TensorShape([input_shape[0]] + new_space + [self.filters])\n    else:\n      space = input_shape[2:]\n      new_space = []\n      for i in range(len(space)):\n        new_dim = tf_layers_util.conv_output_length(\n            space[i],\n            self.kernel_size[i],\n            padding=self.padding,\n            stride=self.strides[i],\n            dilation=self.dilation_rate[i])\n        new_space.append(new_dim)\n      return tf.TensorShape([input_shape[0], self.filters] + new_space)", "response": "Computes the output shape of the layer."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nreturning the config of the current layer.", "response": "def get_config(self):\n    \"\"\"Returns the config of the layer.\n\n    A layer config is a Python dictionary (serializable) containing the\n    configuration of a layer. The same layer can be reinstantiated later\n    (without its trained weights) from this configuration.\n\n    Returns:\n      config: A Python dictionary of class keyword arguments and their\n        serialized values.\n    \"\"\"\n    config = {\n        'filters': self.filters,\n        'kernel_size': self.kernel_size,\n        'strides': self.strides,\n        'padding': self.padding,\n        'data_format': self.data_format,\n        'dilation_rate': self.dilation_rate,\n        'activation': (tf.keras.activations.serialize(self.activation)\n                       if self.activation else None),\n        'activity_regularizer':\n            tf.keras.initializers.serialize(self.activity_regularizer),\n    }\n    function_keys = [\n        'kernel_posterior_fn',\n        'kernel_posterior_tensor_fn',\n        'kernel_prior_fn',\n        'kernel_divergence_fn',\n        'bias_posterior_fn',\n        'bias_posterior_tensor_fn',\n        'bias_prior_fn',\n        'bias_divergence_fn',\n    ]\n    for function_key in function_keys:\n      function = getattr(self, function_key)\n      if function is None:\n        function_name = None\n        function_type = None\n      else:\n        function_name, function_type = tfp_layers_util.serialize_function(\n            function)\n      config[function_key] = function_name\n      config[function_key + '_type'] = function_type\n    base_config = super(_ConvVariational, self).get_config()\n    return dict(list(base_config.items()) + list(config.items()))"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\ncreate a new node from its config dictionary.", "response": "def from_config(cls, config):\n    \"\"\"Creates a layer from its config.\n\n    This method is the reverse of `get_config`, capable of instantiating the\n    same layer from the config dictionary.\n\n    Args:\n      config: A Python dictionary, typically the output of `get_config`.\n\n    Returns:\n      layer: A layer instance.\n    \"\"\"\n    config = config.copy()\n    function_keys = [\n        'kernel_posterior_fn',\n        'kernel_posterior_tensor_fn',\n        'kernel_prior_fn',\n        'kernel_divergence_fn',\n        'bias_posterior_fn',\n        'bias_posterior_tensor_fn',\n        'bias_prior_fn',\n        'bias_divergence_fn',\n    ]\n    for function_key in function_keys:\n      serial = config[function_key]\n      function_type = config.pop(function_key + '_type')\n      if serial is not None:\n        config[function_key] = tfp_layers_util.deserialize_function(\n            serial,\n            function_type=function_type)\n    return cls(**config)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef get_config(self):\n    config = {\n        'seed': self.seed,\n    }\n    base_config = super(_ConvFlipout, self).get_config()\n    return dict(list(base_config.items()) + list(config.items()))", "response": "Returns the config of the layer."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _create_scale_operator(self, identity_multiplier, diag, tril,\n                             perturb_diag, perturb_factor, shift, validate_args,\n                             dtype):\n    \"\"\"Construct `scale` from various components.\n\n    Args:\n      identity_multiplier: floating point rank 0 `Tensor` representing a scaling\n        done to the identity matrix.\n      diag: Floating-point `Tensor` representing the diagonal matrix.`diag` has\n        shape `[N1, N2, ...  k]`, which represents a k x k diagonal matrix.\n      tril: Floating-point `Tensor` representing the lower triangular matrix.\n       `tril` has shape `[N1, N2, ...  k, k]`, which represents a k x k lower\n       triangular matrix.\n      perturb_diag: Floating-point `Tensor` representing the diagonal matrix of\n        the low rank update.\n      perturb_factor: Floating-point `Tensor` representing factor matrix.\n      shift: Floating-point `Tensor` representing `shift in `scale @ X + shift`.\n      validate_args: Python `bool` indicating whether arguments should be\n        checked for correctness.\n      dtype: `DType` for arg `Tensor` conversions.\n\n    Returns:\n      scale. In the case of scaling by a constant, scale is a\n      floating point `Tensor`. Otherwise, scale is a `LinearOperator`.\n\n    Raises:\n      ValueError: if all of `tril`, `diag` and `identity_multiplier` are `None`.\n    \"\"\"\n    identity_multiplier = _as_tensor(identity_multiplier, \"identity_multiplier\",\n                                     dtype)\n    diag = _as_tensor(diag, \"diag\", dtype)\n    tril = _as_tensor(tril, \"tril\", dtype)\n    perturb_diag = _as_tensor(perturb_diag, \"perturb_diag\", dtype)\n    perturb_factor = _as_tensor(perturb_factor, \"perturb_factor\", dtype)\n\n    # If possible, use the low rank update to infer the shape of\n    # the identity matrix, when scale represents a scaled identity matrix\n    # with a low rank update.\n    shape_hint = None\n    if perturb_factor is not None:\n      shape_hint = distribution_util.dimension_size(perturb_factor, axis=-2)\n\n    if self._is_only_identity_multiplier:\n      if validate_args:\n        return distribution_util.with_dependencies([\n            assert_util.assert_none_equal(\n                identity_multiplier, tf.zeros([], identity_multiplier.dtype),\n                [\"identity_multiplier should be non-zero.\"])\n        ], identity_multiplier)\n      return identity_multiplier\n\n    scale = distribution_util.make_tril_scale(\n        loc=shift,\n        scale_tril=tril,\n        scale_diag=diag,\n        scale_identity_multiplier=identity_multiplier,\n        validate_args=validate_args,\n        assert_positive=False,\n        shape_hint=shape_hint)\n\n    if perturb_factor is not None:\n      return tf.linalg.LinearOperatorLowRankUpdate(\n          scale,\n          u=perturb_factor,\n          diag_update=perturb_diag,\n          is_diag_update_positive=perturb_diag is None,\n          is_non_singular=True,  # Implied by is_positive_definite=True.\n          is_self_adjoint=True,\n          is_positive_definite=True,\n          is_square=True)\n\n    return scale", "response": "Constructs a LinearOperator object for scaling a set of identity matrices."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef random_walk_normal_fn(scale=1., name=None):\n  def _fn(state_parts, seed):\n    \"\"\"Adds a normal perturbation to the input state.\n\n    Args:\n      state_parts: A list of `Tensor`s of any shape and real dtype representing\n        the state parts of the `current_state` of the Markov chain.\n      seed: `int` or None. The random seed for this `Op`. If `None`, no seed is\n        applied.\n        Default value: `None`.\n\n    Returns:\n      perturbed_state_parts: A Python `list` of The `Tensor`s. Has the same\n        shape and type as the `state_parts`.\n\n    Raises:\n      ValueError: if `scale` does not broadcast with `state_parts`.\n    \"\"\"\n    with tf.compat.v1.name_scope(\n        name, 'random_walk_normal_fn', values=[state_parts, scale, seed]):\n      scales = scale if mcmc_util.is_list_like(scale) else [scale]\n      if len(scales) == 1:\n        scales *= len(state_parts)\n      if len(state_parts) != len(scales):\n        raise ValueError('`scale` must broadcast with `state_parts`.')\n      seed_stream = distributions.SeedStream(seed, salt='RandomWalkNormalFn')\n      next_state_parts = [\n          tf.random.normal(\n              mean=state_part,\n              stddev=scale_part,\n              shape=tf.shape(input=state_part),\n              dtype=state_part.dtype.base_dtype,\n              seed=seed_stream())\n          for scale_part, state_part in zip(scales, state_parts)\n      ]\n\n      return next_state_parts\n  return _fn", "response": "Returns a function that returns a list of the same - type list of the state parts of the Markov chain and random perturbation to the input state."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef random_walk_uniform_fn(scale=1., name=None):\n  def _fn(state_parts, seed):\n    \"\"\"Adds a uniform perturbation to the input state.\n\n    Args:\n      state_parts: A list of `Tensor`s of any shape and real dtype representing\n        the state parts of the `current_state` of the Markov chain.\n      seed: `int` or None. The random seed for this `Op`. If `None`, no seed is\n        applied.\n        Default value: `None`.\n\n    Returns:\n      perturbed_state_parts: A Python `list` of The `Tensor`s. Has the same\n        shape and type as the `state_parts`.\n\n    Raises:\n      ValueError: if `scale` does not broadcast with `state_parts`.\n    \"\"\"\n    with tf.compat.v1.name_scope(\n        name, 'random_walk_uniform_fn', values=[state_parts, scale, seed]):\n      scales = scale if mcmc_util.is_list_like(scale) else [scale]\n      if len(scales) == 1:\n        scales *= len(state_parts)\n      if len(state_parts) != len(scales):\n        raise ValueError('`scale` must broadcast with `state_parts`.')\n      seed_stream = distributions.SeedStream(seed, salt='RandomWalkUniformFn')\n      next_state_parts = [\n          tf.random.uniform(\n              minval=state_part - scale_part,\n              maxval=state_part + scale_part,\n              shape=tf.shape(input=state_part),\n              dtype=state_part.dtype.base_dtype,\n              seed=seed_stream())\n          for scale_part, state_part in zip(scales, state_parts)\n      ]\n      return next_state_parts\n  return _fn", "response": "Returns a function that adds a random uniform perturbation to the input."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nbatch KL divergence `KL(a || b)` for Independent distributions. We can leverage the fact that ``` KL(Independent(a) || Independent(b)) = sum(KL(a || b)) ``` where the sum is over the `reinterpreted_batch_ndims`. Args: a: Instance of `Independent`. b: Instance of `Independent`. name: (optional) name to use for created ops. Default \"kl_independent\". Returns: Batchwise `KL(a || b)`. Raises: ValueError: If the event space for `a` and `b`, or their underlying distributions don't match.", "response": "def _kl_independent(a, b, name=\"kl_independent\"):\n  \"\"\"Batched KL divergence `KL(a || b)` for Independent distributions.\n\n  We can leverage the fact that\n  ```\n  KL(Independent(a) || Independent(b)) = sum(KL(a || b))\n  ```\n  where the sum is over the `reinterpreted_batch_ndims`.\n\n  Args:\n    a: Instance of `Independent`.\n    b: Instance of `Independent`.\n    name: (optional) name to use for created ops. Default \"kl_independent\".\n\n  Returns:\n    Batchwise `KL(a || b)`.\n\n  Raises:\n    ValueError: If the event space for `a` and `b`, or their underlying\n      distributions don't match.\n  \"\"\"\n  p = a.distribution\n  q = b.distribution\n\n  # The KL between any two (non)-batched distributions is a scalar.\n  # Given that the KL between two factored distributions is the sum, i.e.\n  # KL(p1(x)p2(y) || q1(x)q2(y)) = KL(p1 || q1) + KL(q1 || q2), we compute\n  # KL(p || q) and do a `reduce_sum` on the reinterpreted batch dimensions.\n  if (tensorshape_util.is_fully_defined(a.event_shape) and\n      tensorshape_util.is_fully_defined(b.event_shape)):\n    if a.event_shape == b.event_shape:\n      if p.event_shape == q.event_shape:\n        num_reduce_dims = (tensorshape_util.rank(a.event_shape) -\n                           tensorshape_util.rank(p.event_shape))\n        reduce_dims = [-i - 1 for i in range(0, num_reduce_dims)]\n\n        return tf.reduce_sum(\n            input_tensor=kullback_leibler.kl_divergence(p, q, name=name),\n            axis=reduce_dims)\n      else:\n        raise NotImplementedError(\"KL between Independents with different \"\n                                  \"event shapes not supported.\")\n    else:\n      raise ValueError(\"Event shapes do not match.\")\n  else:\n    with tf.control_dependencies(\n        [\n            assert_util.assert_equal(a.event_shape_tensor(),\n                                     b.event_shape_tensor()),\n            assert_util.assert_equal(p.event_shape_tensor(),\n                                     q.event_shape_tensor())\n        ]):\n      num_reduce_dims = (\n          prefer_static.rank_from_shape(\n              a.event_shape_tensor, a.event_shape) -\n          prefer_static.rank_from_shape(\n              p.event_shape_tensor, a.event_shape))\n      reduce_dims = prefer_static.range(-num_reduce_dims - 1, -1, 1)\n      return tf.reduce_sum(\n          input_tensor=kullback_leibler.kl_divergence(p, q, name=name),\n          axis=reduce_dims)"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\ncomputes the default value for reinterpreted_batch_ndim __init__ arg.", "response": "def _get_default_reinterpreted_batch_ndims(self, distribution):\n    \"\"\"Computes the default value for reinterpreted_batch_ndim __init__ arg.\"\"\"\n    ndims = prefer_static.rank_from_shape(\n        distribution.batch_shape_tensor, distribution.batch_shape)\n    return prefer_static.maximum(0, ndims - 1)"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _expand_to_event_rank(self, x):\n    expanded_x = x\n    for _ in range(tensorshape_util.rank(self.event_shape)):\n      expanded_x = tf.expand_dims(expanded_x, -1)\n    return expanded_x", "response": "Expand the rank of x up to static_event_rank times for broadcasting."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef entropy_lower_bound(self, name=\"entropy_lower_bound\"):\n    with self._name_scope(name):\n      with tf.control_dependencies(self._assertions):\n        distribution_entropies = [d.entropy() for d in self.components]\n        cat_probs = self._cat_probs(log_probs=False)\n        partial_entropies = [\n            c_p * m for (c_p, m) in zip(cat_probs, distribution_entropies)\n        ]\n        # These are all the same shape by virtue of matching batch_shape\n        return tf.add_n(partial_entropies)", "response": "r A lower bound on the mixture s entropy."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef _cat_probs(self, log_probs):\n    which_softmax = tf.nn.log_softmax if log_probs else tf.nn.softmax\n    cat_probs = which_softmax(self.cat.logits)\n    cat_probs = tf.unstack(cat_probs, num=self.num_components, axis=-1)\n    return cat_probs", "response": "Get a list of num_components batchwise probabilities."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef _maybe_validate_args(outcomes, logits, probs, validate_args):\n  assertions = []\n\n  def validate_equal_last_dim(tensor_a, tensor_b, message):\n    if tensor_a.shape.is_fully_defined() and tensor_b.shape.is_fully_defined():\n      if tensor_a.shape[-1] != tensor_b.shape[-1]:\n        raise ValueError(message)\n    elif validate_args:\n      assertions.append(\n          tf.compat.v1.assert_equal(\n              tf.shape(input=tensor_a)[-1],\n              tf.shape(input=tensor_b)[-1],\n              message=message))\n\n  if logits is not None:\n    validate_equal_last_dim(\n        outcomes,\n        logits,\n        message='Last dimension of outcomes and logits must be equal size.')\n  if probs is not None:\n    validate_equal_last_dim(\n        outcomes,\n        probs,\n        message='Last dimension of outcomes and probs must be equal size.')\n\n  message = 'Rank of outcomes must be 1.'\n  if outcomes.shape.ndims is not None:\n    if outcomes.shape.ndims != 1:\n      raise ValueError(message)\n  elif validate_args:\n    assertions.append(tf.compat.v1.assert_rank(outcomes, 1, message=message))\n\n  message = 'Size of outcomes must be greater than 0.'\n  if outcomes.shape.num_elements() is not None:\n    if outcomes.shape.num_elements() == 0:\n      raise ValueError(message)\n  elif validate_args:\n    assertions.append(\n        tf.compat.v1.assert_greater(\n            tf.size(input=outcomes), 0, message=message))\n\n  if validate_args:\n    assertions.append(\n        tf.compat.v1.assert_equal(\n            tf.math.is_strictly_increasing(outcomes),\n            True,\n            message='outcomes is not strictly increasing.'))\n\n  return assertions", "response": "Validate outcomes logits and probs."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nattempts to import tensorflow and ensure its version is sufficient.", "response": "def _ensure_tf_install():  # pylint: disable=g-statement-before-imports\n  \"\"\"Attempt to import tensorflow, and ensure its version is sufficient.\n\n  Raises:\n    ImportError: if either tensorflow is not importable or its version is\n    inadequate.\n  \"\"\"\n  try:\n    import tensorflow as tf\n  except ImportError:\n    # Print more informative error message, then reraise.\n    print(\"\\n\\nFailed to import TensorFlow. Please note that TensorFlow is not \"\n          \"installed by default when you install TensorFlow Probability. This \"\n          \"is so that users can decide whether to install the GPU-enabled \"\n          \"TensorFlow package. To use TensorFlow Probability, please install \"\n          \"the most recent version of TensorFlow, by following instructions at \"\n          \"https://tensorflow.org/install.\\n\\n\")\n    raise\n\n  import distutils.version\n\n  #\n  # Update this whenever we need to depend on a newer TensorFlow release.\n  #\n  required_tensorflow_version = \"1.13\"\n\n  if (distutils.version.LooseVersion(tf.__version__) <\n      distutils.version.LooseVersion(required_tensorflow_version)):\n    raise ImportError(\n        \"This version of TensorFlow Probability requires TensorFlow \"\n        \"version >= {required}; Detected an installation of version {present}. \"\n        \"Please upgrade TensorFlow to proceed.\".format(\n            required=required_tensorflow_version,\n            present=tf.__version__))"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef covertype():\n  import sklearn.datasets  # pylint: disable=g-import-not-at-top\n  data = sklearn.datasets.covtype.fetch_covtype()\n  features = data.data\n  labels = data.target\n\n  # Normalize features and append a column of ones for the intercept.\n  features -= features.mean(0)\n  features /= features.std(0)\n  features = np.hstack([features, np.ones([features.shape[0], 1])])\n  features = tf.cast(features, dtype=tf.float32)\n\n  # Binarize outcomes on whether it is a specific category.\n  _, counts = np.unique(labels, return_counts=True)\n  specific_category = np.argmax(counts)\n  labels = (labels == specific_category)\n  labels = tf.cast(labels, dtype=tf.int32)\n  return features, labels", "response": "Builds the Covertype data set."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _kl_dirichlet_dirichlet(d1, d2, name=None):\n  with tf.name_scope(name or \"kl_dirichlet_dirichlet\"):\n    # The KL between Dirichlet distributions can be derived as follows. We have\n    #\n    #   Dir(x; a) = 1 / B(a) * prod_i[x[i]^(a[i] - 1)]\n    #\n    # where B(a) is the multivariate Beta function:\n    #\n    #   B(a) = Gamma(a[1]) * ... * Gamma(a[n]) / Gamma(a[1] + ... + a[n])\n    #\n    # The KL is\n    #\n    #   KL(Dir(x; a), Dir(x; b)) = E_Dir(x; a){log(Dir(x; a) / Dir(x; b))}\n    #\n    # so we'll need to know the log density of the Dirichlet. This is\n    #\n    #   log(Dir(x; a)) = sum_i[(a[i] - 1) log(x[i])] - log B(a)\n    #\n    # The only term that matters for the expectations is the log(x[i]). To\n    # compute the expectation of this term over the Dirichlet density, we can\n    # use the following facts about the Dirichlet in exponential family form:\n    #   1. log(x[i]) is a sufficient statistic\n    #   2. expected sufficient statistics (of any exp family distribution) are\n    #      equal to derivatives of the log normalizer with respect to\n    #      corresponding natural parameters: E{T[i](x)} = dA/d(eta[i])\n    #\n    # To proceed, we can rewrite the Dirichlet density in exponential family\n    # form as follows:\n    #\n    #   Dir(x; a) = exp{eta(a) . T(x) - A(a)}\n    #\n    # where '.' is the dot product of vectors eta and T, and A is a scalar:\n    #\n    #   eta[i](a) = a[i] - 1\n    #     T[i](x) = log(x[i])\n    #        A(a) = log B(a)\n    #\n    # Now, we can use fact (2) above to write\n    #\n    #   E_Dir(x; a)[log(x[i])]\n    #       = dA(a) / da[i]\n    #       = d/da[i] log B(a)\n    #       = d/da[i] (sum_j lgamma(a[j])) - lgamma(sum_j a[j])\n    #       = digamma(a[i])) - digamma(sum_j a[j])\n    #\n    # Putting it all together, we have\n    #\n    # KL[Dir(x; a) || Dir(x; b)]\n    #     = E_Dir(x; a){log(Dir(x; a) / Dir(x; b)}\n    #     = E_Dir(x; a){sum_i[(a[i] - b[i]) log(x[i])} - (lbeta(a) - lbeta(b))\n    #     = sum_i[(a[i] - b[i]) * E_Dir(x; a){log(x[i])}] - lbeta(a) + lbeta(b)\n    #     = sum_i[(a[i] - b[i]) * (digamma(a[i]) - digamma(sum_j a[j]))]\n    #          - lbeta(a) + lbeta(b))\n\n    digamma_sum_d1 = tf.math.digamma(\n        tf.reduce_sum(input_tensor=d1.concentration, axis=-1, keepdims=True))\n    digamma_diff = tf.math.digamma(d1.concentration) - digamma_sum_d1\n    concentration_diff = d1.concentration - d2.concentration\n\n    return (\n        tf.reduce_sum(input_tensor=concentration_diff * digamma_diff, axis=-1) -\n        tf.math.lbeta(d1.concentration) + tf.math.lbeta(d2.concentration))", "response": "Batchwise KL divergence KL between two Dirichlet distributions."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nchecking the validity of the concentration parameter.", "response": "def _maybe_assert_valid_concentration(self, concentration, validate_args):\n    \"\"\"Checks the validity of the concentration parameter.\"\"\"\n    if not validate_args:\n      return concentration\n    return distribution_util.with_dependencies([\n        assert_util.assert_positive(\n            concentration, message=\"Concentration parameter must be positive.\"),\n        assert_util.assert_rank_at_least(\n            concentration,\n            1,\n            message=\"Concentration parameter must have >=1 dimensions.\"),\n        assert_util.assert_less(\n            1,\n            tf.shape(input=concentration)[-1],\n            message=\"Concentration parameter must have event_size >= 2.\"),\n    ], concentration)"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\ncheck the validity of a sample.", "response": "def _maybe_assert_valid_sample(self, x):\n    \"\"\"Checks the validity of a sample.\"\"\"\n    if not self.validate_args:\n      return x\n    return distribution_util.with_dependencies([\n        assert_util.assert_positive(x, message=\"samples must be positive\"),\n        assert_util.assert_near(\n            tf.ones([], dtype=self.dtype),\n            tf.reduce_sum(input_tensor=x, axis=-1),\n            message=\"sample last-dimension must sum to `1`\"),\n    ], x)"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef cholesky_covariance(x, sample_axis=0, keepdims=False, name=None):\n  with tf.compat.v1.name_scope(\n      name, 'cholesky_covariance', values=[x, sample_axis]):\n    sample_axis = tf.convert_to_tensor(value=sample_axis, dtype=tf.int32)\n    cov = covariance(\n        x, sample_axis=sample_axis, event_axis=-1, keepdims=keepdims)\n    return tf.linalg.cholesky(cov)", "response": "Plots a Cholesky factor of the covariance matrix of vector - variate random samples."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef covariance(x,\n               y=None,\n               sample_axis=0,\n               event_axis=-1,\n               keepdims=False,\n               name=None):\n  \"\"\"Sample covariance between observations indexed by `event_axis`.\n\n  Given `N` samples of scalar random variables `X` and `Y`, covariance may be\n  estimated as\n\n  ```none\n  Cov[X, Y] := N^{-1} sum_{n=1}^N (X_n - Xbar) Conj{(Y_n - Ybar)}\n  Xbar := N^{-1} sum_{n=1}^N X_n\n  Ybar := N^{-1} sum_{n=1}^N Y_n\n  ```\n\n  For vector-variate random variables `X = (X1, ..., Xd)`, `Y = (Y1, ..., Yd)`,\n  one is often interested in the covariance matrix, `C_{ij} := Cov[Xi, Yj]`.\n\n  ```python\n  x = tf.random_normal(shape=(100, 2, 3))\n  y = tf.random_normal(shape=(100, 2, 3))\n\n  # cov[i, j] is the sample covariance between x[:, i, j] and y[:, i, j].\n  cov = tfp.stats.covariance(x, y, sample_axis=0, event_axis=None)\n\n  # cov_matrix[i, m, n] is the sample covariance of x[:, i, m] and y[:, i, n]\n  cov_matrix = tfp.stats.covariance(x, y, sample_axis=0, event_axis=-1)\n  ```\n\n  Notice we divide by `N` (the numpy default), which does not create `NaN`\n  when `N = 1`, but is slightly biased.\n\n  Args:\n    x:  A numeric `Tensor` holding samples.\n    y:  Optional `Tensor` with same `dtype` and `shape` as `x`.\n      Default value: `None` (`y` is effectively set to `x`).\n    sample_axis: Scalar or vector `Tensor` designating axis holding samples, or\n      `None` (meaning all axis hold samples).\n      Default value: `0` (leftmost dimension).\n    event_axis:  Scalar or vector `Tensor`, or `None` (scalar events).\n      Axis indexing random events, whose covariance we are interested in.\n      If a vector, entries must form a contiguous block of dims. `sample_axis`\n      and `event_axis` should not intersect.\n      Default value: `-1` (rightmost axis holds events).\n    keepdims:  Boolean.  Whether to keep the sample axis as singletons.\n    name: Python `str` name prefixed to Ops created by this function.\n          Default value: `None` (i.e., `'covariance'`).\n\n  Returns:\n    cov: A `Tensor` of same `dtype` as the `x`, and rank equal to\n      `rank(x) - len(sample_axis) + 2 * len(event_axis)`.\n\n  Raises:\n    AssertionError:  If `x` and `y` are found to have different shape.\n    ValueError:  If `sample_axis` and `event_axis` are found to overlap.\n    ValueError:  If `event_axis` is found to not be contiguous.\n  \"\"\"\n\n  with tf.compat.v1.name_scope(\n      name, 'covariance', values=[x, y, event_axis, sample_axis]):\n    x = tf.convert_to_tensor(value=x, name='x')\n    # Covariance *only* uses the centered versions of x (and y).\n    x -= tf.reduce_mean(input_tensor=x, axis=sample_axis, keepdims=True)\n\n    if y is None:\n      y = x\n    else:\n      y = tf.convert_to_tensor(value=y, name='y', dtype=x.dtype)\n      # If x and y have different shape, sample_axis and event_axis will likely\n      # be wrong for one of them!\n      x.shape.assert_is_compatible_with(y.shape)\n      y -= tf.reduce_mean(input_tensor=y, axis=sample_axis, keepdims=True)\n\n    if event_axis is None:\n      return tf.reduce_mean(\n          input_tensor=x * tf.math.conj(y), axis=sample_axis, keepdims=keepdims)\n\n    if sample_axis is None:\n      raise ValueError(\n          'sample_axis was None, which means all axis hold events, and this '\n          'overlaps with event_axis ({})'.format(event_axis))\n\n    event_axis = _make_positive_axis(event_axis, tf.rank(x))\n    sample_axis = _make_positive_axis(sample_axis, tf.rank(x))\n\n    # If we get lucky and axis is statically defined, we can do some checks.\n    if _is_list_like(event_axis) and _is_list_like(sample_axis):\n      if set(event_axis).intersection(sample_axis):\n        raise ValueError(\n            'sample_axis ({}) and event_axis ({}) overlapped'.format(\n                sample_axis, event_axis))\n      if (np.diff(sorted(event_axis)) > 1).any():\n        raise ValueError(\n            'event_axis must be contiguous. Found: {}'.format(event_axis))\n      batch_axis = list(\n          sorted(\n              set(range(x.shape.ndims)).difference(sample_axis + event_axis)))\n    else:\n      batch_axis, _ = tf.compat.v1.setdiff1d(\n          tf.range(0, tf.rank(x)), tf.concat((sample_axis, event_axis), 0))\n\n    event_axis = tf.convert_to_tensor(\n        value=event_axis, name='event_axis', dtype=tf.int32)\n    sample_axis = tf.convert_to_tensor(\n        value=sample_axis, name='sample_axis', dtype=tf.int32)\n    batch_axis = tf.convert_to_tensor(\n        value=batch_axis, name='batch_axis', dtype=tf.int32)\n\n    # Permute x/y until shape = B + E + S\n    perm_for_xy = tf.concat((batch_axis, event_axis, sample_axis), 0)\n    x_permed = tf.transpose(a=x, perm=perm_for_xy)\n    y_permed = tf.transpose(a=y, perm=perm_for_xy)\n\n    batch_ndims = tf.size(input=batch_axis)\n    batch_shape = tf.shape(input=x_permed)[:batch_ndims]\n    event_ndims = tf.size(input=event_axis)\n    event_shape = tf.shape(input=x_permed)[batch_ndims:batch_ndims +\n                                           event_ndims]\n    sample_shape = tf.shape(input=x_permed)[batch_ndims + event_ndims:]\n    sample_ndims = tf.size(input=sample_shape)\n    n_samples = tf.reduce_prod(input_tensor=sample_shape)\n    n_events = tf.reduce_prod(input_tensor=event_shape)\n\n    # Flatten sample_axis into one long dim.\n    x_permed_flat = tf.reshape(\n        x_permed, tf.concat((batch_shape, event_shape, [n_samples]), 0))\n    y_permed_flat = tf.reshape(\n        y_permed, tf.concat((batch_shape, event_shape, [n_samples]), 0))\n    # Do the same for event_axis.\n    x_permed_flat = tf.reshape(\n        x_permed, tf.concat((batch_shape, [n_events], [n_samples]), 0))\n    y_permed_flat = tf.reshape(\n        y_permed, tf.concat((batch_shape, [n_events], [n_samples]), 0))\n\n    # After matmul, cov.shape = batch_shape + [n_events, n_events]\n    cov = tf.matmul(\n        x_permed_flat, y_permed_flat, adjoint_b=True) / tf.cast(\n            n_samples, x.dtype)\n\n    # Insert some singletons to make\n    # cov.shape = batch_shape + event_shape**2 + [1,...,1]\n    # This is just like x_permed.shape, except the sample_axis is all 1's, and\n    # the [n_events] became event_shape**2.\n    cov = tf.reshape(\n        cov,\n        tf.concat(\n            (\n                batch_shape,\n                # event_shape**2 used here because it is the same length as\n                # event_shape, and has the same number of elements as one\n                # batch of covariance.\n                event_shape**2,\n                tf.ones([sample_ndims], tf.int32)),\n            0))\n    # Permuting by the argsort inverts the permutation, making\n    # cov.shape have ones in the position where there were samples, and\n    # [n_events * n_events] in the event position.\n    cov = tf.transpose(a=cov, perm=tf.math.invert_permutation(perm_for_xy))\n\n    # Now expand event_shape**2 into event_shape + event_shape.\n    # We here use (for the first time) the fact that we require event_axis to be\n    # contiguous.\n    e_start = event_axis[0]\n    e_len = 1 + event_axis[-1] - event_axis[0]\n    cov = tf.reshape(\n        cov,\n        tf.concat((tf.shape(input=cov)[:e_start], event_shape, event_shape,\n                   tf.shape(input=cov)[e_start + e_len:]), 0))\n\n    # tf.squeeze requires python ints for axis, not Tensor.  This is enough to\n    # require our axis args to be constants.\n    if not keepdims:\n      squeeze_axis = tf.where(sample_axis < e_start, sample_axis,\n                              sample_axis + e_len)\n      cov = _squeeze(cov, axis=squeeze_axis)\n\n    return cov", "response": "Sample covariance between observations indexed by sample_axis and event_axis."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef correlation(x,\n                y=None,\n                sample_axis=0,\n                event_axis=-1,\n                keepdims=False,\n                name=None):\n  \"\"\"Sample correlation (Pearson) between observations indexed by `event_axis`.\n\n  Given `N` samples of scalar random variables `X` and `Y`, correlation may be\n  estimated as\n\n  ```none\n  Corr[X, Y] := Cov[X, Y] / Sqrt(Cov[X, X] * Cov[Y, Y]),\n  where\n  Cov[X, Y] := N^{-1} sum_{n=1}^N (X_n - Xbar) Conj{(Y_n - Ybar)}\n  Xbar := N^{-1} sum_{n=1}^N X_n\n  Ybar := N^{-1} sum_{n=1}^N Y_n\n  ```\n\n  Correlation is always in the interval `[-1, 1]`, and `Corr[X, X] == 1`.\n\n  For vector-variate random variables `X = (X1, ..., Xd)`, `Y = (Y1, ..., Yd)`,\n  one is often interested in the correlation matrix, `C_{ij} := Corr[Xi, Yj]`.\n\n  ```python\n  x = tf.random_normal(shape=(100, 2, 3))\n  y = tf.random_normal(shape=(100, 2, 3))\n\n  # corr[i, j] is the sample correlation between x[:, i, j] and y[:, i, j].\n  corr = tfp.stats.correlation(x, y, sample_axis=0, event_axis=None)\n\n  # corr_matrix[i, m, n] is the sample correlation of x[:, i, m] and y[:, i, n]\n  corr_matrix = tfp.stats.correlation(x, y, sample_axis=0, event_axis=-1)\n  ```\n\n  Notice we divide by `N` (the numpy default), which does not create `NaN`\n  when `N = 1`, but is slightly biased.\n\n  Args:\n    x:  A numeric `Tensor` holding samples.\n    y:  Optional `Tensor` with same `dtype` and `shape` as `x`.\n      Default value: `None` (`y` is effectively set to `x`).\n    sample_axis: Scalar or vector `Tensor` designating axis holding samples, or\n      `None` (meaning all axis hold samples).\n      Default value: `0` (leftmost dimension).\n    event_axis:  Scalar or vector `Tensor`, or `None` (scalar events).\n      Axis indexing random events, whose correlation we are interested in.\n      If a vector, entries must form a contiguous block of dims. `sample_axis`\n      and `event_axis` should not intersect.\n      Default value: `-1` (rightmost axis holds events).\n    keepdims:  Boolean.  Whether to keep the sample axis as singletons.\n    name: Python `str` name prefixed to Ops created by this function.\n          Default value: `None` (i.e., `'correlation'`).\n\n  Returns:\n    corr: A `Tensor` of same `dtype` as the `x`, and rank equal to\n      `rank(x) - len(sample_axis) + 2 * len(event_axis)`.\n\n  Raises:\n    AssertionError:  If `x` and `y` are found to have different shape.\n    ValueError:  If `sample_axis` and `event_axis` are found to overlap.\n    ValueError:  If `event_axis` is found to not be contiguous.\n  \"\"\"\n\n  with tf.compat.v1.name_scope(\n      name, 'correlation', values=[x, y, event_axis, sample_axis]):\n    # Corr[X, Y] = Cov[X, Y] / (Stddev[X] * Stddev[Y])\n    #            = Cov[X / Stddev[X], Y / Stddev[Y]]\n    # So we could compute covariance first then divide by stddev, or\n    # divide by stddev and compute covariance.\n    # Dividing by stddev then computing covariance is potentially more stable.\n    # But... computing covariance first then dividing involves 2 fewer large\n    # broadcasts.  We choose to divide first, largely because it avoids\n    # difficulties with the various options for sample/event axis kwargs.\n\n    x /= stddev(x, sample_axis=sample_axis, keepdims=True)\n    if y is not None:\n      y /= stddev(y, sample_axis=sample_axis, keepdims=True)\n\n    return covariance(\n        x=x,\n        y=y,\n        event_axis=event_axis,\n        sample_axis=sample_axis,\n        keepdims=keepdims)", "response": "Sample correlation between observations X and Y."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef stddev(x, sample_axis=0, keepdims=False, name=None):\n  with tf.compat.v1.name_scope(name, 'stddev', values=[x, sample_axis]):\n    return tf.sqrt(variance(x, sample_axis=sample_axis, keepdims=keepdims))", "response": "Estimate standard deviation of samples of scalar valued random variable X."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef variance(x, sample_axis=0, keepdims=False, name=None):\n  with tf.compat.v1.name_scope(name, 'variance', values=[x, sample_axis]):\n    return covariance(\n        x, y=None, sample_axis=sample_axis, event_axis=None, keepdims=keepdims)", "response": "Estimate variance using samples."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef _make_list_or_1d_tensor(values):\n  values = tf.convert_to_tensor(value=values, name='values')\n  values_ = tf.get_static_value(values)\n\n  # Static didn't work.\n  if values_ is None:\n    # Cheap way to bring to at least 1d.\n    return values + tf.zeros([1], dtype=values.dtype)\n\n  # Static worked!\n  if values_.ndim > 1:\n    raise ValueError('values had > 1 dim: {}'.format(values_.shape))\n  # Cheap way to bring to at least 1d.\n  values_ = values_ + np.zeros([1], dtype=values_.dtype)\n  return list(values_)", "response": "Return a list or 1d Tensor from values."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef _make_positive_axis(axis, ndims):\n  axis = _make_list_or_1d_tensor(axis)\n\n  ndims = tf.convert_to_tensor(value=ndims, name='ndims', dtype=tf.int32)\n  ndims_ = tf.get_static_value(ndims)\n\n  if _is_list_like(axis) and ndims_ is not None:\n    # Static case\n    positive_axis = []\n    for a in axis:\n      if a < 0:\n        a = ndims_ + a\n      positive_axis.append(a)\n  else:\n    # Dynamic case\n    axis = tf.convert_to_tensor(value=axis, name='axis', dtype=tf.int32)\n    positive_axis = tf.where(axis >= 0, axis, axis + ndims)\n\n  return positive_axis", "response": "Rectify possibly negatively axis. Prefer return Python list."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef _squeeze(x, axis):\n  x = tf.convert_to_tensor(value=x, name='x')\n  if axis is None:\n    return tf.squeeze(x, axis=None)\n  axis = tf.convert_to_tensor(value=axis, name='axis', dtype=tf.int32)\n  axis += tf.zeros([1], dtype=axis.dtype)  # Make axis at least 1d.\n  keep_axis, _ = tf.compat.v1.setdiff1d(tf.range(0, tf.rank(x)), axis)\n  return tf.reshape(x, tf.gather(tf.shape(input=x), keep_axis))", "response": "A version of squeeze that works with dynamic axis."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef _kl_normal_normal(n_a, n_b, name=None):\n  with tf.name_scope(name or \"kl_normal_normal\"):\n    one = tf.constant(1, dtype=n_a.dtype)\n    two = tf.constant(2, dtype=n_a.dtype)\n    half = tf.constant(0.5, dtype=n_a.dtype)\n    s_a_squared = tf.square(n_a.scale)\n    s_b_squared = tf.square(n_b.scale)\n    ratio = s_a_squared / s_b_squared\n    return (tf.square(n_a.loc - n_b.loc) / (two * s_b_squared) + half *\n            (ratio - one - tf.math.log(ratio)))", "response": "Calculate the batched KL divergence of n_a and n_b Normal."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nstandardize input x to a unit normal.", "response": "def _z(self, x):\n    \"\"\"Standardize input `x` to a unit normal.\"\"\"\n    with tf.name_scope(\"standardize\"):\n      return (x - self.loc) / self.scale"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef _inv_z(self, z):\n    with tf.name_scope(\"reconstruct\"):\n      return z * self.scale + self.loc", "response": "Reconstruct input x from a its normalized version."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nbuilding the transition matrix for a semi - local linear trend model.", "response": "def semilocal_linear_trend_transition_matrix(autoregressive_coef):\n  \"\"\"Build the transition matrix for a semi-local linear trend model.\"\"\"\n  # We want to write the following 2 x 2 matrix:\n  #  [[1., 1., ],    # level(t+1) = level(t) + slope(t)\n  #   [0., ar_coef], # slope(t+1) = ar_coef * slope(t)\n  # but it's slightly tricky to properly incorporate the batch shape of\n  # autoregressive_coef. E.g., if autoregressive_coef has shape [4,6], we want\n  # to return shape [4, 6, 2, 2]. We do this by breaking the matrix into its\n  # fixed entries, written explicitly, and then the autoregressive_coef part\n  # which we add in after using a mask to broadcast to the correct matrix shape.\n\n  fixed_entries = tf.constant(\n      [[1., 1.],\n       [0., 0.]],\n      dtype=autoregressive_coef.dtype)\n\n  autoregressive_coef_mask = tf.constant([[0., 0.],\n                                          [0., 1.]],\n                                         dtype=autoregressive_coef.dtype)\n  bottom_right_entry = (autoregressive_coef[..., tf.newaxis, tf.newaxis] *\n                        autoregressive_coef_mask)\n  return tf.linalg.LinearOperatorFullMatrix(\n      fixed_entries + bottom_right_entry)"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nbuilds a transition noise model for a semi - local linear trend model.", "response": "def semilocal_linear_trend_transition_noise(level_scale,\n                                            slope_mean,\n                                            slope_scale,\n                                            autoregressive_coef):\n  \"\"\"Build the transition noise model for a semi-local linear trend model.\"\"\"\n\n  # At each timestep, the stochasticity of `level` and `slope` are given\n  # by `level_scale` and `slope_scale` respectively.\n  broadcast_batch_shape = dist_util.get_broadcast_shape(\n      level_scale, slope_mean, slope_scale, autoregressive_coef)\n  broadcast_ones = tf.ones(broadcast_batch_shape, dtype=level_scale.dtype)\n  scale_diag = tf.stack([level_scale * broadcast_ones,\n                         slope_scale * broadcast_ones],\n                        axis=-1)\n\n  # We additionally fold in a bias term implementing the nonzero `slope_mean`.\n  # The overall `slope` update is (from `SemiLocalLinearTrend` docstring)\n  #   slope[t] = (slope_mean +\n  #               autoregressive_coef * (slope[t-1] - slope_mean) +\n  #               Normal(0., slope_scale))\n  # which we rewrite as\n  #   slope[t] = (\n  #    autoregressive_coef * slope[t-1] +                  # linear transition\n  #    Normal(loc=slope_mean - autoregressive_coef * slope_mean,  # noise bias\n  #           scale=slope_scale))                                 # noise scale\n  bias = tf.stack([tf.zeros_like(broadcast_ones),\n                   slope_mean * (1 - autoregressive_coef) * broadcast_ones],\n                  axis=-1)\n  return tfd.MultivariateNormalDiag(\n      loc=bias,\n      scale_diag=scale_diag)"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef sample_halton_sequence(dim,\n                           num_results=None,\n                           sequence_indices=None,\n                           dtype=tf.float32,\n                           randomized=True,\n                           seed=None,\n                           name=None):\n  r\"\"\"Returns a sample from the `dim` dimensional Halton sequence.\n\n  Warning: The sequence elements take values only between 0 and 1. Care must be\n  taken to appropriately transform the domain of a function if it differs from\n  the unit cube before evaluating integrals using Halton samples. It is also\n  important to remember that quasi-random numbers without randomization are not\n  a replacement for pseudo-random numbers in every context. Quasi random numbers\n  are completely deterministic and typically have significant negative\n  autocorrelation unless randomization is used.\n\n  Computes the members of the low discrepancy Halton sequence in dimension\n  `dim`. The `dim`-dimensional sequence takes values in the unit hypercube in\n  `dim` dimensions. Currently, only dimensions up to 1000 are supported. The\n  prime base for the k-th axes is the k-th prime starting from 2. For example,\n  if `dim` = 3, then the bases will be [2, 3, 5] respectively and the first\n  element of the non-randomized sequence will be: [0.5, 0.333, 0.2]. For a more\n  complete description of the Halton sequences see\n  [here](https://en.wikipedia.org/wiki/Halton_sequence). For low discrepancy\n  sequences and their applications see\n  [here](https://en.wikipedia.org/wiki/Low-discrepancy_sequence).\n\n  If `randomized` is true, this function produces a scrambled version of the\n  Halton sequence introduced by [Owen (2017)][1]. For the advantages of\n  randomization of low discrepancy sequences see [here](\n  https://en.wikipedia.org/wiki/Quasi-Monte_Carlo_method#Randomization_of_quasi-Monte_Carlo).\n\n  The number of samples produced is controlled by the `num_results` and\n  `sequence_indices` parameters. The user must supply either `num_results` or\n  `sequence_indices` but not both.\n  The former is the number of samples to produce starting from the first\n  element. If `sequence_indices` is given instead, the specified elements of\n  the sequence are generated. For example, sequence_indices=tf.range(10) is\n  equivalent to specifying n=10.\n\n  #### Examples\n\n  ```python\n  import tensorflow as tf\n  import tensorflow_probability as tfp\n\n  # Produce the first 1000 members of the Halton sequence in 3 dimensions.\n  num_results = 1000\n  dim = 3\n  sample = tfp.mcmc.sample_halton_sequence(\n    dim,\n    num_results=num_results,\n    seed=127)\n\n  # Evaluate the integral of x_1 * x_2^2 * x_3^3  over the three dimensional\n  # hypercube.\n  powers = tf.range(1.0, limit=dim + 1)\n  integral = tf.reduce_mean(tf.reduce_prod(sample ** powers, axis=-1))\n  true_value = 1.0 / tf.reduce_prod(powers + 1.0)\n  with tf.Session() as session:\n    values = session.run((integral, true_value))\n\n  # Produces a relative absolute error of 1.7%.\n  print (\"Estimated: %f, True Value: %f\" % values)\n\n  # Now skip the first 1000 samples and recompute the integral with the next\n  # thousand samples. The sequence_indices argument can be used to do this.\n\n\n  sequence_indices = tf.range(start=1000, limit=1000 + num_results,\n                              dtype=tf.int32)\n  sample_leaped = tfp.mcmc.sample_halton_sequence(\n      dim,\n      sequence_indices=sequence_indices,\n      seed=111217)\n\n  integral_leaped = tf.reduce_mean(tf.reduce_prod(sample_leaped ** powers,\n                                                  axis=-1))\n  with tf.Session() as session:\n    values = session.run((integral_leaped, true_value))\n  # Now produces a relative absolute error of 0.05%.\n  print (\"Leaped Estimated: %f, True Value: %f\" % values)\n  ```\n\n  Args:\n    dim: Positive Python `int` representing each sample's `event_size.` Must\n      not be greater than 1000.\n    num_results: (Optional) Positive scalar `Tensor` of dtype int32. The number\n      of samples to generate. Either this parameter or sequence_indices must\n      be specified but not both. If this parameter is None, then the behaviour\n      is determined by the `sequence_indices`.\n      Default value: `None`.\n    sequence_indices: (Optional) `Tensor` of dtype int32 and rank 1. The\n      elements of the sequence to compute specified by their position in the\n      sequence. The entries index into the Halton sequence starting with 0 and\n      hence, must be whole numbers. For example, sequence_indices=[0, 5, 6] will\n      produce the first, sixth and seventh elements of the sequence. If this\n      parameter is None, then the `num_results` parameter must be specified\n      which gives the number of desired samples starting from the first sample.\n      Default value: `None`.\n    dtype: (Optional) The dtype of the sample. One of: `float16`, `float32` or\n      `float64`.\n      Default value: `tf.float32`.\n    randomized: (Optional) bool indicating whether to produce a randomized\n      Halton sequence. If True, applies the randomization described in\n      [Owen (2017)][1].\n      Default value: `True`.\n    seed: (Optional) Python integer to seed the random number generator. Only\n      used if `randomized` is True. If not supplied and `randomized` is True,\n      no seed is set.\n      Default value: `None`.\n    name:  (Optional) Python `str` describing ops managed by this function. If\n      not supplied the name of this function is used.\n      Default value: \"sample_halton_sequence\".\n\n  Returns:\n    halton_elements: Elements of the Halton sequence. `Tensor` of supplied dtype\n      and `shape` `[num_results, dim]` if `num_results` was specified or shape\n      `[s, dim]` where s is the size of `sequence_indices` if `sequence_indices`\n      were specified.\n\n  Raises:\n    ValueError: if both `sequence_indices` and `num_results` were specified or\n      if dimension `dim` is less than 1 or greater than 1000.\n\n  #### References\n\n  [1]: Art B. Owen. A randomized Halton algorithm in R. _arXiv preprint\n       arXiv:1706.02808_, 2017. https://arxiv.org/abs/1706.02808\n  \"\"\"\n  if dim < 1 or dim > _MAX_DIMENSION:\n    raise ValueError(\n        'Dimension must be between 1 and {}. Supplied {}'.format(_MAX_DIMENSION,\n                                                                 dim))\n  if (num_results is None) == (sequence_indices is None):\n    raise ValueError('Either `num_results` or `sequence_indices` must be'\n                     ' specified but not both.')\n\n  if not dtype.is_floating:\n    raise ValueError('dtype must be of `float`-type')\n\n  with tf.compat.v1.name_scope(\n      name, 'sample', values=[num_results, sequence_indices]):\n    # Here and in the following, the shape layout is as follows:\n    # [sample dimension, event dimension, coefficient dimension].\n    # The coefficient dimension is an intermediate axes which will hold the\n    # weights of the starting integer when expressed in the (prime) base for\n    # an event dimension.\n    if num_results is not None:\n      num_results = tf.convert_to_tensor(value=num_results)\n    if sequence_indices is not None:\n      sequence_indices = tf.convert_to_tensor(value=sequence_indices)\n    indices = _get_indices(num_results, sequence_indices, dtype)\n    radixes = tf.constant(_PRIMES[0:dim], dtype=dtype, shape=[dim, 1])\n\n    max_sizes_by_axes = _base_expansion_size(\n        tf.reduce_max(input_tensor=indices), radixes)\n\n    max_size = tf.reduce_max(input_tensor=max_sizes_by_axes)\n\n    # The powers of the radixes that we will need. Note that there is a bit\n    # of an excess here. Suppose we need the place value coefficients of 7\n    # in base 2 and 3. For 2, we will have 3 digits but we only need 2 digits\n    # for base 3. However, we can only create rectangular tensors so we\n    # store both expansions in a [2, 3] tensor. This leads to the problem that\n    # we might end up attempting to raise large numbers to large powers. For\n    # example, base 2 expansion of 1024 has 10 digits. If we were in 10\n    # dimensions, then the 10th prime (29) we will end up computing 29^10 even\n    # though we don't need it. We avoid this by setting the exponents for each\n    # axes to 0 beyond the maximum value needed for that dimension.\n    exponents_by_axes = tf.tile([tf.range(max_size)], [dim, 1])\n\n    # The mask is true for those coefficients that are irrelevant.\n    weight_mask = exponents_by_axes >= max_sizes_by_axes\n    capped_exponents = tf.where(\n        weight_mask,\n        tf.zeros_like(exponents_by_axes),\n        exponents_by_axes)\n    weights = radixes ** capped_exponents\n    # The following computes the base b expansion of the indices. Suppose,\n    # x = a0 + a1*b + a2*b^2 + ... Then, performing a floor div of x with\n    # the vector (1, b, b^2, b^3, ...) will produce\n    # (a0 + s1 * b, a1 + s2 * b, ...) where s_i are coefficients we don't care\n    # about. Noting that all a_i < b by definition of place value expansion,\n    # we see that taking the elements mod b of the above vector produces the\n    # place value expansion coefficients.\n    coeffs = tf.math.floordiv(indices, weights)\n    coeffs *= 1. - tf.cast(weight_mask, dtype)\n    coeffs %= radixes\n    if not randomized:\n      coeffs /= radixes\n      return tf.reduce_sum(input_tensor=coeffs / weights, axis=-1)\n    stream = distributions.SeedStream(seed, salt='MCMCSampleHaltonSequence')\n    coeffs = _randomize(coeffs, radixes, seed=stream())\n    # Remove the contribution from randomizing the trailing zero for the\n    # axes where max_size_by_axes < max_size. This will be accounted\n    # for separately below (using zero_correction).\n    coeffs *= 1. - tf.cast(weight_mask, dtype)\n    coeffs /= radixes\n    base_values = tf.reduce_sum(input_tensor=coeffs / weights, axis=-1)\n\n    # The randomization used in Owen (2017) does not leave 0 invariant. While\n    # we have accounted for the randomization of the first `max_size_by_axes`\n    # coefficients, we still need to correct for the trailing zeros. Luckily,\n    # this is equivalent to adding a uniform random value scaled so the first\n    # `max_size_by_axes` coefficients are zero. The following statements perform\n    # this correction.\n    zero_correction = tf.random.uniform([dim, 1], seed=stream(), dtype=dtype)\n    zero_correction /= radixes ** max_sizes_by_axes\n    return base_values + tf.reshape(zero_correction, [-1])", "response": "r Returns a sample from the Halton sequence."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef _randomize(coeffs, radixes, seed=None):\n  given_dtype = coeffs.dtype\n  coeffs = tf.cast(coeffs, dtype=tf.int32)\n  num_coeffs = tf.shape(input=coeffs)[-1]\n  radixes = tf.reshape(tf.cast(radixes, dtype=tf.int32), shape=[-1])\n  stream = distributions.SeedStream(seed, salt='MCMCSampleHaltonSequence2')\n  perms = _get_permutations(num_coeffs, radixes, seed=stream())\n  perms = tf.reshape(perms, shape=[-1])\n  radix_sum = tf.reduce_sum(input_tensor=radixes)\n  radix_offsets = tf.reshape(tf.cumsum(radixes, exclusive=True),\n                             shape=[-1, 1])\n  offsets = radix_offsets + tf.range(num_coeffs) * radix_sum\n  permuted_coeffs = tf.gather(perms, coeffs + offsets)\n  return tf.cast(permuted_coeffs, dtype=given_dtype)", "response": "Applies the Owen ( 2017 ) randomization to the coefficients."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nreturns a tensor of shape num_results and the same dtype as the num_results.", "response": "def _get_permutations(num_results, dims, seed=None):\n  \"\"\"Uniform iid sample from the space of permutations.\n\n  Draws a sample of size `num_results` from the group of permutations of degrees\n  specified by the `dims` tensor. These are packed together into one tensor\n  such that each row is one sample from each of the dimensions in `dims`. For\n  example, if dims = [2,3] and num_results = 2, the result is a tensor of shape\n  [2, 2 + 3] and the first row of the result might look like:\n  [1, 0, 2, 0, 1]. The first two elements are a permutation over 2 elements\n  while the next three are a permutation over 3 elements.\n\n  Args:\n    num_results: A positive scalar `Tensor` of integral type. The number of\n      draws from the discrete uniform distribution over the permutation groups.\n    dims: A 1D `Tensor` of the same dtype as `num_results`. The degree of the\n      permutation groups from which to sample.\n    seed: (Optional) Python integer to seed the random number generator.\n\n  Returns:\n    permutations: A `Tensor` of shape `[num_results, sum(dims)]` and the same\n    dtype as `dims`.\n  \"\"\"\n  sample_range = tf.range(num_results)\n  stream = distributions.SeedStream(seed, salt='MCMCSampleHaltonSequence3')\n  def generate_one(d):\n    seed = stream()\n    fn = lambda _: tf.random.shuffle(tf.range(d), seed=seed)\n    return tf.map_fn(\n        fn,\n        sample_range,\n        parallel_iterations=1 if seed is not None else 10)\n  return tf.concat([generate_one(d) for d in tf.unstack(dims)],\n                   axis=-1)"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\ngenerating the indices for the Halton sequence procedure.", "response": "def _get_indices(num_results, sequence_indices, dtype, name=None):\n  \"\"\"Generates starting points for the Halton sequence procedure.\n\n  The k'th element of the sequence is generated starting from a positive integer\n  which must be distinct for each `k`. It is conventional to choose the starting\n  point as `k` itself (or `k+1` if k is zero based). This function generates\n  the starting integers for the required elements and reshapes the result for\n  later use.\n\n  Args:\n    num_results: Positive scalar `Tensor` of dtype int32. The number of samples\n      to generate. If this parameter is supplied, then `sequence_indices`\n      should be None.\n    sequence_indices: `Tensor` of dtype int32 and rank 1. The entries\n      index into the Halton sequence starting with 0 and hence, must be whole\n      numbers. For example, sequence_indices=[0, 5, 6] will produce the first,\n      sixth and seventh elements of the sequence. If this parameter is not None\n      then `n` must be None.\n    dtype: The dtype of the sample. One of `float32` or `float64`.\n      Default is `float32`.\n    name: Python `str` name which describes ops created by this function.\n\n  Returns:\n    indices: `Tensor` of dtype `dtype` and shape = `[n, 1, 1]`.\n  \"\"\"\n  with tf.compat.v1.name_scope(name, '_get_indices',\n                               [num_results, sequence_indices]):\n    if sequence_indices is None:\n      num_results = tf.cast(num_results, dtype=dtype)\n      sequence_indices = tf.range(num_results, dtype=dtype)\n    else:\n      sequence_indices = tf.cast(sequence_indices, dtype)\n\n    # Shift the indices so they are 1 based.\n    indices = sequence_indices + 1\n\n    # Reshape to make space for the event dimension and the place value\n    # coefficients.\n    return tf.reshape(indices, [-1, 1, 1])"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _base_expansion_size(num, bases):\n  return tf.floor(tf.math.log(num) / tf.math.log(bases)) + 1", "response": "Computes the number of terms in the place value expansion of num in base bases."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef _primes_less_than(n):\n  # Based on\n  # https://stackoverflow.com/questions/2068372/fastest-way-to-list-all-primes-below-n-in-python/3035188#3035188\n  \"\"\"Returns sorted array of primes such that `2 <= prime < n`.\"\"\"\n  small_primes = np.array((2, 3, 5))\n  if n <= 6:\n    return small_primes[small_primes < n]\n  sieve = np.ones(n // 3 + (n % 6 == 2), dtype=np.bool)\n  sieve[0] = False\n  m = int(n ** 0.5) // 3 + 1\n  for i in range(m):\n    if not sieve[i]:\n      continue\n    k = 3 * i + 1 | 1\n    sieve[k ** 2 // 3::2 * k] = False\n    sieve[(k ** 2 + 4 * k - 2 * k * (i & 1)) // 3::2 * k] = False\n  return np.r_[2, 3, 3 * np.nonzero(sieve)[0] + 1 | 1]", "response": "Returns sorted array of primes such that 2 <= prime < n."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nreturning the machine epsilon for the supplied dtype.", "response": "def _machine_eps(dtype):\n  \"\"\"Returns the machine epsilon for the supplied dtype.\"\"\"\n  if isinstance(dtype, tf.DType):\n    dtype = dtype.as_numpy_dtype()\n  return np.finfo(dtype).eps"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nshrink the input step size until the value and grad become finite.", "response": "def _fix_step_size(value_and_gradients_function,\n                   val_c_input,\n                   active,\n                   step_size_shrink_param):\n  \"\"\"Shrinks the input step size until the value and grad become finite.\"\"\"\n  # The maximum iterations permitted are determined as the number of halvings\n  # it takes to reduce 1 to 0 in the given dtype.\n  iter_max = np.ceil(-np.log2(_machine_eps(val_c_input.x.dtype)))\n\n  def _cond(i, val_c, to_fix):\n    del val_c  # Unused.\n    return (i < iter_max) & tf.reduce_any(input_tensor=to_fix)\n\n  def _body(i, val_c, to_fix):\n    next_c = tf.where(to_fix, val_c.x * step_size_shrink_param, val_c.x)\n    next_val_c = value_and_gradients_function(next_c)\n    still_to_fix = to_fix & ~hzl.is_finite(next_val_c)\n    return (i + 1, next_val_c, still_to_fix)\n\n  to_fix = active & ~hzl.is_finite(val_c_input)\n  return tf.while_loop(\n      cond=_cond, body=_body, loop_vars=(0, val_c_input, to_fix))"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nbracket the minimum and performs a line search. Args: value_and_gradients_function: A Python callable that accepts a real scalar tensor and returns a namedtuple with the fields 'x', 'f', and 'df' that correspond to scalar tensors of real dtype containing the point at which the function was evaluated, the value of the function, and its derivative at that point. The other namedtuple fields, if present, should be tensors or sequences (possibly nested) of tensors. In usual optimization application, this function would be generated by projecting the multivariate objective function along some specific direction. The direction is determined by some other procedure but should be a descent direction (i.e. the derivative of the projected univariate function must be negative at 0.). Alternatively, the function may represent the batching of `n` such line functions (e.g. projecting a single multivariate objective function along `n` distinct directions at once) accepting n points as input, i.e. a tensor of shape [n], and the fields 'x', 'f' and 'df' in the returned namedtuple should each be a tensor of shape [n], with the corresponding input points, function values, and derivatives at those input points. init_interval: Instance of `HagerZhangLineSearchResults` containing the initial line search interval. The gradient of init_interval.left must be negative (i.e. must be a descent direction), while init_interval.right must be positive and finite. f_lim: Scalar `Tensor` of float dtype. max_iterations: Positive scalar `Tensor` of integral dtype. The maximum number of iterations to perform in the line search. The number of iterations used to bracket the minimum are also counted against this parameter. shrinkage_param: Scalar positive Tensor of real dtype. Must be less than `1.`. Corresponds to the parameter `gamma` in [Hager and Zhang (2006)][2]. expansion_param: Scalar positive `Tensor` of real dtype. Must be greater than `1.`. Used to expand the initial interval in case it does not bracket a minimum. Corresponds to `rho` in [Hager and Zhang (2006)][2]. sufficient_decrease_param: Positive scalar `Tensor` of real dtype. Bounded above by the curvature param. Corresponds to `delta` in the terminology of [Hager and Zhang (2006)][2]. curvature_param: Positive scalar `Tensor` of real dtype. Bounded above by `1.`. Corresponds to 'sigma' in the terminology of [Hager and Zhang (2006)][2]. Returns: A namedtuple containing the following fields. converged: Boolean `Tensor` of shape [n]. Whether a point satisfying Wolfe/Approx wolfe was found. failed: Boolean `Tensor` of shape [n]. Whether line search failed e.g. if either the objective function or the gradient are not finite at an evaluation point. iterations: Scalar int32 `Tensor`. Number of line search iterations made. func_evals: Scalar int32 `Tensor`. Number of function evaluations made. left: A namedtuple, as returned by value_and_gradients_function, of the left end point of the updated bracketing interval. right: A namedtuple, as returned by value_and_gradients_function, of the right end point of the updated bracketing interval.", "response": "def _bracket_and_search(\n    value_and_gradients_function,\n    init_interval,\n    f_lim,\n    max_iterations,\n    shrinkage_param,\n    expansion_param,\n    sufficient_decrease_param,\n    curvature_param):\n  \"\"\"Brackets the minimum and performs a line search.\n\n  Args:\n    value_and_gradients_function: A Python callable that accepts a real scalar\n      tensor and returns a namedtuple with the fields 'x', 'f', and 'df' that\n      correspond to scalar tensors of real dtype containing the point at which\n      the function was evaluated, the value of the function, and its\n      derivative at that point. The other namedtuple fields, if present,\n      should be tensors or sequences (possibly nested) of tensors.\n      In usual optimization application, this function would be generated by\n      projecting the multivariate objective function along some specific\n      direction. The direction is determined by some other procedure but should\n      be a descent direction (i.e. the derivative of the projected univariate\n      function must be negative at 0.).\n      Alternatively, the function may represent the batching of `n` such line\n      functions (e.g. projecting a single multivariate objective function along\n      `n` distinct directions at once) accepting n points as input, i.e. a\n      tensor of shape [n], and the fields 'x', 'f' and 'df' in the returned\n      namedtuple should each be a tensor of shape [n], with the corresponding\n      input points, function values, and derivatives at those input points.\n    init_interval: Instance of `HagerZhangLineSearchResults` containing\n      the initial line search interval. The gradient of init_interval.left must\n      be negative (i.e. must be a descent direction), while init_interval.right\n      must be positive and finite.\n    f_lim: Scalar `Tensor` of float dtype.\n    max_iterations: Positive scalar `Tensor` of integral dtype. The maximum\n      number of iterations to perform in the line search. The number of\n      iterations used to bracket the minimum are also counted against this\n      parameter.\n    shrinkage_param: Scalar positive Tensor of real dtype. Must be less than\n      `1.`. Corresponds to the parameter `gamma` in [Hager and Zhang (2006)][2].\n    expansion_param: Scalar positive `Tensor` of real dtype. Must be greater\n      than `1.`. Used to expand the initial interval in case it does not bracket\n      a minimum. Corresponds to `rho` in [Hager and Zhang (2006)][2].\n    sufficient_decrease_param: Positive scalar `Tensor` of real dtype.\n      Bounded above by the curvature param. Corresponds to `delta` in the\n      terminology of [Hager and Zhang (2006)][2].\n    curvature_param: Positive scalar `Tensor` of real dtype. Bounded above\n      by `1.`. Corresponds to 'sigma' in the terminology of\n      [Hager and Zhang (2006)][2].\n\n  Returns:\n    A namedtuple containing the following fields.\n      converged: Boolean `Tensor` of shape [n]. Whether a point satisfying\n        Wolfe/Approx wolfe was found.\n      failed: Boolean `Tensor` of shape [n]. Whether line search failed e.g.\n        if either the objective function or the gradient are not finite at\n        an evaluation point.\n      iterations: Scalar int32 `Tensor`. Number of line search iterations made.\n      func_evals: Scalar int32 `Tensor`. Number of function evaluations made.\n      left: A namedtuple, as returned by value_and_gradients_function,\n        of the left end point of the updated bracketing interval.\n      right: A namedtuple, as returned by value_and_gradients_function,\n        of the right end point of the updated bracketing interval.\n  \"\"\"\n  bracket_result = hzl.bracket(value_and_gradients_function, init_interval,\n                               f_lim, max_iterations, expansion_param)\n\n  converged = init_interval.converged | _very_close(\n      bracket_result.left.x, bracket_result.right.x)\n\n  # We fail if we have not yet converged but already exhausted all iterations.\n  exhausted_iterations = ~converged & tf.greater_equal(\n      bracket_result.iteration, max_iterations)\n\n  line_search_args = HagerZhangLineSearchResult(\n      converged=converged,\n      failed=bracket_result.failed | exhausted_iterations,\n      iterations=bracket_result.iteration,\n      func_evals=bracket_result.num_evals,\n      left=bracket_result.left,\n      right=bracket_result.right)\n\n  return _line_search_after_bracketing(\n      value_and_gradients_function, line_search_args, init_interval.left,\n      f_lim, max_iterations, sufficient_decrease_param, curvature_param,\n      shrinkage_param)"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _line_search_after_bracketing(\n    value_and_gradients_function,\n    search_interval,\n    val_0,\n    f_lim,\n    max_iterations,\n    sufficient_decrease_param,\n    curvature_param,\n    shrinkage_param):\n  \"\"\"The main loop of line search after the minimum has been bracketed.\n\n  Args:\n    value_and_gradients_function: A Python callable that accepts a real scalar\n      tensor and returns a namedtuple with the fields 'x', 'f', and 'df' that\n      correspond to scalar tensors of real dtype containing the point at which\n      the function was evaluated, the value of the function, and its\n      derivative at that point. The other namedtuple fields, if present,\n      should be tensors or sequences (possibly nested) of tensors.\n      In usual optimization application, this function would be generated by\n      projecting the multivariate objective function along some specific\n      direction. The direction is determined by some other procedure but should\n      be a descent direction (i.e. the derivative of the projected univariate\n      function must be negative at 0.).\n      Alternatively, the function may represent the batching of `n` such line\n      functions (e.g. projecting a single multivariate objective function along\n      `n` distinct directions at once) accepting n points as input, i.e. a\n      tensor of shape [n], and the fields 'x', 'f' and 'df' in the returned\n      namedtuple should each be a tensor of shape [n], with the corresponding\n      input points, function values, and derivatives at those input points.\n    search_interval: Instance of `HagerZhangLineSearchResults` containing\n      the current line search interval.\n    val_0: A namedtuple as returned by value_and_gradients_function evaluated\n      at `0.`. The gradient must be negative (i.e. must be a descent direction).\n    f_lim: Scalar `Tensor` of float dtype.\n    max_iterations: Positive scalar `Tensor` of integral dtype. The maximum\n      number of iterations to perform in the line search. The number of\n      iterations used to bracket the minimum are also counted against this\n      parameter.\n    sufficient_decrease_param: Positive scalar `Tensor` of real dtype.\n      Bounded above by the curvature param. Corresponds to `delta` in the\n      terminology of [Hager and Zhang (2006)][2].\n    curvature_param: Positive scalar `Tensor` of real dtype. Bounded above\n      by `1.`. Corresponds to 'sigma' in the terminology of\n      [Hager and Zhang (2006)][2].\n    shrinkage_param: Scalar positive Tensor of real dtype. Must be less than\n      `1.`. Corresponds to the parameter `gamma` in [Hager and Zhang (2006)][2].\n\n  Returns:\n    A namedtuple containing the following fields.\n      converged: Boolean `Tensor` of shape [n]. Whether a point satisfying\n        Wolfe/Approx wolfe was found.\n      failed: Boolean `Tensor` of shape [n]. Whether line search failed e.g.\n        if either the objective function or the gradient are not finite at\n        an evaluation point.\n      iterations: Scalar int32 `Tensor`. Number of line search iterations made.\n      func_evals: Scalar int32 `Tensor`. Number of function evaluations made.\n      left: A namedtuple, as returned by value_and_gradients_function,\n        of the left end point of the updated bracketing interval.\n      right: A namedtuple, as returned by value_and_gradients_function,\n        of the right end point of the updated bracketing interval.\n  \"\"\"\n\n  def _loop_cond(curr_interval):\n    \"\"\"Loop condition.\"\"\"\n    active = ~(curr_interval.converged | curr_interval.failed)\n    return (curr_interval.iterations <\n            max_iterations) & tf.reduce_any(input_tensor=active)\n\n  def _loop_body(curr_interval):\n    \"\"\"The loop body.\"\"\"\n    secant2_raw_result = hzl.secant2(\n        value_and_gradients_function, val_0, curr_interval, f_lim,\n        sufficient_decrease_param, curvature_param)\n    secant2_result = HagerZhangLineSearchResult(\n        converged=secant2_raw_result.converged,\n        failed=secant2_raw_result.failed,\n        iterations=curr_interval.iterations + 1,\n        func_evals=secant2_raw_result.num_evals,\n        left=secant2_raw_result.left,\n        right=secant2_raw_result.right)\n\n    should_check_shrinkage = ~(secant2_result.converged | secant2_result.failed)\n\n    def _do_check_shrinkage():\n      \"\"\"Check if interval has shrinked enough.\"\"\"\n      old_width = curr_interval.right.x - curr_interval.left.x\n      new_width = secant2_result.right.x - secant2_result.left.x\n      sufficient_shrinkage = new_width < old_width * shrinkage_param\n      func_is_flat = (\n          _very_close(curr_interval.left.f, curr_interval.right.f) &\n          _very_close(secant2_result.left.f, secant2_result.right.f))\n\n      new_converged = (\n          should_check_shrinkage & sufficient_shrinkage & func_is_flat)\n      needs_inner_bisect = should_check_shrinkage & ~sufficient_shrinkage\n\n      inner_bisect_args = secant2_result._replace(\n          converged=secant2_result.converged | new_converged)\n\n      def _apply_inner_bisect():\n        return _line_search_inner_bisection(\n            value_and_gradients_function, inner_bisect_args,\n            needs_inner_bisect, f_lim)\n\n      return prefer_static.cond(\n          tf.reduce_any(input_tensor=needs_inner_bisect),\n          _apply_inner_bisect,\n          lambda: inner_bisect_args)\n\n    next_args = prefer_static.cond(\n        tf.reduce_any(input_tensor=should_check_shrinkage),\n        _do_check_shrinkage,\n        lambda: secant2_result)\n\n    interval_shrunk = (\n        ~next_args.failed & _very_close(next_args.left.x, next_args.right.x))\n    return [next_args._replace(converged=next_args.converged | interval_shrunk)]\n\n  return tf.while_loop(\n      cond=_loop_cond,\n      body=_loop_body,\n      loop_vars=[search_interval],\n      parallel_iterations=1)[0]", "response": "This is the main loop of the HagerZhangLineSearchResults function that is called after the minimum has been bracketed."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef _line_search_inner_bisection(\n    value_and_gradients_function,\n    search_interval,\n    active,\n    f_lim):\n  \"\"\"Performs bisection and updates the interval.\"\"\"\n  midpoint = (search_interval.left.x + search_interval.right.x) / 2\n  val_mid = value_and_gradients_function(midpoint)\n  is_valid_mid = hzl.is_finite(val_mid)\n\n  still_active = active & is_valid_mid\n  new_failed = active & ~is_valid_mid\n  next_inteval = search_interval._replace(\n      failed=search_interval.failed | new_failed,\n      func_evals=search_interval.func_evals + 1)\n\n  def _apply_update():\n    update_result = hzl.update(\n        value_and_gradients_function, next_inteval.left, next_inteval.right,\n        val_mid, f_lim, active=still_active)\n    return HagerZhangLineSearchResult(\n        converged=next_inteval.converged,\n        failed=next_inteval.failed | update_result.failed,\n        iterations=next_inteval.iterations + update_result.iteration,\n        func_evals=next_inteval.func_evals + update_result.num_evals,\n        left=update_result.left,\n        right=update_result.right)\n\n  return prefer_static.cond(\n      tf.reduce_any(input_tensor=still_active),\n      _apply_update,\n      lambda: next_inteval)", "response": "Performs a bisection and updates the interval."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\npreparing the arguments for the line search.", "response": "def _prepare_args(value_and_gradients_function,\n                  initial_step_size,\n                  val_initial,\n                  val_0,\n                  approximate_wolfe_threshold):\n  \"\"\"Prepares the arguments for the line search initialization.\n\n  Args:\n    value_and_gradients_function: A Python callable that accepts a real scalar\n      tensor and returns a namedtuple with the fields 'x', 'f', and 'df' that\n      correspond to scalar tensors of real dtype containing the point at which\n      the function was evaluated, the value of the function, and its\n      derivative at that point. The other namedtuple fields, if present,\n      should be tensors or sequences (possibly nested) of tensors.\n      In usual optimization application, this function would be generated by\n      projecting the multivariate objective function along some specific\n      direction. The direction is determined by some other procedure but should\n      be a descent direction (i.e. the derivative of the projected univariate\n      function must be negative at 0.).\n      Alternatively, the function may represent the batching of `n` such line\n      functions (e.g. projecting a single multivariate objective function along\n      `n` distinct directions at once) accepting n points as input, i.e. a\n      tensor of shape [n], and the fields 'x', 'f' and 'df' in the returned\n      namedtuple should each be a tensor of shape [n], with the corresponding\n      input points, function values, and derivatives at those input points.\n    initial_step_size: Scalar positive `Tensor` of real dtype, or a tensor of\n      shape [n] in batching mode. The initial value (or values) to try to\n      bracket the minimum. Default is `1.` as a float32.\n      Note that this point need not necessarily bracket the minimum for the line\n      search to work correctly but the supplied value must be greater than 0.\n      A good initial value will make the search converge faster.\n    val_initial: The full return value of evaluating\n      value_and_gradients_function at initial_step_size, i.e. a namedtuple with\n      'x', 'f', 'df', if already known by the caller. If not None the value of\n      `initial_step_size` will be ignored, otherwise the tuple will be computed\n      by evaluating value_and_gradients_function.\n    val_0: The full return value of value_and_gradients_function at `0.`, i.e.\n      a namedtuple with 'x', 'f', 'df', if already known by the caller. If None\n      the tuple will be computed by evaluating value_and_gradients_function.\n    approximate_wolfe_threshold: Scalar positive `Tensor` of\n      real dtype. Corresponds to the parameter 'epsilon' in\n      [Hager and Zhang (2006)][2]. Used to estimate the\n      threshold at which the line search switches to approximate Wolfe\n      conditions.\n\n  Returns:\n    left: A namedtuple, as returned by value_and_gradients_function,\n      containing the value and derivative of the function at `0.`.\n    val_initial: A namedtuple, as returned by value_and_gradients_function,\n      containing the value and derivative of the function at\n      `initial_step_size`.\n    f_lim: Real `Tensor` of shape [n]. The function value threshold for\n      the approximate Wolfe conditions to be checked.\n    eval_count: Scalar int32 `Tensor`. The number of target function\n      evaluations made by this function.\n  \"\"\"\n  eval_count = 0\n  if val_initial is None:\n    if initial_step_size is not None:\n      initial_step_size = tf.convert_to_tensor(value=initial_step_size)\n    else:\n      initial_step_size = tf.convert_to_tensor(value=1.0, dtype=tf.float32)\n    val_initial = value_and_gradients_function(initial_step_size)\n    eval_count += 1\n\n  if val_0 is None:\n    x_0 = tf.zeros_like(val_initial.x)\n    val_0 = value_and_gradients_function(x_0)\n    eval_count += 1\n\n  f_lim = val_0.f + (approximate_wolfe_threshold * tf.abs(val_0.f))\n  return val_0, val_initial, f_lim, tf.convert_to_tensor(value=eval_count)"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef _to_str(x):\n  x = tf.convert_to_tensor(value=x)\n  if x.dtype == tf.bool:\n    return tf.where(x, tf.fill(x.shape, 'True'), tf.fill(x.shape, 'False'))\n  return x", "response": "Converts a bool tensor to a string with True / False values."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nwraps for tf. Print which supports lists and namedtuples for printing.", "response": "def _print(pass_through_tensor, values):\n  \"\"\"Wrapper for tf.Print which supports lists and namedtuples for printing.\"\"\"\n  flat_values = []\n  for value in values:\n    # Checks if it is a namedtuple.\n    if hasattr(value, '_fields'):\n      for field in value._fields:\n        flat_values.extend([field, _to_str(getattr(value, field))])\n      continue\n    if isinstance(value, (list, tuple)):\n      for v in value:\n        flat_values.append(_to_str(v))\n      continue\n    flat_values.append(_to_str(value))\n  return tf.compat.v1.Print(pass_through_tensor, flat_values)"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef _kl_brute_force(a, b, name=None):\n\n  def squared_frobenius_norm(x):\n    \"\"\"Helper to make KL calculation slightly more readable.\"\"\"\n    # http://mathworld.wolfram.com/FrobeniusNorm.html\n    # The gradient of KL[p,q] is not defined when p==q. The culprit is\n    # tf.norm, i.e., we cannot use the commented out code.\n    # return tf.square(tf.norm(x, ord=\"fro\", axis=[-2, -1]))\n    return tf.reduce_sum(input_tensor=tf.square(x), axis=[-2, -1])\n\n  # TODO(b/35041439): See also b/35040945. Remove this function once LinOp\n  # supports something like:\n  #   A.inverse().solve(B).norm(order='fro', axis=[-1, -2])\n  def is_diagonal(x):\n    \"\"\"Helper to identify if `LinearOperator` has only a diagonal component.\"\"\"\n    return (isinstance(x, tf.linalg.LinearOperatorIdentity) or\n            isinstance(x, tf.linalg.LinearOperatorScaledIdentity) or\n            isinstance(x, tf.linalg.LinearOperatorDiag))\n\n  with tf.name_scope(name or \"kl_mvn\"):\n    # Calculation is based on:\n    # http://stats.stackexchange.com/questions/60680/kl-divergence-between-two-multivariate-gaussians\n    # and,\n    # https://en.wikipedia.org/wiki/Matrix_norm#Frobenius_norm\n    # i.e.,\n    #   If Ca = AA', Cb = BB', then\n    #   tr[inv(Cb) Ca] = tr[inv(B)' inv(B) A A']\n    #                  = tr[inv(B) A A' inv(B)']\n    #                  = tr[(inv(B) A) (inv(B) A)']\n    #                  = sum_{ij} (inv(B) A)_{ij}**2\n    #                  = ||inv(B) A||_F**2\n    # where ||.||_F is the Frobenius norm and the second equality follows from\n    # the cyclic permutation property.\n    if is_diagonal(a.scale) and is_diagonal(b.scale):\n      # Using `stddev` because it handles expansion of Identity cases.\n      b_inv_a = (a.stddev() / b.stddev())[..., tf.newaxis]\n    else:\n      b_inv_a = b.scale.solve(a.scale.to_dense())\n    kl_div = (\n        b.scale.log_abs_determinant() - a.scale.log_abs_determinant() +\n        0.5 * (-tf.cast(a.scale.domain_dimension_tensor(), a.dtype) +\n               squared_frobenius_norm(b_inv_a) + squared_frobenius_norm(\n                   b.scale.solve((b.mean() - a.mean())[..., tf.newaxis]))))\n    tensorshape_util.set_shape(\n        kl_div, tf.broadcast_static_shape(a.batch_shape, b.batch_shape))\n    return kl_div", "response": "Batchwise KL divergence for a batch of multivariate normals."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef quadrature_scheme_softmaxnormal_gauss_hermite(\n    normal_loc, normal_scale, quadrature_size,\n    validate_args=False, name=None):\n  \"\"\"Use Gauss-Hermite quadrature to form quadrature on `K - 1` simplex.\n\n  A `SoftmaxNormal` random variable `Y` may be generated via\n\n  ```\n  Y = SoftmaxCentered(X),\n  X = Normal(normal_loc, normal_scale)\n  ```\n\n  Note: for a given `quadrature_size`, this method is generally less accurate\n  than `quadrature_scheme_softmaxnormal_quantiles`.\n\n  Args:\n    normal_loc: `float`-like `Tensor` with shape `[b1, ..., bB, K-1]`, B>=0.\n      The location parameter of the Normal used to construct the SoftmaxNormal.\n    normal_scale: `float`-like `Tensor`. Broadcastable with `normal_loc`.\n      The scale parameter of the Normal used to construct the SoftmaxNormal.\n    quadrature_size: Python `int` scalar representing the number of quadrature\n      points.\n    validate_args: Python `bool`, default `False`. When `True` distribution\n      parameters are checked for validity despite possibly degrading runtime\n      performance. When `False` invalid inputs may silently render incorrect\n      outputs.\n    name: Python `str` name prefixed to Ops created by this class.\n\n  Returns:\n    grid: Shape `[b1, ..., bB, K, quadrature_size]` `Tensor` representing the\n      convex combination of affine parameters for `K` components.\n      `grid[..., :, n]` is the `n`-th grid point, living in the `K - 1` simplex.\n    probs:  Shape `[b1, ..., bB, K, quadrature_size]` `Tensor` representing the\n      associated with each grid point.\n  \"\"\"\n  with tf.name_scope(\n      name or \"quadrature_scheme_softmaxnormal_gauss_hermite\"):\n    normal_loc = tf.convert_to_tensor(value=normal_loc, name=\"normal_loc\")\n    npdt = dtype_util.as_numpy_dtype(normal_loc.dtype)\n    normal_scale = tf.convert_to_tensor(\n        value=normal_scale, dtype=npdt, name=\"normal_scale\")\n\n    normal_scale = maybe_check_quadrature_param(\n        normal_scale, \"normal_scale\", validate_args)\n\n    grid, probs = np.polynomial.hermite.hermgauss(deg=quadrature_size)\n    grid = grid.astype(npdt)\n    probs = probs.astype(npdt)\n    probs /= np.linalg.norm(probs, ord=1, keepdims=True)\n    probs = tf.convert_to_tensor(value=probs, name=\"probs\", dtype=npdt)\n\n    grid = softmax(\n        -distribution_util.pad(\n            (normal_loc[..., tf.newaxis] +\n             np.sqrt(2.) * normal_scale[..., tf.newaxis] * grid),\n            axis=-2,\n            front=True),\n        axis=-2)  # shape: [B, components, deg]\n\n    return grid, probs", "response": "A simplex that uses Gauss - Hermite quadrature to form a quadrature on a given set of points."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef quadrature_scheme_softmaxnormal_quantiles(\n    normal_loc, normal_scale, quadrature_size,\n    validate_args=False, name=None):\n  \"\"\"Use SoftmaxNormal quantiles to form quadrature on `K - 1` simplex.\n\n  A `SoftmaxNormal` random variable `Y` may be generated via\n\n  ```\n  Y = SoftmaxCentered(X),\n  X = Normal(normal_loc, normal_scale)\n  ```\n\n  Args:\n    normal_loc: `float`-like `Tensor` with shape `[b1, ..., bB, K-1]`, B>=0.\n      The location parameter of the Normal used to construct the SoftmaxNormal.\n    normal_scale: `float`-like `Tensor`. Broadcastable with `normal_loc`.\n      The scale parameter of the Normal used to construct the SoftmaxNormal.\n    quadrature_size: Python `int` scalar representing the number of quadrature\n      points.\n    validate_args: Python `bool`, default `False`. When `True` distribution\n      parameters are checked for validity despite possibly degrading runtime\n      performance. When `False` invalid inputs may silently render incorrect\n      outputs.\n    name: Python `str` name prefixed to Ops created by this class.\n\n  Returns:\n    grid: Shape `[b1, ..., bB, K, quadrature_size]` `Tensor` representing the\n      convex combination of affine parameters for `K` components.\n      `grid[..., :, n]` is the `n`-th grid point, living in the `K - 1` simplex.\n    probs:  Shape `[b1, ..., bB, K, quadrature_size]` `Tensor` representing the\n      associated with each grid point.\n  \"\"\"\n  with tf.name_scope(name or \"softmax_normal_grid_and_probs\"):\n    normal_loc = tf.convert_to_tensor(value=normal_loc, name=\"normal_loc\")\n    dt = dtype_util.base_dtype(normal_loc.dtype)\n    normal_scale = tf.convert_to_tensor(\n        value=normal_scale, dtype=dt, name=\"normal_scale\")\n\n    normal_scale = maybe_check_quadrature_param(\n        normal_scale, \"normal_scale\", validate_args)\n\n    dist = normal.Normal(loc=normal_loc, scale=normal_scale)\n\n    def _get_batch_ndims():\n      \"\"\"Helper to get rank(dist.batch_shape), statically if possible.\"\"\"\n      ndims = tensorshape_util.rank(dist.batch_shape)\n      if ndims is None:\n        ndims = tf.shape(input=dist.batch_shape_tensor())[0]\n      return ndims\n    batch_ndims = _get_batch_ndims()\n\n    def _get_final_shape(qs):\n      \"\"\"Helper to build `TensorShape`.\"\"\"\n      bs = tensorshape_util.with_rank_at_least(dist.batch_shape, 1)\n      num_components = tf.compat.dimension_value(bs[-1])\n      if num_components is not None:\n        num_components += 1\n      tail = tf.TensorShape([num_components, qs])\n      return bs[:-1].concatenate(tail)\n\n    def _compute_quantiles():\n      \"\"\"Helper to build quantiles.\"\"\"\n      # Omit {0, 1} since they might lead to Inf/NaN.\n      zero = tf.zeros([], dtype=dist.dtype)\n      edges = tf.linspace(zero, 1., quadrature_size + 3)[1:-1]\n      # Expand edges so its broadcast across batch dims.\n      edges = tf.reshape(\n          edges,\n          shape=tf.concat(\n              [[-1], tf.ones([batch_ndims], dtype=tf.int32)], axis=0))\n      quantiles = dist.quantile(edges)\n      quantiles = softmax_centered_bijector.SoftmaxCentered().forward(quantiles)\n      # Cyclically permute left by one.\n      perm = tf.concat([tf.range(1, 1 + batch_ndims), [0]], axis=0)\n      quantiles = tf.transpose(a=quantiles, perm=perm)\n      tensorshape_util.set_shape(\n          quantiles, _get_final_shape(quadrature_size + 1))\n      return quantiles\n    quantiles = _compute_quantiles()\n\n    # Compute grid as quantile midpoints.\n    grid = (quantiles[..., :-1] + quantiles[..., 1:]) / 2.\n    # Set shape hints.\n    tensorshape_util.set_shape(grid, _get_final_shape(quadrature_size))\n\n    # By construction probs is constant, i.e., `1 / quadrature_size`. This is\n    # important, because non-constant probs leads to non-reparameterizable\n    # samples.\n    probs = tf.fill(\n        dims=[quadrature_size], value=1. / tf.cast(quadrature_size, dist.dtype))\n\n    return grid, probs", "response": "A function that generates a graph of uniform uniform."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef maybe_check_quadrature_param(param, name, validate_args):\n  with tf.name_scope(\"check_\" + name):\n    assertions = []\n    if tensorshape_util.rank(param.shape) is not None:\n      if tensorshape_util.rank(param.shape) == 0:\n        raise ValueError(\"Mixing params must be a (batch of) vector; \"\n                         \"{}.rank={} is not at least one.\".format(\n                             name, tensorshape_util.rank(param.shape)))\n    elif validate_args:\n      assertions.append(\n          assert_util.assert_rank_at_least(\n              param,\n              1,\n              message=(\"Mixing params must be a (batch of) vector; \"\n                       \"{}.rank is not at least one.\".format(name))))\n\n    # TODO(jvdillon): Remove once we support k-mixtures.\n    if tensorshape_util.with_rank_at_least(param.shape, 1)[-1] is not None:\n      if tf.compat.dimension_value(param.shape[-1]) != 1:\n        raise NotImplementedError(\"Currently only bimixtures are supported; \"\n                                  \"{}.shape[-1]={} is not 1.\".format(\n                                      name,\n                                      tf.compat.dimension_value(\n                                          param.shape[-1])))\n    elif validate_args:\n      assertions.append(\n          assert_util.assert_equal(\n              tf.shape(input=param)[-1],\n              1,\n              message=(\"Currently only bimixtures are supported; \"\n                       \"{}.shape[-1] is not 1.\".format(name))))\n\n    if assertions:\n      return distribution_util.with_dependencies(assertions, param)\n    return param", "response": "Helper which checks validity of loc and scale init args."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef determine_batch_event_shapes(grid, endpoint_affine):\n  with tf.name_scope(\"determine_batch_event_shapes\"):\n    # grid  # shape: [B, k, q]\n    # endpoint_affine     # len=k, shape: [B, d, d]\n    batch_shape = grid.shape[:-2]\n    batch_shape_tensor = tf.shape(input=grid)[:-2]\n    event_shape = None\n    event_shape_tensor = None\n\n    def _set_event_shape(shape, shape_tensor):\n      if event_shape is None:\n        return shape, shape_tensor\n      return (tf.broadcast_static_shape(event_shape, shape),\n              tf.broadcast_dynamic_shape(event_shape_tensor, shape_tensor))\n\n    for aff in endpoint_affine:\n      if aff.shift is not None:\n        batch_shape = tf.broadcast_static_shape(batch_shape,\n                                                aff.shift.shape[:-1])\n        batch_shape_tensor = tf.broadcast_dynamic_shape(\n            batch_shape_tensor,\n            tf.shape(input=aff.shift)[:-1])\n        event_shape, event_shape_tensor = _set_event_shape(\n            aff.shift.shape[-1:],\n            tf.shape(input=aff.shift)[-1:])\n\n      if aff.scale is not None:\n        batch_shape = tf.broadcast_static_shape(batch_shape,\n                                                aff.scale.batch_shape)\n        batch_shape_tensor = tf.broadcast_dynamic_shape(\n            batch_shape_tensor, aff.scale.batch_shape_tensor())\n        event_shape, event_shape_tensor = _set_event_shape(\n            tf.TensorShape([aff.scale.range_dimension]),\n            aff.scale.range_dimension_tensor()[tf.newaxis])\n\n    return batch_shape, batch_shape_tensor, event_shape, event_shape_tensor", "response": "Helper to infer batch_shape and event_shape."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef interpolate_loc(grid, loc):\n  if len(loc) != 2:\n    raise NotImplementedError(\"Currently only bimixtures are supported; \"\n                              \"len(scale)={} is not 2.\".format(len(loc)))\n  deg = tf.compat.dimension_value(\n      tensorshape_util.with_rank_at_least(grid.shape, 1)[-1])\n  if deg is None:\n    raise ValueError(\"Num quadrature grid points must be known prior \"\n                     \"to graph execution.\")\n  with tf.name_scope(\"interpolate_loc\"):\n    if loc is None or loc[0] is None and loc[1] is None:\n      return [None]*deg\n    # shape: [B, 1, k, deg]\n    w = grid[..., tf.newaxis, :, :]\n    loc = [\n        x[..., tf.newaxis]  # shape: [B, e, 1]\n        if x is not None else None for x in loc\n    ]\n    if loc[0] is None:\n      x = w[..., 1, :] * loc[1]                        # shape: [B, e, deg]\n    elif loc[1] is None:\n      x = w[..., 0, :] * loc[0]                        # shape: [B, e, deg]\n    else:\n      delta = loc[0] - loc[1]\n      x = w[..., 0, :] * delta + loc[1]                # shape: [B, e, deg]\n    return [x[..., k] for k in range(deg)]", "response": "Helper which interpolates between two locs."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\ncreates weighted LinearOp from an existing LinOp.", "response": "def linop_scale(w, op):\n  \"\"\"Creates weighted `LinOp` from existing `LinOp`.\"\"\"\n  # We assume w > 0. (This assumption only relates to the is_* attributes.)\n  with tf.name_scope(\"linop_scale\"):\n    # TODO(b/35301104): LinearOperatorComposition doesn't combine operators, so\n    # special case combinations here. Once it does, this function can be\n    # replaced by:\n    #     return linop_composition_lib.LinearOperatorComposition([\n    #         scaled_identity(w), op])\n    def scaled_identity(w):\n      return tf.linalg.LinearOperatorScaledIdentity(\n          num_rows=op.range_dimension_tensor(),\n          multiplier=w,\n          is_non_singular=op.is_non_singular,\n          is_self_adjoint=op.is_self_adjoint,\n          is_positive_definite=op.is_positive_definite)\n\n    if isinstance(op, tf.linalg.LinearOperatorIdentity):\n      return scaled_identity(w)\n    if isinstance(op, tf.linalg.LinearOperatorScaledIdentity):\n      return scaled_identity(w * op.multiplier)\n    if isinstance(op, tf.linalg.LinearOperatorDiag):\n      return tf.linalg.LinearOperatorDiag(\n          diag=w[..., tf.newaxis] * op.diag_part(),\n          is_non_singular=op.is_non_singular,\n          is_self_adjoint=op.is_self_adjoint,\n          is_positive_definite=op.is_positive_definite)\n    if isinstance(op, tf.linalg.LinearOperatorLowerTriangular):\n      return tf.linalg.LinearOperatorLowerTriangular(\n          tril=w[..., tf.newaxis, tf.newaxis] * op.to_dense(),\n          is_non_singular=op.is_non_singular,\n          is_self_adjoint=op.is_self_adjoint,\n          is_positive_definite=op.is_positive_definite)\n    raise NotImplementedError(\n        \"Unsupported Linop type ({})\".format(type(op).__name__))"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nconcatenate input vectors statically if possible.", "response": "def concat_vectors(*args):\n  \"\"\"Concatenates input vectors, statically if possible.\"\"\"\n  args_ = [tf.get_static_value(x) for x in args]\n  if any(vec is None for vec in args_):\n    return tf.concat(args, axis=0)\n  return [val for vec in args_ for val in vec]"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef softmax(x, axis, name=None):\n  with tf.name_scope(name or \"softmax\"):\n    x = tf.convert_to_tensor(value=x, name=\"x\")\n    ndims = (\n        tensorshape_util.rank(x.shape)\n        if tensorshape_util.rank(x.shape) is not None else tf.rank(\n            x, name=\"ndims\"))\n    axis = tf.convert_to_tensor(value=axis, dtype=tf.int32, name=\"axis\")\n    axis_ = tf.get_static_value(axis)\n    if axis_ is not None:\n      axis = np.int(ndims + axis_ if axis_ < 0 else axis_)\n    else:\n      axis = tf.where(axis < 0, ndims + axis, axis)\n  return tf.nn.softmax(x, axis=axis)", "response": "Equivalent to tf. nn. softmax but works around b/70297725."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nensures self. distribution. mean has [ batch event shape ].", "response": "def _expand_base_distribution_mean(self):\n    \"\"\"Ensures `self.distribution.mean()` has `[batch, event]` shape.\"\"\"\n    single_draw_shape = concat_vectors(self.batch_shape_tensor(),\n                                       self.event_shape_tensor())\n    m = tf.reshape(\n        self.distribution.mean(),  # A scalar.\n        shape=tf.ones_like(single_draw_shape, dtype=tf.int32))\n    m = tf.tile(m, multiples=single_draw_shape)\n    tensorshape_util.set_shape(\n        m, tensorshape_util.concatenate(self.batch_shape, self.event_shape))\n    return m"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nmultiplies tensor of vectors by matrices assuming values stored are logs.", "response": "def _log_vector_matrix(vs, ms):\n  \"\"\"Multiply tensor of vectors by matrices assuming values stored are logs.\"\"\"\n\n  return tf.reduce_logsumexp(input_tensor=vs[..., tf.newaxis] + ms, axis=-2)"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef _log_matrix_vector(ms, vs):\n\n  return tf.reduce_logsumexp(input_tensor=ms + vs[..., tf.newaxis, :], axis=-1)", "response": "Multiply tensor of matrices by vectors assuming values stored are logs."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nmultiply tensor of vectors by matrices.", "response": "def _vector_matrix(vs, ms):\n  \"\"\"Multiply tensor of vectors by matrices.\"\"\"\n\n  return tf.reduce_sum(input_tensor=vs[..., tf.newaxis] * ms, axis=-2)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef _extract_log_probs(num_states, dist):\n\n  states = tf.reshape(tf.range(num_states),\n                      tf.concat([[num_states],\n                                 tf.ones_like(dist.batch_shape_tensor())],\n                                axis=0))\n  return distribution_util.move_dimension(dist.log_prob(states), 0, -1)", "response": "Tabulate log probabilities from a batch of distributions."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\ncompute marginal pdf for each individual observable.", "response": "def _marginal_hidden_probs(self):\n    \"\"\"Compute marginal pdf for each individual observable.\"\"\"\n\n    initial_log_probs = tf.broadcast_to(self._log_init,\n                                        tf.concat([self.batch_shape_tensor(),\n                                                   [self._num_states]],\n                                                  axis=0))\n    # initial_log_probs :: batch_shape num_states\n\n    if self._num_steps > 1:\n      transition_log_probs = self._log_trans\n\n      def forward_step(log_probs, _):\n        return _log_vector_matrix(log_probs, transition_log_probs)\n\n      dummy_index = tf.zeros(self._num_steps - 1, dtype=tf.float32)\n\n      forward_log_probs = tf.scan(forward_step, dummy_index,\n                                  initializer=initial_log_probs,\n                                  name=\"forward_log_probs\")\n\n      forward_log_probs = tf.concat([[initial_log_probs], forward_log_probs],\n                                    axis=0)\n    else:\n      forward_log_probs = initial_log_probs[tf.newaxis, ...]\n\n    # returns :: num_steps batch_shape num_states\n\n    return tf.exp(forward_log_probs)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef posterior_marginals(self, observations, name=None):\n\n    with tf.name_scope(name or \"posterior_marginals\"):\n      with tf.control_dependencies(self._runtime_assertions):\n        observation_tensor_shape = tf.shape(input=observations)\n\n        with self._observation_shape_preconditions(observation_tensor_shape):\n          observation_batch_shape = observation_tensor_shape[\n              :-1 - self._underlying_event_rank]\n          observation_event_shape = observation_tensor_shape[\n              -1 - self._underlying_event_rank:]\n\n          batch_shape = tf.broadcast_dynamic_shape(observation_batch_shape,\n                                                   self.batch_shape_tensor())\n          log_init = tf.broadcast_to(self._log_init,\n                                     tf.concat([batch_shape,\n                                                [self._num_states]],\n                                               axis=0))\n          log_transition = self._log_trans\n\n          observations = tf.broadcast_to(observations,\n                                         tf.concat([batch_shape,\n                                                    observation_event_shape],\n                                                   axis=0))\n          observation_rank = tf.rank(observations)\n          underlying_event_rank = self._underlying_event_rank\n          observations = distribution_util.move_dimension(\n              observations, observation_rank - underlying_event_rank - 1, 0)\n          observations = tf.expand_dims(\n              observations,\n              observation_rank - underlying_event_rank)\n          observation_log_probs = self._observation_distribution.log_prob(\n              observations)\n\n          log_adjoint_prob = tf.zeros_like(log_init)\n\n          def forward_step(log_previous_step, log_prob_observation):\n            return _log_vector_matrix(log_previous_step,\n                                      log_transition) + log_prob_observation\n\n          log_prob = log_init + observation_log_probs[0]\n\n          forward_log_probs = tf.scan(forward_step, observation_log_probs[1:],\n                                      initializer=log_prob,\n                                      name=\"forward_log_probs\")\n\n          forward_log_probs = tf.concat([[log_prob], forward_log_probs], axis=0)\n\n          def backward_step(log_previous_step, log_prob_observation):\n            return _log_matrix_vector(log_transition,\n                                      log_prob_observation + log_previous_step)\n\n          backward_log_adjoint_probs = tf.scan(\n              backward_step,\n              observation_log_probs[1:],\n              initializer=log_adjoint_prob,\n              reverse=True,\n              name=\"backward_log_adjoint_probs\")\n\n          total_log_prob = tf.reduce_logsumexp(\n              input_tensor=forward_log_probs[-1], axis=-1)\n\n          backward_log_adjoint_probs = tf.concat([backward_log_adjoint_probs,\n                                                  [log_adjoint_prob]], axis=0)\n\n          log_likelihoods = forward_log_probs + backward_log_adjoint_probs\n\n          marginal_log_probs = distribution_util.move_dimension(\n              log_likelihoods - total_log_prob[..., tf.newaxis], 0, -2)\n\n          return categorical.Categorical(logits=marginal_log_probs)", "response": "Compute the marginal posterior probability for each state."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef posterior_mode(self, observations, name=None):\n\n    with tf.name_scope(name or \"posterior_mode\"):\n      with tf.control_dependencies(self._runtime_assertions):\n        observation_tensor_shape = tf.shape(input=observations)\n\n        with self._observation_shape_preconditions(observation_tensor_shape):\n          observation_batch_shape = observation_tensor_shape[\n              :-1 - self._underlying_event_rank]\n          observation_event_shape = observation_tensor_shape[\n              -1 - self._underlying_event_rank:]\n\n          batch_shape = tf.broadcast_dynamic_shape(observation_batch_shape,\n                                                   self.batch_shape_tensor())\n          log_init = tf.broadcast_to(self._log_init,\n                                     tf.concat([batch_shape,\n                                                [self._num_states]],\n                                               axis=0))\n\n          observations = tf.broadcast_to(observations,\n                                         tf.concat([batch_shape,\n                                                    observation_event_shape],\n                                                   axis=0))\n          observation_rank = tf.rank(observations)\n          underlying_event_rank = self._underlying_event_rank\n          observations = distribution_util.move_dimension(\n              observations, observation_rank - underlying_event_rank - 1, 0)\n\n          # We need to compute the probability of each observation for\n          # each possible state.\n          # This requires inserting an extra index just before the\n          # observation event indices that will be broadcast with the\n          # last batch index in `observation_distribution`.\n          observations = tf.expand_dims(\n              observations,\n              observation_rank - underlying_event_rank)\n          observation_log_probs = self._observation_distribution.log_prob(\n              observations)\n\n          log_prob = log_init + observation_log_probs[0]\n\n          if self._num_steps == 1:\n            most_likely_end = tf.argmax(input=log_prob, axis=-1)\n            return most_likely_end[..., tf.newaxis]\n\n          def forward_step(previous_step_pair, log_prob_observation):\n            log_prob_previous = previous_step_pair[0]\n            log_prob = (log_prob_previous[..., tf.newaxis] +\n                        self._log_trans +\n                        log_prob_observation[..., tf.newaxis, :])\n            most_likely_given_successor = tf.argmax(input=log_prob, axis=-2)\n            max_log_p_given_successor = tf.reduce_max(input_tensor=log_prob,\n                                                      axis=-2)\n            return (max_log_p_given_successor, most_likely_given_successor)\n\n          forward_log_probs, all_most_likely_given_successor = tf.scan(\n              forward_step,\n              observation_log_probs[1:],\n              initializer=(log_prob,\n                           tf.zeros(tf.shape(input=log_init), dtype=tf.int64)),\n              name=\"forward_log_probs\")\n\n          most_likely_end = tf.argmax(input=forward_log_probs[-1], axis=-1)\n\n          # We require the operation that gives C from A and B where\n          # C[i...j] = A[i...j, B[i...j]]\n          # and A = most_likely_given_successor\n          #     B = most_likely_successor.\n          # tf.gather requires indices of known shape so instead we use\n          # reduction with tf.one_hot(B) to pick out elements from B\n          def backward_step(most_likely_successor, most_likely_given_successor):\n            return tf.reduce_sum(\n                input_tensor=(most_likely_given_successor *\n                              tf.one_hot(most_likely_successor,\n                                         self._num_states,\n                                         dtype=tf.int64)),\n                axis=-1)\n\n          backward_scan = tf.scan(\n              backward_step,\n              all_most_likely_given_successor,\n              most_likely_end,\n              reverse=True)\n          most_likely_sequences = tf.concat([backward_scan, [most_likely_end]],\n                                            axis=0)\n          return distribution_util.move_dimension(most_likely_sequences, 0, -1)", "response": "This function computes the maximum likelihood sequence of hidden states for a given set of observations."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef _choose_random_direction(current_state_parts, batch_rank, seed=None):\n  seed_gen = distributions.SeedStream(seed, salt='_choose_random_direction')\n  # Chooses the random directions across each of the input components.\n  rnd_direction_parts = [\n      tf.random.normal(\n          tf.shape(input=current_state_part), dtype=tf.float32, seed=seed_gen())\n      for current_state_part in current_state_parts\n  ]\n\n  # Sum squares over all of the input components. Note this takes all\n  # components into account.\n  sum_squares = sum(\n      tf.reduce_sum(\n          input_tensor=rnd_direction**2.,\n          axis=tf.range(batch_rank, tf.rank(rnd_direction)),\n          keepdims=True) for rnd_direction in rnd_direction_parts)\n\n  # Normalizes the random direction fragments.\n  rnd_direction_parts = [rnd_direction / tf.sqrt(sum_squares)\n                         for rnd_direction in rnd_direction_parts]\n\n  return rnd_direction_parts", "response": "Chooses a random direction in the event space."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\napplying a single iteration of slice sampling update.", "response": "def _sample_next(target_log_prob_fn,\n                 current_state_parts,\n                 step_sizes,\n                 max_doublings,\n                 current_target_log_prob,\n                 batch_rank,\n                 seed=None,\n                 name=None):\n  \"\"\"Applies a single iteration of slice sampling update.\n\n  Applies hit and run style slice sampling. Chooses a uniform random direction\n  on the unit sphere in the event space. Applies the one dimensional slice\n  sampling update along that direction.\n\n  Args:\n    target_log_prob_fn: Python callable which takes an argument like\n      `*current_state_parts` and returns its (possibly unnormalized) log-density\n      under the target distribution.\n    current_state_parts: Python `list` of `Tensor`s representing the current\n      state(s) of the Markov chain(s). The first `independent_chain_ndims` of\n      the `Tensor`(s) index different chains.\n    step_sizes: Python `list` of `Tensor`s. Provides a measure of the width\n      of the density. Used to find the slice bounds. Must broadcast with the\n      shape of `current_state_parts`.\n    max_doublings: Integer number of doublings to allow while locating the slice\n      boundaries.\n    current_target_log_prob: `Tensor` representing the value of\n      `target_log_prob_fn(*current_state_parts)`. The only reason to specify\n      this argument is to reduce TF graph size.\n    batch_rank: Integer. The number of axes in the state that correspond to\n      independent batches.\n    seed: Python integer to seed random number generators.\n    name: Python `str` name prefixed to Ops created by this function.\n      Default value: `None` (i.e., 'find_slice_bounds').\n\n  Returns:\n    proposed_state_parts: Tensor or Python list of `Tensor`s representing the\n      state(s) of the Markov chain(s) at each result step. Has same shape as\n      input `current_state_parts`.\n    proposed_target_log_prob: `Tensor` representing the value of\n      `target_log_prob_fn` at `next_state`.\n    bounds_satisfied: Boolean `Tensor` of the same shape as the log density.\n      True indicates whether the an interval containing the slice for that\n      batch was found successfully.\n    direction: `Tensor` or Python list of `Tensors`s representing the direction\n      along which the slice was sampled. Has the same shape and dtype(s) as\n      `current_state_parts`.\n    upper_bounds: `Tensor` of batch shape and the dtype of the input state. The\n      upper bounds of the slices along the sampling direction.\n    lower_bounds: `Tensor` of batch shape and the dtype of the input state. The\n      lower bounds of the slices along the sampling direction.\n  \"\"\"\n  with tf.compat.v1.name_scope(name, 'sample_next', [\n      current_state_parts, step_sizes, max_doublings, current_target_log_prob,\n      batch_rank\n  ]):\n    # First step: Choose a random direction.\n    # Direction is a list of tensors. The i'th tensor should have the same shape\n    # as the i'th state part.\n    direction = _choose_random_direction(current_state_parts,\n                                         batch_rank=batch_rank,\n                                         seed=seed)\n\n    # Interpolates the step sizes for the chosen direction.\n    # Applies an ellipsoidal interpolation to compute the step direction for\n    # the chosen direction. Suppose we are given step sizes for each direction.\n    # Label these s_1, s_2, ... s_k. These are the step sizes to use if moving\n    # in a direction parallel to one of the axes. Consider an ellipsoid which\n    # intercepts the i'th axis at s_i. The step size for a direction specified\n    # by the unit vector (n_1, n_2 ...n_k) is then defined as the intersection\n    # of the line through this vector with this ellipsoid.\n    #\n    # One can show that the length of the vector from the origin to the\n    # intersection point is given by:\n    # 1 / sqrt(n_1^2 / s_1^2  + n_2^2 / s_2^2  + ...).\n    #\n    # Proof:\n    # The equation of the ellipsoid is:\n    # Sum_i [x_i^2 / s_i^2 ] = 1. Let n be a unit direction vector. Points\n    # along the line given by n may be parameterized as alpha*n where alpha is\n    # the distance along the vector. Plugging this into the equation for the\n    # ellipsoid, we get:\n    # alpha^2 ( n_1^2 / s_1^2 + n_2^2 / s_2^2 + ...) = 1\n    # so alpha = \\sqrt { \\frac{1} { ( n_1^2 / s_1^2 + n_2^2 / s_2^2 + ...) } }\n    reduce_axes = [tf.range(batch_rank, tf.rank(dirn_part))\n                   for dirn_part in direction]\n\n    components = [\n        tf.reduce_sum(\n            input_tensor=(dirn_part / step_size)**2, axis=reduce_axes[i])\n        for i, (step_size, dirn_part) in enumerate(zip(step_sizes, direction))\n    ]\n    step_size = tf.math.rsqrt(tf.add_n(components))\n    # Computes the rank of a tensor. Uses the static rank if possible.\n    def _get_rank(x):\n      return (len(x.shape.as_list()) if x.shape.dims is not None\n              else tf.rank(x))\n    state_part_ranks = [_get_rank(part) for part in current_state_parts]\n\n    def _step_along_direction(alpha):\n      \"\"\"Converts the scalar alpha into an n-dim vector with full state info.\n\n      Computes x_0 + alpha * direction where x_0 is the current state and\n      direction is the direction chosen above.\n\n      Args:\n        alpha: A tensor of shape equal to the batch dimensions of\n          `current_state_parts`.\n\n      Returns:\n        state_parts: Tensor or Python list of `Tensor`s representing the\n          state(s) of the Markov chain(s) for a given alpha and a given chosen\n          direction. Has the same shape as `current_state_parts`.\n      \"\"\"\n      padded_alphas = [_right_pad(alpha, final_rank=part_rank)\n                       for part_rank in state_part_ranks]\n\n      state_parts = [state_part + padded_alpha\n                     * direction_part for state_part, direction_part,\n                     padded_alpha in\n                     zip(current_state_parts, direction, padded_alphas)]\n      return state_parts\n\n    def projected_target_log_prob_fn(alpha):\n      \"\"\"The target log density projected along the chosen direction.\n\n      Args:\n        alpha: A tensor of shape equal to the batch dimensions of\n          `current_state_parts`.\n\n      Returns:\n        Target log density evaluated at x_0 + alpha * direction where x_0 is the\n        current state and direction is the direction chosen above. Has the same\n        shape as `alpha`.\n      \"\"\"\n      return target_log_prob_fn(*_step_along_direction(alpha))\n\n    alpha_init = tf.zeros_like(current_target_log_prob,\n                               dtype=current_state_parts[0].dtype.base_dtype)\n    [\n        next_alpha,\n        next_target_log_prob,\n        bounds_satisfied,\n        upper_bounds,\n        lower_bounds\n    ] = ssu.slice_sampler_one_dim(projected_target_log_prob_fn,\n                                  x_initial=alpha_init,\n                                  max_doublings=max_doublings,\n                                  step_size=step_size, seed=seed)\n    return [\n        _step_along_direction(next_alpha),\n        next_target_log_prob,\n        bounds_satisfied,\n        direction,\n        upper_bounds,\n        lower_bounds\n    ]"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef _right_pad(x, final_rank):\n  padded_shape = tf.concat(\n      [tf.shape(input=x),\n       tf.ones(final_rank - tf.rank(x), dtype=tf.int32)],\n      axis=0)\n  static_padded_shape = None\n  if x.shape.is_fully_defined() and isinstance(final_rank, int):\n    static_padded_shape = x.shape.as_list()\n    extra_dims = final_rank - len(static_padded_shape)\n    static_padded_shape.extend([1] * extra_dims)\n\n  padded_x = tf.reshape(x, static_padded_shape or padded_shape)\n  return padded_x", "response": "Pads the shape of x to the right to be of rank final_rank."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef _prepare_args(target_log_prob_fn, state, step_size,\n                  target_log_prob=None, maybe_expand=False,\n                  description='target_log_prob'):\n  \"\"\"Processes input args to meet list-like assumptions.\"\"\"\n  state_parts = list(state) if mcmc_util.is_list_like(state) else [state]\n  state_parts = [\n      tf.convert_to_tensor(value=s, name='current_state') for s in state_parts\n  ]\n\n  target_log_prob = _maybe_call_fn(\n      target_log_prob_fn,\n      state_parts,\n      target_log_prob,\n      description)\n  step_sizes = (list(step_size) if mcmc_util.is_list_like(step_size)\n                else [step_size])\n  step_sizes = [\n      tf.convert_to_tensor(\n          value=s, name='step_size', dtype=target_log_prob.dtype)\n      for s in step_sizes\n  ]\n  if len(step_sizes) == 1:\n    step_sizes *= len(state_parts)\n  if len(state_parts) != len(step_sizes):\n    raise ValueError('There should be exactly one `step_size` or it should '\n                     'have same length as `current_state`.')\n  def maybe_flatten(x):\n    return x if maybe_expand or mcmc_util.is_list_like(state) else x[0]\n  return [\n      maybe_flatten(state_parts),\n      maybe_flatten(step_sizes),\n      target_log_prob\n  ]", "response": "Processes input args to meet list - like assumptions."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nrun one iteration of Slice Sampler.", "response": "def one_step(self, current_state, previous_kernel_results):\n    \"\"\"Runs one iteration of Slice Sampler.\n\n    Args:\n      current_state: `Tensor` or Python `list` of `Tensor`s representing the\n        current state(s) of the Markov chain(s). The first `r` dimensions\n        index independent chains,\n        `r = tf.rank(target_log_prob_fn(*current_state))`.\n      previous_kernel_results: `collections.namedtuple` containing `Tensor`s\n        representing values from previous calls to this function (or from the\n        `bootstrap_results` function.)\n\n    Returns:\n      next_state: Tensor or Python list of `Tensor`s representing the state(s)\n        of the Markov chain(s) after taking exactly one step. Has same type and\n        shape as `current_state`.\n      kernel_results: `collections.namedtuple` of internal calculations used to\n        advance the chain.\n\n    Raises:\n      ValueError: if there isn't one `step_size` or a list with same length as\n        `current_state`.\n      TypeError: if `not target_log_prob.dtype.is_floating`.\n    \"\"\"\n    with tf.compat.v1.name_scope(\n        name=mcmc_util.make_name(self.name, 'slice', 'one_step'),\n        values=[\n            self.step_size, self.max_doublings, self._seed_stream,\n            current_state, previous_kernel_results.target_log_prob\n        ]):\n      with tf.compat.v1.name_scope('initialize'):\n        [\n            current_state_parts,\n            step_sizes,\n            current_target_log_prob\n        ] = _prepare_args(\n            self.target_log_prob_fn,\n            current_state,\n            self.step_size,\n            previous_kernel_results.target_log_prob,\n            maybe_expand=True)\n\n        max_doublings = tf.convert_to_tensor(\n            value=self.max_doublings, dtype=tf.int32, name='max_doublings')\n\n      independent_chain_ndims = distribution_util.prefer_static_rank(\n          current_target_log_prob)\n\n      [\n          next_state_parts,\n          next_target_log_prob,\n          bounds_satisfied,\n          direction,\n          upper_bounds,\n          lower_bounds\n      ] = _sample_next(\n          self.target_log_prob_fn,\n          current_state_parts,\n          step_sizes,\n          max_doublings,\n          current_target_log_prob,\n          independent_chain_ndims,\n          seed=self._seed_stream()\n      )\n\n      def maybe_flatten(x):\n        return x if mcmc_util.is_list_like(current_state) else x[0]\n\n      return [\n          maybe_flatten(next_state_parts),\n          SliceSamplerKernelResults(\n              target_log_prob=next_target_log_prob,\n              bounds_satisfied=bounds_satisfied,\n              direction=direction,\n              upper_bounds=upper_bounds,\n              lower_bounds=lower_bounds\n          ),\n      ]"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\ninitialize from a uniform - 2 distribution in unconstrained space.", "response": "def sample_uniform_initial_state(parameter,\n                                 return_constrained=True,\n                                 init_sample_shape=(),\n                                 seed=None):\n  \"\"\"Initialize from a uniform [-2, 2] distribution in unconstrained space.\n\n  Args:\n    parameter: `sts.Parameter` named tuple instance.\n    return_constrained: if `True`, re-applies the constraining bijector\n      to return initializations in the original domain. Otherwise, returns\n      initializations in the unconstrained space.\n      Default value: `True`.\n    init_sample_shape: `sample_shape` of the sampled initializations.\n      Default value: `[]`.\n    seed: Python integer to seed the random number generator.\n\n  Returns:\n    uniform_initializer: `Tensor` of shape `concat([init_sample_shape,\n    parameter.prior.batch_shape, transformed_event_shape])`, where\n    `transformed_event_shape` is `parameter.prior.event_shape`, if\n    `return_constrained=True`, and otherwise it is\n    `parameter.bijector.inverse_event_shape(parameteter.prior.event_shape)`.\n  \"\"\"\n  unconstrained_prior_sample = parameter.bijector.inverse(\n      parameter.prior.sample(init_sample_shape, seed=seed))\n  uniform_initializer = 4 * tf.random.uniform(\n      tf.shape(input=unconstrained_prior_sample),\n      dtype=unconstrained_prior_sample.dtype,\n      seed=seed) - 2\n  if return_constrained:\n    return parameter.bijector.forward(uniform_initializer)\n  else:\n    return uniform_initializer"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nbuilds a transformed - normal variational distribution over a parameter s support.", "response": "def _build_trainable_posterior(param, initial_loc_fn):\n  \"\"\"Built a transformed-normal variational dist over a parameter's support.\"\"\"\n  loc = tf.compat.v1.get_variable(\n      param.name + '_loc',\n      initializer=lambda: initial_loc_fn(param),\n      dtype=param.prior.dtype,\n      use_resource=True)\n  scale = tf.nn.softplus(\n      tf.compat.v1.get_variable(\n          param.name + '_scale',\n          initializer=lambda: -4 * tf.ones_like(initial_loc_fn(param)),\n          dtype=param.prior.dtype,\n          use_resource=True))\n\n  q = tfd.Normal(loc=loc, scale=scale)\n\n  # Ensure the `event_shape` of the variational distribution matches the\n  # parameter.\n  if (param.prior.event_shape.ndims is None\n      or param.prior.event_shape.ndims > 0):\n    q = tfd.Independent(\n        q, reinterpreted_batch_ndims=param.prior.event_shape.ndims)\n\n  # Transform to constrained parameter space.\n  return tfd.TransformedDistribution(q, param.bijector)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nbuild a loss function for a variational inference in STS models.", "response": "def build_factored_variational_loss(model,\n                                    observed_time_series,\n                                    init_batch_shape=(),\n                                    seed=None,\n                                    name=None):\n  \"\"\"Build a loss function for variational inference in STS models.\n\n  Variational inference searches for the distribution within some family of\n  approximate posteriors that minimizes a divergence between the approximate\n  posterior `q(z)` and true posterior `p(z|observed_time_series)`. By converting\n  inference to optimization, it's generally much faster than sampling-based\n  inference algorithms such as HMC. The tradeoff is that the approximating\n  family rarely contains the true posterior, so it may miss important aspects of\n  posterior structure (in particular, dependence between variables) and should\n  not be blindly trusted. Results may vary; it's generally wise to compare to\n  HMC to evaluate whether inference quality is sufficient for your task at hand.\n\n  This method constructs a loss function for variational inference using the\n  Kullback-Liebler divergence `KL[q(z) || p(z|observed_time_series)]`, with an\n  approximating family given by independent Normal distributions transformed to\n  the appropriate parameter space for each parameter. Minimizing this loss (the\n  negative ELBO) maximizes a lower bound on the log model evidence `-log\n  p(observed_time_series)`. This is equivalent to the 'mean-field' method\n  implemented in [1]. and is a standard approach. The resulting posterior\n  approximations are unimodal; they will tend to underestimate posterior\n  uncertainty when the true posterior contains multiple modes (the `KL[q||p]`\n  divergence encourages choosing a single mode) or dependence between variables.\n\n  Args:\n    model: An instance of `StructuralTimeSeries` representing a\n      time-series model. This represents a joint distribution over\n      time-series and their parameters with batch shape `[b1, ..., bN]`.\n    observed_time_series: `float` `Tensor` of shape\n      `concat([sample_shape, model.batch_shape, [num_timesteps, 1]]) where\n      `sample_shape` corresponds to i.i.d. observations, and the trailing `[1]`\n      dimension may (optionally) be omitted if `num_timesteps > 1`. May\n      optionally be an instance of `tfp.sts.MaskedTimeSeries`, which includes\n      a mask `Tensor` to specify timesteps with missing observations.\n    init_batch_shape: Batch shape (Python `tuple`, `list`, or `int`) of initial\n      states to optimize in parallel.\n      Default value: `()`. (i.e., just run a single optimization).\n    seed: Python integer to seed the random number generator.\n    name: Python `str` name prefixed to ops created by this function.\n      Default value: `None` (i.e., 'build_factored_variational_loss').\n\n  Returns:\n    variational_loss: `float` `Tensor` of shape\n      `concat([init_batch_shape, model.batch_shape])`, encoding a stochastic\n      estimate of an upper bound on the negative model evidence `-log p(y)`.\n      Minimizing this loss performs variational inference; the gap between the\n      variational bound and the true (generally unknown) model evidence\n      corresponds to the divergence `KL[q||p]` between the approximate and true\n      posterior.\n    variational_distributions: `collections.OrderedDict` giving\n      the approximate posterior for each model parameter. The keys are\n      Python `str` parameter names in order, corresponding to\n      `[param.name for param in model.parameters]`. The values are\n      `tfd.Distribution` instances with batch shape\n      `concat([init_batch_shape, model.batch_shape])`; these will typically be\n      of the form `tfd.TransformedDistribution(tfd.Normal(...),\n      bijector=param.bijector)`.\n\n  #### Examples\n\n  Assume we've built a structural time-series model:\n\n  ```python\n    day_of_week = tfp.sts.Seasonal(\n        num_seasons=7,\n        observed_time_series=observed_time_series,\n        name='day_of_week')\n    local_linear_trend = tfp.sts.LocalLinearTrend(\n        observed_time_series=observed_time_series,\n        name='local_linear_trend')\n    model = tfp.sts.Sum(components=[day_of_week, local_linear_trend],\n                        observed_time_series=observed_time_series)\n  ```\n\n  To run variational inference, we simply construct the loss and optimize\n  it:\n\n  ```python\n    (variational_loss,\n     variational_distributions) = tfp.sts.build_factored_variational_loss(\n       model=model, observed_time_series=observed_time_series)\n\n    train_op = tf.train.AdamOptimizer(0.1).minimize(variational_loss)\n    with tf.Session() as sess:\n      sess.run(tf.global_variables_initializer())\n\n      for step in range(200):\n        _, loss_ = sess.run((train_op, variational_loss))\n\n        if step % 20 == 0:\n          print(\"step {} loss {}\".format(step, loss_))\n\n      posterior_samples_ = sess.run({\n        param_name: q.sample(50)\n        for param_name, q in variational_distributions.items()})\n  ```\n\n  As a more complex example, we might try to avoid local optima by optimizing\n  from multiple initializations in parallel, and selecting the result with the\n  lowest loss:\n\n  ```python\n    (variational_loss,\n     variational_distributions) = tfp.sts.build_factored_variational_loss(\n       model=model, observed_time_series=observed_time_series,\n       init_batch_shape=[10])\n\n    train_op = tf.train.AdamOptimizer(0.1).minimize(variational_loss)\n    with tf.Session() as sess:\n      sess.run(tf.global_variables_initializer())\n\n      for step in range(200):\n        _, loss_ = sess.run((train_op, variational_loss))\n\n        if step % 20 == 0:\n          print(\"step {} losses {}\".format(step, loss_))\n\n      # Draw multiple samples to reduce Monte Carlo error in the optimized\n      # variational bounds.\n      avg_loss = np.mean(\n        [sess.run(variational_loss) for _ in range(25)], axis=0)\n      best_posterior_idx = np.argmin(avg_loss, axis=0).astype(np.int32)\n  ```\n\n  #### References\n\n  [1]: Alp Kucukelbir, Dustin Tran, Rajesh Ranganath, Andrew Gelman, and\n       David M. Blei. Automatic Differentiation Variational Inference. In\n       _Journal of Machine Learning Research_, 2017.\n       https://arxiv.org/abs/1603.00788\n\n  \"\"\"\n\n  with tf.compat.v1.name_scope(\n      name, 'build_factored_variational_loss',\n      values=[observed_time_series]) as name:\n    seed = tfd.SeedStream(\n        seed, salt='StructuralTimeSeries_build_factored_variational_loss')\n\n    variational_distributions = collections.OrderedDict()\n    variational_samples = []\n    for param in model.parameters:\n      def initial_loc_fn(param):\n        return sample_uniform_initial_state(\n            param, return_constrained=True,\n            init_sample_shape=init_batch_shape,\n            seed=seed())\n      q = _build_trainable_posterior(param, initial_loc_fn=initial_loc_fn)\n      variational_distributions[param.name] = q\n      variational_samples.append(q.sample(seed=seed()))\n\n    # Multiple initializations (similar to HMC chains) manifest as an extra\n    # param batch dimension, so we need to add corresponding batch dimension(s)\n    # to `observed_time_series`.\n    observed_time_series = sts_util.pad_batch_dimension_for_multiple_chains(\n        observed_time_series, model, chain_batch_shape=init_batch_shape)\n\n    # Construct the variational bound.\n    log_prob_fn = model.joint_log_prob(observed_time_series)\n    expected_log_joint = log_prob_fn(*variational_samples)\n    entropy = tf.reduce_sum(\n        input_tensor=[\n            -q.log_prob(sample) for (q, sample) in zip(\n                variational_distributions.values(), variational_samples)\n        ],\n        axis=0)\n    variational_loss = -(expected_log_joint + entropy)  # -ELBO\n\n  return variational_loss, variational_distributions"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef _minimize_in_graph(build_loss_fn, num_steps=200, optimizer=None):\n  optimizer = tf.compat.v1.train.AdamOptimizer(\n      0.1) if optimizer is None else optimizer\n\n  def train_loop_body(step):\n    train_op = optimizer.minimize(\n        build_loss_fn if tf.executing_eagerly() else build_loss_fn())\n    return tf.tuple(tensors=[tf.add(step, 1)], control_inputs=[train_op])\n\n  minimize_op = tf.compat.v1.while_loop(\n      cond=lambda step: step < num_steps,\n      body=train_loop_body,\n      loop_vars=[tf.constant(0)],\n      return_same_structure=True)[0]  # Always return a single op.\n  return minimize_op", "response": "Run an optimizer within the graph to minimize a loss function."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nfits a model to the posterior distribution using the Markov chain Monte Carlo method.", "response": "def fit_with_hmc(model,\n                 observed_time_series,\n                 num_results=100,\n                 num_warmup_steps=50,\n                 num_leapfrog_steps=15,\n                 initial_state=None,\n                 initial_step_size=None,\n                 chain_batch_shape=(),\n                 num_variational_steps=150,\n                 variational_optimizer=None,\n                 seed=None,\n                 name=None):\n  \"\"\"Draw posterior samples using Hamiltonian Monte Carlo (HMC).\n\n  Markov chain Monte Carlo (MCMC) methods are considered the gold standard of\n  Bayesian inference; under suitable conditions and in the limit of infinitely\n  many draws they generate samples from the true posterior distribution. HMC [1]\n  uses gradients of the model's log-density function to propose samples,\n  allowing it to exploit posterior geometry. However, it is computationally more\n  expensive than variational inference and relatively sensitive to tuning.\n\n  This method attempts to provide a sensible default approach for fitting\n  StructuralTimeSeries models using HMC. It first runs variational inference as\n  a fast posterior approximation, and initializes the HMC sampler from the\n  variational posterior, using the posterior standard deviations to set\n  per-variable step sizes (equivalently, a diagonal mass matrix). During the\n  warmup phase, it adapts the step size to target an acceptance rate of 0.75,\n  which is thought to be in the desirable range for optimal mixing [2].\n\n\n  Args:\n    model: An instance of `StructuralTimeSeries` representing a\n      time-series model. This represents a joint distribution over\n      time-series and their parameters with batch shape `[b1, ..., bN]`.\n    observed_time_series: `float` `Tensor` of shape\n      `concat([sample_shape, model.batch_shape, [num_timesteps, 1]]) where\n      `sample_shape` corresponds to i.i.d. observations, and the trailing `[1]`\n      dimension may (optionally) be omitted if `num_timesteps > 1`. May\n      optionally be an instance of `tfp.sts.MaskedTimeSeries`, which includes\n      a mask `Tensor` to specify timesteps with missing observations.\n    num_results: Integer number of Markov chain draws.\n      Default value: `100`.\n    num_warmup_steps: Integer number of steps to take before starting to\n      collect results. The warmup steps are also used to adapt the step size\n      towards a target acceptance rate of 0.75.\n      Default value: `50`.\n    num_leapfrog_steps: Integer number of steps to run the leapfrog integrator\n      for. Total progress per HMC step is roughly proportional to\n      `step_size * num_leapfrog_steps`.\n      Default value: `15`.\n    initial_state: Optional Python `list` of `Tensor`s, one for each model\n      parameter, representing the initial state(s) of the Markov chain(s). These\n      should have shape `concat([chain_batch_shape, param.prior.batch_shape,\n      param.prior.event_shape])`. If `None`, the initial state is set\n      automatically using a sample from a variational posterior.\n      Default value: `None`.\n    initial_step_size: Python `list` of `Tensor`s, one for each model parameter,\n      representing the step size for the leapfrog integrator. Must\n      broadcast with the shape of `initial_state`. Larger step sizes lead to\n      faster progress, but too-large step sizes make rejection exponentially\n      more likely. If `None`, the step size is set automatically using the\n      standard deviation of a variational posterior.\n      Default value: `None`.\n    chain_batch_shape: Batch shape (Python `tuple`, `list`, or `int`) of chains\n      to run in parallel.\n      Default value: `[]` (i.e., a single chain).\n    num_variational_steps: Python `int` number of steps to run the variational\n      optimization to determine the initial state and step sizes.\n      Default value: `150`.\n    variational_optimizer: Optional `tf.train.Optimizer` instance to use in\n      the variational optimization. If `None`, defaults to\n      `tf.train.AdamOptimizer(0.1)`.\n      Default value: `None`.\n    seed: Python integer to seed the random number generator.\n    name: Python `str` name prefixed to ops created by this function.\n      Default value: `None` (i.e., 'fit_with_hmc').\n\n  Returns:\n    samples: Python `list` of `Tensors` representing posterior samples of model\n      parameters, with shapes `[concat([[num_results], chain_batch_shape,\n      param.prior.batch_shape, param.prior.event_shape]) for param in\n      model.parameters]`.\n    kernel_results: A (possibly nested) `tuple`, `namedtuple` or `list` of\n      `Tensor`s representing internal calculations made within the HMC sampler.\n\n  #### Examples\n\n  Assume we've built a structural time-series model:\n\n  ```python\n    day_of_week = tfp.sts.Seasonal(\n        num_seasons=7,\n        observed_time_series=observed_time_series,\n        name='day_of_week')\n    local_linear_trend = tfp.sts.LocalLinearTrend(\n        observed_time_series=observed_time_series,\n        name='local_linear_trend')\n    model = tfp.sts.Sum(components=[day_of_week, local_linear_trend],\n                        observed_time_series=observed_time_series)\n  ```\n\n  To draw posterior samples using HMC under default settings:\n\n  ```python\n  samples, kernel_results = tfp.sts.fit_with_hmc(model, observed_time_series)\n\n  with tf.Session() as sess:\n    sess.run(tf.global_variables_initializer())\n    samples_, kernel_results_ = sess.run((samples, kernel_results))\n\n  print(\"acceptance rate: {}\".format(\n    np.mean(kernel_results_.inner_results.is_accepted, axis=0)))\n  print(\"posterior means: {}\".format(\n    {param.name: np.mean(param_draws, axis=0)\n     for (param, param_draws) in zip(model.parameters, samples_)}))\n  ```\n\n  We can also run multiple chains. This may help diagnose convergence issues\n  and allows us to exploit vectorization to draw samples more quickly, although\n  warmup still requires the same number of sequential steps.\n\n  ```python\n  from matplotlib import pylab as plt\n\n  samples, kernel_results = tfp.sts.fit_with_hmc(\n    model, observed_time_series, chain_batch_shape=[10])\n\n  with tf.Session() as sess:\n    sess.run(tf.global_variables_initializer())\n    samples_, kernel_results_ = sess.run((samples, kernel_results))\n\n  print(\"acceptance rate: {}\".format(\n    np.mean(kernel_results_.inner_results.inner_results.is_accepted, axis=0)))\n\n  # Plot the sampled traces for each parameter. If the chains have mixed, their\n  # traces should all cover the same region of state space, frequently crossing\n  # over each other.\n  for (param, param_draws) in zip(model.parameters, samples_):\n    if param.prior.event_shape.ndims > 0:\n      print(\"Only plotting traces for scalar parameters, skipping {}\".format(\n        param.name))\n      continue\n    plt.figure(figsize=[10, 4])\n    plt.title(param.name)\n    plt.plot(param_draws)\n    plt.ylabel(param.name)\n    plt.xlabel(\"HMC step\")\n\n  # Combining the samples from multiple chains into a single dimension allows\n  # us to easily pass sampled parameters to downstream forecasting methods.\n  combined_samples_ = [np.reshape(param_draws,\n                                  [-1] + list(param_draws.shape[2:]))\n                       for param_draws in samples_]\n  ```\n\n  For greater flexibility, you may prefer to implement your own sampler using\n  the TensorFlow Probability primitives in `tfp.mcmc`. The following recipe\n  constructs a basic HMC sampler, using a `TransformedTransitionKernel` to\n  incorporate constraints on the parameter space.\n\n  ```python\n  transformed_hmc_kernel = mcmc.TransformedTransitionKernel(\n      inner_kernel=mcmc.SimpleStepSizeAdaptation(\n          inner_kernel=mcmc.HamiltonianMonteCarlo(\n              target_log_prob_fn=model.joint_log_prob(observed_time_series),\n              step_size=step_size,\n              num_leapfrog_steps=num_leapfrog_steps,\n              state_gradients_are_stopped=True,\n              seed=seed),\n          num_adaptation_steps = int(0.8 * num_warmup_steps)),\n      bijector=[param.bijector for param in model.parameters])\n\n  # Initialize from a Uniform[-2, 2] distribution in unconstrained space.\n  initial_state = [tfp.sts.sample_uniform_initial_state(\n    param, return_constrained=True) for param in model.parameters]\n\n  samples, kernel_results = tfp.mcmc.sample_chain(\n    kernel=transformed_hmc_kernel,\n    num_results=num_results,\n    current_state=initial_state,\n    num_burnin_steps=num_warmup_steps)\n  ```\n\n  #### References\n\n  [1]: Radford Neal. MCMC Using Hamiltonian Dynamics. _Handbook of Markov Chain\n       Monte Carlo_, 2011. https://arxiv.org/abs/1206.1901\n  [2]  M.J. Betancourt, Simon Byrne, and Mark Girolami. Optimizing The\n       Integrator Step Size for Hamiltonian Monte Carlo.\n       https://arxiv.org/abs/1411.6669\n\n  \"\"\"\n  with tf.compat.v1.name_scope(\n      name, 'fit_with_hmc', values=[observed_time_series]) as name:\n    seed = tfd.SeedStream(seed, salt='StructuralTimeSeries_fit_with_hmc')\n\n    # Initialize state and step sizes from a variational posterior if not\n    # specified.\n    if initial_step_size is None or initial_state is None:\n\n      # To avoid threading variational distributions through the training\n      # while loop, we build our own copy here. `make_template` ensures\n      # that our variational distributions share the optimized parameters.\n      def make_variational():\n        return build_factored_variational_loss(\n            model, observed_time_series,\n            init_batch_shape=chain_batch_shape, seed=seed())\n\n      make_variational = tf.compat.v1.make_template('make_variational',\n                                                    make_variational)\n      _, variational_distributions = make_variational()\n      minimize_op = _minimize_in_graph(\n          build_loss_fn=lambda: make_variational()[0],  # return just the loss.\n          num_steps=num_variational_steps,\n          optimizer=variational_optimizer)\n\n      with tf.control_dependencies([minimize_op]):\n        if initial_state is None:\n          initial_state = [tf.stop_gradient(d.sample())\n                           for d in variational_distributions.values()]\n\n        # Set step sizes using the unconstrained variational distribution.\n        if initial_step_size is None:\n          initial_step_size = [\n              transformed_q.distribution.stddev()\n              for transformed_q in variational_distributions.values()]\n\n    # Multiple chains manifest as an extra param batch dimension, so we need to\n    # add a corresponding batch dimension to `observed_time_series`.\n    observed_time_series = sts_util.pad_batch_dimension_for_multiple_chains(\n        observed_time_series, model, chain_batch_shape=chain_batch_shape)\n\n    # Run HMC to sample from the posterior on parameters.\n    samples, kernel_results = mcmc.sample_chain(\n        num_results=num_results,\n        current_state=initial_state,\n        num_burnin_steps=num_warmup_steps,\n        kernel=mcmc.SimpleStepSizeAdaptation(\n            inner_kernel=mcmc.TransformedTransitionKernel(\n                inner_kernel=mcmc.HamiltonianMonteCarlo(\n                    target_log_prob_fn=model.joint_log_prob(\n                        observed_time_series),\n                    step_size=initial_step_size,\n                    num_leapfrog_steps=num_leapfrog_steps,\n                    state_gradients_are_stopped=True,\n                    seed=seed()),\n                bijector=[param.bijector for param in model.parameters]),\n            num_adaptation_steps=int(num_warmup_steps * 0.8),\n            adaptation_rate=tf.convert_to_tensor(\n                value=0.1, dtype=initial_state[0].dtype)),\n        parallel_iterations=1 if seed is not None else 10)\n\n    return samples, kernel_results"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef moments_of_masked_time_series(time_series_tensor, broadcast_mask):\n  num_unmasked_entries = tf.cast(\n      tf.reduce_sum(input_tensor=tf.cast(~broadcast_mask, tf.int32), axis=-1),\n      time_series_tensor.dtype)\n\n  # Manually compute mean and variance, excluding masked entries.\n  mean = (tf.reduce_sum(input_tensor=tf.where(\n      broadcast_mask,\n      tf.zeros_like(time_series_tensor),\n      time_series_tensor), axis=-1) / num_unmasked_entries)\n  variance = (tf.reduce_sum(input_tensor=tf.where(\n      broadcast_mask,\n      tf.zeros_like(time_series_tensor),\n      (time_series_tensor - mean[..., tf.newaxis]) ** 2), axis=-1)\n              / num_unmasked_entries)\n  return mean, variance", "response": "Compute mean and variance accounting for a mask."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nreturns the initial value of the first unmasked entry of each time series.", "response": "def initial_value_of_masked_time_series(time_series_tensor, broadcast_mask):\n  \"\"\"Get the first unmasked entry of each time series in the batch.\n\n  Args:\n    time_series_tensor: float `Tensor` of shape [..., num_timesteps].\n    broadcast_mask: bool `Tensor` of same shape as `time_series`.\n  \"\"\"\n\n  num_timesteps = tf.shape(input=time_series_tensor)[-1]\n\n  # Compute the index of the first unmasked entry for each series in the batch.\n  unmasked_negindices = (\n      tf.cast(~broadcast_mask, tf.int32) *\n      tf.range(num_timesteps, 0, -1))\n  first_unmasked_indices = num_timesteps - tf.reduce_max(\n      input_tensor=unmasked_negindices, axis=-1)\n\n  if first_unmasked_indices.shape.ndims is None:\n    raise NotImplementedError(\n        'Cannot compute initial values of a masked time series with'\n        'dynamic rank.')  # `batch_gather` requires static rank\n\n  # Extract the initial value for each series in the batch.\n  return tf.squeeze(tf.compat.v1.batch_gather(\n      params=time_series_tensor,\n      indices=first_unmasked_indices[..., tf.newaxis]), axis=-1)"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nget broadcast batch shape from distributions.", "response": "def broadcast_batch_shape(distributions):\n  \"\"\"Get broadcast batch shape from distributions, statically if possible.\"\"\"\n\n  # Static case\n  batch_shape = distributions[0].batch_shape\n  for distribution in distributions:\n    batch_shape = tf.broadcast_static_shape(batch_shape,\n                                            distribution.batch_shape)\n  if batch_shape.is_fully_defined():\n    return batch_shape.as_list()\n\n  # Fallback on dynamic.\n  batch_shape = distributions[0].batch_shape_tensor()\n  for distribution in distributions:\n    batch_shape = tf.broadcast_dynamic_shape(batch_shape,\n                                             distribution.batch_shape_tensor())\n\n  return tf.convert_to_tensor(value=batch_shape)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef pad_batch_dimension_for_multiple_chains(\n    observed_time_series, model, chain_batch_shape):\n  \"\"\"\"Expand the observed time series with extra batch dimension(s).\"\"\"\n  # Running with multiple chains introduces an extra batch dimension. In\n  # general we also need to pad the observed time series with a matching batch\n  # dimension.\n  #\n  # For example, suppose our model has batch shape [3, 4] and\n  # the observed time series has shape `concat([[5], [3, 4], [100])`,\n  # corresponding to `sample_shape`, `batch_shape`, and `num_timesteps`\n  # respectively. The model will produce distributions with batch shape\n  # `concat([chain_batch_shape, [3, 4]])`, so we pad `observed_time_series` to\n  # have matching shape `[5, 1, 3, 4, 100]`, where the added `1` dimension\n  # between the sample and batch shapes will broadcast to `chain_batch_shape`.\n\n  [  # Extract mask and guarantee `event_ndims=2`.\n      observed_time_series,\n      is_missing\n  ] = canonicalize_observed_time_series_with_mask(observed_time_series)\n\n  event_ndims = 2  # event_shape = [num_timesteps, observation_size=1]\n\n  model_batch_ndims = (\n      model.batch_shape.ndims if model.batch_shape.ndims is not None else\n      tf.shape(input=model.batch_shape_tensor())[0])\n\n  # Compute ndims from chain_batch_shape.\n  chain_batch_shape = tf.convert_to_tensor(\n      value=chain_batch_shape, name='chain_batch_shape', dtype=tf.int32)\n  if not chain_batch_shape.shape.is_fully_defined():\n    raise ValueError('Batch shape must have static rank. (given: {})'.format(\n        chain_batch_shape))\n  if chain_batch_shape.shape.ndims == 0:  # expand int `k` to `[k]`.\n    chain_batch_shape = chain_batch_shape[tf.newaxis]\n  chain_batch_ndims = tf.compat.dimension_value(chain_batch_shape.shape[0])\n\n  def do_padding(observed_time_series_tensor):\n    current_sample_shape = tf.shape(\n        input=observed_time_series_tensor)[:-(model_batch_ndims + event_ndims)]\n    current_batch_and_event_shape = tf.shape(\n        input=observed_time_series_tensor)[-(model_batch_ndims + event_ndims):]\n    return tf.reshape(\n        tensor=observed_time_series_tensor,\n        shape=tf.concat([\n            current_sample_shape,\n            tf.ones([chain_batch_ndims], dtype=tf.int32),\n            current_batch_and_event_shape], axis=0))\n\n  # Padding is only needed if the observed time series has sample shape.\n  observed_time_series = prefer_static.cond(\n      (dist_util.prefer_static_rank(observed_time_series) >\n       model_batch_ndims + event_ndims),\n      lambda: do_padding(observed_time_series),\n      lambda: observed_time_series)\n\n  if is_missing is not None:\n    is_missing = prefer_static.cond(\n        (dist_util.prefer_static_rank(is_missing) >\n         model_batch_ndims + event_ndims),\n        lambda: do_padding(is_missing),\n        lambda: is_missing)\n    return missing_values_util.MaskedTimeSeries(observed_time_series,\n                                                is_missing=is_missing)\n\n  return observed_time_series", "response": "Expand the observed time series with extra batch dimension ( s )."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef factored_joint_mvn(distributions):\n\n  graph_parents = [tensor for distribution in distributions\n                   for tensor in distribution._graph_parents]  # pylint: disable=protected-access\n  with tf.compat.v1.name_scope('factored_joint_mvn', values=graph_parents):\n\n    # We explicitly broadcast the `locs` so that we can concatenate them.\n    # We don't have direct numerical access to the `scales`, which are arbitrary\n    # linear operators, but `LinearOperatorBlockDiag` appears to do the right\n    # thing without further intervention.\n    dtype = tf.debugging.assert_same_float_dtype(distributions)\n    broadcast_ones = tf.ones(broadcast_batch_shape(distributions),\n                             dtype=dtype)[..., tf.newaxis]\n    return MultivariateNormalLinearOperator(\n        loc=tf.concat([mvn.mean() * broadcast_ones for mvn in distributions],\n                      axis=-1),\n        scale=tfl.LinearOperatorBlockDiag([mvn.scale for mvn in distributions],\n                                          is_square=True))", "response": "Combine a list of multivariate normal distributions into a factored multivariate normal linear operator."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nattempts to sum MultivariateNormal distributions. The sum of (multivariate) normal random variables is itself (multivariate) normal, with mean given by the sum of means and (co)variance given by the sum of (co)variances. This method exploits this fact to compute the sum of a list of `tfd.MultivariateNormalDiag` objects. It may in the future be extended to support summation of other forms of (Multivariate)Normal distributions. Args: distributions: Python `iterable` of `tfd.MultivariateNormalDiag` distribution instances. These must all have the same event shape, and broadcast to a consistent batch shape. Returns: sum_distribution: A `tfd.MultivariateNormalDiag` instance with mean equal to the sum of input means and covariance equal to the sum of input covariances.", "response": "def sum_mvns(distributions):\n  \"\"\"Attempt to sum MultivariateNormal distributions.\n\n  The sum of (multivariate) normal random variables is itself (multivariate)\n  normal, with mean given by the sum of means and (co)variance given by the\n  sum of (co)variances. This method exploits this fact to compute the\n  sum of a list of `tfd.MultivariateNormalDiag` objects.\n\n  It may in the future be extended to support summation of other forms of\n  (Multivariate)Normal distributions.\n\n  Args:\n    distributions: Python `iterable` of `tfd.MultivariateNormalDiag`\n      distribution instances. These must all have the same event\n      shape, and broadcast to a consistent batch shape.\n\n  Returns:\n    sum_distribution: A `tfd.MultivariateNormalDiag` instance with mean\n      equal to the sum of input means and covariance equal to the sum of\n      input covariances.\n  \"\"\"\n\n  graph_parents = [tensor for distribution in distributions\n                   for tensor in distribution._graph_parents]  # pylint: disable=protected-access\n  with tf.compat.v1.name_scope('sum_mvns', values=graph_parents):\n    if all([isinstance(mvn, tfd.MultivariateNormalDiag)\n            for mvn in distributions]):\n      return tfd.MultivariateNormalDiag(\n          loc=sum([mvn.mean() for mvn in distributions]),\n          scale_diag=tf.sqrt(sum([\n              mvn.scale.diag**2 for mvn in distributions])))\n    else:\n      raise NotImplementedError(\n          'Sums of distributions other than MultivariateNormalDiag are not '\n          'currently implemented. (given: {})'.format(distributions))"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef empirical_statistics(observed_time_series):\n\n  with tf.compat.v1.name_scope(\n      'empirical_statistics', values=[observed_time_series]):\n\n    [\n        observed_time_series,\n        mask\n    ] = canonicalize_observed_time_series_with_mask(observed_time_series)\n\n    squeezed_series = observed_time_series[..., 0]\n    if mask is None:\n      observed_mean, observed_variance = tf.nn.moments(\n          x=squeezed_series, axes=-1)\n      observed_initial = squeezed_series[..., 0]\n    else:\n      broadcast_mask = tf.broadcast_to(tf.cast(mask, tf.bool),\n                                       tf.shape(input=squeezed_series))\n      observed_mean, observed_variance = (\n          missing_values_util.moments_of_masked_time_series(\n              squeezed_series, broadcast_mask=broadcast_mask))\n      try:\n        observed_initial = (\n            missing_values_util.initial_value_of_masked_time_series(\n                squeezed_series, broadcast_mask=broadcast_mask))\n      except NotImplementedError:\n        tf.compat.v1.logging.warn(\n            'Cannot compute initial values for a masked time series'\n            'with dynamic shape; using the mean instead. This will'\n            'affect heuristic priors and may change the results of'\n            'inference.')\n        observed_initial = observed_mean\n\n    observed_stddev = tf.sqrt(observed_variance)\n    observed_initial_centered = observed_initial - observed_mean\n    return observed_mean, observed_stddev, observed_initial_centered", "response": "Compute statistics of a provided time series as heuristic initialization."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef _maybe_expand_trailing_dim(observed_time_series_tensor):\n\n  with tf.compat.v1.name_scope(\n      'maybe_expand_trailing_dim', values=[observed_time_series_tensor]):\n    if (observed_time_series_tensor.shape.ndims is not None and\n        tf.compat.dimension_value(\n            observed_time_series_tensor.shape[-1]) is not None):\n      expanded_time_series = (\n          observed_time_series_tensor\n          if observed_time_series_tensor.shape[-1] == 1\n          else observed_time_series_tensor[..., tf.newaxis])\n    else:\n      expanded_time_series = tf.cond(\n          pred=tf.equal(tf.shape(input=observed_time_series_tensor)[-1], 1),\n          true_fn=lambda: observed_time_series_tensor,\n          false_fn=lambda: observed_time_series_tensor[..., tf.newaxis])\n    return expanded_time_series", "response": "Ensures that the observed_time_series_tensor has a trailing dimension of size 1."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef canonicalize_observed_time_series_with_mask(\n    maybe_masked_observed_time_series):\n  \"\"\"Extract a Tensor with canonical shape and optional mask.\n\n  Args:\n    maybe_masked_observed_time_series: a `Tensor`-like object with shape\n      `[..., num_timesteps]` or `[..., num_timesteps, 1]`, or a\n      `tfp.sts.MaskedTimeSeries` containing such an object.\n  Returns:\n    masked_time_series: a `tfp.sts.MaskedTimeSeries` namedtuple, in which\n      the `observed_time_series` is converted to `Tensor` with canonical shape\n      `[..., num_timesteps, 1]`, and `is_missing` is either `None` or a boolean\n      `Tensor`.\n  \"\"\"\n\n  with tf.compat.v1.name_scope('canonicalize_observed_time_series_with_mask'):\n    if hasattr(maybe_masked_observed_time_series, 'is_missing'):\n      observed_time_series = (\n          maybe_masked_observed_time_series.time_series)\n      is_missing = maybe_masked_observed_time_series.is_missing\n    else:\n      observed_time_series = maybe_masked_observed_time_series\n      is_missing = None\n\n    observed_time_series = tf.convert_to_tensor(value=observed_time_series,\n                                                name='observed_time_series')\n    observed_time_series = _maybe_expand_trailing_dim(observed_time_series)\n\n    if is_missing is not None:\n      is_missing = tf.convert_to_tensor(\n          value=is_missing, name='is_missing', dtype_hint=tf.bool)\n\n    return missing_values_util.MaskedTimeSeries(observed_time_series,\n                                                is_missing=is_missing)", "response": "Returns a Tensor with canonical shape and optional mask."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef mix_over_posterior_draws(means, variances):\n  # The inputs `means`, `variances` have shape\n  #   `concat([\n  #      [num_posterior_draws],\n  #      sample_shape,\n  #      batch_shape,\n  #      [num_timesteps]])`\n  # Because MixtureSameFamily mixes over the rightmost batch dimension,\n  # we need to move the `num_posterior_draws` dimension to be rightmost\n  # in the batch shape. This requires use of `Independent` (to preserve\n  # `num_timesteps` as part of the event shape) and `move_dimension`.\n  # TODO(b/120245392): enhance `MixtureSameFamily` to reduce along an\n  # arbitrary axis, and eliminate `move_dimension` calls here.\n\n  with tf.compat.v1.name_scope(\n      'mix_over_posterior_draws', values=[means, variances]):\n    num_posterior_draws = dist_util.prefer_static_value(\n        tf.shape(input=means))[0]\n\n    component_observations = tfd.Independent(\n        distribution=tfd.Normal(\n            loc=dist_util.move_dimension(means, 0, -2),\n            scale=tf.sqrt(dist_util.move_dimension(variances, 0, -2))),\n        reinterpreted_batch_ndims=1)\n\n    return tfd.MixtureSameFamily(\n        mixture_distribution=tfd.Categorical(\n            logits=tf.zeros([num_posterior_draws],\n                            dtype=component_observations.dtype)),\n        components_distribution=component_observations)", "response": "Construct a predictive normal distribution that mixes over posterior draws."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef _kl_uniform_uniform(a, b, name=None):\n  with tf.name_scope(name or \"kl_uniform_uniform\"):\n    # Consistent with\n    # http://www.mast.queensu.ca/~communications/Papers/gil-msc11.pdf, page 60\n    # Watch out for the change in conventions--they use 'a' and 'b' to refer to\n    # lower and upper bounds respectively there.\n    final_batch_shape = distribution_util.get_broadcast_shape(\n        a.low, b.low, a.high, b.high)\n    dtype = dtype_util.common_dtype(\n        [a.low, a.high, b.low, b.high], tf.float32)\n    return tf.where((b.low <= a.low) & (a.high <= b.high),\n                    tf.math.log(b.high - b.low) - tf.math.log(a.high - a.low),\n                    tf.broadcast_to(\n                        dtype_util.as_numpy_dtype(dtype)(np.inf),\n                        final_batch_shape))", "response": "Calculate the batched KL divergence of a and b uniform distribution."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef _make_summary_statistic(attr):\n  def _fn(self):\n    if any(self._dist_fn_args):  # pylint: disable=protected-access\n      raise ValueError(\n          'Can only compute ' + attr + ' when all distributions are '\n          'independent; {}'.format(self.model))\n    return self._unflatten(getattr(d(), attr)() for d in self._dist_fn_wrapped)  # pylint: disable=protected-access\n  return _fn", "response": "Factory for making summary statistics eg mean mode stddev."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nunifying the call signature of dist_fn with all previous nodes.", "response": "def _unify_call_signature(i, dist_fn):\n  \"\"\"Creates `dist_fn_wrapped` which calls `dist_fn` with all prev nodes.\n\n  Args:\n    i: Python `int` corresponding to position in topologically sorted DAG.\n    dist_fn: Python `callable` which takes a subset of previously constructed\n      distributions (in reverse order) and produces a new distribution instance.\n\n  Returns:\n    dist_fn_wrapped: Python `callable` which takes all previous distributions\n      (in non reverse order) and produces a  new distribution instance.\n    args: `tuple` of `str` representing the arg names of `dist_fn` (and in non\n      wrapped, \"natural\" order). `None` is returned only if the input is not a\n      `callable`.\n  \"\"\"\n  if distribution_util.is_distribution_instance(dist_fn):\n    return (lambda *_: dist_fn), None\n\n  if not callable(dist_fn):\n    raise TypeError('{} must be either `tfd.Distribution`-like or '\n                    '`callable`.'.format(dist_fn))\n\n  args = _get_required_args(dist_fn)\n  if not args:\n    return (lambda *_: dist_fn()), ()\n\n  @functools.wraps(dist_fn)\n  def dist_fn_wrapped(*xs):\n    \"\"\"Calls `dist_fn` with reversed and truncated args.\"\"\"\n    if i != len(xs):\n      raise ValueError(\n          'Internal Error: Unexpected number of inputs provided to {}-th '\n          'distribution maker (dist_fn: {}, expected: {}, saw: {}).'.format(\n              i, dist_fn, i, len(xs)))\n    if len(xs) < len(args):\n      raise ValueError(\n          'Internal Error: Too few inputs provided to {}-th distribution maker '\n          '(dist_fn: {}, expected: {}, saw: {}).'.format(\n              i, dist_fn, len(args), len(xs)))\n    return dist_fn(*reversed(xs[-len(args):]))\n  return dist_fn_wrapped, args"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nuse arg names to resolve distribution names.", "response": "def _resolve_distribution_names(dist_fn_args, dist_names, leaf_name):\n  \"\"\"Uses arg names to resolve distribution names.\"\"\"\n  if dist_names is None:\n    dist_names = []\n  else:\n    dist_names = dist_names.copy()\n  n = len(dist_fn_args)\n  dist_names.extend([None]*(n - len(dist_names)))\n  for i_, args in enumerate(reversed(dist_fn_args)):\n    if not args:\n      continue  # There's no args to analyze.\n    i = n - i_ - 1\n    for j, arg_name in enumerate(args):\n      dist_names[i - j - 1] = arg_name\n  j = 0\n  for i_ in range(len(dist_names)):\n    i = n - i_ - 1\n    if dist_names[i] is None:\n      dist_names[i] = leaf_name if j == 0 else leaf_name + str(j)\n      j += 1\n  return tuple(dist_names)"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nreturns the distribution s required args.", "response": "def _get_required_args(fn):\n  \"\"\"Returns the distribution's required args.\"\"\"\n  argspec = tf_inspect.getfullargspec(fn)\n  args = argspec.args\n  if tf_inspect.isclass(fn):\n    args = args[1:]  # Remove the `self` arg.\n  if argspec.defaults:\n    # Remove the args which have defaults. By convention we only feed\n    # *required args*. This means some distributions must always be wrapped\n    # with a `lambda`, e.g., `lambda logits: tfd.Bernoulli(logits=logits)`\n    # or `lambda probs: tfd.Bernoulli(probs=probs)`.\n    args = args[:-len(argspec.defaults)]\n  return tuple(args)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\ncomputing the KL divergence between two joint distributions.", "response": "def _kl_joint_joint(d0, d1, name=None):\n  \"\"\"Calculate the KL divergence between two `JointDistributionSequential`s.\n\n  Args:\n    d0: instance of a `JointDistributionSequential` object.\n    d1: instance of a `JointDistributionSequential` object.\n    name: (optional) Name to use for created operations.\n      Default value: `\"kl_joint_joint\"`.\n\n  Returns:\n    kl_joint_joint: `Tensor` The sum of KL divergences between elemental\n      distributions of two joint distributions.\n\n  Raises:\n    ValueError: when joint distributions have a different number of elemental\n      distributions.\n    ValueError: when either joint distribution has a distribution with dynamic\n      dependency, i.e., when either joint distribution is not a collection of\n      independent distributions.\n  \"\"\"\n  if len(d0._dist_fn_wrapped) != len(d1._dist_fn_wrapped):  # pylint: disable=protected-access\n    raise ValueError(\n        'Can only compute KL divergence between when each has the'\n        'same number of component distributions.')\n  if (not all(a is None for a in d0._dist_fn_args) or  # pylint: disable=protected-access\n      not all(a is None for a in d1._dist_fn_args)):  # pylint: disable=protected-access\n    raise ValueError(\n        'Can only compute KL divergence when all distributions are '\n        'independent.')\n  with tf.name_scope(name or 'kl_jointseq_jointseq'):\n    return sum(kullback_leibler.kl_divergence(d0_(), d1_())\n               for d0_, d1_ in zip(d0._dist_fn_wrapped, d1._dist_fn_wrapped))"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\ncreating dist_fn dist_fn_wrapped dist_fn_args.", "response": "def _build(self, model):\n    \"\"\"Creates `dist_fn`, `dist_fn_wrapped`, `dist_fn_args`.\"\"\"\n    if not isinstance(model, collections.Sequence):\n      raise TypeError('`model` must be `list`-like (saw: {}).'.format(\n          type(model).__name__))\n    self._dist_fn = model\n    self._dist_fn_wrapped, self._dist_fn_args = zip(*[\n        _unify_call_signature(i, dist_fn)\n        for i, dist_fn in enumerate(model)])"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nresolves the graph of the given distribution_names and leaf_name.", "response": "def _resolve_graph(self, distribution_names=None, leaf_name='x'):\n    \"\"\"Creates a `tuple` of `tuple`s of dependencies.\n\n    This function is **experimental**. That said, we encourage its use\n    and ask that you report problems to `tfprobability@tensorflow.org`.\n\n    Args:\n      distribution_names: `list` of `str` or `None` names corresponding to each\n        of `model` elements. (`None`s are expanding into the\n        appropriate `str`.)\n      leaf_name: `str` used when no maker depends on a particular\n        `model` element.\n\n    Returns:\n      graph: `tuple` of `(str tuple)` pairs representing the name of each\n        distribution (maker) and the names of its dependencies.\n\n    #### Example\n\n    ```python\n    d = tfd.JointDistributionSequential([\n                     tfd.Independent(tfd.Exponential(rate=[100, 120]), 1),\n        lambda    e: tfd.Gamma(concentration=e[..., 0], rate=e[..., 1]),\n                     tfd.Normal(loc=0, scale=2.),\n        lambda n, g: tfd.Normal(loc=n, scale=g),\n    ])\n    d._resolve_graph()\n    # ==> (\n    #       ('e', ()),\n    #       ('g', ('e',)),\n    #       ('n', ()),\n    #       ('x', ('n', 'g')),\n    #     )\n    ```\n\n    \"\"\"\n    # This function additionally depends on:\n    #   self._dist_fn_args\n    #   self._dist_fn_wrapped\n    # TODO(b/129008220): Robustify this procedure. Eg, handle collisions better,\n    # ignore args prefixed with `_`.\n    if distribution_names is None or any(self._dist_fn_args):\n      distribution_names = _resolve_distribution_names(\n          self._dist_fn_args, distribution_names, leaf_name)\n    if len(set(distribution_names)) != len(distribution_names):\n      raise ValueError('Distribution names must be unique: {}'.format(\n          distribution_names))\n    if len(distribution_names) != len(self._dist_fn_wrapped):\n      raise ValueError('Distribution names must be 1:1 with `rvs`.')\n    return tuple(zip(distribution_names,\n                     tuple(() if a is None else a for a in self._dist_fn_args)))"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\ncompute the Shannon entropy in nats.", "response": "def _entropy(self):\n    \"\"\"Shannon entropy in nats.\"\"\"\n    if any(self._dist_fn_args):\n      raise ValueError(\n          'Can only compute entropy when all distributions are independent.')\n    return sum(joint_distribution_lib.maybe_check_wont_broadcast(\n        (d().entropy() for d in self._dist_fn_wrapped),\n        self.validate_args))"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef _extend_support_with_default_value(self, x, f, default_value):\n    with tf.name_scope(\"extend_support_with_default_value\"):\n      x = tf.convert_to_tensor(value=x, dtype=self.dtype, name=\"x\")\n      loc = self.loc + tf.zeros_like(self.scale) + tf.zeros_like(x)\n      x = x + tf.zeros_like(loc)\n      # Substitute out-of-support values in x with values that are in the\n      # support of the distribution before applying f.\n      y = f(tf.where(x < loc, self._inv_z(0.5) + tf.zeros_like(x), x))\n      if default_value == 0.:\n        default_value = tf.zeros_like(y)\n      elif default_value == 1.:\n        default_value = tf.ones_like(y)\n      else:\n        default_value = tf.fill(\n            dims=tf.shape(input=y),\n            value=dtype_util.as_numpy_dtype(self.dtype)(default_value))\n      return tf.where(x < loc, default_value, y)", "response": "Extends the support of this distribution with a default value."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nprocess input args to meet list - like assumptions.", "response": "def _prepare_args(log_likelihood_fn, state,\n                  log_likelihood=None, description='log_likelihood'):\n  \"\"\"Processes input args to meet list-like assumptions.\"\"\"\n  state_parts = list(state) if mcmc_util.is_list_like(state) else [state]\n  state_parts = [tf.convert_to_tensor(s, name='current_state')\n                 for s in state_parts]\n\n  log_likelihood = _maybe_call_fn(\n      log_likelihood_fn,\n      state_parts,\n      log_likelihood,\n      description)\n  return [state_parts, log_likelihood]"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef one_step(self, current_state, previous_kernel_results):\n    with tf.compat.v1.name_scope(\n        name=mcmc_util.make_name(self.name, 'elliptical_slice', 'one_step'),\n        values=[self._seed_stream,\n                current_state,\n                previous_kernel_results.log_likelihood]):\n      with tf.compat.v1.name_scope('initialize'):\n        [\n            init_state_parts,\n            init_log_likelihood\n        ] = _prepare_args(\n            self.log_likelihood_fn,\n            current_state,\n            previous_kernel_results.log_likelihood)\n\n      normal_samples = self.normal_sampler_fn(self._seed_stream())  # pylint: disable=not-callable\n      normal_samples = list(normal_samples) if mcmc_util.is_list_like(\n          normal_samples) else [normal_samples]\n      u = tf.random.uniform(\n          shape=tf.shape(init_log_likelihood),\n          seed=self._seed_stream(),\n          dtype=init_log_likelihood.dtype.base_dtype,\n      )\n      threshold = init_log_likelihood + tf.math.log(u)\n\n      starting_angle = tf.random.uniform(\n          shape=tf.shape(init_log_likelihood),\n          minval=0.,\n          maxval=2 * np.pi,\n          name='angle',\n          seed=self._seed_stream(),\n          dtype=init_log_likelihood.dtype.base_dtype,\n      )\n      starting_angle_min = starting_angle - 2 * np.pi\n      starting_angle_max = starting_angle\n\n      starting_state_parts = _rotate_on_ellipse(\n          init_state_parts, normal_samples, starting_angle)\n      starting_log_likelihood = self.log_likelihood_fn(*starting_state_parts)  # pylint: disable=not-callable\n\n      def chain_not_done(\n          angle,\n          angle_min,\n          angle_max,\n          current_state_parts,\n          current_log_likelihood):\n        del angle, angle_min, angle_max, current_state_parts\n        return tf.reduce_any(current_log_likelihood < threshold)\n\n      def sample_next_angle(\n          angle,\n          angle_min,\n          angle_max,\n          current_state_parts,\n          current_log_likelihood):\n        \"\"\"Slice sample a new angle, and rotate init_state by that amount.\"\"\"\n        chain_not_done = current_log_likelihood < threshold\n        # Box in on angle. Only update angles for which we haven't generated a\n        # point that beats the threshold.\n        angle_min = tf.where(\n            tf.math.logical_and(angle < 0, chain_not_done),\n            angle,\n            angle_min)\n        angle_max = tf.where(\n            tf.math.logical_and(angle >= 0, chain_not_done),\n            angle,\n            angle_max)\n        new_angle = tf.random.uniform(\n            shape=tf.shape(current_log_likelihood),\n            minval=angle_min,\n            maxval=angle_max,\n            seed=self._seed_stream(),\n            dtype=angle.dtype.base_dtype\n        )\n        angle = tf.where(chain_not_done, new_angle, angle)\n        next_state_parts = _rotate_on_ellipse(\n            init_state_parts, normal_samples, angle)\n\n        new_state_parts = []\n        broadcasted_chain_not_done = _right_pad_with_ones(\n            chain_not_done, tf.rank(next_state_parts[0]))\n        for n_state, c_state in zip(next_state_parts, current_state_parts):\n          new_state_part = tf.where(\n              tf.broadcast_to(\n                  broadcasted_chain_not_done,\n                  tf.shape(n_state)),\n              n_state,\n              c_state)\n          new_state_parts.append(new_state_part)\n\n        return (\n            angle,\n            angle_min,\n            angle_max,\n            new_state_parts,\n            self.log_likelihood_fn(*new_state_parts)  # pylint: disable=not-callable\n        )\n\n      [\n          next_angle,\n          _,\n          _,\n          next_state_parts,\n          next_log_likelihood,\n      ] = tf.while_loop(\n          cond=chain_not_done,\n          body=sample_next_angle,\n          loop_vars=[\n              starting_angle,\n              starting_angle_min,\n              starting_angle_max,\n              starting_state_parts,\n              starting_log_likelihood\n          ])\n\n      return [\n          next_state_parts if mcmc_util.is_list_like(\n              current_state) else next_state_parts[0],\n          EllipticalSliceSamplerKernelResults(\n              log_likelihood=next_log_likelihood,\n              angle=next_angle,\n              normal_samples=normal_samples,\n          ),\n      ]", "response": "Runs one iteration of the Elliptical Slice Sampler."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef image_summary(seqs, name, num=None):\n  seqs = tf.clip_by_value(seqs, 0., 1.)\n  seqs = tf.unstack(seqs[:num])\n  joined_seqs = [tf.concat(tf.unstack(seq), 1) for seq in seqs]\n  joined_seqs = tf.expand_dims(tf.concat(joined_seqs, 0), 0)\n  tf.compat.v2.summary.image(\n      name,\n      joined_seqs,\n      max_outputs=1,\n      step=tf.compat.v1.train.get_or_create_global_step())", "response": "Visualizes sequences as TensorBoard summaries."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nvisualizes the reconstruction of inputs in TensorBoard.", "response": "def visualize_reconstruction(inputs, reconstruct, num=3, name=\"reconstruction\"):\n  \"\"\"Visualizes the reconstruction of inputs in TensorBoard.\n\n  Args:\n    inputs: A tensor of the original inputs, of shape [batch, timesteps,\n      h, w, c].\n    reconstruct: A tensor of a reconstruction of inputs, of shape\n      [batch, timesteps, h, w, c].\n    num: Integer for the number of examples to visualize.\n    name: String name of this summary.\n  \"\"\"\n  reconstruct = tf.clip_by_value(reconstruct, 0., 1.)\n  inputs_and_reconstruct = tf.concat((inputs[:num], reconstruct[:num]), axis=0)\n  image_summary(inputs_and_reconstruct, name)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nvisualizing a qualitative analysis of a given model.", "response": "def visualize_qualitative_analysis(inputs, model, samples=1, batch_size=3,\n                                   length=8):\n  \"\"\"Visualizes a qualitative analysis of a given model.\n\n  Args:\n    inputs: A tensor of the original inputs, of shape [batch, timesteps,\n      h, w, c].\n    model: A DisentangledSequentialVAE model.\n    samples: Number of samples to draw from the latent distributions.\n    batch_size: Number of sequences to generate.\n    length: Number of timesteps to generate for each sequence.\n  \"\"\"\n  average = lambda dist: tf.reduce_mean(\n      input_tensor=dist.mean(), axis=0)  # avg over samples\n  with tf.compat.v1.name_scope(\"val_reconstruction\"):\n    reconstruct = functools.partial(model.reconstruct, inputs=inputs,\n                                    samples=samples)\n    visualize_reconstruction(inputs, average(reconstruct()))\n    visualize_reconstruction(inputs, average(reconstruct(sample_static=True)),\n                             name=\"static_prior\")\n    visualize_reconstruction(inputs, average(reconstruct(sample_dynamic=True)),\n                             name=\"dynamic_prior\")\n    visualize_reconstruction(inputs, average(reconstruct(swap_static=True)),\n                             name=\"swap_static\")\n    visualize_reconstruction(inputs, average(reconstruct(swap_dynamic=True)),\n                             name=\"swap_dynamic\")\n\n  with tf.compat.v1.name_scope(\"generation\"):\n    generate = functools.partial(model.generate, batch_size=batch_size,\n                                 length=length, samples=samples)\n    image_summary(average(generate(fix_static=True)), \"fix_static\")\n    image_summary(average(generate(fix_dynamic=True)), \"fix_dynamic\")"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef summarize_dist_params(dist, name, name_scope=\"dist_params\"):\n  with tf.compat.v1.name_scope(name_scope):\n    tf.compat.v2.summary.histogram(\n        name=\"{}/{}\".format(name, \"mean\"),\n        data=dist.mean(),\n        step=tf.compat.v1.train.get_or_create_global_step())\n    tf.compat.v2.summary.histogram(\n        name=\"{}/{}\".format(name, \"stddev\"),\n        data=dist.stddev(),\n        step=tf.compat.v1.train.get_or_create_global_step())", "response": "Summarize the parameters of a distribution."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef summarize_mean_in_nats_and_bits(inputs, units, name,\n                                    nats_name_scope=\"nats\",\n                                    bits_name_scope=\"bits_per_dim\"):\n  \"\"\"Summarize the mean of a tensor in nats and bits per unit.\n\n  Args:\n    inputs: A tensor of values measured in nats.\n    units: The units of the tensor with which to compute the mean bits\n      per unit.\n    name: The name of the tensor.\n    nats_name_scope: The name scope of the nats summary.\n    bits_name_scope: The name scope of the bits summary.\n  \"\"\"\n  mean = tf.reduce_mean(input_tensor=inputs)\n  with tf.compat.v1.name_scope(nats_name_scope):\n    tf.compat.v2.summary.scalar(\n        name,\n        mean,\n        step=tf.compat.v1.train.get_or_create_global_step())\n  with tf.compat.v1.name_scope(bits_name_scope):\n    tf.compat.v2.summary.scalar(\n        name,\n        mean / units / tf.math.log(2.),\n        step=tf.compat.v1.train.get_or_create_global_step())", "response": "Summarize the mean of a tensor in nats and bits per unit."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nrun the model to generate multivariate normal distribution.", "response": "def call(self, inputs):\n    \"\"\"Runs the model to generate multivariate normal distribution.\n\n    Args:\n      inputs: Unused.\n\n    Returns:\n      A MultivariateNormalDiag distribution with event shape\n      [dimensions], batch shape [], and sample shape [sample_shape,\n      dimensions].\n    \"\"\"\n    del inputs  # unused\n    with tf.compat.v1.name_scope(self._name):\n      return tfd.MultivariateNormalDiag(self.loc, self.scale_diag)"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef zero_state(self, sample_batch_shape=()):\n    h0 = tf.zeros([1, self.hidden_size])\n    c0 = tf.zeros([1, self.hidden_size])\n    combined_shape = tf.concat((tf.convert_to_tensor(\n        value=sample_batch_shape, dtype=tf.int32), [self.dimensions]),\n                               axis=-1)\n    previous_output = tf.zeros(combined_shape)\n    return previous_output, (h0, c0)", "response": "Returns an initial state for the LSTM cell."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef call(self, inputs, state):\n    # In order to allow the user to pass in a single example without a batch\n    # dimension, we always expand the input to at least two dimensions, then\n    # fix the output shape to remove the batch dimension if necessary.\n    original_shape = inputs.shape\n    if len(original_shape) < 2:\n      inputs = tf.reshape(inputs, [1, -1])\n    out, state = self.lstm_cell(inputs, state)\n    out = self.output_layer(out)\n    correct_shape = tf.concat((original_shape[:-1], tf.shape(input=out)[-1:]),\n                              0)\n    out = tf.reshape(out, correct_shape)\n    loc = out[..., :self.dimensions]\n    scale_diag = tf.nn.softplus(out[..., self.dimensions:]) + 1e-5  # keep > 0\n    return tfd.MultivariateNormalDiag(loc=loc, scale_diag=scale_diag), state", "response": "Runs the model to generate a batched MultivariateNormalDiag distribution for a single timestep."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nrunning the model to generate a set of IndependentDistributions over the pixels of x_t.", "response": "def call(self, inputs):\n    \"\"\"Runs the model to generate a distribution p(x_t | z_t, f).\n\n    Args:\n      inputs: A tuple of (z_{1:T}, f), where `z_{1:T}` is a tensor of\n        shape [..., batch_size, timesteps, latent_size_dynamic], and `f`\n        is of shape [..., batch_size, latent_size_static].\n\n    Returns:\n      A batched Independent distribution wrapping a set of Normal\n      distributions over the pixels of x_t, where the Independent\n      distribution has event shape [height, width, channels], batch\n      shape [batch_size, timesteps], and sample shape [sample_shape,\n      batch_size, timesteps, height, width, channels].\n    \"\"\"\n    # We explicitly broadcast f to the same shape as z other than the final\n    # dimension, because `tf.concat` can't automatically do this.\n    dynamic, static = inputs\n    timesteps = tf.shape(input=dynamic)[-2]\n    static = static[..., tf.newaxis, :] + tf.zeros([timesteps, 1])\n    latents = tf.concat([dynamic, static], axis=-1)  # (sample, N, T, latents)\n    out = self.dense(latents)\n    out = tf.reshape(out, (-1, 1, 1, self.hidden_size))\n    out = self.conv_transpose1(out)\n    out = self.conv_transpose2(out)\n    out = self.conv_transpose3(out)\n    out = self.conv_transpose4(out)  # (sample*N*T, h, w, c)\n    expanded_shape = tf.concat(\n        (tf.shape(input=latents)[:-1], tf.shape(input=out)[1:]), axis=0)\n    out = tf.reshape(out, expanded_shape)  # (sample, N, T, h, w, c)\n    return tfd.Independent(\n        distribution=tfd.Normal(loc=out, scale=1.),\n        reinterpreted_batch_ndims=3,  # wrap (h, w, c)\n        name=\"decoded_image\")"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nrunning the model to generate an intermediate representation of x_t.", "response": "def call(self, inputs):\n    \"\"\"Runs the model to generate an intermediate representation of x_t.\n\n    Args:\n      inputs: A batch of image sequences `x_{1:T}` of shape\n        `[sample_shape, batch_size, timesteps, height, width,\n        channels]`.\n\n    Returns:\n      A batch of intermediate representations of shape [sample_shape,\n      batch_size, timesteps, hidden_size].\n    \"\"\"\n    image_shape = tf.shape(input=inputs)[-3:]\n    collapsed_shape = tf.concat(([-1], image_shape), axis=0)\n    out = tf.reshape(inputs, collapsed_shape)  # (sample*batch*T, h, w, c)\n    out = self.conv1(out)\n    out = self.conv2(out)\n    out = self.conv3(out)\n    out = self.conv4(out)\n    expanded_shape = tf.concat((tf.shape(input=inputs)[:-3], [-1]), axis=0)\n    return tf.reshape(out, expanded_shape)"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nruns the model to generate a list of MultivariateNormalDiag distributions using the output of the recurrent model at each timestep.", "response": "def call(self, inputs):\n    \"\"\"Runs the model to generate a distribution `q(f | x_{1:T})`.\n\n    This generates a list of batched MultivariateNormalDiag\n    distributions using the output of the recurrent model at each\n    timestep to parameterize each distribution.\n\n    Args:\n      inputs: A batch of intermediate representations of image frames\n        across all timesteps, of shape [..., batch_size, timesteps,\n        hidden_size].\n\n    Returns:\n      A batched MultivariateNormalDiag distribution with event shape\n      [latent_size], batch shape [..., batch_size], and sample shape\n      [sample_shape, ..., batch_size, latent_size].\n    \"\"\"\n    # TODO(dusenberrymw): Remove these reshaping commands after b/113126249 is\n    # fixed.\n    collapsed_shape = tf.concat(([-1], tf.shape(input=inputs)[-2:]), axis=0)\n    out = tf.reshape(inputs, collapsed_shape)  # (sample*batch_size, T, hidden)\n    out = self.bilstm(out)  # (sample*batch_size, hidden)\n    expanded_shape = tf.concat((tf.shape(input=inputs)[:-2], [-1]), axis=0)\n    out = tf.reshape(out, expanded_shape)  # (sample, batch_size, hidden)\n    out = self.output_layer(out)  # (sample, batch_size, 2*latent_size)\n    loc = out[..., :self.latent_size]\n    scale_diag = tf.nn.softplus(out[..., self.latent_size:]) + 1e-5  # keep > 0\n    return tfd.MultivariateNormalDiag(loc=loc, scale_diag=scale_diag)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef call(self, inputs):\n    out = self.dense(inputs)  # (..., batch, time, hidden)\n    out = self.output_layer(out)  # (..., batch, time, 2*latent)\n    loc = out[..., :self.latent_size]\n    scale_diag = tf.nn.softplus(out[..., self.latent_size:]) + 1e-5  # keep > 0\n    return tfd.MultivariateNormalDiag(loc=loc, scale_diag=scale_diag)", "response": "Runs the model to generate a MultivariateNormalDiag distribution."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nrun the model to generate a MultivariateNormalDiag .", "response": "def call(self, inputs):\n    \"\"\"Runs the model to generate a distribution `q(z_{1:T} | x_{1:T}, f)`.\n\n    This generates a list of batched MultivariateNormalDiag\n    distributions using the output of the recurrent model at each\n    timestep to parameterize each distribution.\n\n    Args:\n      inputs: A tuple of a batch of intermediate representations of\n        image frames across all timesteps of shape [..., batch_size,\n        timesteps, dimensions], and a sample of the static latent\n        variable `f` of shape [..., batch_size, latent_size].\n\n    Returns:\n      A batch of MultivariateNormalDiag distributions with event shape\n      [latent_size], batch shape [broadcasted_shape, batch_size,\n      timesteps], and sample shape [sample_shape, broadcasted_shape,\n      batch_size, timesteps, latent_size], where `broadcasted_shape` is\n      the broadcasted sampled shape between the inputs and static\n      sample.\n    \"\"\"\n    # We explicitly broadcast `x` and `f` to the same shape other than the final\n    # dimension, because `tf.concat` can't automatically do this. This will\n    # entail adding a `timesteps` dimension to `f` to give the shape `(...,\n    # batch, timesteps, latent)`, and then broadcasting the sample shapes of\n    # both tensors to the same shape.\n    features, static_sample = inputs\n    length = tf.shape(input=features)[-2]\n    static_sample = static_sample[..., tf.newaxis, :] + tf.zeros([length, 1])\n    sample_shape_static = tf.shape(input=static_sample)[:-3]\n    sample_shape_inputs = tf.shape(input=features)[:-3]\n    broadcast_shape_inputs = tf.concat((sample_shape_static, [1, 1, 1]), 0)\n    broadcast_shape_static = tf.concat((sample_shape_inputs, [1, 1, 1]), 0)\n    features = features + tf.zeros(broadcast_shape_inputs)\n    static_sample = static_sample + tf.zeros(broadcast_shape_static)\n    # `combined` will have shape (..., batch, T, hidden+latent).\n    combined = tf.concat((features, static_sample), axis=-1)\n    # TODO(dusenberrymw): Remove these reshaping commands after b/113126249 is\n    # fixed.\n    collapsed_shape = tf.concat(([-1], tf.shape(input=combined)[-2:]), axis=0)\n    out = tf.reshape(combined, collapsed_shape)\n    out = self.bilstm(out)  # (sample*batch, T, hidden_size)\n    out = self.rnn(out)  # (sample*batch, T, hidden_size)\n    expanded_shape = tf.concat(\n        (tf.shape(input=combined)[:-2], tf.shape(input=out)[1:]), axis=0)\n    out = tf.reshape(out, expanded_shape)  # (sample, batch, T, hidden_size)\n    out = self.output_layer(out)  # (sample, batch, T, 2*latent_size)\n    loc = out[..., :self.latent_size]\n    scale_diag = tf.nn.softplus(out[..., self.latent_size:]) + 1e-5  # keep > 0\n    return tfd.MultivariateNormalDiag(loc=loc, scale_diag=scale_diag)"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\ngenerate a new set of normalised independent distributions.", "response": "def generate(self, batch_size, length, samples=1, fix_static=False,\n               fix_dynamic=False):\n    \"\"\"Generate new sequences.\n\n    Args:\n      batch_size: Number of sequences to generate.\n      length: Number of timesteps to generate for each sequence.\n      samples: Number of samples to draw from the latent distributions.\n      fix_static: Boolean for whether or not to share the same random\n        sample of the static latent variable `f` from its prior across\n        all examples.\n      fix_dynamic: Boolean for whether or not to share the same random\n        sample of the dynamic latent variable `z_{1:T}` from its prior\n        across all examples.\n\n    Returns:\n      A batched Independent distribution wrapping a set of Normal\n      distributions over the pixels of the generated sequences, where\n      the Independent distribution has event shape [height, width,\n      channels], batch shape [samples, batch_size, timesteps], and\n      sample shape [sample_shape, samples, batch_size, timesteps,\n      height, width, channels].\n    \"\"\"\n    static_sample, _ = self.sample_static_prior(samples, batch_size, fix_static)\n    dynamic_sample, _ = self.sample_dynamic_prior(samples, batch_size, length,\n                                                  fix_dynamic)\n    likelihood = self.decoder((dynamic_sample, static_sample))\n    return likelihood"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef reconstruct(self, inputs, samples=1, sample_static=False,\n                  sample_dynamic=False, swap_static=False, swap_dynamic=False,\n                  fix_static=False, fix_dynamic=False):\n    \"\"\"Reconstruct the given input sequences.\n\n    Args:\n      inputs: A batch of image sequences `x_{1:T}` of shape\n        `[batch_size, timesteps, height, width, channels]`.\n      samples: Number of samples to draw from the latent distributions.\n      sample_static: Boolean for whether or not to randomly sample the\n        static latent variable `f` from its prior distribution.\n      sample_dynamic: Boolean for whether or not to randomly sample the\n        dynamic latent variable `z_{1:T}` from its prior distribution.\n      swap_static: Boolean for whether or not to swap the encodings for\n        the static latent variable `f` between the examples.\n      swap_dynamic: Boolean for whether or not to swap the encodings for\n        the dynamic latent variable `z_{1:T}` between the examples.\n      fix_static: Boolean for whether or not to share the same random\n        sample of the static latent variable `f` from its prior across\n        all examples.\n      fix_dynamic: Boolean for whether or not to share the same random\n        sample of the dynamic latent variable `z_{1:T}` from its prior\n        across all examples.\n\n    Returns:\n      A batched Independent distribution wrapping a set of Normal\n      distributions over the pixels of the reconstruction of the input,\n      where the Independent distribution has event shape [height, width,\n      channels], batch shape [samples, batch_size, timesteps], and\n      sample shape [sample_shape, samples, batch_size, timesteps,\n      height, width, channels].\n    \"\"\"\n    batch_size = tf.shape(input=inputs)[-5]\n    length = len(tf.unstack(inputs, axis=-4))  # hack for graph mode\n\n    features = self.compressor(inputs)  # (..., batch, timesteps, hidden)\n\n    if sample_static:\n      static_sample, _ = self.sample_static_prior(\n          samples, batch_size, fix_static)\n    else:\n      static_sample, _ = self.sample_static_posterior(features, samples)\n\n    if swap_static:\n      static_sample = tf.reverse(static_sample, axis=[1])\n\n    if sample_dynamic:\n      dynamic_sample, _ = self.sample_dynamic_prior(\n          samples, batch_size, length, fix_dynamic)\n    else:\n      dynamic_sample, _ = self.sample_dynamic_posterior(\n          features, samples, static_sample)\n\n    if swap_dynamic:\n      dynamic_sample = tf.reverse(dynamic_sample, axis=[1])\n\n    likelihood = self.decoder((dynamic_sample, static_sample))\n    return likelihood", "response": "Reconstructs the given input sequences."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nsample the static prior.", "response": "def sample_static_prior(self, samples, batch_size, fixed=False):\n    \"\"\"Sample the static latent prior.\n\n    Args:\n      samples: Number of samples to draw from the latent distribution.\n      batch_size: Number of sequences to sample.\n      fixed: Boolean for whether or not to share the same random\n        sample across all sequences.\n\n    Returns:\n      A tuple of a sample tensor of shape [samples, batch_size,\n      latent_size], and a MultivariateNormalDiag distribution from which\n      the tensor was sampled, with event shape [latent_size], and batch\n      shape [].\n    \"\"\"\n    dist = self.static_prior()\n    if fixed:  # in either case, shape is (samples, batch, latent)\n      sample = dist.sample((samples, 1)) + tf.zeros([batch_size, 1])\n    else:\n      sample = dist.sample((samples, batch_size))\n    return sample, dist"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nsamples the static latent posterior.", "response": "def sample_static_posterior(self, inputs, samples):\n    \"\"\"Sample the static latent posterior.\n\n    Args:\n      inputs: A batch of intermediate representations of image frames\n        across all timesteps, of shape [..., batch_size, timesteps,\n        hidden_size].\n      samples: Number of samples to draw from the latent distribution.\n\n    Returns:\n      A tuple of a sample tensor of shape [samples, batch_size,\n      latent_size], and a MultivariateNormalDiag distribution from which\n      the tensor was sampled, with event shape [latent_size], and batch\n      shape [..., batch_size].\n    \"\"\"\n    dist = self.static_encoder(inputs)\n    sample = dist.sample(samples)\n    return sample, dist"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef sample_dynamic_prior(self, samples, batch_size, length, fixed=False):\n    if fixed:\n      sample_batch_size = 1\n    else:\n      sample_batch_size = batch_size\n\n    sample, state = self.dynamic_prior.zero_state([samples, sample_batch_size])\n    locs = []\n    scale_diags = []\n    sample_list = []\n    for _ in range(length):\n      dist, state = self.dynamic_prior(sample, state)\n      sample = dist.sample()\n      locs.append(dist.parameters[\"loc\"])\n      scale_diags.append(dist.parameters[\"scale_diag\"])\n      sample_list.append(sample)\n\n    sample = tf.stack(sample_list, axis=2)\n    loc = tf.stack(locs, axis=2)\n    scale_diag = tf.stack(scale_diags, axis=2)\n\n    if fixed:  # tile along the batch axis\n      sample = sample + tf.zeros([batch_size, 1, 1])\n\n    return sample, tfd.MultivariateNormalDiag(loc=loc, scale_diag=scale_diag)", "response": "Sample the dynamic prior."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nsamples the static latent variable from the given inputs.", "response": "def sample_dynamic_posterior(self, inputs, samples, static_sample=None):\n    \"\"\"Sample the static latent posterior.\n\n    Args:\n      inputs: A batch of intermediate representations of image frames\n        across all timesteps, of shape [..., batch_size, timesteps,\n        hidden_size].\n      samples: Number of samples to draw from the latent distribution.\n      static_sample: A tensor sample of the static latent variable `f`\n        of shape [..., batch_size, latent_size]. Only used\n        for the full dynamic posterior formulation.\n\n    Returns:\n      A tuple of a sample tensor of shape [samples, batch_size, length\n      latent_size], and a MultivariateNormalDiag distribution from which\n      the tensor was sampled, with event shape [latent_size], and batch\n      shape [broadcasted_shape, batch_size, length], where\n      `broadcasted_shape` is the broadcasted sampled shape between the\n      inputs and static sample.\n\n    Raises:\n      ValueError: If the \"full\" latent posterior formulation is being\n        used, yet a static latent sample was not provided.\n    \"\"\"\n    if self.latent_posterior == \"factorized\":\n      dist = self.dynamic_encoder(inputs)\n      samples = dist.sample(samples)  # (s, N, T, lat)\n    else:  # full\n      if static_sample is None:\n        raise ValueError(\n            \"The full dynamic posterior requires a static latent sample\")\n      dist = self.dynamic_encoder((inputs, static_sample))\n      samples = dist.sample()  # (samples, N, latent)\n    return samples, dist"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef _canonicalize_param_vals_as_map(self, param_vals):\n    if hasattr(param_vals, 'keys'):\n      param_map = param_vals\n    else:\n      param_map = {p.name: v for (p, v) in zip(self.parameters, param_vals)}\n\n    return param_map", "response": "This method takes a list of parameter values and converts them to a canonical dict that maps from the names given in self. parameters to the specified parameter values."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nsamples from the joint prior over model parameters and trajectories.", "response": "def prior_sample(self,\n                   num_timesteps,\n                   initial_step=0,\n                   params_sample_shape=(),\n                   trajectories_sample_shape=(),\n                   seed=None):\n    \"\"\"Sample from the joint prior over model parameters and trajectories.\n\n    Args:\n      num_timesteps: Scalar `int` `Tensor` number of timesteps to model.\n      initial_step: Optional scalar `int` `Tensor` specifying the starting\n        timestep.\n          Default value: 0.\n      params_sample_shape: Number of possible worlds to sample iid from the\n        parameter prior, or more generally, `Tensor` `int` shape to fill with\n        iid samples.\n          Default value: [] (i.e., draw a single sample and don't expand the\n          shape).\n      trajectories_sample_shape: For each sampled set of parameters, number\n        of trajectories to sample, or more generally, `Tensor` `int` shape to\n        fill with iid samples.\n        Default value: [] (i.e., draw a single sample and don't expand the\n          shape).\n      seed: Python `int` random seed.\n\n    Returns:\n      trajectories: `float` `Tensor` of shape\n        `trajectories_sample_shape + params_sample_shape + [num_timesteps, 1]`\n        containing all sampled trajectories.\n      param_samples: list of sampled parameter value `Tensor`s, in order\n        corresponding to `self.parameters`, each of shape\n        `params_sample_shape + prior.batch_shape + prior.event_shape`.\n    \"\"\"\n\n    seed = distributions.SeedStream(\n        seed, salt='StructuralTimeSeries_prior_sample')\n\n    with tf.compat.v1.name_scope(\n        'prior_sample',\n        values=[num_timesteps, params_sample_shape, trajectories_sample_shape]):\n      param_samples = [\n          p.prior.sample(params_sample_shape, seed=seed(), name=p.name)\n          for p in self.parameters\n      ]\n      model = self.make_state_space_model(\n          num_timesteps=num_timesteps,\n          initial_step=initial_step,\n          param_vals=param_samples)\n      return model.sample(trajectories_sample_shape, seed=seed()), param_samples"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef joint_log_prob(self, observed_time_series):\n\n    with tf.compat.v1.name_scope(\n        'joint_log_prob', values=[observed_time_series]):\n      [\n          observed_time_series,\n          mask\n      ] = sts_util.canonicalize_observed_time_series_with_mask(\n          observed_time_series)\n\n      num_timesteps = distribution_util.prefer_static_value(\n          tf.shape(input=observed_time_series))[-2]\n\n      def log_joint_fn(*param_vals):\n        \"\"\"Generated log-density function.\"\"\"\n\n        # Sum the log_prob values from parameter priors.\n        param_lp = sum([\n            param.prior.log_prob(param_val)\n            for (param, param_val) in zip(self.parameters, param_vals)\n        ])\n\n        # Build a linear Gaussian state space model and evaluate the marginal\n        # log_prob on observations.\n        lgssm = self.make_state_space_model(\n            param_vals=param_vals, num_timesteps=num_timesteps)\n        observation_lp = lgssm.log_prob(observed_time_series, mask=mask)\n\n        # Sum over likelihoods from iid observations. Without this sum,\n        # adding `param_lp + observation_lp` would broadcast the param priors\n        # over the sample shape, which incorrectly multi-counts the param\n        # priors.\n        sample_ndims = tf.maximum(0,\n                                  tf.rank(observation_lp) - tf.rank(param_lp))\n        observation_lp = tf.reduce_sum(\n            input_tensor=observation_lp, axis=tf.range(sample_ndims))\n\n        return param_lp + observation_lp\n\n    return log_joint_fn", "response": "Builds the joint log - probability function for the given model and time series."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef _compute_min_event_ndims(bijector_list, compute_forward=True):\n  min_event_ndims = 0\n  # This is a mouthful, but what this encapsulates is that if not for rank\n  # changing bijectors, we'd only need to compute the largest of the min\n  # required ndims. Hence \"max_min\". Due to rank changing bijectors, we need to\n  # account for synthetic rank growth / synthetic rank decrease from a rank\n  # changing bijector.\n  rank_changed_adjusted_max_min_event_ndims = 0\n\n  if compute_forward:\n    bijector_list = reversed(bijector_list)\n\n  for b in bijector_list:\n    if compute_forward:\n      current_min_event_ndims = b.forward_min_event_ndims\n      current_inverse_min_event_ndims = b.inverse_min_event_ndims\n    else:\n      current_min_event_ndims = b.inverse_min_event_ndims\n      current_inverse_min_event_ndims = b.forward_min_event_ndims\n\n    # New dimensions were touched.\n    if rank_changed_adjusted_max_min_event_ndims < current_min_event_ndims:\n      min_event_ndims += (\n          current_min_event_ndims - rank_changed_adjusted_max_min_event_ndims)\n    rank_changed_adjusted_max_min_event_ndims = max(\n        current_min_event_ndims, rank_changed_adjusted_max_min_event_ndims)\n\n    # If the number of dimensions has increased via forward, then\n    # inverse_min_event_ndims > forward_min_event_ndims, and hence the\n    # dimensions we computed on, have moved left (so we have operated\n    # on additional dimensions).\n    # Conversely, if the number of dimensions has decreased via forward,\n    # then we have inverse_min_event_ndims < forward_min_event_ndims,\n    # and so we will have operated on fewer right most dimensions.\n\n    number_of_changed_dimensions = (\n        current_min_event_ndims - current_inverse_min_event_ndims)\n    rank_changed_adjusted_max_min_event_ndims -= number_of_changed_dimensions\n  return min_event_ndims", "response": "This function computes the minimum event ndims for a given list of bijectors."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nconverting a vector size to a matrix size.", "response": "def vector_size_to_square_matrix_size(d, validate_args, name=None):\n  \"\"\"Convert a vector size to a matrix size.\"\"\"\n  if isinstance(d, (float, int, np.generic, np.ndarray)):\n    n = (-1 + np.sqrt(1 + 8 * d)) / 2.\n    if float(int(n)) != n:\n      raise ValueError(\"Vector length is not a triangular number.\")\n    return int(n)\n  else:\n    with tf.name_scope(name or \"vector_size_to_square_matrix_size\") as name:\n      n = (-1. + tf.sqrt(1 + 8. * tf.cast(d, dtype=tf.float32))) / 2.\n      if validate_args:\n        with tf.control_dependencies([\n            assert_util.assert_equal(\n                tf.cast(tf.cast(n, dtype=tf.int32), dtype=tf.float32),\n                n,\n                message=\"Vector length is not a triangular number\")\n        ]):\n          n = tf.identity(n)\n      return tf.cast(n, d.dtype)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef _kl_gumbel_gumbel(a, b, name=None):\n  with tf.name_scope(name or \"kl_gumbel_gumbel\"):\n    # Consistent with\n    # http://www.mast.queensu.ca/~communications/Papers/gil-msc11.pdf, page 64\n    # The paper uses beta to refer to scale and mu to refer to loc.\n    # There is actually an error in the solution as printed; this is based on\n    # the second-to-last step of the derivation. The value as printed would be\n    # off by (a.loc - b.loc) / b.scale.\n    return (tf.math.log(b.scale) - tf.math.log(a.scale) + np.euler_gamma *\n            (a.scale / b.scale - 1.) +\n            tf.math.expm1((b.loc - a.loc) / b.scale +\n                          tf.math.lgamma(a.scale / b.scale + 1.)) +\n            (a.loc - b.loc) / b.scale)", "response": "Calculate the batched KL divergence of a Gumbel distribution with a and b Gumbel distribution."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef _ndtr(x):\n  half_sqrt_2 = tf.constant(\n      0.5 * np.sqrt(2.), dtype=x.dtype, name=\"half_sqrt_2\")\n  w = x * half_sqrt_2\n  z = tf.abs(w)\n  y = tf.where(\n      tf.less(z, half_sqrt_2), 1. + tf.math.erf(w),\n      tf.where(tf.greater(w, 0.), 2. - tf.math.erfc(z), tf.math.erfc(z)))\n  return 0.5 * y", "response": "Implements ndtr core logic."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\ninversing of the CDF of the Normal distribution function.", "response": "def ndtri(p, name=\"ndtri\"):\n  \"\"\"The inverse of the CDF of the Normal distribution function.\n\n  Returns x such that the area under the pdf from minus infinity to x is equal\n  to p.\n\n  A piece-wise rational approximation is done for the function.\n  This is a port of the implementation in netlib.\n\n  Args:\n    p: `Tensor` of type `float32`, `float64`.\n    name: Python string. A name for the operation (default=\"ndtri\").\n\n  Returns:\n    x: `Tensor` with `dtype=p.dtype`.\n\n  Raises:\n    TypeError: if `p` is not floating-type.\n  \"\"\"\n\n  with tf.name_scope(name):\n    p = tf.convert_to_tensor(value=p, name=\"p\")\n    if dtype_util.as_numpy_dtype(p.dtype) not in [np.float32, np.float64]:\n      raise TypeError(\n          \"p.dtype=%s is not handled, see docstring for supported types.\"\n          % p.dtype)\n    return _ndtri(p)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef _ndtri(p):\n\n  # Constants used in piece-wise rational approximations. Taken from the cephes\n  # library:\n  # https://root.cern.ch/doc/v608/SpecFuncCephesInv_8cxx_source.html\n  p0 = list(reversed([-5.99633501014107895267E1,\n                      9.80010754185999661536E1,\n                      -5.66762857469070293439E1,\n                      1.39312609387279679503E1,\n                      -1.23916583867381258016E0]))\n  q0 = list(reversed([1.0,\n                      1.95448858338141759834E0,\n                      4.67627912898881538453E0,\n                      8.63602421390890590575E1,\n                      -2.25462687854119370527E2,\n                      2.00260212380060660359E2,\n                      -8.20372256168333339912E1,\n                      1.59056225126211695515E1,\n                      -1.18331621121330003142E0]))\n  p1 = list(reversed([4.05544892305962419923E0,\n                      3.15251094599893866154E1,\n                      5.71628192246421288162E1,\n                      4.40805073893200834700E1,\n                      1.46849561928858024014E1,\n                      2.18663306850790267539E0,\n                      -1.40256079171354495875E-1,\n                      -3.50424626827848203418E-2,\n                      -8.57456785154685413611E-4]))\n  q1 = list(reversed([1.0,\n                      1.57799883256466749731E1,\n                      4.53907635128879210584E1,\n                      4.13172038254672030440E1,\n                      1.50425385692907503408E1,\n                      2.50464946208309415979E0,\n                      -1.42182922854787788574E-1,\n                      -3.80806407691578277194E-2,\n                      -9.33259480895457427372E-4]))\n  p2 = list(reversed([3.23774891776946035970E0,\n                      6.91522889068984211695E0,\n                      3.93881025292474443415E0,\n                      1.33303460815807542389E0,\n                      2.01485389549179081538E-1,\n                      1.23716634817820021358E-2,\n                      3.01581553508235416007E-4,\n                      2.65806974686737550832E-6,\n                      6.23974539184983293730E-9]))\n  q2 = list(reversed([1.0,\n                      6.02427039364742014255E0,\n                      3.67983563856160859403E0,\n                      1.37702099489081330271E0,\n                      2.16236993594496635890E-1,\n                      1.34204006088543189037E-2,\n                      3.28014464682127739104E-4,\n                      2.89247864745380683936E-6,\n                      6.79019408009981274425E-9]))\n\n  def _create_polynomial(var, coeffs):\n    \"\"\"Compute n_th order polynomial via Horner's method.\"\"\"\n    coeffs = np.array(coeffs, dtype_util.as_numpy_dtype(var.dtype))\n    if not coeffs.size:\n      return tf.zeros_like(var)\n    return coeffs[0] + _create_polynomial(var, coeffs[1:]) * var\n\n  maybe_complement_p = tf.where(p > -np.expm1(-2.), 1. - p, p)\n  # Write in an arbitrary value in place of 0 for p since 0 will cause NaNs\n  # later on. The result from the computation when p == 0 is not used so any\n  # number that doesn't result in NaNs is fine.\n  sanitized_mcp = tf.where(\n      maybe_complement_p <= 0.,\n      tf.fill(tf.shape(input=p),\n              dtype_util.as_numpy_dtype(p.dtype)(0.5)), maybe_complement_p)\n\n  # Compute x for p > exp(-2): x/sqrt(2pi) = w + w**3 P0(w**2)/Q0(w**2).\n  w = sanitized_mcp - 0.5\n  ww = w ** 2\n  x_for_big_p = w + w * ww * (_create_polynomial(ww, p0)\n                              / _create_polynomial(ww, q0))\n  x_for_big_p *= -np.sqrt(2. * np.pi)\n\n  # Compute x for p <= exp(-2): x = z - log(z)/z - (1/z) P(1/z) / Q(1/z),\n  # where z = sqrt(-2. * log(p)), and P/Q are chosen between two different\n  # arrays based on whether p < exp(-32).\n  z = tf.sqrt(-2. * tf.math.log(sanitized_mcp))\n  first_term = z - tf.math.log(z) / z\n  second_term_small_p = (\n      _create_polynomial(1. / z, p2) /\n      _create_polynomial(1. / z, q2) / z)\n  second_term_otherwise = (\n      _create_polynomial(1. / z, p1) /\n      _create_polynomial(1. / z, q1) / z)\n  x_for_small_p = first_term - second_term_small_p\n  x_otherwise = first_term - second_term_otherwise\n\n  x = tf.where(sanitized_mcp > np.exp(-2.), x_for_big_p,\n               tf.where(z >= 8.0, x_for_small_p, x_otherwise))\n\n  x = tf.where(p > 1. - np.exp(-2.), x, -x)\n  infinity_scalar = tf.constant(np.inf, dtype=p.dtype)\n  infinity = tf.fill(tf.shape(input=p), infinity_scalar)\n  x_nan_replaced = tf.where(\n      p <= 0.0, -infinity, tf.where(p >= 1.0, infinity, x))\n  return x_nan_replaced", "response": "Implements ndtri core logic."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nlog Normal distribution function.", "response": "def log_ndtr(x, series_order=3, name=\"log_ndtr\"):\n  \"\"\"Log Normal distribution function.\n\n  For details of the Normal distribution function see `ndtr`.\n\n  This function calculates `(log o ndtr)(x)` by either calling `log(ndtr(x))` or\n  using an asymptotic series. Specifically:\n  - For `x > upper_segment`, use the approximation `-ndtr(-x)` based on\n    `log(1-x) ~= -x, x << 1`.\n  - For `lower_segment < x <= upper_segment`, use the existing `ndtr` technique\n    and take a log.\n  - For `x <= lower_segment`, we use the series approximation of erf to compute\n    the log CDF directly.\n\n  The `lower_segment` is set based on the precision of the input:\n\n  ```\n  lower_segment = { -20,  x.dtype=float64\n                  { -10,  x.dtype=float32\n  upper_segment = {   8,  x.dtype=float64\n                  {   5,  x.dtype=float32\n  ```\n\n  When `x < lower_segment`, the `ndtr` asymptotic series approximation is:\n\n  ```\n     ndtr(x) = scale * (1 + sum) + R_N\n     scale   = exp(-0.5 x**2) / (-x sqrt(2 pi))\n     sum     = Sum{(-1)^n (2n-1)!! / (x**2)^n, n=1:N}\n     R_N     = O(exp(-0.5 x**2) (2N+1)!! / |x|^{2N+3})\n  ```\n\n  where `(2n-1)!! = (2n-1) (2n-3) (2n-5) ...  (3) (1)` is a\n  [double-factorial](https://en.wikipedia.org/wiki/Double_factorial).\n\n\n  Args:\n    x: `Tensor` of type `float32`, `float64`.\n    series_order: Positive Python `integer`. Maximum depth to\n      evaluate the asymptotic expansion. This is the `N` above.\n    name: Python string. A name for the operation (default=\"log_ndtr\").\n\n  Returns:\n    log_ndtr: `Tensor` with `dtype=x.dtype`.\n\n  Raises:\n    TypeError: if `x.dtype` is not handled.\n    TypeError: if `series_order` is a not Python `integer.`\n    ValueError:  if `series_order` is not in `[0, 30]`.\n  \"\"\"\n  if not isinstance(series_order, int):\n    raise TypeError(\"series_order must be a Python integer.\")\n  if series_order < 0:\n    raise ValueError(\"series_order must be non-negative.\")\n  if series_order > 30:\n    raise ValueError(\"series_order must be <= 30.\")\n\n  with tf.name_scope(name):\n    x = tf.convert_to_tensor(value=x, name=\"x\")\n\n    if dtype_util.base_equal(x.dtype, tf.float64):\n      lower_segment = LOGNDTR_FLOAT64_LOWER\n      upper_segment = LOGNDTR_FLOAT64_UPPER\n    elif dtype_util.base_equal(x.dtype, tf.float32):\n      lower_segment = LOGNDTR_FLOAT32_LOWER\n      upper_segment = LOGNDTR_FLOAT32_UPPER\n    else:\n      raise TypeError(\"x.dtype=%s is not supported.\" % x.dtype)\n\n    # The basic idea here was ported from:\n    #   https://root.cern.ch/doc/v608/SpecFuncCephesInv_8cxx_source.html\n    # We copy the main idea, with a few changes\n    # * For x >> 1, and X ~ Normal(0, 1),\n    #     Log[P[X < x]] = Log[1 - P[X < -x]] approx -P[X < -x],\n    #     which extends the range of validity of this function.\n    # * We use one fixed series_order for all of 'x', rather than adaptive.\n    # * Our docstring properly reflects that this is an asymptotic series, not a\n    #   Taylor series. We also provided a correct bound on the remainder.\n    # * We need to use the max/min in the _log_ndtr_lower arg to avoid nan when\n    #   x=0. This happens even though the branch is unchosen because when x=0\n    #   the gradient of a select involves the calculation 1*dy+0*(-inf)=nan\n    #   regardless of whether dy is finite. Note that the minimum is a NOP if\n    #   the branch is chosen.\n    return tf.where(\n        tf.greater(x, upper_segment),\n        -_ndtr(-x),  # log(1-x) ~= -x, x << 1\n        tf.where(\n            tf.greater(x, lower_segment),\n            tf.math.log(_ndtr(tf.maximum(x, lower_segment))),\n            _log_ndtr_lower(tf.minimum(x, lower_segment), series_order)))"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef _log_ndtr_lower(x, series_order):\n  x_2 = tf.square(x)\n  # Log of the term multiplying (1 + sum)\n  log_scale = -0.5 * x_2 - tf.math.log(-x) - 0.5 * np.log(2. * np.pi)\n  return log_scale + tf.math.log(_log_ndtr_asymptotic_series(x, series_order))", "response": "Asymptotic expansion version of Log [ cdf x ] appropriate for x << - 1."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\ncalculates the asymptotic series used in log_ndtr.", "response": "def _log_ndtr_asymptotic_series(x, series_order):\n  \"\"\"Calculates the asymptotic series used in log_ndtr.\"\"\"\n  npdt = dtype_util.as_numpy_dtype(x.dtype)\n  if series_order <= 0:\n    return npdt(1)\n  x_2 = tf.square(x)\n  even_sum = tf.zeros_like(x)\n  odd_sum = tf.zeros_like(x)\n  x_2n = x_2  # Start with x^{2*1} = x^{2*n} with n = 1.\n  for n in range(1, series_order + 1):\n    y = npdt(_double_factorial(2 * n - 1)) / x_2n\n    if n % 2:\n      odd_sum += y\n    else:\n      even_sum += y\n    x_2n *= x_2\n  return 1. + even_sum - odd_sum"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\ninverses function for erf error function.", "response": "def erfinv(x, name=\"erfinv\"):\n  \"\"\"The inverse function for erf, the error function.\n\n  Args:\n    x: `Tensor` of type `float32`, `float64`.\n    name: Python string. A name for the operation (default=\"erfinv\").\n\n  Returns:\n    x: `Tensor` with `dtype=x.dtype`.\n\n  Raises:\n    TypeError: if `x` is not floating-type.\n  \"\"\"\n\n  with tf.name_scope(name):\n    x = tf.convert_to_tensor(value=x, name=\"x\")\n    if dtype_util.as_numpy_dtype(x.dtype) not in [np.float32, np.float64]:\n      raise TypeError(\"x.dtype={} is not handled, see docstring for supported \"\n                      \"types.\".format(dtype_util.name(x.dtype)))\n    return ndtri((x + 1.) / 2.) / np.sqrt(2.)"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef log_cdf_laplace(x, name=\"log_cdf_laplace\"):\n\n  with tf.name_scope(name):\n    x = tf.convert_to_tensor(value=x, name=\"x\")\n\n    # For x < 0, L(x) = 0.5 * exp{x} exactly, so Log[L(x)] = log(0.5) + x.\n    lower_solution = -np.log(2.) + x\n\n    # safe_exp_neg_x = exp{-x} for x > 0, but is\n    # bounded above by 1, which avoids\n    #   log[1 - 1] = -inf for x = log(1/2), AND\n    #   exp{-x} --> inf, for x << -1\n    safe_exp_neg_x = tf.exp(-tf.abs(x))\n\n    # log1p(z) = log(1 + z) approx z for |z| << 1. This approxmation is used\n    # internally by log1p, rather than being done explicitly here.\n    upper_solution = tf.math.log1p(-0.5 * safe_exp_neg_x)\n\n    return tf.where(x < 0., lower_solution, upper_solution)", "response": "Log Laplace distribution function."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef text_messages_joint_log_prob(count_data, lambda_1, lambda_2, tau):\n  alpha = (1. / tf.reduce_mean(input_tensor=count_data))\n  rv_lambda = tfd.Exponential(rate=alpha)\n\n  rv_tau = tfd.Uniform()\n\n  lambda_ = tf.gather(\n      [lambda_1, lambda_2],\n      indices=tf.cast(\n          tau * tf.cast(tf.size(input=count_data), dtype=tf.float32) <= tf.cast(\n              tf.range(tf.size(input=count_data)), dtype=tf.float32),\n          dtype=tf.int32))\n  rv_observation = tfd.Poisson(rate=lambda_)\n\n  return (rv_lambda.log_prob(lambda_1) + rv_lambda.log_prob(lambda_2) +\n          rv_tau.log_prob(tau) +\n          tf.reduce_sum(input_tensor=rv_observation.log_prob(count_data)))", "response": "Joint log probability function."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nruns HMC on the text - messages unnormalized posterior.", "response": "def benchmark_text_messages_hmc(\n    num_results=int(3e3),\n    num_burnin_steps=int(3e3),\n    num_leapfrog_steps=3):\n  \"\"\"Runs HMC on the text-messages unnormalized posterior.\"\"\"\n\n  if not tf.executing_eagerly():\n    tf.compat.v1.reset_default_graph()\n\n  # Build a static, pretend dataset.\n  count_data = tf.cast(\n      tf.concat(\n          [tfd.Poisson(rate=15.).sample(43),\n           tfd.Poisson(rate=25.).sample(31)],\n          axis=0),\n      dtype=tf.float32)\n  if tf.executing_eagerly():\n    count_data = count_data.numpy()\n  else:\n    with tf.compat.v1.Session():\n      count_data = count_data.eval()\n\n  # Define a closure over our joint_log_prob.\n  def unnormalized_log_posterior(lambda1, lambda2, tau):\n    return text_messages_joint_log_prob(count_data, lambda1, lambda2, tau)\n\n  if tf.executing_eagerly():\n    sample_chain = tf.function(tfp.mcmc.sample_chain)\n  else:\n    sample_chain = tfp.mcmc.sample_chain\n\n  # Initialize the step_size. (It will be automatically adapted.)\n  step_size = tf.compat.v2.Variable(\n      name='step_size',\n      initial_value=tf.constant(0.05, dtype=tf.float32),\n      trainable=False)\n\n  def computation():\n    \"\"\"The benchmark computation.\"\"\"\n\n    initial_chain_state = [\n        tf.constant(count_data.mean(), name='init_lambda1'),\n        tf.constant(count_data.mean(), name='init_lambda2'),\n        tf.constant(0.5, name='init_tau'),\n    ]\n\n    unconstraining_bijectors = [\n        tfp.bijectors.Exp(),       # Maps a positive real to R.\n        tfp.bijectors.Exp(),       # Maps a positive real to R.\n        tfp.bijectors.Sigmoid(),   # Maps [0,1] to R.\n    ]\n\n    _, kernel_results = sample_chain(\n        num_results=num_results,\n        num_burnin_steps=num_burnin_steps,\n        current_state=initial_chain_state,\n        kernel=tfp.mcmc.TransformedTransitionKernel(\n            inner_kernel=tfp.mcmc.HamiltonianMonteCarlo(\n                target_log_prob_fn=unnormalized_log_posterior,\n                num_leapfrog_steps=num_leapfrog_steps,\n                step_size=step_size,\n                step_size_update_fn=\n                tfp.mcmc.make_simple_step_size_update_policy(num_burnin_steps),\n                state_gradients_are_stopped=True),\n            bijector=unconstraining_bijectors))\n\n    return kernel_results.inner_results.is_accepted\n\n  # Let's force evaluation of graph to ensure build time is not part of our time\n  # trial.\n  is_accepted_tensor = computation()\n  if not tf.executing_eagerly():\n    session = tf.compat.v1.Session()\n    session.run(tf.compat.v1.global_variables_initializer())\n    session.run(is_accepted_tensor)\n\n  start_time = time.time()\n  if tf.executing_eagerly():\n    is_accepted = computation()\n  else:\n    is_accepted = session.run(is_accepted_tensor)\n  wall_time = time.time() - start_time\n\n  num_accepted = np.sum(is_accepted)\n  acceptance_rate = np.float32(num_accepted) / np.float32(num_results)\n\n  return dict(\n      iters=(num_results + num_burnin_steps) * num_leapfrog_steps,\n      extras={'acceptance_rate': acceptance_rate},\n      wall_time=wall_time)"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nreturn True if the given index_points would yield a univariate marginal.", "response": "def _is_univariate_marginal(self, index_points):\n    \"\"\"True if the given index_points would yield a univariate marginal.\n\n    Args:\n      index_points: the set of index set locations at which to compute the\n      marginal Gaussian distribution. If this set is of size 1, the marginal is\n      univariate.\n\n    Returns:\n      is_univariate: Boolean indicating whether the marginal is univariate or\n      multivariate. In the case of dynamic shape in the number of index points,\n      defaults to \"multivariate\" since that's the best we can do.\n    \"\"\"\n    num_index_points = tf.compat.dimension_value(\n        index_points.shape[-(self.kernel.feature_ndims + 1)])\n    if num_index_points is None:\n      warnings.warn(\n          'Unable to detect statically whether the number of index_points is '\n          '1. As a result, defaulting to treating the marginal GP at '\n          '`index_points` as a multivariate Gaussian. This makes some methods, '\n          'like `cdf` unavailable.')\n    return num_index_points == 1"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\ncompute the marginal distribution over function values at index_points.", "response": "def get_marginal_distribution(self, index_points=None):\n    \"\"\"Compute the marginal of this GP over function values at `index_points`.\n\n    Args:\n      index_points: `float` `Tensor` representing finite (batch of) vector(s) of\n        points in the index set over which the GP is defined. Shape has the form\n        `[b1, ..., bB, e, f1, ..., fF]` where `F` is the number of feature\n        dimensions and must equal `kernel.feature_ndims` and `e` is the number\n        (size) of index points in each batch. Ultimately this distribution\n        corresponds to a `e`-dimensional multivariate normal. The batch shape\n        must be broadcastable with `kernel.batch_shape` and any batch dims\n        yielded by `mean_fn`.\n\n    Returns:\n      marginal: a `Normal` or `MultivariateNormalLinearOperator` distribution,\n        according to whether `index_points` consists of one or many index\n        points, respectively.\n    \"\"\"\n    with self._name_scope('get_marginal_distribution'):\n      # TODO(cgs): consider caching the result here, keyed on `index_points`.\n      index_points = self._get_index_points(index_points)\n      covariance = self._compute_covariance(index_points)\n      loc = self._mean_fn(index_points)\n      # If we're sure the number of index points is 1, we can just construct a\n      # scalar Normal. This has computational benefits and supports things like\n      # CDF that aren't otherwise straightforward to provide.\n      if self._is_univariate_marginal(index_points):\n        scale = tf.sqrt(covariance)\n        # `loc` has a trailing 1 in the shape; squeeze it.\n        loc = tf.squeeze(loc, axis=-1)\n        return normal.Normal(\n            loc=loc,\n            scale=scale,\n            validate_args=self._validate_args,\n            allow_nan_stats=self._allow_nan_stats,\n            name='marginal_distribution')\n      else:\n        scale = tf.linalg.LinearOperatorLowerTriangular(\n            tf.linalg.cholesky(_add_diagonal_shift(covariance, self.jitter)),\n            is_non_singular=True,\n            name='GaussianProcessScaleLinearOperator')\n        return mvn_linear_operator.MultivariateNormalLinearOperator(\n            loc=loc,\n            scale=scale,\n            validate_args=self._validate_args,\n            allow_nan_stats=self._allow_nan_stats,\n            name='marginal_distribution')"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nreturns the index_points argument if given otherwise the class member .", "response": "def _get_index_points(self, index_points=None):\n    \"\"\"Return `index_points` if not None, else `self._index_points`.\n\n    Args:\n      index_points: if given, this is what is returned; else,\n      `self._index_points`\n\n    Returns:\n      index_points: the given arg, if not None, else the class member\n      `self._index_points`.\n\n    Rases:\n      ValueError: if `index_points` and `self._index_points` are both `None`.\n    \"\"\"\n    if self._index_points is None and index_points is None:\n      raise ValueError(\n          'This GaussianProcess instance was not instantiated with a value for '\n          'index_points. One must therefore be provided when calling sample, '\n          'log_prob, and other such methods. In particular, one can\\'t compute '\n          'KL divergences to/from an instance of `GaussianProccess` with '\n          'unspecified `index_points` directly. Instead, use the '\n          '`get_marginal_distribution` function, which takes `index_points` as '\n          'an argument and returns a `Normal` or '\n          '`MultivariateNormalLinearOperator` instance, whose KL can be '\n          'computed.')\n    return index_points if index_points is not None else self._index_points"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef _logsum_expbig_minus_expsmall(big, small):\n  with tf.name_scope(\"logsum_expbig_minus_expsmall\"):\n    return tf.math.log1p(-tf.exp(small - big)) + big", "response": "Stable evaluation of log [ expbig - expsmall )."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\ncomputing log_prob using log survival_function and cdf together.", "response": "def _log_prob_with_logsf_and_logcdf(self, y):\n    \"\"\"Compute log_prob(y) using log survival_function and cdf together.\"\"\"\n    # There are two options that would be equal if we had infinite precision:\n    # Log[ sf(y - 1) - sf(y) ]\n    #   = Log[ exp{logsf(y - 1)} - exp{logsf(y)} ]\n    # Log[ cdf(y) - cdf(y - 1) ]\n    #   = Log[ exp{logcdf(y)} - exp{logcdf(y - 1)} ]\n    logsf_y = self.log_survival_function(y)\n    logsf_y_minus_1 = self.log_survival_function(y - 1)\n    logcdf_y = self.log_cdf(y)\n    logcdf_y_minus_1 = self.log_cdf(y - 1)\n\n    # Important:  Here we use select in a way such that no input is inf, this\n    # prevents the troublesome case where the output of select can be finite,\n    # but the output of grad(select) will be NaN.\n\n    # In either case, we are doing Log[ exp{big} - exp{small} ]\n    # We want to use the sf items precisely when we are on the right side of the\n    # median, which occurs when logsf_y < logcdf_y.\n    big = tf.where(logsf_y < logcdf_y, logsf_y_minus_1, logcdf_y)\n    small = tf.where(logsf_y < logcdf_y, logsf_y, logcdf_y_minus_1)\n\n    return _logsum_expbig_minus_expsmall(big, small)"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\ncreate an IAF bijector.", "response": "def make_iaf_stack(total_event_size,\n                   num_hidden_layers=2,\n                   seed=None,\n                   dtype=tf.float32):\n  \"\"\"Creates an stacked IAF bijector.\n\n  This bijector operates on vector-valued events.\n\n  Args:\n    total_event_size: Number of dimensions to operate over.\n    num_hidden_layers: How many hidden layers to use in each IAF.\n    seed: Random seed for the initializers.\n    dtype: DType for the variables.\n\n  Returns:\n    bijector: The created bijector.\n  \"\"\"\n\n  seed = tfd.SeedStream(seed, 'make_iaf_stack')\n\n  def make_iaf():\n    \"\"\"Create an IAF.\"\"\"\n    initializer = tf.compat.v2.keras.initializers.VarianceScaling(\n        2 * 0.01, seed=seed() % (2**31 - 1))\n\n    made = tfb.AutoregressiveLayer(\n        params=2,\n        event_shape=[total_event_size],\n        hidden_units=[total_event_size] * num_hidden_layers,\n        activation=tf.nn.elu,\n        kernel_initializer=initializer,\n        dtype=dtype)\n\n    def shift_and_scale(x):\n      # TODO(siege): Something is losing the static shape.\n      x.set_shape(\n          x.shape.merge_with([None] * (x.shape.ndims - 1) + [total_event_size]))\n      return tf.unstack(made(x), num=2, axis=-1)\n\n    return tfb.Invert(tfb.MaskedAutoregressiveFlow(shift_and_scale))\n\n  def make_swap():\n    \"\"\"Create an swap.\"\"\"\n    permutation = list(reversed(range(total_event_size)))\n    return tfb.Permute(permutation)\n\n  bijector = make_iaf()\n  bijector = make_swap()(bijector)\n  bijector = make_iaf()(bijector)\n  bijector = make_swap()(bijector)\n  bijector = make_iaf()(bijector)\n  bijector = make_swap()(bijector)\n\n  return bijector"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nrun one iteration of NeuTra.", "response": "def one_step(self, current_state, previous_kernel_results):\n    \"\"\"Runs one iteration of NeuTra.\n\n    Args:\n      current_state: `Tensor` or Python `list` of `Tensor`s representing the\n        current state(s) of the Markov chain(s). The first `r` dimensions index\n        independent chains, `r = tf.rank(target_log_prob_fn(*current_state))`.\n      previous_kernel_results: `collections.namedtuple` containing `Tensor`s\n        representing values from previous calls to this function (or from the\n        `bootstrap_results` function.)\n\n    Returns:\n      next_state: Tensor or Python list of `Tensor`s representing the state(s)\n        of the Markov chain(s) after taking exactly one step. Has same type and\n        shape as `current_state`.\n      kernel_results: `collections.namedtuple` of internal calculations used to\n        advance the chain.\n    \"\"\"\n\n    @tfp.mcmc.internal.util.make_innermost_setter\n    def set_num_leapfrog_steps(kernel_results, num_leapfrog_steps):\n      return kernel_results._replace(\n          accepted_results=kernel_results.accepted_results._replace(\n              num_leapfrog_steps=num_leapfrog_steps))\n\n    step_size = previous_kernel_results.new_step_size\n    previous_kernel_results = set_num_leapfrog_steps(\n        previous_kernel_results, self._num_leapfrog_steps(step_size))\n\n    new_state, kernel_results = self._kernel.one_step(\n        self._flatten_state(current_state), previous_kernel_results)\n    return self._unflatten_state(new_state), kernel_results"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\ntrain the bijector and creates initial kernel_results.", "response": "def bootstrap_results(self, state):\n    \"\"\"Trains the bijector and creates initial `previous_kernel_results`.\n\n    The supplied `state` is only used to determine the number of chains to run\n    in parallel_iterations\n\n    Args:\n      state: `Tensor` or Python `list` of `Tensor`s representing the initial\n        state(s) of the Markov chain(s). The first `r` dimensions index\n        independent chains, `r = tf.rank(target_log_prob_fn(*state))`.\n\n    Returns:\n      kernel_results: Instance of\n        `UncalibratedHamiltonianMonteCarloKernelResults` inside\n        `MetropolisHastingsResults` inside `TransformedTransitionKernelResults`\n        inside `SimpleStepSizeAdaptationResults`.\n    \"\"\"\n\n    def loss():\n      q = self._flattened_variational_distribution()\n      # TODO(siege): How to seed this?\n      samples = q.sample(self.train_batch_size)\n      return tf.reduce_mean(\n          input_tensor=q.log_prob(samples) -\n          self._flattened_target_log_prob(samples),\n          axis=-1)\n\n    lr = tf.convert_to_tensor(value=self.learning_rate, dtype=self._dtype)\n    dtype = lr.dtype\n\n    learning_rate = tf.compat.v2.optimizers.schedules.PiecewiseConstantDecay(\n        list(self.num_train_steps *\n             np.array([0.2, 0.8]).astype(dtype.as_numpy_dtype())),\n        [lr, lr * 0.1, lr * 0.01])\n\n    opt = tf.compat.v2.optimizers.Adam(learning_rate)\n\n    @tf.function(autograph=False)\n    def train_step():\n      with tf.GradientTape() as tape:\n        loss_val = loss()\n      vals = tape.watched_variables()\n      grads = tape.gradient(loss_val, vals)\n      grads_and_vals = list(zip(grads, vals))\n      opt.apply_gradients(grads_and_vals)\n      return loss_val\n\n    for step in range(self.num_train_steps):\n      loss_val = train_step()\n      tf.debugging.assert_all_finite(\n          loss_val, 'NeuTra loss is NaN at step {}'.format(step))\n      if self.train_debug_fn:\n        # pylint: disable=not-callable\n        self.train_debug_fn(self, step, loss_val)\n\n    state_parts = tf.nest.flatten(state)\n    flat_state_shapes = tf.nest.flatten(self.state_shape)\n    batch_shape = tf.shape(input=state_parts[0])[:-flat_state_shapes[0].ndims]\n\n    return self._kernel.bootstrap_results(\n        self._flattened_variational_distribution().sample(\n            batch_shape, seed=self.seed))"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef mean_log_det(self, name=\"mean_log_det\"):\n    with self._name_scope(name):\n      return (self._multi_digamma(0.5 * self.df, self.dimension) +\n              self.dimension * math.log(2.) +\n              2 * self.scale_operator.log_abs_determinant())", "response": "Computes the mean log det of X under this Wishart distribution."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\ncomputing the log normalizing constant log ( Z.", "response": "def log_normalization(self, name=\"log_normalization\"):\n    \"\"\"Computes the log normalizing constant, log(Z).\"\"\"\n    with self._name_scope(name):\n      return (self.df * self.scale_operator.log_abs_determinant() +\n              0.5 * self.df * self.dimension * math.log(2.) +\n              self._multi_lgamma(0.5 * self.df, self.dimension))"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\ncreates a sequence used in multivariate gamma ; shape = shape a + p.", "response": "def _multi_gamma_sequence(self, a, p, name=\"multi_gamma_sequence\"):\n    \"\"\"Creates sequence used in multivariate (di)gamma; shape = shape(a)+[p].\"\"\"\n    with self._name_scope(name):\n      # Linspace only takes scalars, so we'll add in the offset afterwards.\n      seq = tf.linspace(\n          tf.constant(0., dtype=self.dtype), 0.5 - 0.5 * p, tf.cast(\n              p, tf.int32))\n      return seq + tf.expand_dims(a, [-1])"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _multi_lgamma(self, a, p, name=\"multi_lgamma\"):\n    with self._name_scope(name):\n      seq = self._multi_gamma_sequence(a, p)\n      return (0.25 * p * (p - 1.) * math.log(math.pi) +\n              tf.reduce_sum(input_tensor=tf.math.lgamma(seq), axis=[-1]))", "response": "Computes the log multivariate gamma function ; log ( Gamma_p a )."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef _multi_digamma(self, a, p, name=\"multi_digamma\"):\n    with self._name_scope(name):\n      seq = self._multi_gamma_sequence(a, p)\n      return tf.reduce_sum(input_tensor=tf.math.digamma(seq), axis=[-1])", "response": "Computes the multivariate digamma function ; Psi_p a."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _outer_squared_difference(x, y):\n  z = x - y\n  return z[..., tf.newaxis, :] * z[..., tf.newaxis]", "response": "Convenience function analogous to tf. squared_difference."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef _value_and_batch_jacobian(f, x):\n  if tf.executing_eagerly():\n    with tf.GradientTape() as tape:\n      tape.watch(x)\n      value = f(x)\n    batch_jacobian = tape.batch_jacobian(value, x)\n  else:\n    value = f(x)\n    batch_jacobian = gradients.batch_jacobian(value, x)\n  return value, batch_jacobian", "response": "Enables uniform interface to value and batch jacobian calculation."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\ndisabling computation of the second derivatives for a tensor.", "response": "def _prevent_2nd_derivative(x):\n  \"\"\"Disables computation of the second derivatives for a tensor.\n\n  NB: you need to apply a non-identity function to the output tensor for the\n  exception to be raised.\n\n  Arguments:\n    x: A tensor.\n\n  Returns:\n    A tensor with the same value and the same derivative as x, but that raises\n    LookupError when trying to compute the second derivatives.\n  \"\"\"\n  def grad(dy):\n    return array_ops.prevent_gradient(\n        dy, message=\"Second derivative is not implemented.\")\n\n  return tf.identity(x), grad"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nadding reparameterization (pathwise) gradients to samples of the mixture. Implicit reparameterization gradients are dx/dphi = -(d transform(x, phi) / dx)^-1 * d transform(x, phi) / dphi, where transform(x, phi) is distributional transform that removes all parameters from samples x. We implement them by replacing x with -stop_gradient(d transform(x, phi) / dx)^-1 * transform(x, phi)] for the backward pass (gradient computation). The derivative of this quantity w.r.t. phi is then the implicit reparameterization gradient. Note that this replaces the gradients w.r.t. both the mixture distribution parameters and components distributions parameters. Limitations: 1. Fundamental: components must be fully reparameterized. 2. Distributional transform is currently only implemented for factorized components. 3. Distributional transform currently only works for known rank of the batch tensor. Arguments: x: Sample of mixture distribution Returns: Tensor with same value as x, but with reparameterization gradients", "response": "def _reparameterize_sample(self, x):\n    \"\"\"Adds reparameterization (pathwise) gradients to samples of the mixture.\n\n    Implicit reparameterization gradients are\n       dx/dphi = -(d transform(x, phi) / dx)^-1 * d transform(x, phi) / dphi,\n    where transform(x, phi) is distributional transform that removes all\n    parameters from samples x.\n\n    We implement them by replacing x with\n      -stop_gradient(d transform(x, phi) / dx)^-1 * transform(x, phi)]\n    for the backward pass (gradient computation).\n    The derivative of this quantity w.r.t. phi is then the implicit\n    reparameterization gradient.\n    Note that this replaces the gradients w.r.t. both the mixture\n    distribution parameters and components distributions parameters.\n\n    Limitations:\n      1. Fundamental: components must be fully reparameterized.\n      2. Distributional transform is currently only implemented for\n        factorized components.\n      3. Distributional transform currently only works for known rank of the\n        batch tensor.\n\n    Arguments:\n      x: Sample of mixture distribution\n\n    Returns:\n      Tensor with same value as x, but with reparameterization gradients\n    \"\"\"\n    # Remove the existing gradients of x wrt parameters of the components.\n    x = tf.stop_gradient(x)\n\n    x_2d_shape = [-1, self._event_size]  # [S*prod(B), prod(E)]\n\n    # Perform distributional transform of x in [S, B, E] shape,\n    # but have Jacobian of size [S*prod(B), prod(E), prod(E)].\n    def reshaped_distributional_transform(x_2d):\n      return tf.reshape(\n          self._distributional_transform(tf.reshape(x_2d, tf.shape(input=x))),\n          x_2d_shape)\n\n    # transform_2d: [S*prod(B), prod(E)]\n    # jacobian: [S*prod(B), prod(E), prod(E)]\n    transform_2d, jacobian = _value_and_batch_jacobian(\n        reshaped_distributional_transform, tf.reshape(x, x_2d_shape))\n\n    # We only provide the first derivative; the second derivative computed by\n    # autodiff would be incorrect, so we raise an error if it is requested.\n    transform_2d = _prevent_2nd_derivative(transform_2d)\n\n    # Compute [- stop_gradient(jacobian)^-1 * transform] by solving a linear\n    # system. The Jacobian is lower triangular because the distributional\n    # transform for i-th event dimension does not depend on the next\n    # dimensions.\n    surrogate_x_2d = -tf.linalg.triangular_solve(\n        tf.stop_gradient(jacobian), tf.expand_dims(transform_2d, axis=-1),\n        lower=True)  # [S*prod(B), prod(E), 1]\n    surrogate_x = tf.reshape(surrogate_x_2d, tf.shape(input=x))\n\n    # Replace gradients of x with gradients of surrogate_x, but keep the value.\n    return x + (surrogate_x - tf.stop_gradient(surrogate_x))"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _distributional_transform(self, x):\n\n    if tensorshape_util.rank(x.shape) is None:\n      # tf.nn.softmax raises an error when applied to inputs of undefined rank.\n      raise ValueError(\"Distributional transform does not support inputs of \"\n                       \"undefined rank.\")\n\n    # Obtain factorized components distribution and assert that it's\n    # a scalar distribution.\n    if isinstance(self._components_distribution, independent.Independent):\n      univariate_components = self._components_distribution.distribution\n    else:\n      univariate_components = self._components_distribution\n\n    with tf.control_dependencies([\n        assert_util.assert_equal(\n            univariate_components.is_scalar_event(),\n            True,\n            message=\"`univariate_components` must have scalar event\")\n    ]):\n      x_padded = self._pad_sample_dims(x)  # [S, B, 1, E]\n      log_prob_x = univariate_components.log_prob(x_padded)  # [S, B, k, E]\n      cdf_x = univariate_components.cdf(x_padded)  # [S, B, k, E]\n\n      # log prob_k (x_1, ..., x_i-1)\n      cumsum_log_prob_x = tf.reshape(\n          tf.math.cumsum(\n              # [S*prod(B)*k, prod(E)]\n              tf.reshape(log_prob_x, [-1, self._event_size]),\n              exclusive=True,\n              axis=-1),\n          tf.shape(input=log_prob_x))  # [S, B, k, E]\n\n      logits_mix_prob = distribution_utils.pad_mixture_dimensions(\n          self.mixture_distribution.logits, self, self.mixture_distribution,\n          self._event_ndims)  # [B, k, 1]\n\n      # Logits of the posterior weights: log w_k + log prob_k (x_1, ..., x_i-1)\n      log_posterior_weights_x = logits_mix_prob + cumsum_log_prob_x\n\n      component_axis = tensorshape_util.rank(x.shape) - self._event_ndims\n      posterior_weights_x = tf.nn.softmax(log_posterior_weights_x,\n                                          axis=component_axis)\n      return tf.reduce_sum(\n          input_tensor=posterior_weights_x * cdf_x, axis=component_axis)", "response": "Performs the distributional transform of the mixture samples."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _split_covariance_into_marginals(covariance, block_sizes):\n  start_dim = 0\n  marginals = []\n  for size in block_sizes:\n    end_dim = start_dim + size\n    marginals.append(covariance[..., start_dim:end_dim, start_dim:end_dim])\n    start_dim = end_dim\n  return marginals", "response": "Split a covariance matrix into block - diagonal marginals of given sizes."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\ndecompose an observed time series into contributions from each component. This method decomposes a time series according to the posterior represention of a structural time series model. In particular, it: - Computes the posterior marginal mean and covariances over the additive model's latent space. - Decomposes the latent posterior into the marginal blocks for each model component. - Maps the per-component latent posteriors back through each component's observation model, to generate the time series modeled by that component. Args: model: An instance of `tfp.sts.Sum` representing a structural time series model. observed_time_series: `float` `Tensor` of shape `batch_shape + [num_timesteps, 1]` (omitting the trailing unit dimension is also supported when `num_timesteps > 1`), specifying an observed time series. May optionally be an instance of `tfp.sts.MaskedTimeSeries`, which includes a mask `Tensor` to specify timesteps with missing observations. parameter_samples: Python `list` of `Tensors` representing posterior samples of model parameters, with shapes `[concat([ [num_posterior_draws], param.prior.batch_shape, param.prior.event_shape]) for param in model.parameters]`. This may optionally also be a map (Python `dict`) of parameter names to `Tensor` values. Returns: component_dists: A `collections.OrderedDict` instance mapping component StructuralTimeSeries instances (elements of `model.components`) to `tfd.Distribution` instances representing the posterior marginal distributions on the process modeled by each component. Each distribution has batch shape matching that of `posterior_means`/`posterior_covs`, and event shape of `[num_timesteps]`. #### Examples Suppose we've built a model and fit it to data: ```python day_of_week = tfp.sts.Seasonal( num_seasons=7, observed_time_series=observed_time_series, name='day_of_week') local_linear_trend = tfp.sts.LocalLinearTrend( observed_time_series=observed_time_series, name='local_linear_trend') model = tfp.sts.Sum(components=[day_of_week, local_linear_trend], observed_time_series=observed_time_series) num_steps_forecast = 50 samples, kernel_results = tfp.sts.fit_with_hmc(model, observed_time_series) ``` To extract the contributions of individual components, pass the time series and sampled parameters into `decompose_by_component`: ```python component_dists = decompose_by_component( model, observed_time_series=observed_time_series, parameter_samples=samples) # Component mean and stddev have shape `[len(observed_time_series)]`. day_of_week_effect_mean = component_dists[day_of_week].mean() day_of_week_effect_stddev = component_dists[day_of_week].stddev() ``` Using the component distributions, we can visualize the uncertainty for each component: ``` from matplotlib import pylab as plt num_components = len(component_dists) xs = np.arange(len(observed_time_series)) fig = plt.figure(figsize=(12, 3 * num_components)) for i, (component, component_dist) in enumerate(component_dists.items()): # If in graph mode, replace `.numpy()` with `.eval()` or `sess.run()`. component_mean = component_dist.mean().numpy() component_stddev = component_dist.stddev().numpy() ax = fig.add_subplot(num_components, 1, 1 + i) ax.plot(xs, component_mean, lw=2) ax.fill_between(xs, component_mean - 2 * component_stddev, component_mean + 2 * component_stddev, alpha=0.5) ax.set_title(component.name) ```", "response": "def decompose_by_component(model, observed_time_series, parameter_samples):\n  \"\"\"Decompose an observed time series into contributions from each component.\n\n  This method decomposes a time series according to the posterior represention\n  of a structural time series model. In particular, it:\n    - Computes the posterior marginal mean and covariances over the additive\n      model's latent space.\n    - Decomposes the latent posterior into the marginal blocks for each\n      model component.\n    - Maps the per-component latent posteriors back through each component's\n      observation model, to generate the time series modeled by that component.\n\n  Args:\n    model: An instance of `tfp.sts.Sum` representing a structural time series\n      model.\n    observed_time_series: `float` `Tensor` of shape\n      `batch_shape + [num_timesteps, 1]` (omitting the trailing unit dimension\n      is also supported when `num_timesteps > 1`), specifying an observed time\n      series. May optionally be an instance of `tfp.sts.MaskedTimeSeries`, which\n      includes a mask `Tensor` to specify timesteps with missing observations.\n    parameter_samples: Python `list` of `Tensors` representing posterior\n      samples of model parameters, with shapes `[concat([\n      [num_posterior_draws], param.prior.batch_shape,\n      param.prior.event_shape]) for param in model.parameters]`. This may\n      optionally also be a map (Python `dict`) of parameter names to\n      `Tensor` values.\n  Returns:\n    component_dists: A `collections.OrderedDict` instance mapping\n      component StructuralTimeSeries instances (elements of `model.components`)\n      to `tfd.Distribution` instances representing the posterior marginal\n      distributions on the process modeled by each component. Each distribution\n      has batch shape matching that of `posterior_means`/`posterior_covs`, and\n      event shape of `[num_timesteps]`.\n\n  #### Examples\n\n  Suppose we've built a model and fit it to data:\n\n  ```python\n    day_of_week = tfp.sts.Seasonal(\n        num_seasons=7,\n        observed_time_series=observed_time_series,\n        name='day_of_week')\n    local_linear_trend = tfp.sts.LocalLinearTrend(\n        observed_time_series=observed_time_series,\n        name='local_linear_trend')\n    model = tfp.sts.Sum(components=[day_of_week, local_linear_trend],\n                        observed_time_series=observed_time_series)\n\n    num_steps_forecast = 50\n    samples, kernel_results = tfp.sts.fit_with_hmc(model, observed_time_series)\n  ```\n\n  To extract the contributions of individual components, pass the time series\n  and sampled parameters into `decompose_by_component`:\n\n  ```python\n    component_dists = decompose_by_component(\n      model,\n      observed_time_series=observed_time_series,\n      parameter_samples=samples)\n\n    # Component mean and stddev have shape `[len(observed_time_series)]`.\n    day_of_week_effect_mean = component_dists[day_of_week].mean()\n    day_of_week_effect_stddev = component_dists[day_of_week].stddev()\n  ```\n\n  Using the component distributions, we can visualize the uncertainty for\n  each component:\n\n  ```\n  from matplotlib import pylab as plt\n  num_components = len(component_dists)\n  xs = np.arange(len(observed_time_series))\n  fig = plt.figure(figsize=(12, 3 * num_components))\n  for i, (component, component_dist) in enumerate(component_dists.items()):\n\n    # If in graph mode, replace `.numpy()` with `.eval()` or `sess.run()`.\n    component_mean = component_dist.mean().numpy()\n    component_stddev = component_dist.stddev().numpy()\n\n    ax = fig.add_subplot(num_components, 1, 1 + i)\n    ax.plot(xs, component_mean, lw=2)\n    ax.fill_between(xs,\n                    component_mean - 2 * component_stddev,\n                    component_mean + 2 * component_stddev,\n                    alpha=0.5)\n    ax.set_title(component.name)\n  ```\n\n  \"\"\"\n\n  with tf.compat.v1.name_scope('decompose_by_component',\n                               values=[observed_time_series]):\n    [\n        observed_time_series,\n        is_missing\n    ] = sts_util.canonicalize_observed_time_series_with_mask(\n        observed_time_series)\n\n    # Run smoothing over the training timesteps to extract the\n    # posterior on latents.\n    num_timesteps = dist_util.prefer_static_value(\n        tf.shape(input=observed_time_series))[-2]\n    ssm = model.make_state_space_model(num_timesteps=num_timesteps,\n                                       param_vals=parameter_samples)\n    posterior_means, posterior_covs = ssm.posterior_marginals(\n        observed_time_series, mask=is_missing)\n\n    return _decompose_from_posterior_marginals(\n        model, posterior_means, posterior_covs, parameter_samples)"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef decompose_forecast_by_component(model, forecast_dist, parameter_samples):\n\n  with tf.compat.v1.name_scope('decompose_forecast_by_component'):\n    try:\n      forecast_lgssm = forecast_dist.components_distribution\n      forecast_latent_mean, _ = forecast_lgssm._joint_mean()  # pylint: disable=protected-access\n      forecast_latent_covs, _ = forecast_lgssm._joint_covariances()  # pylint: disable=protected-access\n    except AttributeError as e:\n      raise ValueError(\n          'Forecast distribution must be a MixtureSameFamily of'\n          'LinearGaussianStateSpaceModel distributions, such as returned by'\n          '`tfp.sts.forecast()`. (saw exception: {})'.format(e))\n\n    # Since `parameter_samples` will have sample shape `[num_posterior_draws]`,\n    # we need to move the `num_posterior_draws` dimension of the forecast\n    # moments from the trailing batch dimension, where it's currently put by\n    # `sts.forecast`, back to the leading (sample shape) dimension.\n    forecast_latent_mean = dist_util.move_dimension(\n        forecast_latent_mean, source_idx=-3, dest_idx=0)\n    forecast_latent_covs = dist_util.move_dimension(\n        forecast_latent_covs, source_idx=-4, dest_idx=0)\n\n    return _decompose_from_posterior_marginals(\n        model, forecast_latent_mean, forecast_latent_covs, parameter_samples)", "response": "Decompose a forecast distribution into contributions from each component."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nconvert dense tensor x to SparseTensor.", "response": "def dense_to_sparse(x, ignore_value=None, name=None):\n  \"\"\"Converts dense `Tensor` to `SparseTensor`, dropping `ignore_value` cells.\n\n  Args:\n    x: A `Tensor`.\n    ignore_value: Entries in `x` equal to this value will be\n      absent from the return `SparseTensor`. If `None`, default value of\n      `x` dtype will be used (e.g. '' for `str`, 0 for `int`).\n    name: Python `str` prefix for ops created by this function.\n\n  Returns:\n    sparse_x: A `tf.SparseTensor` with the same shape as `x`.\n\n  Raises:\n    ValueError: when `x`'s rank is `None`.\n  \"\"\"\n  # Copied (with modifications) from:\n  # tensorflow/contrib/layers/python/ops/sparse_ops.py.\n  with tf.compat.v1.name_scope(name, 'dense_to_sparse', [x, ignore_value]):\n    x = tf.convert_to_tensor(value=x, name='x')\n    if ignore_value is None:\n      if x.dtype.base_dtype == tf.string:\n        # Exception due to TF strings are converted to numpy objects by default.\n        ignore_value = ''\n      else:\n        ignore_value = x.dtype.as_numpy_dtype(0)\n      ignore_value = tf.cast(ignore_value, x.dtype, name='ignore_value')\n    indices = tf.where(tf.not_equal(x, ignore_value), name='indices')\n    return tf.SparseTensor(\n        indices=indices,\n        values=tf.gather_nd(x, indices, name='values'),\n        dense_shape=tf.shape(input=x, out_type=tf.int64, name='dense_shape'))"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef _operator(attr):\n  @functools.wraps(attr)\n  def func(a, *args):\n    return attr(a.value, *args)\n  return func", "response": "Defers an operator overload to attr.\n\n"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef sample_shape(self):\n    if isinstance(self._sample_shape, tf.Tensor):\n      return tf.TensorShape(tf.get_static_value(self._sample_shape))\n    return tf.TensorShape(self._sample_shape)", "response": "Sample shape of random variable as a TensorShape."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nreturns the sample_shape of the random variable as a 1 - D Tensor.", "response": "def sample_shape_tensor(self, name=\"sample_shape_tensor\"):\n    \"\"\"Sample shape of random variable as a 1-D `Tensor`.\n\n    Args:\n      name: name to give to the op\n\n    Returns:\n      sample_shape: `Tensor`.\n    \"\"\"\n    with tf.compat.v1.name_scope(name):\n      if isinstance(self._sample_shape, tf.Tensor):\n        return self._sample_shape\n      return tf.convert_to_tensor(\n          value=self.sample_shape.as_list(), dtype=tf.int32)"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\ngets tensor that the random variable corresponds to.", "response": "def value(self):\n    \"\"\"Get tensor that the random variable corresponds to.\"\"\"\n    if self._value is None:\n      try:\n        self._value = self.distribution.sample(self.sample_shape_tensor())\n      except NotImplementedError:\n        raise NotImplementedError(\n            \"sample is not implemented for {0}. You must either pass in the \"\n            \"value argument or implement sample for {0}.\"\n            .format(self.distribution.__class__.__name__))\n    return self._value"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef numpy(self):\n    if not isinstance(self.value, ops.EagerTensor):\n      raise NotImplementedError(\"value argument must be a EagerTensor.\")\n\n    return self.value.numpy()", "response": "Value as NumPy array only available for TF Eager."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nbuilding a scale-and-shift function using a multi-layer neural network. This will be wrapped in a make_template to ensure the variables are only created once. It takes the `d`-dimensional input x[0:d] and returns the `D-d` dimensional outputs `loc` (\"mu\") and `log_scale` (\"alpha\"). The default template does not support conditioning and will raise an exception if `condition_kwargs` are passed to it. To use conditioning in real nvp bijector, implement a conditioned shift/scale template that handles the `condition_kwargs`. Arguments: hidden_layers: Python `list`-like of non-negative integer, scalars indicating the number of units in each hidden layer. Default: `[512, 512]. shift_only: Python `bool` indicating if only the `shift` term shall be computed (i.e. NICE bijector). Default: `False`. activation: Activation function (callable). Explicitly setting to `None` implies a linear activation. name: A name for ops managed by this function. Default: \"real_nvp_default_template\". *args: `tf.layers.dense` arguments. **kwargs: `tf.layers.dense` keyword arguments. Returns: shift: `Float`-like `Tensor` of shift terms (\"mu\" in [Papamakarios et al. (2016)][1]). log_scale: `Float`-like `Tensor` of log(scale) terms (\"alpha\" in [Papamakarios et al. (2016)][1]). Raises: NotImplementedError: if rightmost dimension of `inputs` is unknown prior to graph execution, or if `condition_kwargs` is not empty. #### References [1]: George Papamakarios, Theo Pavlakou, and Iain Murray. Masked Autoregressive Flow for Density Estimation. In _Neural Information Processing Systems_, 2017. https://arxiv.org/abs/1705.07057", "response": "def real_nvp_default_template(hidden_layers,\n                              shift_only=False,\n                              activation=tf.nn.relu,\n                              name=None,\n                              *args,  # pylint: disable=keyword-arg-before-vararg\n                              **kwargs):\n  \"\"\"Build a scale-and-shift function using a multi-layer neural network.\n\n  This will be wrapped in a make_template to ensure the variables are only\n  created once. It takes the `d`-dimensional input x[0:d] and returns the `D-d`\n  dimensional outputs `loc` (\"mu\") and `log_scale` (\"alpha\").\n\n  The default template does not support conditioning and will raise an\n  exception if `condition_kwargs` are passed to it. To use conditioning in\n  real nvp bijector, implement a conditioned shift/scale template that\n  handles the `condition_kwargs`.\n\n  Arguments:\n    hidden_layers: Python `list`-like of non-negative integer, scalars\n      indicating the number of units in each hidden layer. Default: `[512, 512].\n    shift_only: Python `bool` indicating if only the `shift` term shall be\n      computed (i.e. NICE bijector). Default: `False`.\n    activation: Activation function (callable). Explicitly setting to `None`\n      implies a linear activation.\n    name: A name for ops managed by this function. Default:\n      \"real_nvp_default_template\".\n    *args: `tf.layers.dense` arguments.\n    **kwargs: `tf.layers.dense` keyword arguments.\n\n  Returns:\n    shift: `Float`-like `Tensor` of shift terms (\"mu\" in\n      [Papamakarios et al.  (2016)][1]).\n    log_scale: `Float`-like `Tensor` of log(scale) terms (\"alpha\" in\n      [Papamakarios et al. (2016)][1]).\n\n  Raises:\n    NotImplementedError: if rightmost dimension of `inputs` is unknown prior to\n      graph execution, or if `condition_kwargs` is not empty.\n\n  #### References\n\n  [1]: George Papamakarios, Theo Pavlakou, and Iain Murray. Masked\n       Autoregressive Flow for Density Estimation. In _Neural Information\n       Processing Systems_, 2017. https://arxiv.org/abs/1705.07057\n  \"\"\"\n\n  with tf.compat.v2.name_scope(name or \"real_nvp_default_template\"):\n\n    def _fn(x, output_units, **condition_kwargs):\n      \"\"\"Fully connected MLP parameterized via `real_nvp_template`.\"\"\"\n      if condition_kwargs:\n        raise NotImplementedError(\n            \"Conditioning not implemented in the default template.\")\n\n      if tensorshape_util.rank(x.shape) == 1:\n        x = x[tf.newaxis, ...]\n        reshape_output = lambda x: x[0]\n      else:\n        reshape_output = lambda x: x\n      for units in hidden_layers:\n        x = tf.compat.v1.layers.dense(\n            inputs=x,\n            units=units,\n            activation=activation,\n            *args,  # pylint: disable=keyword-arg-before-vararg\n            **kwargs)\n      x = tf.compat.v1.layers.dense(\n          inputs=x,\n          units=(1 if shift_only else 2) * output_units,\n          activation=None,\n          *args,  # pylint: disable=keyword-arg-before-vararg\n          **kwargs)\n      if shift_only:\n        return reshape_output(x), None\n      shift, log_scale = tf.split(x, 2, axis=-1)\n      return reshape_output(shift), reshape_output(log_scale)\n\n    return tf.compat.v1.make_template(\"real_nvp_default_template\", _fn)"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef _uniform_unit_norm(dimension, shape, dtype, seed):\n  # This works because the Gaussian distribution is spherically symmetric.\n  # raw shape: shape + [dimension]\n  raw = normal.Normal(\n      loc=dtype_util.as_numpy_dtype(dtype)(0),\n      scale=dtype_util.as_numpy_dtype(dtype)(1)).sample(\n          tf.concat([shape, [dimension]], axis=0), seed=seed())\n  unit_norm = raw / tf.norm(tensor=raw, ord=2, axis=-1)[..., tf.newaxis]\n  return unit_norm", "response": "Returns a batch of points chosen uniformly from the unit hypersphere."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nreplicate the input tensor n times along a new major dimension.", "response": "def _replicate(n, tensor):\n  \"\"\"Replicate the input tensor n times along a new (major) dimension.\"\"\"\n  # TODO(axch) Does this already exist somewhere?  Should it get contributed?\n  multiples = tf.concat([[n], tf.ones_like(tensor.shape)], axis=0)\n  return tf.tile(tf.expand_dims(tensor, axis=0), multiples)"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef _sample_n(self, num_samples, seed=None, name=None):\n    if self.dimension < 0:\n      raise ValueError(\n          'Cannot sample negative-dimension correlation matrices.')\n    # Notation below: B is the batch shape, i.e., tf.shape(concentration)\n    seed = seed_stream.SeedStream(seed, 'sample_lkj')\n    with tf.name_scope('sample_lkj' or name):\n      if not dtype_util.is_floating(self.concentration.dtype):\n        raise TypeError(\n            'The concentration argument should have floating type, not '\n            '{}'.format(dtype_util.name(self.concentration.dtype)))\n\n      concentration = _replicate(num_samples, self.concentration)\n      concentration_shape = tf.shape(input=concentration)\n      if self.dimension <= 1:\n        # For any dimension <= 1, there is only one possible correlation matrix.\n        shape = tf.concat([\n            concentration_shape, [self.dimension, self.dimension]], axis=0)\n        return tf.ones(shape=shape, dtype=self.concentration.dtype)\n      beta_conc = concentration + (self.dimension - 2.) / 2.\n      beta_dist = beta.Beta(concentration1=beta_conc, concentration0=beta_conc)\n\n      # Note that the sampler below deviates from [1], by doing the sampling in\n      # cholesky space. This does not change the fundamental logic of the\n      # sampler, but does speed up the sampling.\n\n      # This is the correlation coefficient between the first two dimensions.\n      # This is also `r` in reference [1].\n      corr12 = 2. * beta_dist.sample(seed=seed()) - 1.\n\n      # Below we construct the Cholesky of the initial 2x2 correlation matrix,\n      # which is of the form:\n      # [[1, 0], [r, sqrt(1 - r**2)]], where r is the correlation between the\n      # first two dimensions.\n      # This is the top-left corner of the cholesky of the final sample.\n      first_row = tf.concat([\n          tf.ones_like(corr12)[..., tf.newaxis],\n          tf.zeros_like(corr12)[..., tf.newaxis]], axis=-1)\n      second_row = tf.concat([\n          corr12[..., tf.newaxis],\n          tf.sqrt(1 - corr12**2)[..., tf.newaxis]], axis=-1)\n\n      chol_result = tf.concat([\n          first_row[..., tf.newaxis, :],\n          second_row[..., tf.newaxis, :]], axis=-2)\n\n      for n in range(2, self.dimension):\n        # Loop invariant: on entry, result has shape B + [n, n]\n        beta_conc -= 0.5\n        # norm is y in reference [1].\n        norm = beta.Beta(\n            concentration1=n/2.,\n            concentration0=beta_conc\n        ).sample(seed=seed())\n        # distance shape: B + [1] for broadcast\n        distance = tf.sqrt(norm)[..., tf.newaxis]\n        # direction is u in reference [1].\n        # direction shape: B + [n]\n        direction = _uniform_unit_norm(\n            n, concentration_shape, self.concentration.dtype, seed)\n        # raw_correlation is w in reference [1].\n        raw_correlation = distance * direction  # shape: B + [n]\n\n        # This is the next row in the cholesky of the result,\n        # which differs from the construction in reference [1].\n        # In the reference, the new row `z` = chol_result @ raw_correlation^T\n        # = C @ raw_correlation^T (where as short hand we use C = chol_result).\n        # We prove that the below equation is the right row to add to the\n        # cholesky, by showing equality with reference [1].\n        # Let S be the sample constructed so far, and let `z` be as in\n        # reference [1]. Then at this iteration, the new sample S' will be\n        # [[S z^T]\n        #  [z 1]]\n        # In our case we have the cholesky decomposition factor C, so\n        # we want our new row x (same size as z) to satisfy:\n        #  [[S z^T]  [[C 0]    [[C^T  x^T]         [[CC^T  Cx^T]\n        #   [z 1]] =  [x k]]    [0     k]]  =       [xC^t   xx^T + k**2]]\n        # Since C @ raw_correlation^T = z = C @ x^T, and C is invertible,\n        # we have that x = raw_correlation. Also 1 = xx^T + k**2, so k\n        # = sqrt(1 - xx^T) = sqrt(1 - |raw_correlation|**2) = sqrt(1 -\n        # distance**2).\n        new_row = tf.concat(\n            [raw_correlation, tf.sqrt(1. - norm[..., tf.newaxis])], axis=-1)\n\n        # Finally add this new row, by growing the cholesky of the result.\n        chol_result = tf.concat([\n            chol_result,\n            tf.zeros_like(chol_result[..., 0][..., tf.newaxis])], axis=-1)\n\n        chol_result = tf.concat(\n            [chol_result, new_row[..., tf.newaxis, :]], axis=-2)\n\n      if self.input_output_cholesky:\n        return chol_result\n\n      result = tf.matmul(chol_result, chol_result, transpose_b=True)\n      # The diagonal for a correlation matrix should always be ones. Due to\n      # numerical instability the matmul might not achieve that, so manually set\n      # these to ones.\n      result = tf.linalg.set_diag(\n          result,\n          tf.ones(shape=tf.shape(input=result)[:-1], dtype=result.dtype))\n      # This sampling algorithm can produce near-PSD matrices on which standard\n      # algorithms such as `tf.cholesky` or `tf.linalg.self_adjoint_eigvals`\n      # fail. Specifically, as documented in b/116828694, around 2% of trials\n      # of 900,000 5x5 matrices (distributed according to 9 different\n      # concentration parameter values) contained at least one matrix on which\n      # the Cholesky decomposition failed.\n      return result", "response": "Returns a Tensor of samples from an LKJ distribution."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef _log_unnorm_prob(self, x, name=None):\n    with tf.name_scope(name or 'log_unnorm_prob_lkj'):\n      x = tf.convert_to_tensor(value=x, name='x')\n      # The density is det(matrix) ** (concentration - 1).\n      # Computing the determinant with `logdet` is usually fine, since\n      # correlation matrices are Hermitian and PSD. But in some cases, for a\n      # PSD matrix whose eigenvalues are close to zero, `logdet` raises an error\n      # complaining that it is not PSD. The root cause is the computation of the\n      # cholesky decomposition in `logdet`. Hence, we use the less efficient but\n      # more robust `slogdet` which does not use `cholesky`.\n      #\n      # An alternative would have been to check allow_nan_stats and use\n      #   eigenvalues = tf.linalg.self_adjoint_eigvals(x)\n      #   psd_mask = tf.cast(\n      #     tf.reduce_min(eigenvalues, axis=-1) >= 0, dtype=x.dtype)\n      #   tf.where(psd_mask, answer, float('-inf'))\n      # to emit probability 0 for inputs that are not PSD, without ever raising\n      # an error. More care must be taken, as due to numerical stability issues,\n      # self_adjoint_eigvals can return slightly negative eigenvalues even for\n      # a PSD matrix.\n      if self.input_output_cholesky:\n        logdet = 2.0 * tf.reduce_sum(\n            input_tensor=tf.math.log(tf.linalg.diag_part(x)), axis=[-1])\n      else:\n        _, logdet = tf.linalg.slogdet(x)\n      answer = (self.concentration - 1.) * logdet\n      return answer", "response": "Returns the unnormalized log density of a LKJ distribution."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nreturn the log normalization of an LKJ distribution.", "response": "def _log_normalization(self, name='log_normalization'):\n    \"\"\"Returns the log normalization of an LKJ distribution.\n\n    Args:\n      name: Python `str` name prefixed to Ops created by this function.\n\n    Returns:\n      log_z: A Tensor of the same shape and dtype as `concentration`, containing\n        the corresponding log normalizers.\n    \"\"\"\n    # The formula is from D. Lewandowski et al [1], p. 1999, from the\n    # proof that eqs 16 and 17 are equivalent.\n    with tf.name_scope(name or 'log_normalization_lkj'):\n      logpi = np.log(np.pi)\n      ans = tf.zeros_like(self.concentration)\n      for k in range(1, self.dimension):\n        ans += logpi * (k / 2.)\n        ans += tf.math.lgamma(self.concentration +\n                              (self.dimension - 1 - k) / 2.)\n        ans -= tf.math.lgamma(self.concentration + (self.dimension - 1) / 2.)\n      return ans"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef common_dtype(args_list, preferred_dtype=None):\n  dtype = None\n  preferred_dtype = (None if preferred_dtype is None\n                     else tf.as_dtype(preferred_dtype))\n  for a in tf.nest.flatten(args_list):\n    if hasattr(a, 'dtype'):\n      dt = tf.as_dtype(a.dtype)\n    else:\n      continue\n    if dtype is None:\n      dtype = dt\n    elif dtype != dt:\n      raise TypeError('Found incompatible dtypes, {} and {}.'.format(dtype, dt))\n  if dtype is None and preferred_dtype is None:\n    return None\n  return (preferred_dtype if dtype is None else dtype).as_numpy_dtype", "response": "Returns explict dtype from args_list if exists else preferred_dtype."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef _make_summary_statistic(attr):\n  def _fn(self, **kwargs):\n    \"\"\"Implements summary statistic, eg, mean, stddev, mode.\"\"\"\n    x = getattr(self.distribution, attr)(**kwargs)\n    shape = prefer_static.concat([\n        self.distribution.batch_shape_tensor(),\n        prefer_static.ones(prefer_static.rank_from_shape(self.sample_shape),\n                           dtype=self.sample_shape.dtype),\n        self.distribution.event_shape_tensor(),\n    ], axis=0)\n    x = tf.reshape(x, shape=shape)\n    shape = prefer_static.concat([\n        self.distribution.batch_shape_tensor(),\n        self.sample_shape,\n        self.distribution.event_shape_tensor(),\n    ], axis=0)\n    return tf.broadcast_to(x, shape)\n  return _fn", "response": "Factory for implementing summary statistics eg mean stddev mode."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef _kl_sample(a, b, name='kl_sample'):\n  assertions = []\n  a_ss = tf.get_static_value(a.sample_shape)\n  b_ss = tf.get_static_value(b.sample_shape)\n  msg = '`a.sample_shape` must be identical to `b.sample_shape`.'\n  if a_ss is not None and b_ss is not None:\n    if not np.array_equal(a_ss, b_ss):\n      raise ValueError(msg)\n  elif a.validate_args or b.validate_args:\n    assertions.append(assert_util.assert_equal(\n        a.sample_shape, b.sample_shape, message=msg))\n  with tf.control_dependencies(assertions):\n    kl = kullback_leibler.kl_divergence(\n        a.distribution, b.distribution, name=name)\n    n = prefer_static.reduce_prod(a.sample_shape)\n    return tf.cast(x=n, dtype=kl.dtype) * kl", "response": "Batchwise KL divergence of a and b."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef _broadcast_to(tensor_to_broadcast, target_tensors):\n  output = tensor_to_broadcast\n  for tensor in target_tensors:\n    output += tf.zeros_like(tensor)\n  return output", "response": "Helper to broadcast a tensor to a list of target tensors."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nevaluating the pdf at the peak.", "response": "def _pdf_at_peak(self):\n    \"\"\"Pdf evaluated at the peak.\"\"\"\n    return (self.peak - self.low) / (self.high - self.low)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef effective_sample_size(states,\n                          filter_threshold=0.,\n                          filter_beyond_lag=None,\n                          name=None):\n  \"\"\"Estimate a lower bound on effective sample size for each independent chain.\n\n  Roughly speaking, \"effective sample size\" (ESS) is the size of an iid sample\n  with the same variance as `state`.\n\n  More precisely, given a stationary sequence of possibly correlated random\n  variables `X_1, X_2,...,X_N`, each identically distributed ESS is the number\n  such that\n\n  ```Variance{ N**-1 * Sum{X_i} } = ESS**-1 * Variance{ X_1 }.```\n\n  If the sequence is uncorrelated, `ESS = N`.  In general, one should expect\n  `ESS <= N`, with more highly correlated sequences having smaller `ESS`.\n\n  Args:\n    states:  `Tensor` or list of `Tensor` objects.  Dimension zero should index\n      identically distributed states.\n    filter_threshold:  `Tensor` or list of `Tensor` objects.\n      Must broadcast with `state`.  The auto-correlation sequence is truncated\n      after the first appearance of a term less than `filter_threshold`.\n      Setting to `None` means we use no threshold filter.  Since `|R_k| <= 1`,\n      setting to any number less than `-1` has the same effect.\n    filter_beyond_lag:  `Tensor` or list of `Tensor` objects.  Must be\n      `int`-like and scalar valued.  The auto-correlation sequence is truncated\n      to this length.  Setting to `None` means we do not filter based on number\n      of lags.\n    name:  `String` name to prepend to created ops.\n\n  Returns:\n    ess:  `Tensor` or list of `Tensor` objects.  The effective sample size of\n      each component of `states`.  Shape will be `states.shape[1:]`.\n\n  Raises:\n    ValueError:  If `states` and `filter_threshold` or `states` and\n      `filter_beyond_lag` are both lists with different lengths.\n\n  #### Examples\n\n  We use ESS to estimate standard error.\n\n  ```\n  import tensorflow as tf\n  import tensorflow_probability as tfp\n  tfd = tfp.distributions\n\n  target = tfd.MultivariateNormalDiag(scale_diag=[1., 2.])\n\n  # Get 1000 states from one chain.\n  states = tfp.mcmc.sample_chain(\n      num_burnin_steps=200,\n      num_results=1000,\n      current_state=tf.constant([0., 0.]),\n      kernel=tfp.mcmc.HamiltonianMonteCarlo(\n        target_log_prob_fn=target.log_prob,\n        step_size=0.05,\n        num_leapfrog_steps=20))\n  states.shape\n  ==> (1000, 2)\n\n  ess = effective_sample_size(states)\n  ==> Shape (2,) Tensor\n\n  mean, variance = tf.nn.moments(states, axis=0)\n  standard_error = tf.sqrt(variance / ess)\n  ```\n\n  Some math shows that, with `R_k` the auto-correlation sequence,\n  `R_k := Covariance{X_1, X_{1+k}} / Variance{X_1}`, we have\n\n  ```ESS(N) =  N / [ 1 + 2 * ( (N - 1) / N * R_1 + ... + 1 / N * R_{N-1}  ) ]```\n\n  This function estimates the above by first estimating the auto-correlation.\n  Since `R_k` must be estimated using only `N - k` samples, it becomes\n  progressively noisier for larger `k`.  For this reason, the summation over\n  `R_k` should be truncated at some number `filter_beyond_lag < N`.  Since many\n  MCMC methods generate chains where `R_k > 0`, a reasonable criteria is to\n  truncate at the first index where the estimated auto-correlation becomes\n  negative.\n\n  The arguments `filter_beyond_lag`, `filter_threshold` are filters intended to\n  remove noisy tail terms from `R_k`.  They combine in an \"OR\" manner meaning\n  terms are removed if they were to be filtered under the `filter_beyond_lag` OR\n  `filter_threshold` criteria.\n  \"\"\"\n  states_was_list = _is_list_like(states)\n\n  # Convert all args to lists.\n  if not states_was_list:\n    states = [states]\n\n  filter_beyond_lag = _broadcast_maybelist_arg(states, filter_beyond_lag,\n                                               'filter_beyond_lag')\n  filter_threshold = _broadcast_maybelist_arg(states, filter_threshold,\n                                              'filter_threshold')\n\n  # Process items, one at a time.\n  with tf.compat.v1.name_scope(name, 'effective_sample_size'):\n    ess_list = [\n        _effective_sample_size_single_state(s, ml, mlt)\n        for (s, ml, mlt) in zip(states, filter_beyond_lag, filter_threshold)\n    ]\n\n  if states_was_list:\n    return ess_list\n  return ess_list[0]", "response": "Estimate a lower bound on effective sample size for each independent chain."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef potential_scale_reduction(chains_states,\n                              independent_chain_ndims=1,\n                              name=None):\n  \"\"\"Gelman and Rubin (1992)'s potential scale reduction for chain convergence.\n\n  Given `N > 1` states from each of `C > 1` independent chains, the potential\n  scale reduction factor, commonly referred to as R-hat, measures convergence of\n  the chains (to the same target) by testing for equality of means.\n  Specifically, R-hat measures the degree to which variance (of the means)\n  between chains exceeds what one would expect if the chains were identically\n  distributed. See [Gelman and Rubin (1992)][1]; [Brooks and Gelman (1998)][2].\n\n  Some guidelines:\n\n  * The initial state of the chains should be drawn from a distribution\n    overdispersed with respect to the target.\n  * If all chains converge to the target, then as `N --> infinity`, R-hat --> 1.\n    Before that, R-hat > 1 (except in pathological cases, e.g. if the chain\n    paths were identical).\n  * The above holds for any number of chains `C > 1`.  Increasing `C` does\n    improves effectiveness of the diagnostic.\n  * Sometimes, R-hat < 1.2 is used to indicate approximate convergence, but of\n    course this is problem dependent. See [Brooks and Gelman (1998)][2].\n  * R-hat only measures non-convergence of the mean. If higher moments, or\n    other statistics are desired, a different diagnostic should be used. See\n    [Brooks and Gelman (1998)][2].\n\n  Args:\n    chains_states:  `Tensor` or Python `list` of `Tensor`s representing the\n      state(s) of a Markov Chain at each result step.  The `ith` state is\n      assumed to have shape `[Ni, Ci1, Ci2,...,CiD] + A`.\n      Dimension `0` indexes the `Ni > 1` result steps of the Markov Chain.\n      Dimensions `1` through `D` index the `Ci1 x ... x CiD` independent\n      chains to be tested for convergence to the same target.\n      The remaining dimensions, `A`, can have any shape (even empty).\n    independent_chain_ndims: Integer type `Tensor` with value `>= 1` giving the\n      number of giving the number of dimensions, from `dim = 1` to `dim = D`,\n      holding independent chain results to be tested for convergence.\n    name: `String` name to prepend to created tf.  Default:\n      `potential_scale_reduction`.\n\n  Returns:\n    `Tensor` or Python `list` of `Tensor`s representing the R-hat statistic for\n    the state(s).  Same `dtype` as `state`, and shape equal to\n    `state.shape[1 + independent_chain_ndims:]`.\n\n  Raises:\n    ValueError:  If `independent_chain_ndims < 1`.\n\n  #### Examples\n\n  Diagnosing convergence by monitoring 10 chains that each attempt to\n  sample from a 2-variate normal.\n\n  ```python\n  import tensorflow as tf\n  import tensorflow_probability as tfp\n  tfd = tfp.distributions\n\n  target = tfd.MultivariateNormalDiag(scale_diag=[1., 2.])\n\n  # Get 10 (2x) overdispersed initial states.\n  initial_state = target.sample(10) * 2.\n  ==> (10, 2)\n\n  # Get 1000 samples from the 10 independent chains.\n  chains_states, _ = tfp.mcmc.sample_chain(\n      num_burnin_steps=200,\n      num_results=1000,\n      current_state=initial_state,\n      kernel=tfp.mcmc.HamiltonianMonteCarlo(\n          target_log_prob_fn=target.log_prob,\n          step_size=0.05,\n          num_leapfrog_steps=20))\n  chains_states.shape\n  ==> (1000, 10, 2)\n\n  rhat = tfp.mcmc.diagnostic.potential_scale_reduction(\n      chains_states, independent_chain_ndims=1)\n\n  # The second dimension needed a longer burn-in.\n  rhat.eval()\n  ==> [1.05, 1.3]\n  ```\n\n  To see why R-hat is reasonable, let `X` be a random variable drawn uniformly\n  from the combined states (combined over all chains).  Then, in the limit\n  `N, C --> infinity`, with `E`, `Var` denoting expectation and variance,\n\n  ```R-hat = ( E[Var[X | chain]] + Var[E[X | chain]] ) / E[Var[X | chain]].```\n\n  Using the law of total variance, the numerator is the variance of the combined\n  states, and the denominator is the total variance minus the variance of the\n  the individual chain means.  If the chains are all drawing from the same\n  distribution, they will have the same mean, and thus the ratio should be one.\n\n  #### References\n\n  [1]: Stephen P. Brooks and Andrew Gelman. General Methods for Monitoring\n       Convergence of Iterative Simulations. _Journal of Computational and\n       Graphical Statistics_, 7(4), 1998.\n\n  [2]: Andrew Gelman and Donald B. Rubin. Inference from Iterative Simulation\n       Using Multiple Sequences. _Statistical Science_, 7(4):457-472, 1992.\n  \"\"\"\n  chains_states_was_list = _is_list_like(chains_states)\n  if not chains_states_was_list:\n    chains_states = [chains_states]\n\n  # tf.get_static_value returns None iff a constant value (as a numpy\n  # array) is not efficiently computable.  Therefore, we try constant_value then\n  # check for None.\n  icn_const_ = tf.get_static_value(\n      tf.convert_to_tensor(value=independent_chain_ndims))\n  if icn_const_ is not None:\n    independent_chain_ndims = icn_const_\n    if icn_const_ < 1:\n      raise ValueError(\n          'Argument `independent_chain_ndims` must be `>= 1`, found: {}'.format(\n              independent_chain_ndims))\n\n  with tf.compat.v1.name_scope(name, 'potential_scale_reduction'):\n    rhat_list = [\n        _potential_scale_reduction_single_state(s, independent_chain_ndims)\n        for s in chains_states\n    ]\n\n  if chains_states_was_list:\n    return rhat_list\n  return rhat_list[0]", "response": "Gelman and Rubin 2002 s potential scale reduction for chains."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef _potential_scale_reduction_single_state(state, independent_chain_ndims):\n  with tf.compat.v1.name_scope(\n      'potential_scale_reduction_single_state',\n      values=[state, independent_chain_ndims]):\n    # We assume exactly one leading dimension indexes e.g. correlated samples\n    # from each Markov chain.\n    state = tf.convert_to_tensor(value=state, name='state')\n    sample_ndims = 1\n\n    sample_axis = tf.range(0, sample_ndims)\n    chain_axis = tf.range(sample_ndims,\n                          sample_ndims + independent_chain_ndims)\n    sample_and_chain_axis = tf.range(\n        0, sample_ndims + independent_chain_ndims)\n\n    n = _axis_size(state, sample_axis)\n    m = _axis_size(state, chain_axis)\n\n    # In the language of Brooks and Gelman (1998),\n    # B / n is the between chain variance, the variance of the chain means.\n    # W is the within sequence variance, the mean of the chain variances.\n    b_div_n = _reduce_variance(\n        tf.reduce_mean(input_tensor=state, axis=sample_axis, keepdims=True),\n        sample_and_chain_axis,\n        biased=False)\n    w = tf.reduce_mean(\n        input_tensor=_reduce_variance(\n            state, sample_axis, keepdims=True, biased=True),\n        axis=sample_and_chain_axis)\n\n    # sigma^2_+ is an estimate of the true variance, which would be unbiased if\n    # each chain was drawn from the target.  c.f. \"law of total variance.\"\n    sigma_2_plus = w + b_div_n\n\n    return ((m + 1.) / m) * sigma_2_plus / w - (n - 1.) / (m * n)", "response": "Internal function for the potential scale reduction for one single state."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\ngets number of elements of x in axis as type x. dtype.", "response": "def _axis_size(x, axis=None):\n  \"\"\"Get number of elements of `x` in `axis`, as type `x.dtype`.\"\"\"\n  if axis is None:\n    return tf.cast(tf.size(input=x), x.dtype)\n  return tf.cast(\n      tf.reduce_prod(input_tensor=tf.gather(tf.shape(input=x), axis)), x.dtype)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef _broadcast_maybelist_arg(states, secondary_arg, name):\n  if _is_list_like(secondary_arg):\n    if len(secondary_arg) != len(states):\n      raise ValueError('Argument `%s` was a list of different length ({}) than '\n                       '`states` ({})'.format(name, len(states)))\n  else:\n    secondary_arg = [secondary_arg] * len(states)\n\n  return secondary_arg", "response": "Broadcast a listable secondary_arg to that of states."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef quadrature_scheme_lognormal_gauss_hermite(\n    loc, scale, quadrature_size,\n    validate_args=False, name=None):  # pylint: disable=unused-argument\n  \"\"\"Use Gauss-Hermite quadrature to form quadrature on positive-reals.\n\n  Note: for a given `quadrature_size`, this method is generally less accurate\n  than `quadrature_scheme_lognormal_quantiles`.\n\n  Args:\n    loc: `float`-like (batch of) scalar `Tensor`; the location parameter of\n      the LogNormal prior.\n    scale: `float`-like (batch of) scalar `Tensor`; the scale parameter of\n      the LogNormal prior.\n    quadrature_size: Python `int` scalar representing the number of quadrature\n      points.\n    validate_args: Python `bool`, default `False`. When `True` distribution\n      parameters are checked for validity despite possibly degrading runtime\n      performance. When `False` invalid inputs may silently render incorrect\n      outputs.\n    name: Python `str` name prefixed to Ops created by this class.\n\n  Returns:\n    grid: (Batch of) length-`quadrature_size` vectors representing the\n      `log_rate` parameters of a `Poisson`.\n    probs: (Batch of) length-`quadrature_size` vectors representing the\n      weight associate with each `grid` value.\n  \"\"\"\n  with tf.name_scope(\n      name or \"vector_diffeomixture_quadrature_gauss_hermite\"):\n    grid, probs = np.polynomial.hermite.hermgauss(deg=quadrature_size)\n    npdt = dtype_util.as_numpy_dtype(loc.dtype)\n    grid = grid.astype(npdt)\n    probs = probs.astype(npdt)\n    probs /= np.linalg.norm(probs, ord=1, keepdims=True)\n    probs = tf.convert_to_tensor(value=probs, name=\"probs\", dtype=loc.dtype)\n    # The following maps the broadcast of `loc` and `scale` to each grid\n    # point, i.e., we are creating several log-rates that correspond to the\n    # different Gauss-Hermite quadrature points and (possible) batches of\n    # `loc` and `scale`.\n    grid = (loc[..., tf.newaxis] + np.sqrt(2.) * scale[..., tf.newaxis] * grid)\n    return grid, probs", "response": "Use Gauss - Hermite quadrature to form a lognormal graph on positive - reals."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nuses LogNormal quantiles to form quadrature on positive - reals.", "response": "def quadrature_scheme_lognormal_quantiles(\n    loc, scale, quadrature_size,\n    validate_args=False, name=None):\n  \"\"\"Use LogNormal quantiles to form quadrature on positive-reals.\n\n  Args:\n    loc: `float`-like (batch of) scalar `Tensor`; the location parameter of\n      the LogNormal prior.\n    scale: `float`-like (batch of) scalar `Tensor`; the scale parameter of\n      the LogNormal prior.\n    quadrature_size: Python `int` scalar representing the number of quadrature\n      points.\n    validate_args: Python `bool`, default `False`. When `True` distribution\n      parameters are checked for validity despite possibly degrading runtime\n      performance. When `False` invalid inputs may silently render incorrect\n      outputs.\n    name: Python `str` name prefixed to Ops created by this class.\n\n  Returns:\n    grid: (Batch of) length-`quadrature_size` vectors representing the\n      `log_rate` parameters of a `Poisson`.\n    probs: (Batch of) length-`quadrature_size` vectors representing the\n      weight associate with each `grid` value.\n  \"\"\"\n  with tf.name_scope(name or \"quadrature_scheme_lognormal_quantiles\"):\n    # Create a LogNormal distribution.\n    dist = transformed_distribution.TransformedDistribution(\n        distribution=normal.Normal(loc=loc, scale=scale),\n        bijector=exp_bijector.Exp(),\n        validate_args=validate_args)\n    batch_ndims = tensorshape_util.rank(dist.batch_shape)\n    if batch_ndims is None:\n      batch_ndims = tf.shape(input=dist.batch_shape_tensor())[0]\n\n    def _compute_quantiles():\n      \"\"\"Helper to build quantiles.\"\"\"\n      # Omit {0, 1} since they might lead to Inf/NaN.\n      zero = tf.zeros([], dtype=dist.dtype)\n      edges = tf.linspace(zero, 1., quadrature_size + 3)[1:-1]\n      # Expand edges so its broadcast across batch dims.\n      edges = tf.reshape(\n          edges,\n          shape=tf.concat(\n              [[-1], tf.ones([batch_ndims], dtype=tf.int32)], axis=0))\n      quantiles = dist.quantile(edges)\n      # Cyclically permute left by one.\n      perm = tf.concat([tf.range(1, 1 + batch_ndims), [0]], axis=0)\n      quantiles = tf.transpose(a=quantiles, perm=perm)\n      return quantiles\n    quantiles = _compute_quantiles()\n\n    # Compute grid as quantile midpoints.\n    grid = (quantiles[..., :-1] + quantiles[..., 1:]) / 2.\n    # Set shape hints.\n    new_shape = tensorshape_util.concatenate(dist.batch_shape,\n                                             [quadrature_size])\n    tensorshape_util.set_shape(grid, new_shape)\n\n    # By construction probs is constant, i.e., `1 / quadrature_size`. This is\n    # important, because non-constant probs leads to non-reparameterizable\n    # samples.\n    probs = tf.fill(\n        dims=[quadrature_size], value=1. / tf.cast(quadrature_size, dist.dtype))\n\n    return grid, probs"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef merge(self, x=None, y=None, ildj=None, kwargs=None, mapping=None):\n    if mapping is None:\n      mapping = _Mapping(x=x, y=y, ildj=ildj, kwargs=kwargs)\n    elif any(arg is not None for arg in [x, y, ildj, kwargs]):\n      raise ValueError(\"Cannot simultaneously specify mapping and individual \"\n                       \"arguments.\")\n\n    return _Mapping(\n        x=self._merge(self.x, mapping.x),\n        y=self._merge(self.y, mapping.y),\n        ildj=self._merge(self.ildj, mapping.ildj),\n        kwargs=self._merge(self.kwargs, mapping.kwargs, use_equals=True))", "response": "Returns a new _Mapping with args merged with self."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nremove the given field from the cache value.", "response": "def remove(self, field):\n    \"\"\"To support weak referencing, removes cache key from the cache value.\"\"\"\n    return _Mapping(\n        x=None if field == \"x\" else self.x,\n        y=None if field == \"y\" else self.y,\n        ildj=self.ildj,\n        kwargs=self.kwargs)"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _merge(self, old, new, use_equals=False):\n    if old is None:\n      return new\n    if new is None:\n      return old\n    if (old == new) if use_equals else (old is new):\n      return old\n    raise ValueError(\"Incompatible values: %s != %s\" % (old, new))", "response": "Helper to merge two values."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nconverting nested tuple list or dict to nested tuple.", "response": "def _deep_tuple(self, x):\n    \"\"\"Converts nested `tuple`, `list`, or `dict` to nested `tuple`.\"\"\"\n    if isinstance(x, dict):\n      return self._deep_tuple(tuple(sorted(x.items())))\n    elif isinstance(x, (list, tuple)):\n      return tuple(map(self._deep_tuple, x))\n\n    return x"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef _left_doubling_increments(batch_shape, max_doublings, step_size, seed=None,\n                              name=None):\n  \"\"\"Computes the doubling increments for the left end point.\n\n  The doubling procedure expands an initial interval to find a superset of the\n  true slice. At each doubling iteration, the interval width is doubled to\n  either the left or the right hand side with equal probability.\n  If, initially, the left end point is at `L(0)` and the width of the\n  interval is `w(0)`, then the left end point and the width at the\n  k-th iteration (denoted L(k) and w(k) respectively) are given by the following\n  recursions:\n\n  ```none\n  w(k) = 2 * w(k-1)\n  L(k) = L(k-1) - w(k-1) * X_k, X_k ~ Bernoulli(0.5)\n  or, L(0) - L(k) = w(0) Sum(2^i * X(i+1), 0 <= i < k)\n  ```\n\n  This function computes the sequence of `L(0)-L(k)` and `w(k)` for k between 0\n  and `max_doublings` independently for each chain.\n\n  Args:\n    batch_shape: Positive int32 `tf.Tensor`. The batch shape.\n    max_doublings: Scalar positive int32 `tf.Tensor`. The maximum number of\n      doublings to consider.\n    step_size: A real `tf.Tensor` with shape compatible with [num_chains].\n      The size of the initial interval.\n    seed: (Optional) positive int. The random seed. If None, no seed is set.\n    name: Python `str` name prefixed to Ops created by this function.\n      Default value: `None` (i.e., 'find_slice_bounds').\n\n  Returns:\n    left_increments: A tensor of shape (max_doublings+1, batch_shape). The\n      relative position of the left end point after the doublings.\n    widths: A tensor of shape (max_doublings+1, ones_like(batch_shape)). The\n      widths of the intervals at each stage of the doubling.\n  \"\"\"\n  with tf.compat.v1.name_scope(name, 'left_doubling_increments',\n                               [batch_shape, max_doublings, step_size]):\n\n    step_size = tf.convert_to_tensor(value=step_size)\n    dtype = step_size.dtype.base_dtype\n    # Output shape of the left increments tensor.\n    output_shape = tf.concat(([max_doublings + 1], batch_shape), axis=0)\n    # A sample realization of X_k.\n    expand_left = distributions.Bernoulli(0.5, dtype=dtype).sample(\n        sample_shape=output_shape, seed=seed)\n\n    # The widths of the successive intervals. Starts with 1.0 and ends with\n    # 2^max_doublings.\n    width_multipliers = tf.cast(2 ** tf.range(0, max_doublings+1), dtype=dtype)\n    # Output shape of the `widths` tensor.\n    widths_shape = tf.concat(([max_doublings + 1],\n                              tf.ones_like(batch_shape)), axis=0)\n    width_multipliers = tf.reshape(width_multipliers, shape=widths_shape)\n    # Widths shape is [max_doublings + 1, 1, 1, 1...].\n    widths = width_multipliers * step_size\n\n    # Take the cumulative sum of the left side increments in slice width to give\n    # the resulting distance from the inital lower bound.\n    left_increments = tf.cumsum(widths * expand_left, exclusive=True, axis=0)\n    return left_increments, widths", "response": "This function computes the doubling increments for the left end point."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef _find_best_interval_idx(x, name=None):\n  with tf.compat.v1.name_scope(name, 'find_best_interval_idx', [x]):\n    # Returns max_doublings + 1. Positive int32.\n    k = tf.shape(input=x)[0]\n    dtype = x.dtype.base_dtype\n    # Factors by which to multiply the flag. Corresponds to (2 * k - i) above.\n    mults = tf.range(2 * k, k, -1, dtype=dtype)[:, tf.newaxis]\n    # Factors by which to shift the flag. Corresponds to i above. Ensures the\n    # widest bounds are selected if there are no bounds outside the slice.\n    shifts = tf.range(k, dtype=dtype)[:, tf.newaxis]\n    indices = tf.argmax(input=mults * x + shifts, axis=0, output_type=dtype)\n    return indices", "response": "Internal function for finding the index of the optimal set of bounds for each chain."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef slice_bounds_by_doubling(x_initial,\n                             target_log_prob,\n                             log_slice_heights,\n                             max_doublings,\n                             step_size,\n                             seed=None,\n                             name=None):\n  \"\"\"Returns the bounds of the slice at each stage of doubling procedure.\n\n  Precomputes the x coordinates of the left (L) and right (R) endpoints of the\n  interval `I` produced in the \"doubling\" algorithm [Neal 2003][1] P713. Note\n  that we simultaneously compute all possible doubling values for each chain,\n  for the reason that at small-medium densities, the gains from parallel\n  evaluation might cause a speed-up, but this will be benchmarked against the\n  while loop implementation.\n\n  Args:\n    x_initial: `tf.Tensor` of any shape and any real dtype consumable by\n      `target_log_prob`. The initial points.\n    target_log_prob: A callable taking a `tf.Tensor` of shape and dtype as\n      `x_initial` and returning a tensor of the same shape. The log density of\n      the target distribution.\n    log_slice_heights: `tf.Tensor` with the same shape as `x_initial` and the\n      same dtype as returned by `target_log_prob`. The log of the height of the\n      slice for each chain. The values must be bounded above by\n      `target_log_prob(x_initial)`.\n    max_doublings: Scalar positive int32 `tf.Tensor`. The maximum number of\n      doublings to consider.\n    step_size: `tf.Tensor` with same dtype as and shape compatible with\n      `x_initial`. The size of the initial interval.\n    seed: (Optional) positive int. The random seed. If None, no seed is set.\n    name: Python `str` name prefixed to Ops created by this function.\n      Default value: `None` (i.e., 'find_slice_bounds').\n\n  Returns:\n    upper_bounds: A tensor of same shape and dtype as `x_initial`. Slice upper\n      bounds for each chain.\n    lower_bounds: A tensor of same shape and dtype as `x_initial`. Slice lower\n      bounds for each chain.\n    both_ok: A tensor of shape `x_initial` and boolean dtype. Indicates if both\n      the chosen upper and lower bound lie outside of the slice.\n\n  #### References\n\n  [1]: Radford M. Neal. Slice Sampling. The Annals of Statistics. 2003, Vol 31,\n       No. 3 , 705-767.\n       https://projecteuclid.org/download/pdf_1/euclid.aos/1056562461\n  \"\"\"\n  with tf.compat.v1.name_scope(\n      name, 'slice_bounds_by_doubling',\n      [x_initial, log_slice_heights, max_doublings, step_size]):\n    seed_gen = distributions.SeedStream(seed, salt='slice_bounds_by_doubling')\n    x_initial = tf.convert_to_tensor(value=x_initial)\n    batch_shape = tf.shape(input=x_initial)\n    dtype = step_size.dtype.base_dtype\n    left_endpoints = x_initial + step_size * tf.random.uniform(\n        batch_shape, minval=-1.0, maxval=0.0, dtype=dtype, seed=seed_gen())\n\n    # Compute the increments by which we need to step the upper and lower bounds\n    # part of the doubling procedure.\n    left_increments, widths = _left_doubling_increments(\n        batch_shape, max_doublings, step_size, seed=seed_gen())\n    # The left and right end points. Shape (max_doublings+1,) + batch_shape.\n    left_endpoints -= left_increments\n    right_endpoints = left_endpoints + widths\n\n    # Test if these end points lie outside of the slice.\n    # Checks if the end points of the slice are outside the graph of the pdf.\n    left_ep_values = tf.map_fn(target_log_prob, left_endpoints)\n    right_ep_values = tf.map_fn(target_log_prob, right_endpoints)\n    left_ok = left_ep_values < log_slice_heights\n    right_ok = right_ep_values < log_slice_heights\n    both_ok = left_ok & right_ok\n\n    both_ok_f = tf.reshape(both_ok, [max_doublings + 1, -1])\n\n    best_interval_idx = _find_best_interval_idx(\n        tf.cast(both_ok_f, dtype=tf.int32))\n\n    # Formats the above index as required to use with gather_nd.\n    point_index_gather = tf.stack(\n        [best_interval_idx,\n         tf.range(tf.size(input=best_interval_idx))],\n        axis=1,\n        name='point_index_gather')\n    left_ep_f = tf.reshape(left_endpoints, [max_doublings + 1, -1])\n    right_ep_f = tf.reshape(right_endpoints, [max_doublings + 1, -1])\n    # The x values of the uppper and lower bounds of the slices for each chain.\n    lower_bounds = tf.reshape(tf.gather_nd(left_ep_f, point_index_gather),\n                              batch_shape)\n    upper_bounds = tf.reshape(tf.gather_nd(right_ep_f, point_index_gather),\n                              batch_shape)\n    both_ok = tf.reduce_any(input_tensor=both_ok, axis=0)\n    return upper_bounds, lower_bounds, both_ok", "response": "Returns the upper and lower bounds of a slice at each stage of doubling procedure."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _sample_with_shrinkage(x_initial, target_log_prob, log_slice_heights,\n                           step_size, lower_bounds, upper_bounds, seed=None,\n                           name=None):\n  \"\"\"Samples from the slice by applying shrinkage for rejected points.\n\n  Implements the one dimensional slice sampling algorithm of Neal (2003), with a\n  doubling algorithm (Neal 2003 P715 Fig. 4), which doubles the size of the\n  interval at each iteration and shrinkage (Neal 2003 P716 Fig. 5), which\n  reduces the width of the slice when a selected point is rejected, by setting\n  the relevant bound that that value. Randomly sampled points are checked for\n  two criteria: that they lie within the slice and that they pass the\n  acceptability check (Neal 2003 P717 Fig. 6), which tests that the new state\n  could have generated the previous one.\n\n  Args:\n    x_initial: A tensor of any shape. The initial positions of the chains. This\n      function assumes that all the dimensions of `x_initial` are batch\n      dimensions (i.e. the event shape is `[]`).\n    target_log_prob: Callable accepting a tensor like `x_initial` and returning\n      a tensor containing the log density at that point of the same shape.\n    log_slice_heights: Tensor of the same shape and dtype as the return value\n      of `target_log_prob` when applied to `x_initial`. The log of the height of\n      the chosen slice.\n    step_size: A tensor of shape and dtype compatible with `x_initial`. The min\n      interval size in the doubling algorithm.\n    lower_bounds: Tensor of same shape and dtype as `x_initial`. Slice lower\n      bounds for each chain.\n    upper_bounds: Tensor of same shape and dtype as `x_initial`. Slice upper\n      bounds for each chain.\n    seed: (Optional) positive int. The random seed. If None, no seed is set.\n    name: Python `str` name prefixed to Ops created by this function.\n      Default value: `None` (i.e., 'find_slice_bounds').\n\n  Returns:\n    x_proposed: A tensor of the same shape and dtype as `x_initial`. The next\n      proposed state of the chain.\n  \"\"\"\n  with tf.compat.v1.name_scope(\n      name, 'sample_with_shrinkage',\n      [x_initial, log_slice_heights, step_size, lower_bounds, upper_bounds]):\n    seed_gen = distributions.SeedStream(seed, salt='_sample_with_shrinkage')\n    # Keeps track of whether an acceptable sample has been found for the chain.\n    found = tf.zeros_like(x_initial, dtype=tf.bool)\n    cond = lambda found, *ignored_args: ~tf.reduce_all(input_tensor=found)\n    x_next = tf.identity(x_initial)\n    x_initial_shape = tf.shape(input=x_initial)\n    x_initial_dtype = x_initial.dtype.base_dtype\n    def _body(found, left, right, x_next):\n      \"\"\"Iterates until every chain has found a suitable next state.\"\"\"\n      proportions = tf.random.uniform(\n          x_initial_shape, dtype=x_initial_dtype, seed=seed_gen())\n      x_proposed = tf.where(~found, left + proportions * (right - left), x_next)\n      accept_res = _test_acceptance(x_initial, target_log_prob=target_log_prob,\n                                    decided=found,\n                                    log_slice_heights=log_slice_heights,\n                                    x_proposed=x_proposed, step_size=step_size,\n                                    lower_bounds=left, upper_bounds=right)\n      boundary_test = log_slice_heights < target_log_prob(x_proposed)\n      can_accept = boundary_test & accept_res\n      next_found = found | can_accept\n      # Note that it might seem that we are moving the left and right end points\n      # even if the point has been accepted (which is contrary to the stated\n      # algorithm in Neal). However, this does not matter because the endpoints\n      # for points that have been already accepted are not used again so it\n      # doesn't matter what we do with them.\n      next_left = tf.where(x_proposed < x_initial, x_proposed, left)\n      next_right = tf.where(x_proposed >= x_initial, x_proposed, right)\n      return next_found, next_left, next_right, x_proposed\n\n    return tf.while_loop(\n        cond=cond,\n        body=_body,\n        loop_vars=(found, lower_bounds, upper_bounds, x_next))[-1]", "response": "This function is a helper function for the one dimensional slice sampling algorithm of Neal 2003."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef sample_annealed_importance_chain(\n    num_steps,\n    proposal_log_prob_fn,\n    target_log_prob_fn,\n    current_state,\n    make_kernel_fn,\n    parallel_iterations=10,\n    name=None):\n  \"\"\"Runs annealed importance sampling (AIS) to estimate normalizing constants.\n\n  This function uses an MCMC transition operator (e.g., Hamiltonian Monte Carlo)\n  to sample from a series of distributions that slowly interpolates between\n  an initial \"proposal\" distribution:\n\n  `exp(proposal_log_prob_fn(x) - proposal_log_normalizer)`\n\n  and the target distribution:\n\n  `exp(target_log_prob_fn(x) - target_log_normalizer)`,\n\n  accumulating importance weights along the way. The product of these\n  importance weights gives an unbiased estimate of the ratio of the\n  normalizing constants of the initial distribution and the target\n  distribution:\n\n  `E[exp(ais_weights)] = exp(target_log_normalizer - proposal_log_normalizer)`.\n\n  Note: When running in graph mode, `proposal_log_prob_fn` and\n  `target_log_prob_fn` are called exactly three times (although this may be\n  reduced to two times in the future).\n\n  Args:\n    num_steps: Integer number of Markov chain updates to run. More\n      iterations means more expense, but smoother annealing between q\n      and p, which in turn means exponentially lower variance for the\n      normalizing constant estimator.\n    proposal_log_prob_fn: Python callable that returns the log density of the\n      initial distribution.\n    target_log_prob_fn: Python callable which takes an argument like\n      `current_state` (or `*current_state` if it's a list) and returns its\n      (possibly unnormalized) log-density under the target distribution.\n    current_state: `Tensor` or Python `list` of `Tensor`s representing the\n      current state(s) of the Markov chain(s). The first `r` dimensions index\n      independent chains, `r = tf.rank(target_log_prob_fn(*current_state))`.\n    make_kernel_fn: Python `callable` which returns a `TransitionKernel`-like\n      object. Must take one argument representing the `TransitionKernel`'s\n      `target_log_prob_fn`. The `target_log_prob_fn` argument represents the\n      `TransitionKernel`'s target log distribution.  Note:\n      `sample_annealed_importance_chain` creates a new `target_log_prob_fn`\n      which is an interpolation between the supplied `target_log_prob_fn` and\n      `proposal_log_prob_fn`; it is this interpolated function which is used as\n      an argument to `make_kernel_fn`.\n    parallel_iterations: The number of iterations allowed to run in parallel.\n        It must be a positive integer. See `tf.while_loop` for more details.\n    name: Python `str` name prefixed to Ops created by this function.\n      Default value: `None` (i.e., \"sample_annealed_importance_chain\").\n\n  Returns:\n    next_state: `Tensor` or Python list of `Tensor`s representing the\n      state(s) of the Markov chain(s) at the final iteration. Has same shape as\n      input `current_state`.\n    ais_weights: Tensor with the estimated weight(s). Has shape matching\n      `target_log_prob_fn(current_state)`.\n    kernel_results: `collections.namedtuple` of internal calculations used to\n      advance the chain.\n\n  #### Examples\n\n  ##### Estimate the normalizing constant of a log-gamma distribution.\n\n  ```python\n  tfd = tfp.distributions\n\n  # Run 100 AIS chains in parallel\n  num_chains = 100\n  dims = 20\n  dtype = np.float32\n\n  proposal = tfd.MultivatiateNormalDiag(\n     loc=tf.zeros([dims], dtype=dtype))\n\n  target = tfd.TransformedDistribution(\n    distribution=tfd.Gamma(concentration=dtype(2),\n                           rate=dtype(3)),\n    bijector=tfp.bijectors.Invert(tfp.bijectors.Exp()),\n    event_shape=[dims])\n\n  chains_state, ais_weights, kernels_results = (\n      tfp.mcmc.sample_annealed_importance_chain(\n          num_steps=1000,\n          proposal_log_prob_fn=proposal.log_prob,\n          target_log_prob_fn=target.log_prob,\n          current_state=proposal.sample(num_chains),\n          make_kernel_fn=lambda tlp_fn: tfp.mcmc.HamiltonianMonteCarlo(\n            target_log_prob_fn=tlp_fn,\n            step_size=0.2,\n            num_leapfrog_steps=2)))\n\n  log_estimated_normalizer = (tf.reduce_logsumexp(ais_weights)\n                              - np.log(num_chains))\n  log_true_normalizer = tf.lgamma(2.) - 2. * tf.log(3.)\n  ```\n\n  ##### Estimate marginal likelihood of a Bayesian regression model.\n\n  ```python\n  tfd = tfp.distributions\n\n  def make_prior(dims, dtype):\n    return tfd.MultivariateNormalDiag(\n        loc=tf.zeros(dims, dtype))\n\n  def make_likelihood(weights, x):\n    return tfd.MultivariateNormalDiag(\n        loc=tf.tensordot(weights, x, axes=[[0], [-1]]))\n\n  # Run 100 AIS chains in parallel\n  num_chains = 100\n  dims = 10\n  dtype = np.float32\n\n  # Make training data.\n  x = np.random.randn(num_chains, dims).astype(dtype)\n  true_weights = np.random.randn(dims).astype(dtype)\n  y = np.dot(x, true_weights) + np.random.randn(num_chains)\n\n  # Setup model.\n  prior = make_prior(dims, dtype)\n  def target_log_prob_fn(weights):\n    return prior.log_prob(weights) + make_likelihood(weights, x).log_prob(y)\n\n  proposal = tfd.MultivariateNormalDiag(\n      loc=tf.zeros(dims, dtype))\n\n  weight_samples, ais_weights, kernel_results = (\n      tfp.mcmc.sample_annealed_importance_chain(\n        num_steps=1000,\n        proposal_log_prob_fn=proposal.log_prob,\n        target_log_prob_fn=target_log_prob_fn\n        current_state=tf.zeros([num_chains, dims], dtype),\n        make_kernel_fn=lambda tlp_fn: tfp.mcmc.HamiltonianMonteCarlo(\n          target_log_prob_fn=tlp_fn,\n          step_size=0.1,\n          num_leapfrog_steps=2)))\n  log_normalizer_estimate = (tf.reduce_logsumexp(ais_weights)\n                             - np.log(num_chains))\n  ```\n\n  \"\"\"\n  with tf.compat.v1.name_scope(name, \"sample_annealed_importance_chain\",\n                               [num_steps, current_state]):\n    num_steps = tf.convert_to_tensor(\n        value=num_steps, dtype=tf.int32, name=\"num_steps\")\n    if mcmc_util.is_list_like(current_state):\n      current_state = [\n          tf.convert_to_tensor(value=s, name=\"current_state\")\n          for s in current_state\n      ]\n    else:\n      current_state = tf.convert_to_tensor(\n          value=current_state, name=\"current_state\")\n\n    def _make_convex_combined_log_prob_fn(iter_):\n      def _fn(*args):\n        p = tf.identity(proposal_log_prob_fn(*args), name=\"proposal_log_prob\")\n        t = tf.identity(target_log_prob_fn(*args), name=\"target_log_prob\")\n        dtype = p.dtype.base_dtype\n        beta = tf.cast(iter_ + 1, dtype) / tf.cast(num_steps, dtype)\n        return tf.identity(beta * t + (1. - beta) * p,\n                           name=\"convex_combined_log_prob\")\n      return _fn\n\n    def _loop_body(iter_, ais_weights, current_state, kernel_results):\n      \"\"\"Closure which implements `tf.while_loop` body.\"\"\"\n      x = (current_state if mcmc_util.is_list_like(current_state)\n           else [current_state])\n      proposal_log_prob = proposal_log_prob_fn(*x)\n      target_log_prob = target_log_prob_fn(*x)\n      ais_weights += ((target_log_prob - proposal_log_prob) /\n                      tf.cast(num_steps, ais_weights.dtype))\n      kernel = make_kernel_fn(_make_convex_combined_log_prob_fn(iter_))\n      next_state, inner_results = kernel.one_step(\n          current_state, kernel_results.inner_results)\n      kernel_results = AISResults(\n          proposal_log_prob=proposal_log_prob,\n          target_log_prob=target_log_prob,\n          inner_results=inner_results,\n      )\n      return [iter_ + 1, ais_weights, next_state, kernel_results]\n\n    def _bootstrap_results(init_state):\n      \"\"\"Creates first version of `previous_kernel_results`.\"\"\"\n      kernel = make_kernel_fn(_make_convex_combined_log_prob_fn(iter_=0))\n      inner_results = kernel.bootstrap_results(init_state)\n\n      convex_combined_log_prob = inner_results.accepted_results.target_log_prob\n      dtype = convex_combined_log_prob.dtype.as_numpy_dtype\n      shape = tf.shape(input=convex_combined_log_prob)\n      proposal_log_prob = tf.fill(shape, dtype(np.nan),\n                                  name=\"bootstrap_proposal_log_prob\")\n      target_log_prob = tf.fill(shape, dtype(np.nan),\n                                name=\"target_target_log_prob\")\n\n      return AISResults(\n          proposal_log_prob=proposal_log_prob,\n          target_log_prob=target_log_prob,\n          inner_results=inner_results,\n      )\n\n    previous_kernel_results = _bootstrap_results(current_state)\n    inner_results = previous_kernel_results.inner_results\n\n    ais_weights = tf.zeros(\n        shape=tf.broadcast_dynamic_shape(\n            tf.shape(input=inner_results.proposed_results.target_log_prob),\n            tf.shape(input=inner_results.accepted_results.target_log_prob)),\n        dtype=inner_results.proposed_results.target_log_prob.dtype.base_dtype)\n\n    [_, ais_weights, current_state, kernel_results] = tf.while_loop(\n        cond=lambda iter_, *args: iter_ < num_steps,\n        body=_loop_body,\n        loop_vars=[\n            np.int32(0),  # iter_\n            ais_weights,\n            current_state,\n            previous_kernel_results,\n        ],\n        parallel_iterations=parallel_iterations)\n\n    return [current_state, ais_weights, kernel_results]", "response": "Runs an annealing from the given proposal_log_prob_fn target_log_prob_fn and current_state and makes a kernel function that returns the unbiased estimate of the ratio of the importance weights of the target distribution."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\ncreate a value - setting interceptor that sets the value of the Uniform and Uniform Un", "response": "def make_value_setter(**model_kwargs):\n  \"\"\"Creates a value-setting interceptor.\n\n  This function creates an interceptor that sets values of Edward2 random\n  variable objects. This is useful for a range of tasks, including conditioning\n  on observed data, sampling from posterior predictive distributions, and as a\n  building block of inference primitives such as computing log joint\n  probabilities (see examples below).\n\n  Args:\n    **model_kwargs: dict of str to Tensor. Keys are the names of random\n      variables in the model to which this interceptor is being applied. Values\n      are Tensors to set their value to. Variables not included in this dict\n      will not be set and will maintain their existing value semantics (by\n      default, a sample from the parent-conditional distribution).\n\n  Returns:\n    set_values: function that sets the value of intercepted ops.\n\n  #### Examples\n\n  Consider for illustration a model with latent `z` and\n  observed `x`, and a corresponding trainable posterior model:\n\n  ```python\n  num_observations = 10\n  def model():\n    z = ed.Normal(loc=0, scale=1., name='z')  # log rate\n    x = ed.Poisson(rate=tf.exp(z) * tf.ones(num_observations), name='x')\n    return x\n\n  def variational_model():\n    return ed.Normal(loc=tf.Variable(0.),\n                     scale=tf.nn.softplus(tf.Variable(-4.)),\n                     name='z')  # for simplicity, match name of the model RV.\n  ```\n\n  We can use a value-setting interceptor to condition the model on observed\n  data. This approach is slightly more cumbersome than that of partially\n  evaluating the complete log-joint function, but has the potential advantage\n  that it returns a new model callable, which may be used to sample downstream\n  variables, passed into additional transformations, etc.\n\n  ```python\n  x_observed = np.array([6, 3, 1, 8, 7, 0, 6, 4, 7, 5])\n  def observed_model():\n    with ed.interception(make_value_setter(x=x_observed)):\n      model()\n  observed_log_joint_fn = ed.make_log_joint_fn(observed_model)\n\n  # After fixing 'x', the observed log joint is now only a function of 'z'.\n  # This enables us to define a variational lower bound,\n  # `E_q[ log p(x, z) - log q(z)]`, simply by evaluating the observed and\n  # variational log joints at variational samples.\n  variational_log_joint_fn = ed.make_log_joint_fn(variational_model)\n  with ed.tape() as variational_sample:  # Sample trace from variational model.\n    variational_model()\n  elbo_loss = -(observed_log_joint_fn(**variational_sample) -\n                variational_log_joint_fn(**variational_sample))\n  ```\n\n  After performing inference by minimizing the variational loss, a value-setting\n  interceptor enables simulation from the posterior predictive distribution:\n\n  ```python\n  with ed.tape() as posterior_samples:  # tape is a map {rv.name : rv}\n    variational_model()\n  with ed.interception(ed.make_value_setter(**posterior_samples)):\n    x = model()\n  # x is a sample from p(X | Z = z') where z' ~ q(z) (the variational model)\n  ```\n\n  As another example, using a value setter inside of `ed.tape` enables\n  computing the log joint probability, by setting all variables to\n  posterior values and then accumulating the log probs of those values under\n  the induced parent-conditional distributions. This is one way that we could\n  have implemented `ed.make_log_joint_fn`:\n\n  ```python\n  def make_log_joint_fn_demo(model):\n    def log_joint_fn(**model_kwargs):\n      with ed.tape() as model_tape:\n        with ed.make_value_setter(**model_kwargs):\n          model()\n\n      # accumulate sum_i log p(X_i = x_i | X_{:i-1} = x_{:i-1})\n      log_prob = 0.\n      for rv in model_tape.values():\n        log_prob += tf.reduce_sum(rv.log_prob(rv.value))\n\n      return log_prob\n    return log_joint_fn\n  ```\n\n  \"\"\"\n  def set_values(f, *args, **kwargs):\n    \"\"\"Sets random variable values to its aligned value.\"\"\"\n    name = kwargs.get(\"name\")\n    if name in model_kwargs:\n      kwargs[\"value\"] = model_kwargs[name]\n    return interceptable(f)(*args, **kwargs)\n  return set_values"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\ntaking Edward probabilistic program and returns its log - joint function.", "response": "def make_log_joint_fn(model):\n  \"\"\"Takes Edward probabilistic program and returns its log joint function.\n\n  Args:\n    model: Python callable which executes the generative process of a\n      computable probability distribution using `ed.RandomVariable`s.\n\n  Returns:\n    A log-joint probability function. Its inputs are `model`'s original inputs\n    and random variables which appear during the program execution. Its output\n    is a scalar tf.Tensor.\n\n  #### Examples\n\n  Below we define Bayesian logistic regression as an Edward program,\n  representing the model's generative process. We apply `make_log_joint_fn` in\n  order to represent the model in terms of its joint probability function.\n\n  ```python\n  from tensorflow_probability import edward2 as ed\n\n  def logistic_regression(features):\n    coeffs = ed.Normal(loc=0., scale=1.,\n                       sample_shape=features.shape[1], name=\"coeffs\")\n    outcomes = ed.Bernoulli(logits=tf.tensordot(features, coeffs, [[1], [0]]),\n                            name=\"outcomes\")\n    return outcomes\n\n  log_joint = ed.make_log_joint_fn(logistic_regression)\n\n  features = tf.random_normal([3, 2])\n  coeffs_value = tf.random_normal([2])\n  outcomes_value = tf.round(tf.random_uniform([3]))\n  output = log_joint(features, coeffs=coeffs_value, outcomes=outcomes_value)\n  ```\n\n  \"\"\"\n  def log_joint_fn(*args, **kwargs):\n    \"\"\"Log-probability of inputs according to a joint probability distribution.\n\n    Args:\n      *args: Positional arguments. They are the model's original inputs and can\n        alternatively be specified as part of `kwargs`.\n      **kwargs: Keyword arguments, where for each key-value pair `k` and `v`,\n        `v` is passed as a `value` to the random variable(s) whose keyword\n        argument `name` during construction is equal to `k`.\n\n    Returns:\n      Scalar tf.Tensor, which represents the model's log-probability summed\n      over all Edward random variables and their dimensions.\n\n    Raises:\n      TypeError: If a random variable in the model has no specified value in\n        `**kwargs`.\n    \"\"\"\n    log_probs = []\n\n    def interceptor(rv_constructor, *rv_args, **rv_kwargs):\n      \"\"\"Overrides a random variable's `value` and accumulates its log-prob.\"\"\"\n      # Set value to keyword argument indexed by `name` (an input tensor).\n      rv_name = rv_kwargs.get(\"name\")\n      if rv_name is None:\n        raise KeyError(\"Random variable constructor {} has no name \"\n                       \"in its arguments.\".format(rv_constructor.__name__))\n\n      # If no value is explicitly passed in for an RV, default to the value\n      # from the RV constructor. This may have been set explicitly by the user\n      # or forwarded from a lower-level interceptor.\n      previously_specified_value = rv_kwargs.get(\"value\")\n      value = kwargs.get(rv_name, previously_specified_value)\n      if value is None:\n        raise LookupError(\"Keyword argument specifying value for {} is \"\n                          \"missing.\".format(rv_name))\n      rv_kwargs[\"value\"] = value\n\n      rv = rv_constructor(*rv_args, **rv_kwargs)\n      log_prob = tf.reduce_sum(input_tensor=rv.distribution.log_prob(rv.value))\n      log_probs.append(log_prob)\n      return rv\n\n    model_kwargs = _get_function_inputs(model, kwargs)\n    with interception(interceptor):\n      model(*args, **model_kwargs)\n    log_prob = sum(log_probs)\n    return log_prob\n  return log_joint_fn"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef _get_function_inputs(f, src_kwargs):\n  if hasattr(f, \"_func\"):  # functions returned by tf.make_template\n    f = f._func  # pylint: disable=protected-access\n\n  try:  # getargspec was deprecated in Python 3.6\n    argspec = inspect.getfullargspec(f)\n  except AttributeError:\n    argspec = inspect.getargspec(f)\n\n  fkwargs = {k: v for k, v in six.iteritems(src_kwargs) if k in argspec.args}\n  return fkwargs", "response": "Filters inputs to be compatible with function f s signature."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nnetworking block for VGG.", "response": "def _vggconv_block(x, filters, kernel, stride, kernel_posterior_fn):\n  \"\"\"Network block for VGG.\"\"\"\n  out = tfp.layers.Convolution2DFlipout(\n      filters,\n      kernel,\n      padding='same',\n      kernel_posterior_fn=kernel_posterior_fn)(x)\n  out = tf.keras.layers.BatchNormalization()(out)\n  out = tf.keras.layers.Activation('relu')(out)\n\n  out = tfp.layers.Convolution2DFlipout(\n      filters,\n      kernel,\n      padding='same',\n      kernel_posterior_fn=kernel_posterior_fn)(out)\n  out = tf.keras.layers.BatchNormalization()(out)\n  out = tf.keras.layers.Activation('relu')(out)\n\n  out = tf.keras.layers.MaxPooling2D(\n      pool_size=(2, 2), strides=stride)(out)\n  return out"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef compute_output_shape(self, input_shape):\n    input_shape = tf.TensorShape(input_shape)\n    input_shape = input_shape.with_rank_at_least(2)\n    if tf.compat.dimension_value(input_shape[-1]) is None:\n      raise ValueError(\n          'The innermost dimension of `input_shape` must be defined, '\n          'but saw: {}'.format(input_shape))\n    return input_shape[:-1].concatenate(self.units)", "response": "Computes the output shape of the layer."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef get_config(self):\n    config = {\n        'units': self.units,\n        'activation': (tf.keras.activations.serialize(self.activation)\n                       if self.activation else None),\n        'activity_regularizer':\n            tf.keras.initializers.serialize(self.activity_regularizer),\n    }\n    function_keys = [\n        'kernel_posterior_fn',\n        'kernel_posterior_tensor_fn',\n        'kernel_prior_fn',\n        'kernel_divergence_fn',\n        'bias_posterior_fn',\n        'bias_posterior_tensor_fn',\n        'bias_prior_fn',\n        'bias_divergence_fn',\n    ]\n    for function_key in function_keys:\n      function = getattr(self, function_key)\n      if function is None:\n        function_name = None\n        function_type = None\n      else:\n        function_name, function_type = tfp_layers_util.serialize_function(\n            function)\n      config[function_key] = function_name\n      config[function_key + '_type'] = function_type\n    base_config = super(_DenseVariational, self).get_config()\n    return dict(list(base_config.items()) + list(config.items()))", "response": "Returns the config of the current layer."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nsimulate a No - U - Turn Sampler.", "response": "def kernel(target_log_prob_fn,\n           current_state,\n           step_size,\n           seed=None,\n           current_target_log_prob=None,\n           current_grads_target_log_prob=None,\n           name=None):\n  \"\"\"Simulates a No-U-Turn Sampler (NUTS) trajectory.\n\n  Args:\n    target_log_prob_fn: Python callable which takes an argument like\n      `*current_state` and returns its (possibly unnormalized) log-density under\n      the target distribution.\n    current_state: List of `Tensor`s representing the states to simulate from.\n    step_size: List of `Tensor`s representing the step sizes for the leapfrog\n      integrator. Must have same shape as `current_state`.\n    seed: Integer to seed the random number generator.\n    current_target_log_prob: Scalar `Tensor` representing the value of\n      `target_log_prob_fn` at the `current_state`.\n    current_grads_target_log_prob: List of `Tensor`s representing gradient of\n      `current_target_log_prob` with respect to `current_state`. Must have same\n      shape as `current_state`.\n    name: A name for the operation.\n\n  Returns:\n    next_state: List of `Tensor`s representing the next states of the NUTS\n      trajectory. Has same shape as `current_state`.\n    next_target_log_prob: Scalar `Tensor` representing the value of\n      `target_log_prob_fn` at `next_state`.\n    next_grads_target_log_prob: List of `Tensor`s representing the gradient of\n      `next_target_log_prob` with respect to `next_state`.\n\n  Raises:\n    NotImplementedError: If the execution mode is not eager.\n  \"\"\"\n  if not tf.executing_eagerly():\n    raise NotImplementedError(\"`kernel` is only available in Eager mode.\")\n\n  with tf.compat.v1.name_scope(\n      name,\n      default_name=\"nuts_kernel\",\n      values=[\n          current_state, step_size, seed, current_target_log_prob,\n          current_grads_target_log_prob\n      ]):\n    with tf.compat.v1.name_scope(\"initialize\"):\n      current_state = [tf.convert_to_tensor(value=s) for s in current_state]\n      step_size = [tf.convert_to_tensor(value=s) for s in step_size]\n      value_and_gradients_fn = lambda *args: tfp.math.value_and_gradient(  # pylint: disable=g-long-lambda\n          target_log_prob_fn, args)\n      value_and_gradients_fn = _embed_no_none_gradient_check(\n          value_and_gradients_fn)\n      if (current_target_log_prob is None or\n          current_grads_target_log_prob is None):\n        (current_target_log_prob,\n         current_grads_target_log_prob) = value_and_gradients_fn(*current_state)\n\n      seed_stream = tfd.SeedStream(seed, \"nuts_kernel\")\n      current_momentum = []\n      for state_tensor in current_state:\n        momentum_tensor = tf.random.normal(\n            shape=tf.shape(input=state_tensor),\n            dtype=state_tensor.dtype,\n            seed=seed_stream())\n        current_momentum.append(momentum_tensor)\n\n      # Draw a slice variable u ~ Uniform(0, p(initial state, initial\n      # momentum)) and compute log u. For numerical stability, we perform this\n      # in log space where log u = log (u' * p(...)) = log u' + log\n      # p(...) and u' ~ Uniform(0, 1).\n      log_slice_sample = tf.math.log(tf.random.uniform([], seed=seed_stream()))\n      log_slice_sample += _log_joint(current_target_log_prob,\n                                     current_momentum)\n\n      # Initialize loop variables. It comprises a collection of information\n      # about a \"reverse\" state, a collection of information about a \"forward\"\n      # state, a collection of information about the next state,\n      # the trajectory's tree depth, the number of candidate states, and\n      # whether to continue the trajectory.\n      reverse_state = current_state\n      reverse_target_log_prob = current_target_log_prob\n      reverse_grads_target_log_prob = current_grads_target_log_prob\n      reverse_momentum = current_momentum\n      forward_state = current_state\n      forward_target_log_prob = current_target_log_prob\n      forward_grads_target_log_prob = current_grads_target_log_prob\n      forward_momentum = current_momentum\n      next_state = current_state\n      next_target_log_prob = current_target_log_prob\n      next_grads_target_log_prob = current_grads_target_log_prob\n      depth = 0\n      num_states = 1\n      continue_trajectory = True\n\n    while continue_trajectory:\n      # Grow the No-U-Turn Sampler trajectory by choosing a random direction and\n      # simulating Hamiltonian dynamics in that direction. This extends either\n      # the forward or reverse state.\n      direction = tfp.math.random_rademacher([], seed=seed_stream())\n      if direction < 0:\n        [\n            reverse_state,\n            reverse_target_log_prob,\n            reverse_grads_target_log_prob,\n            reverse_momentum,\n            _,\n            _,\n            _,\n            _,\n            next_state_in_subtree,\n            next_target_log_prob_in_subtree,\n            next_grads_target_log_prob_in_subtree,\n            num_states_in_subtree,\n            continue_trajectory,\n        ] = _build_tree(\n            value_and_gradients_fn=value_and_gradients_fn,\n            current_state=reverse_state,\n            current_target_log_prob=reverse_target_log_prob,\n            current_grads_target_log_prob=reverse_grads_target_log_prob,\n            current_momentum=reverse_momentum,\n            direction=direction,\n            depth=depth,\n            step_size=step_size,\n            log_slice_sample=log_slice_sample,\n            seed=seed_stream())\n      else:\n        [\n            _,\n            _,\n            _,\n            _,\n            forward_state,\n            forward_target_log_prob,\n            forward_grads_target_log_prob,\n            forward_momentum,\n            next_state_in_subtree,\n            next_target_log_prob_in_subtree,\n            next_grads_target_log_prob_in_subtree,\n            num_states_in_subtree,\n            continue_trajectory,\n        ] = _build_tree(\n            value_and_gradients_fn=value_and_gradients_fn,\n            current_state=forward_state,\n            current_target_log_prob=forward_target_log_prob,\n            current_grads_target_log_prob=forward_grads_target_log_prob,\n            current_momentum=forward_momentum,\n            direction=direction,\n            depth=depth,\n            step_size=step_size,\n            log_slice_sample=log_slice_sample,\n            seed=seed_stream())\n\n      if continue_trajectory:\n        # If the built tree did not terminate, accept the tree's next state\n        # with a certain probability.\n        accept_state_in_subtree = _random_bernoulli(\n            [],\n            probs=tf.minimum(1., num_states_in_subtree / num_states),\n            dtype=tf.bool,\n            seed=seed_stream())\n        if accept_state_in_subtree:\n          next_state = next_state_in_subtree\n          next_target_log_prob = next_target_log_prob_in_subtree\n          next_grads_target_log_prob = next_grads_target_log_prob_in_subtree\n\n      # Continue the NUTS trajectory if the tree-building did not terminate, and\n      # if the reverse-most and forward-most states do not exhibit a U-turn.\n      has_no_u_turn = tf.logical_and(\n          _has_no_u_turn(forward_state, reverse_state, forward_momentum),\n          _has_no_u_turn(forward_state, reverse_state, reverse_momentum))\n      continue_trajectory = continue_trajectory and has_no_u_turn\n      num_states += num_states_in_subtree\n      depth += 1\n\n    return next_state, next_target_log_prob, next_grads_target_log_prob"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nbuild a tree at a given depth.", "response": "def _build_tree(value_and_gradients_fn,\n                current_state,\n                current_target_log_prob,\n                current_grads_target_log_prob,\n                current_momentum,\n                direction,\n                depth,\n                step_size,\n                log_slice_sample,\n                max_simulation_error=1000.,\n                seed=None):\n  \"\"\"Builds a tree at a given tree depth and at a given state.\n\n  The `current` state is immediately adjacent to, but outside of,\n  the subtrajectory spanned by the returned `forward` and `reverse` states.\n\n  Args:\n    value_and_gradients_fn: Python callable which takes an argument like\n      `*current_state` and returns a tuple of its (possibly unnormalized)\n      log-density under the target distribution and its gradient with respect to\n      each state.\n    current_state: List of `Tensor`s representing the current states of the\n      NUTS trajectory.\n    current_target_log_prob: Scalar `Tensor` representing the value of\n      `target_log_prob_fn` at the `current_state`.\n    current_grads_target_log_prob: List of `Tensor`s representing gradient of\n      `current_target_log_prob` with respect to `current_state`. Must have same\n      shape as `current_state`.\n    current_momentum: List of `Tensor`s representing the momentums of\n      `current_state`. Must have same shape as `current_state`.\n    direction: int that is either -1 or 1. It determines whether to perform\n      leapfrog integration backwards (reverse) or forward in time respectively.\n    depth: non-negative int that indicates how deep of a tree to build.\n      Each call to `_build_tree` takes `2**depth` leapfrog steps.\n    step_size: List of `Tensor`s representing the step sizes for the leapfrog\n      integrator. Must have same shape as `current_state`.\n    log_slice_sample: The log of an auxiliary slice variable. It is used\n      together with `max_simulation_error` to avoid simulating trajectories with\n      too much numerical error.\n    max_simulation_error: Maximum simulation error to tolerate before\n      terminating the trajectory. Simulation error is the\n      `log_slice_sample` minus the log-joint probability at the simulated state.\n    seed: Integer to seed the random number generator.\n\n  Returns:\n    reverse_state: List of `Tensor`s representing the \"reverse\" states of the\n      NUTS trajectory. Has same shape as `current_state`.\n    reverse_target_log_prob: Scalar `Tensor` representing the value of\n      `target_log_prob_fn` at the `reverse_state`.\n    reverse_grads_target_log_prob: List of `Tensor`s representing gradient of\n      `reverse_target_log_prob` with respect to `reverse_state`. Has same shape\n      as `reverse_state`.\n    reverse_momentum: List of `Tensor`s representing the momentums of\n      `reverse_state`. Has same shape as `reverse_state`.\n    forward_state: List of `Tensor`s representing the \"forward\" states of the\n      NUTS trajectory. Has same shape as `current_state`.\n    forward_target_log_prob: Scalar `Tensor` representing the value of\n      `target_log_prob_fn` at the `forward_state`.\n    forward_grads_target_log_prob: List of `Tensor`s representing gradient of\n      `forward_target_log_prob` with respect to `forward_state`. Has same shape\n      as `forward_state`.\n    forward_momentum: List of `Tensor`s representing the momentums of\n      `forward_state`. Has same shape as `forward_state`.\n    next_state: List of `Tensor`s representing the next states of the NUTS\n      trajectory. Has same shape as `current_state`.\n    next_target_log_prob: Scalar `Tensor` representing the value of\n      `target_log_prob_fn` at `next_state`.\n    next_grads_target_log_prob: List of `Tensor`s representing the gradient of\n      `next_target_log_prob` with respect to `next_state`.\n    num_states: Number of acceptable candidate states in the subtree. A state is\n      acceptable if it is \"in the slice\", that is, if its log-joint probability\n      with its momentum is greater than `log_slice_sample`.\n    continue_trajectory: bool determining whether to continue the simulation\n      trajectory. The trajectory is continued if no U-turns are encountered\n      within the built subtree, and if the log-probability accumulation due to\n      integration error does not exceed `max_simulation_error`.\n  \"\"\"\n  if depth == 0:  # base case\n    # Take a leapfrog step. Terminate the tree-building if the simulation\n    # error from the leapfrog integrator is too large. States discovered by\n    # continuing the simulation are likely to have very low probability.\n    [\n        next_state,\n        next_target_log_prob,\n        next_grads_target_log_prob,\n        next_momentum,\n    ] = _leapfrog(\n        value_and_gradients_fn=value_and_gradients_fn,\n        current_state=current_state,\n        current_grads_target_log_prob=current_grads_target_log_prob,\n        current_momentum=current_momentum,\n        step_size=direction * step_size)\n    next_log_joint = _log_joint(next_target_log_prob, next_momentum)\n    num_states = tf.cast(next_log_joint > log_slice_sample, dtype=tf.int32)\n    continue_trajectory = (next_log_joint >\n                           log_slice_sample - max_simulation_error)\n    return [\n        next_state,\n        next_target_log_prob,\n        next_grads_target_log_prob,\n        next_momentum,\n        next_state,\n        next_target_log_prob,\n        next_grads_target_log_prob,\n        next_momentum,\n        next_state,\n        next_target_log_prob,\n        next_grads_target_log_prob,\n        num_states,\n        continue_trajectory,\n    ]\n\n  # Build a tree at the current state.\n  seed_stream = tfd.SeedStream(seed, \"build_tree\")\n  [\n      reverse_state,\n      reverse_target_log_prob,\n      reverse_grads_target_log_prob,\n      reverse_momentum,\n      forward_state,\n      forward_target_log_prob,\n      forward_grads_target_log_prob,\n      forward_momentum,\n      next_state,\n      next_target_log_prob,\n      next_grads_target_log_prob,\n      num_states,\n      continue_trajectory,\n  ] = _build_tree(value_and_gradients_fn=value_and_gradients_fn,\n                  current_state=current_state,\n                  current_target_log_prob=current_target_log_prob,\n                  current_grads_target_log_prob=current_grads_target_log_prob,\n                  current_momentum=current_momentum,\n                  direction=direction,\n                  depth=depth - 1,\n                  step_size=step_size,\n                  log_slice_sample=log_slice_sample,\n                  seed=seed_stream())\n  if continue_trajectory:\n    # If the just-built subtree did not terminate, build a second subtree at\n    # the forward or reverse state, as appropriate.\n    if direction < 0:\n      [\n          reverse_state,\n          reverse_target_log_prob,\n          reverse_grads_target_log_prob,\n          reverse_momentum,\n          _,\n          _,\n          _,\n          _,\n          far_state,\n          far_target_log_prob,\n          far_grads_target_log_prob,\n          far_num_states,\n          far_continue_trajectory,\n      ] = _build_tree(\n          value_and_gradients_fn=value_and_gradients_fn,\n          current_state=reverse_state,\n          current_target_log_prob=reverse_target_log_prob,\n          current_grads_target_log_prob=reverse_grads_target_log_prob,\n          current_momentum=reverse_momentum,\n          direction=direction,\n          depth=depth - 1,\n          step_size=step_size,\n          log_slice_sample=log_slice_sample,\n          seed=seed_stream())\n    else:\n      [\n          _,\n          _,\n          _,\n          _,\n          forward_state,\n          forward_target_log_prob,\n          forward_grads_target_log_prob,\n          forward_momentum,\n          far_state,\n          far_target_log_prob,\n          far_grads_target_log_prob,\n          far_num_states,\n          far_continue_trajectory,\n      ] = _build_tree(\n          value_and_gradients_fn=value_and_gradients_fn,\n          current_state=forward_state,\n          current_target_log_prob=forward_target_log_prob,\n          current_grads_target_log_prob=forward_grads_target_log_prob,\n          current_momentum=forward_momentum,\n          direction=direction,\n          depth=depth - 1,\n          step_size=step_size,\n          log_slice_sample=log_slice_sample,\n          seed=seed_stream())\n\n    # Propose either `next_state` (which came from the first subtree and so is\n    # nearby) or the new forward/reverse state (which came from the second\n    # subtree and so is far away).\n    num_states += far_num_states\n    accept_far_state = _random_bernoulli(\n        [],\n        probs=far_num_states / num_states,\n        dtype=tf.bool,\n        seed=seed_stream())\n    if accept_far_state:\n      next_state = far_state\n      next_target_log_prob = far_target_log_prob\n      next_grads_target_log_prob = far_grads_target_log_prob\n\n    # Continue the NUTS trajectory if the far subtree did not terminate either,\n    # and if the reverse-most and forward-most states do not exhibit a U-turn.\n    has_no_u_turn = tf.logical_and(\n        _has_no_u_turn(forward_state, reverse_state, forward_momentum),\n        _has_no_u_turn(forward_state, reverse_state, reverse_momentum))\n    continue_trajectory = far_continue_trajectory and has_no_u_turn\n\n  return [\n      reverse_state,\n      reverse_target_log_prob,\n      reverse_grads_target_log_prob,\n      reverse_momentum,\n      forward_state,\n      forward_target_log_prob,\n      forward_grads_target_log_prob,\n      forward_momentum,\n      next_state,\n      next_target_log_prob,\n      next_grads_target_log_prob,\n      num_states,\n      continue_trajectory,\n  ]"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nwrap value and gradients function to assist with None gradients.", "response": "def _embed_no_none_gradient_check(value_and_gradients_fn):\n  \"\"\"Wraps value and gradients function to assist with None gradients.\"\"\"\n  @functools.wraps(value_and_gradients_fn)\n  def func_wrapped(*args, **kwargs):\n    \"\"\"Wrapped function which checks for None gradients.\"\"\"\n    value, grads = value_and_gradients_fn(*args, **kwargs)\n    if any(grad is None for grad in grads):\n      raise ValueError(\"Gradient is None for a state.\")\n    return value, grads\n  return func_wrapped"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nreturns True if two given states and momentum do not exhibit a U - turn pattern.", "response": "def _has_no_u_turn(state_one, state_two, momentum):\n  \"\"\"If two given states and momentum do not exhibit a U-turn pattern.\"\"\"\n  dot_product = sum([\n      tf.reduce_sum(input_tensor=(s1 - s2) * m)\n      for s1, s2, m in zip(state_one, state_two, momentum)\n  ])\n  return dot_product > 0"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nrunning one step of leapfrog integration.", "response": "def _leapfrog(value_and_gradients_fn,\n              current_state,\n              current_grads_target_log_prob,\n              current_momentum,\n              step_size):\n  \"\"\"Runs one step of leapfrog integration.\"\"\"\n  mid_momentum = [\n      m + 0.5 * step * g for m, step, g in\n      zip(current_momentum, step_size, current_grads_target_log_prob)]\n  next_state = [\n      s + step * m for s, step, m in\n      zip(current_state, step_size, mid_momentum)]\n  next_target_log_prob, next_grads_target_log_prob = value_and_gradients_fn(\n      *next_state)\n  next_momentum = [\n      m + 0.5 * step * g for m, step, g in\n      zip(mid_momentum, step_size, next_grads_target_log_prob)]\n  return [\n      next_state,\n      next_target_log_prob,\n      next_grads_target_log_prob,\n      next_momentum,\n  ]"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nlog - joint probability given a state s log - probability and momentum.", "response": "def _log_joint(current_target_log_prob, current_momentum):\n  \"\"\"Log-joint probability given a state's log-probability and momentum.\"\"\"\n  momentum_log_prob = -sum(\n      [tf.reduce_sum(input_tensor=0.5 * (m**2.)) for m in current_momentum])\n  return current_target_log_prob + momentum_log_prob"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nreturn samples from a Bernoulli distribution.", "response": "def _random_bernoulli(shape, probs, dtype=tf.int32, seed=None, name=None):\n  \"\"\"Returns samples from a Bernoulli distribution.\"\"\"\n  with tf.compat.v1.name_scope(name, \"random_bernoulli\", [shape, probs]):\n    probs = tf.convert_to_tensor(value=probs)\n    random_uniform = tf.random.uniform(shape, dtype=probs.dtype, seed=seed)\n    return tf.cast(tf.less(random_uniform, probs), dtype)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef default_loc_scale_fn(\n    is_singular=False,\n    loc_initializer=tf.compat.v1.initializers.random_normal(stddev=0.1),\n    untransformed_scale_initializer=tf.compat.v1.initializers.random_normal(\n        mean=-3., stddev=0.1),\n    loc_regularizer=None,\n    untransformed_scale_regularizer=None,\n    loc_constraint=None,\n    untransformed_scale_constraint=None):\n  \"\"\"Makes closure which creates `loc`, `scale` params from `tf.get_variable`.\n\n  This function produces a closure which produces `loc`, `scale` using\n  `tf.get_variable`. The closure accepts the following arguments:\n\n    dtype: Type of parameter's event.\n    shape: Python `list`-like representing the parameter's event shape.\n    name: Python `str` name prepended to any created (or existing)\n      `tf.Variable`s.\n    trainable: Python `bool` indicating all created `tf.Variable`s should be\n      added to the graph collection `GraphKeys.TRAINABLE_VARIABLES`.\n    add_variable_fn: `tf.get_variable`-like `callable` used to create (or\n      access existing) `tf.Variable`s.\n\n  Args:\n    is_singular: Python `bool` indicating if `scale is None`. Default: `False`.\n    loc_initializer: Initializer function for the `loc` parameters.\n      The default is `tf.random_normal_initializer(mean=0., stddev=0.1)`.\n    untransformed_scale_initializer: Initializer function for the `scale`\n      parameters. Default value: `tf.random_normal_initializer(mean=-3.,\n      stddev=0.1)`. This implies the softplus transformed result is initialized\n      near `0`. It allows a `Normal` distribution with `scale` parameter set to\n      this value to approximately act like a point mass.\n    loc_regularizer: Regularizer function for the `loc` parameters.\n      The default (`None`) is to use the `tf.get_variable` default.\n    untransformed_scale_regularizer: Regularizer function for the `scale`\n      parameters. The default (`None`) is to use the `tf.get_variable` default.\n    loc_constraint: An optional projection function to be applied to the\n      loc after being updated by an `Optimizer`. The function must take as input\n      the unprojected variable and must return the projected variable (which\n      must have the same shape). Constraints are not safe to use when doing\n      asynchronous distributed training.\n      The default (`None`) is to use the `tf.get_variable` default.\n    untransformed_scale_constraint: An optional projection function to be\n      applied to the `scale` parameters after being updated by an `Optimizer`\n      (e.g. used to implement norm constraints or value constraints). The\n      function must take as input the unprojected variable and must return the\n      projected variable (which must have the same shape). Constraints are not\n      safe to use when doing asynchronous distributed training. The default\n      (`None`) is to use the `tf.get_variable` default.\n\n  Returns:\n    default_loc_scale_fn: Python `callable` which instantiates `loc`, `scale`\n    parameters from args: `dtype, shape, name, trainable, add_variable_fn`.\n  \"\"\"\n  def _fn(dtype, shape, name, trainable, add_variable_fn):\n    \"\"\"Creates `loc`, `scale` parameters.\"\"\"\n    loc = add_variable_fn(\n        name=name + '_loc',\n        shape=shape,\n        initializer=loc_initializer,\n        regularizer=loc_regularizer,\n        constraint=loc_constraint,\n        dtype=dtype,\n        trainable=trainable)\n    if is_singular:\n      return loc, None\n    untransformed_scale = add_variable_fn(\n        name=name + '_untransformed_scale',\n        shape=shape,\n        initializer=untransformed_scale_initializer,\n        regularizer=untransformed_scale_regularizer,\n        constraint=untransformed_scale_constraint,\n        dtype=dtype,\n        trainable=trainable)\n    scale = (np.finfo(dtype.as_numpy_dtype).eps +\n             tf.nn.softplus(untransformed_scale))\n    return loc, scale\n  return _fn", "response": "Returns a closure which creates loc scale params from tf. get_variable."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef default_mean_field_normal_fn(\n    is_singular=False,\n    loc_initializer=tf.compat.v1.initializers.random_normal(stddev=0.1),\n    untransformed_scale_initializer=tf.compat.v1.initializers.random_normal(\n        mean=-3., stddev=0.1),\n    loc_regularizer=None,\n    untransformed_scale_regularizer=None,\n    loc_constraint=None,\n    untransformed_scale_constraint=None):\n  \"\"\"Creates a function to build Normal distributions with trainable params.\n\n  This function produces a closure which produces `tfd.Normal`\n  parameterized by a loc` and `scale` each created using `tf.get_variable`.\n\n  Args:\n    is_singular: Python `bool` if `True`, forces the special case limit of\n      `scale->0`, i.e., a `Deterministic` distribution.\n    loc_initializer: Initializer function for the `loc` parameters.\n      The default is `tf.random_normal_initializer(mean=0., stddev=0.1)`.\n    untransformed_scale_initializer: Initializer function for the `scale`\n      parameters. Default value: `tf.random_normal_initializer(mean=-3.,\n      stddev=0.1)`. This implies the softplus transformed result is initialized\n      near `0`. It allows a `Normal` distribution with `scale` parameter set to\n      this value to approximately act like a point mass.\n    loc_regularizer: Regularizer function for the `loc` parameters.\n    untransformed_scale_regularizer: Regularizer function for the `scale`\n      parameters.\n    loc_constraint: An optional projection function to be applied to the\n      loc after being updated by an `Optimizer`. The function must take as input\n      the unprojected variable and must return the projected variable (which\n      must have the same shape). Constraints are not safe to use when doing\n      asynchronous distributed training.\n    untransformed_scale_constraint: An optional projection function to be\n      applied to the `scale` parameters after being updated by an `Optimizer`\n      (e.g. used to implement norm constraints or value constraints). The\n      function must take as input the unprojected variable and must return the\n      projected variable (which must have the same shape). Constraints are not\n      safe to use when doing asynchronous distributed training.\n\n  Returns:\n    make_normal_fn: Python `callable` which creates a `tfd.Normal`\n      using from args: `dtype, shape, name, trainable, add_variable_fn`.\n  \"\"\"\n  loc_scale_fn = default_loc_scale_fn(\n      is_singular=is_singular,\n      loc_initializer=loc_initializer,\n      untransformed_scale_initializer=untransformed_scale_initializer,\n      loc_regularizer=loc_regularizer,\n      untransformed_scale_regularizer=untransformed_scale_regularizer,\n      loc_constraint=loc_constraint,\n      untransformed_scale_constraint=untransformed_scale_constraint)\n  def _fn(dtype, shape, name, trainable, add_variable_fn):\n    \"\"\"Creates multivariate `Deterministic` or `Normal` distribution.\n\n    Args:\n      dtype: Type of parameter's event.\n      shape: Python `list`-like representing the parameter's event shape.\n      name: Python `str` name prepended to any created (or existing)\n        `tf.Variable`s.\n      trainable: Python `bool` indicating all created `tf.Variable`s should be\n        added to the graph collection `GraphKeys.TRAINABLE_VARIABLES`.\n      add_variable_fn: `tf.get_variable`-like `callable` used to create (or\n        access existing) `tf.Variable`s.\n\n    Returns:\n      Multivariate `Deterministic` or `Normal` distribution.\n    \"\"\"\n    loc, scale = loc_scale_fn(dtype, shape, name, trainable, add_variable_fn)\n    if scale is None:\n      dist = tfd.Deterministic(loc=loc)\n    else:\n      dist = tfd.Normal(loc=loc, scale=scale)\n    batch_ndims = tf.size(input=dist.batch_shape_tensor())\n    return tfd.Independent(dist, reinterpreted_batch_ndims=batch_ndims)\n  return _fn", "response": "Returns a function to build a normal distribution with trainable params."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\ndefaults multivariate standard normal distribution.", "response": "def default_multivariate_normal_fn(dtype, shape, name, trainable,\n                                   add_variable_fn):\n  \"\"\"Creates multivariate standard `Normal` distribution.\n\n  Args:\n    dtype: Type of parameter's event.\n    shape: Python `list`-like representing the parameter's event shape.\n    name: Python `str` name prepended to any created (or existing)\n      `tf.Variable`s.\n    trainable: Python `bool` indicating all created `tf.Variable`s should be\n      added to the graph collection `GraphKeys.TRAINABLE_VARIABLES`.\n    add_variable_fn: `tf.get_variable`-like `callable` used to create (or\n      access existing) `tf.Variable`s.\n\n  Returns:\n    Multivariate standard `Normal` distribution.\n  \"\"\"\n  del name, trainable, add_variable_fn   # unused\n  dist = tfd.Normal(loc=tf.zeros(shape, dtype), scale=dtype.as_numpy_dtype(1))\n  batch_ndims = tf.size(input=dist.batch_shape_tensor())\n  return tfd.Independent(dist, reinterpreted_batch_ndims=batch_ndims)"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef serialize_function(func):\n  if isinstance(func, types.LambdaType):\n    return generic_utils.func_dump(func), 'lambda'\n  return func.__name__, 'function'", "response": "Serializes a Python function to be serialized by Keras."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef broadcast_structure(to_structure, from_structure):\n  from_parts = tf.nest.flatten(from_structure)\n  if len(from_parts) == 1:\n    from_structure = tf.nest.map_structure(lambda _: from_parts[0],\n                                           to_structure)\n  return from_structure", "response": "Broadcasts from_structure to to_structure."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef expand_as_args(args):\n  return (isinstance(args, collections.Sequence) and\n          not _is_namedtuple(args) and not _force_leaf(args))", "response": "Returns True if args should be expanded as args."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nconvert the given arguments to Tensor s.", "response": "def convert_args_to_tensor(args, dtype=None, name=None):\n  \"\"\"Converts `args` to `Tensor`s.\n\n  Use this when it is necessary to convert user-provided arguments that will\n  then be passed to user-provided callables.\n\n  When `dtype` is `None` this function behaves as follows:\n\n  1A. If the top-level structure is a `list`/`tuple` but not a `namedtuple`,\n      then it is left as is and only its elements are converted to `Tensor`s.\n\n  2A. The sub-structures are converted to `Tensor`s eagerly. E.g. if `args` is\n      `{'arg': [[1], [2]]}` it is converted to\n      `{'arg': tf.constant([[1], [2]])}`. If the conversion fails, it will\n      attempt to recurse into its children.\n\n  When `dtype` is specified, it acts as both a structural and numeric type\n  constraint. `dtype` can be a single `DType`, `None` or a nested collection\n  thereof. The conversion rule becomes as follows:\n\n  1B. The return value of this function will have the same structure as `dtype`.\n\n  2B. If the leaf of `dtype` is a concrete `DType`, then the corresponding\n      sub-structure in `args` is converted to a `Tensor`.\n\n  3B. If the leaf of `dtype` is `None`, then the corresponding sub-structure is\n      converted eagerly as described in the rule 2A above.\n\n  Args:\n    args: Arguments to convert to `Tensor`s.\n    dtype: Optional structure/numeric type constraint.\n    name: Optional name-scope to use.\n\n  Returns:\n    args: Converted `args`.\n\n  #### Examples.\n\n  This table shows some useful conversion cases. `T` means `Tensor`, `NT` means\n  `namedtuple` and `CNT` means a `namedtuple` with a `Tensor`-conversion\n  function registered.\n\n  |     args     |    dtype   |       output       |\n  |:------------:|:----------:|:------------------:|\n  | `{\"a\": 1}`   | `None`     | `{\"a\": T(1)}`      |\n  | `T(1)`       | `None`     | `T(1)`             |\n  | `[1]`        | `None`     | `[T(1)]`           |\n  | `[1]`        | `tf.int32` | `T([1])`           |\n  | `[[T(1)]]`   | `None`     | `[T([1])]`         |\n  | `[[T(1)]]`   | `[[None]]` | `[[T(1)]]`         |\n  | `NT(1, 2)`   | `None`     | `NT(T(1), T(2))`   |\n  | `NT(1, 2)`   | `tf.int32` | `T([1, 2])`        |\n  | `CNT(1, 2)`  | `None`     | `T(...)`           |\n  | `[[1, [2]]]` | `None`     | `[[T(1), T([2])]]` |\n\n  \"\"\"\n  if dtype is None:\n    if expand_as_args(args) or _expand_as_kwargs(args):\n      shallow_args = _get_shallow_structure(args)\n      return nest.map_structure_up_to(\n          shallow_args, lambda s: _nested_convert_to_tensor(s, name=name), args)\n    else:\n      return _nested_convert_to_tensor(args, name=name)\n  else:\n    return nest.map_structure_up_to(\n        dtype, lambda s, dtype: _nested_convert_to_tensor(s, dtype, name), args,\n        dtype)"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef call_fn(fn, args):\n\n  if expand_as_args(args):\n    return fn(*args)\n  elif _expand_as_kwargs(args):\n    return fn(**args)\n  else:\n    return fn(args)", "response": "Calls fn with args possibly expanding args."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef _wrap_method(cls, attr):\n  fn = getattr(cls, attr)\n  is_property = isinstance(fn, property)\n  if is_property:\n    fn = fn.fget\n  @functools.wraps(fn)\n  def wrapped(self, *args, **kwargs):\n    return fn(self._value(), *args, **kwargs)  # pylint: disable=protected-access\n  return property(wrapped) if is_property else wrapped", "response": "Wraps a method in a dependency_injected_function that will be called when a Tensor member function is used to take existing\n."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef _get_tensor_like_attributes():\n  # Enable \"Tensor semantics\" for distributions.\n  # See tensorflow/python/framework/ops.py `class Tensor` for details.\n  attrs = dict()\n  # Setup overloadable operators and white-listed members / properties.\n  attrs.update((attr, _wrap_method(tf.Tensor, attr))\n               for attr in tf.Tensor.OVERLOADABLE_OPERATORS.union({'__iter__'}))\n  # Copy some members straight-through.\n  attrs.update((attr, getattr(tf.Tensor, attr))\n               for attr in {'__nonzero__', '__bool__', '__array_priority__'})\n  return attrs", "response": "Returns a dictionary of Tensor attributes related to shape and Python builtins."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\ngets the value of the current object in the cache.", "response": "def _value(self, dtype=None, name=None, as_ref=False):  # pylint: disable=g-doc-args\n  \"\"\"Get the value returned by `tf.convert_to_tensor(distribution)`.\n\n  Note: this function may mutate the distribution instance state by caching\n  the concretized `Tensor` value.\n\n  Args:\n    dtype: Must return a `Tensor` with the given `dtype` if specified.\n    name: If the conversion function creates a new `Tensor`, it should use the\n      given `name` if specified.\n    as_ref: `as_ref` is true, the function must return a `Tensor` reference,\n      such as a `Variable`.\n  Returns:\n    concretized_distribution_value: `Tensor` identical to\n    `tf.convert_to_tensor(distribution)`.\n\n  #### Examples\n\n  ```python\n  tfd = tfp.distributions\n  x = tfd.Normal(0.5, 1).set_tensor_conversion(tfd.Distribution.mean)\n\n  x._value()\n  # ==> tf.convert_to_tensor(x) ==> 0.5\n\n  x._value() + 2\n  # ==> tf.convert_to_tensor(x) + 2. ==> 2.5\n\n  x + 2\n  # ==> tf.convert_to_tensor(x) + 2. ==> 2.5\n  ```\n\n  \"\"\"\n  # pylint: disable=protected-access\n  if as_ref:\n    raise NotImplementedError(\n        'Cannot convert a `Distribution` to a reference '\n        '(e.g., `tf.Variable`).')\n  if self._concrete_value is None:\n    if self._convert_to_tensor_fn is None:\n      raise NotImplementedError(\n          'Failed to convert object of type {} to Tensor. Contents: {}. '\n          'Call `distribution.set_tensor_conversion(lambda self: ...)` to '\n          'enable `tf.convert_to_tensor` capability. For example: '\n          '`x = tfd.Normal(0,1).set_tensor_conversion(tfd.Distribution.mean)`'\n          ' results in `tf.convert_to_tensor(x)` being identical to '\n          '`x.mean()`.'.format(type(self), self))\n    with self._name_scope('value'):\n      self._concrete_value = (self._convert_to_tensor_fn(self)\n                              if callable(self._convert_to_tensor_fn)\n                              else self._convert_to_tensor_fn)\n      if not tf.is_tensor(self._concrete_value):\n        self._concrete_value = tfd._convert_to_tensor(\n            value=self._concrete_value,\n            name=name or 'concrete_value',\n            dtype=dtype,\n            dtype_hint=self.dtype)\n  return self._concrete_value"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\ncreate the encoder function.", "response": "def make_encoder(activation, latent_size, base_depth):\n  \"\"\"Creates the encoder function.\n\n  Args:\n    activation: Activation function in hidden layers.\n    latent_size: The dimensionality of the encoding.\n    base_depth: The lowest depth for a layer.\n\n  Returns:\n    encoder: A `callable` mapping a `Tensor` of images to a\n      `tfd.Distribution` instance over encodings.\n  \"\"\"\n  conv = functools.partial(\n      tf.keras.layers.Conv2D, padding=\"SAME\", activation=activation)\n\n  encoder_net = tf.keras.Sequential([\n      conv(base_depth, 5, 1),\n      conv(base_depth, 5, 2),\n      conv(2 * base_depth, 5, 1),\n      conv(2 * base_depth, 5, 2),\n      conv(4 * latent_size, 7, padding=\"VALID\"),\n      tf.keras.layers.Flatten(),\n      tf.keras.layers.Dense(2 * latent_size, activation=None),\n  ])\n\n  def encoder(images):\n    images = 2 * tf.cast(images, dtype=tf.float32) - 1\n    net = encoder_net(images)\n    return tfd.MultivariateNormalDiag(\n        loc=net[..., :latent_size],\n        scale_diag=tf.nn.softplus(net[..., latent_size:] +\n                                  _softplus_inverse(1.0)),\n        name=\"code\")\n\n  return encoder"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef make_decoder(activation, latent_size, output_shape, base_depth):\n  deconv = functools.partial(\n      tf.keras.layers.Conv2DTranspose, padding=\"SAME\", activation=activation)\n  conv = functools.partial(\n      tf.keras.layers.Conv2D, padding=\"SAME\", activation=activation)\n\n  decoder_net = tf.keras.Sequential([\n      deconv(2 * base_depth, 7, padding=\"VALID\"),\n      deconv(2 * base_depth, 5),\n      deconv(2 * base_depth, 5, 2),\n      deconv(base_depth, 5),\n      deconv(base_depth, 5, 2),\n      deconv(base_depth, 5),\n      conv(output_shape[-1], 5, activation=None),\n  ])\n\n  def decoder(codes):\n    original_shape = tf.shape(input=codes)\n    # Collapse the sample and batch dimension and convert to rank-4 tensor for\n    # use with a convolutional decoder network.\n    codes = tf.reshape(codes, (-1, 1, 1, latent_size))\n    logits = decoder_net(codes)\n    logits = tf.reshape(\n        logits, shape=tf.concat([original_shape[:-1], output_shape], axis=0))\n    return tfd.Independent(tfd.Bernoulli(logits=logits),\n                           reinterpreted_batch_ndims=len(output_shape),\n                           name=\"image\")\n\n  return decoder", "response": "Creates the decoder function."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\ncreate the mixture of Gaussians prior distribution.", "response": "def make_mixture_prior(latent_size, mixture_components):\n  \"\"\"Creates the mixture of Gaussians prior distribution.\n\n  Args:\n    latent_size: The dimensionality of the latent representation.\n    mixture_components: Number of elements of the mixture.\n\n  Returns:\n    random_prior: A `tfd.Distribution` instance representing the distribution\n      over encodings in the absence of any evidence.\n  \"\"\"\n  if mixture_components == 1:\n    # See the module docstring for why we don't learn the parameters here.\n    return tfd.MultivariateNormalDiag(\n        loc=tf.zeros([latent_size]),\n        scale_identity_multiplier=1.0)\n\n  loc = tf.compat.v1.get_variable(\n      name=\"loc\", shape=[mixture_components, latent_size])\n  raw_scale_diag = tf.compat.v1.get_variable(\n      name=\"raw_scale_diag\", shape=[mixture_components, latent_size])\n  mixture_logits = tf.compat.v1.get_variable(\n      name=\"mixture_logits\", shape=[mixture_components])\n\n  return tfd.MixtureSameFamily(\n      components_distribution=tfd.MultivariateNormalDiag(\n          loc=loc,\n          scale_diag=tf.nn.softplus(raw_scale_diag)),\n      mixture_distribution=tfd.Categorical(logits=mixture_logits),\n      name=\"prior\")"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef pack_images(images, rows, cols):\n  shape = tf.shape(input=images)\n  width = shape[-3]\n  height = shape[-2]\n  depth = shape[-1]\n  images = tf.reshape(images, (-1, width, height, depth))\n  batch = tf.shape(input=images)[0]\n  rows = tf.minimum(rows, batch)\n  cols = tf.minimum(batch // rows, cols)\n  images = images[:rows * cols]\n  images = tf.reshape(images, (rows, cols, width, height, depth))\n  images = tf.transpose(a=images, perm=[0, 2, 1, 3, 4])\n  images = tf.reshape(images, [1, rows * width, cols * height, depth])\n  return images", "response": "Helper utility to make a field of images."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef model_fn(features, labels, mode, params, config):\n  del labels, config\n\n  if params[\"analytic_kl\"] and params[\"mixture_components\"] != 1:\n    raise NotImplementedError(\n        \"Using `analytic_kl` is only supported when `mixture_components = 1` \"\n        \"since there's no closed form otherwise.\")\n\n  encoder = make_encoder(params[\"activation\"],\n                         params[\"latent_size\"],\n                         params[\"base_depth\"])\n  decoder = make_decoder(params[\"activation\"],\n                         params[\"latent_size\"],\n                         IMAGE_SHAPE,\n                         params[\"base_depth\"])\n  latent_prior = make_mixture_prior(params[\"latent_size\"],\n                                    params[\"mixture_components\"])\n\n  image_tile_summary(\n      \"input\", tf.cast(features, dtype=tf.float32), rows=1, cols=16)\n\n  approx_posterior = encoder(features)\n  approx_posterior_sample = approx_posterior.sample(params[\"n_samples\"])\n  decoder_likelihood = decoder(approx_posterior_sample)\n  image_tile_summary(\n      \"recon/sample\",\n      tf.cast(decoder_likelihood.sample()[:3, :16], dtype=tf.float32),\n      rows=3,\n      cols=16)\n  image_tile_summary(\n      \"recon/mean\",\n      decoder_likelihood.mean()[:3, :16],\n      rows=3,\n      cols=16)\n\n  # `distortion` is just the negative log likelihood.\n  distortion = -decoder_likelihood.log_prob(features)\n  avg_distortion = tf.reduce_mean(input_tensor=distortion)\n  tf.compat.v1.summary.scalar(\"distortion\", avg_distortion)\n\n  if params[\"analytic_kl\"]:\n    rate = tfd.kl_divergence(approx_posterior, latent_prior)\n  else:\n    rate = (approx_posterior.log_prob(approx_posterior_sample)\n            - latent_prior.log_prob(approx_posterior_sample))\n  avg_rate = tf.reduce_mean(input_tensor=rate)\n  tf.compat.v1.summary.scalar(\"rate\", avg_rate)\n\n  elbo_local = -(rate + distortion)\n\n  elbo = tf.reduce_mean(input_tensor=elbo_local)\n  loss = -elbo\n  tf.compat.v1.summary.scalar(\"elbo\", elbo)\n\n  importance_weighted_elbo = tf.reduce_mean(\n      input_tensor=tf.reduce_logsumexp(input_tensor=elbo_local, axis=0) -\n      tf.math.log(tf.cast(params[\"n_samples\"], dtype=tf.float32)))\n  tf.compat.v1.summary.scalar(\"elbo/importance_weighted\",\n                              importance_weighted_elbo)\n\n  # Decode samples from the prior for visualization.\n  random_image = decoder(latent_prior.sample(16))\n  image_tile_summary(\n      \"random/sample\",\n      tf.cast(random_image.sample(), dtype=tf.float32),\n      rows=4,\n      cols=4)\n  image_tile_summary(\"random/mean\", random_image.mean(), rows=4, cols=4)\n\n  # Perform variational inference by minimizing the -ELBO.\n  global_step = tf.compat.v1.train.get_or_create_global_step()\n  learning_rate = tf.compat.v1.train.cosine_decay(\n      params[\"learning_rate\"], global_step, params[\"max_steps\"])\n  tf.compat.v1.summary.scalar(\"learning_rate\", learning_rate)\n  optimizer = tf.compat.v1.train.AdamOptimizer(learning_rate)\n  train_op = optimizer.minimize(loss, global_step=global_step)\n\n  return tf.estimator.EstimatorSpec(\n      mode=mode,\n      loss=loss,\n      train_op=train_op,\n      eval_metric_ops={\n          \"elbo\":\n              tf.compat.v1.metrics.mean(elbo),\n          \"elbo/importance_weighted\":\n              tf.compat.v1.metrics.mean(importance_weighted_elbo),\n          \"rate\":\n              tf.compat.v1.metrics.mean(avg_rate),\n          \"distortion\":\n              tf.compat.v1.metrics.mean(avg_distortion),\n      },\n  )", "response": "Builds the model function for use in an estimator."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\ndownloads a file from a directory.", "response": "def download(directory, filename):\n  \"\"\"Downloads a file.\"\"\"\n  filepath = os.path.join(directory, filename)\n  if tf.io.gfile.exists(filepath):\n    return filepath\n  if not tf.io.gfile.exists(directory):\n    tf.io.gfile.makedirs(directory)\n  url = os.path.join(ROOT_PATH, filename)\n  print(\"Downloading %s to %s\" % (url, filepath))\n  urllib.request.urlretrieve(url, filepath)\n  return filepath"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef build_fake_input_fns(batch_size):\n  random_sample = np.random.rand(batch_size, *IMAGE_SHAPE).astype(\"float32\")\n\n  def train_input_fn():\n    dataset = tf.data.Dataset.from_tensor_slices(\n        random_sample).map(lambda row: (row, 0)).batch(batch_size).repeat()\n    return tf.compat.v1.data.make_one_shot_iterator(dataset).get_next()\n\n  def eval_input_fn():\n    dataset = tf.data.Dataset.from_tensor_slices(\n        random_sample).map(lambda row: (row, 0)).batch(batch_size)\n    return tf.compat.v1.data.make_one_shot_iterator(dataset).get_next()\n\n  return train_input_fn, eval_input_fn", "response": "Builds fake MNIST - style data for unit testing."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef build_input_fns(data_dir, batch_size):\n\n  # Build an iterator over training batches.\n  def train_input_fn():\n    dataset = static_mnist_dataset(data_dir, \"train\")\n    dataset = dataset.shuffle(50000).repeat().batch(batch_size)\n    return tf.compat.v1.data.make_one_shot_iterator(dataset).get_next()\n\n  # Build an iterator over the heldout set.\n  def eval_input_fn():\n    eval_dataset = static_mnist_dataset(data_dir, \"valid\")\n    eval_dataset = eval_dataset.batch(batch_size)\n    return tf.compat.v1.data.make_one_shot_iterator(eval_dataset).get_next()\n\n  return train_input_fn, eval_input_fn", "response": "Builds an iterator switching between train and heldout data."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nverifies that parts don t broadcast.", "response": "def maybe_check_wont_broadcast(flat_xs, validate_args):\n  \"\"\"Verifies that `parts` don't broadcast.\"\"\"\n  flat_xs = tuple(flat_xs)  # So we can receive generators.\n  if not validate_args:\n    # Note: we don't try static validation because it is theoretically\n    # possible that a user wants to take advantage of broadcasting.\n    # Only when `validate_args` is `True` do we enforce the validation.\n    return flat_xs\n  msg = 'Broadcasting probably indicates an error in model specification.'\n  s = tuple(x.shape for x in flat_xs)\n  if all(tensorshape_util.is_fully_defined(s_) for s_ in s):\n    if not all(a == b for a, b in zip(s[1:], s[:-1])):\n      raise ValueError(msg)\n    return flat_xs\n  assertions = [assert_util.assert_equal(a, b, message=msg)\n                for a, b in zip(s[1:], s[:-1])]\n  with tf.control_dependencies(assertions):\n    return tuple(tf.identity(x) for x in flat_xs)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef softplus_and_shift(x, shift=1e-5, name=None):\n  with tf.compat.v1.name_scope(name, 'softplus_and_shift', [x, shift]):\n    x = tf.convert_to_tensor(value=x, name='x')\n    y = tf.nn.softplus(x)\n    if shift is not None:\n      y += shift\n    return y", "response": "Converts a batch of float - like tensors to positive valued scalars."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nconverting x to batch of lower - triangular matrices with positive diagonal diagonal .", "response": "def tril_with_diag_softplus_and_shift(x, diag_shift=1e-5, name=None):\n  \"\"\"Converts (batch of) vectors to (batch of) lower-triangular scale matrices.\n\n  Args:\n    x: (Batch of) `float`-like `Tensor` representing vectors which will be\n      transformed into lower-triangular scale matrices with positive diagonal\n      elements. Rightmost shape `n` must be such that\n      `n = dims * (dims + 1) / 2` for some positive, integer `dims`.\n    diag_shift: `Tensor` added to `softplus` transformation of diagonal\n      elements.\n      Default value: `1e-5`.\n    name: A `name_scope` name for operations created by this function.\n      Default value: `None` (i.e., \"tril_with_diag_softplus_and_shift\").\n\n  Returns:\n    scale_tril: (Batch of) lower-triangular `Tensor` with `x.dtype` and\n      rightmost shape `[dims, dims]` where `n = dims * (dims + 1) / 2` where\n      `n = x.shape[-1]`.\n  \"\"\"\n  with tf.compat.v1.name_scope(name, 'tril_with_diag_softplus_and_shift',\n                               [x, diag_shift]):\n    x = tf.convert_to_tensor(value=x, name='x')\n    x = tfd.fill_triangular(x)\n    diag = softplus_and_shift(tf.linalg.diag_part(x), diag_shift)\n    x = tf.linalg.set_diag(x, diag)\n    return x"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nconstruct a trainable multivariate normal triangular distribution.", "response": "def multivariate_normal_tril(x,\n                             dims,\n                             layer_fn=tf.compat.v1.layers.dense,\n                             loc_fn=lambda x: x,\n                             scale_fn=tril_with_diag_softplus_and_shift,\n                             name=None):\n  \"\"\"Constructs a trainable `tfd.MultivariateNormalTriL` distribution.\n\n  This function creates a MultivariateNormal (MVN) with lower-triangular scale\n  matrix. By default the MVN is parameterized via affine transformation of input\n  tensor `x`. Using default args, this function is mathematically equivalent to:\n\n  ```none\n  Y = MVN(loc=matmul(W, x) + b,\n          scale_tril=f(reshape_tril(matmul(M, x) + c)))\n\n  where,\n    W in R^[d, n]\n    M in R^[d*(d+1)/2, n]\n    b in R^d\n    c in R^d\n    f(S) = set_diag(S, softplus(matrix_diag_part(S)) + 1e-5)\n  ```\n\n  Observe that `f` makes the diagonal of the triangular-lower scale matrix be\n  positive and no smaller than `1e-5`.\n\n  #### Examples\n\n  ```python\n  # This example fits a multilinear regression loss.\n  import tensorflow as tf\n  import tensorflow_probability as tfp\n\n  # Create fictitious training data.\n  dtype = np.float32\n  n = 3000    # number of samples\n  x_size = 4  # size of single x\n  y_size = 2  # size of single y\n  def make_training_data():\n    np.random.seed(142)\n    x = np.random.randn(n, x_size).astype(dtype)\n    w = np.random.randn(x_size, y_size).astype(dtype)\n    b = np.random.randn(1, y_size).astype(dtype)\n    true_mean = np.tensordot(x, w, axes=[[-1], [0]]) + b\n    noise = np.random.randn(n, y_size).astype(dtype)\n    y = true_mean + noise\n    return y, x\n  y, x = make_training_data()\n\n  # Build TF graph for fitting MVNTriL maximum likelihood estimator.\n  mvn = tfp.trainable_distributions.multivariate_normal_tril(x, dims=y_size)\n  loss = -tf.reduce_mean(mvn.log_prob(y))\n  train_op = tf.train.AdamOptimizer(learning_rate=2.**-3).minimize(loss)\n  mse = tf.reduce_mean(tf.squared_difference(y, mvn.mean()))\n  init_op = tf.global_variables_initializer()\n\n  # Run graph 1000 times.\n  num_steps = 1000\n  loss_ = np.zeros(num_steps)   # Style: `_` to indicate sess.run result.\n  mse_ = np.zeros(num_steps)\n  with tf.Session() as sess:\n    sess.run(init_op)\n    for it in xrange(loss_.size):\n      _, loss_[it], mse_[it] = sess.run([train_op, loss, mse])\n      if it % 200 == 0 or it == loss_.size - 1:\n        print(\"iteration:{}  loss:{}  mse:{}\".format(it, loss_[it], mse_[it]))\n\n  # ==> iteration:0    loss:38.2020797729  mse:4.17175960541\n  #     iteration:200  loss:2.90179634094  mse:0.990987896919\n  #     iteration:400  loss:2.82727336884  mse:0.990926623344\n  #     iteration:600  loss:2.82726788521  mse:0.990926682949\n  #     iteration:800  loss:2.82726788521  mse:0.990926682949\n  #     iteration:999  loss:2.82726788521  mse:0.990926682949\n  ```\n\n  Args:\n    x: `Tensor` with floating type. Must have statically defined rank and\n      statically known right-most dimension.\n    dims: Scalar, `int`, `Tensor` indicated the MVN event size, i.e., the\n      created MVN will be distribution over length-`dims` vectors.\n    layer_fn: Python `callable` which takes input `x` and `int` scalar `d` and\n      returns a transformation of `x` with shape\n      `tf.concat([tf.shape(x)[:-1], [d]], axis=0)`.\n      Default value: `tf.layers.dense`.\n    loc_fn: Python `callable` which transforms the `loc` parameter. Takes a\n      (batch of) length-`dims` vectors and returns a `Tensor` of same shape and\n      `dtype`.\n      Default value: `lambda x: x`.\n    scale_fn: Python `callable` which transforms the `scale` parameters. Takes a\n      (batch of) length-`dims * (dims + 1) / 2` vectors and returns a\n      lower-triangular `Tensor` of same batch shape with rightmost dimensions\n      having shape `[dims, dims]`.\n      Default value: `tril_with_diag_softplus_and_shift`.\n    name: A `name_scope` name for operations created by this function.\n      Default value: `None` (i.e., \"multivariate_normal_tril\").\n\n  Returns:\n    mvntril: An instance of `tfd.MultivariateNormalTriL`.\n  \"\"\"\n  with tf.compat.v1.name_scope(name, 'multivariate_normal_tril', [x, dims]):\n    x = tf.convert_to_tensor(value=x, name='x')\n    x = layer_fn(x, dims + dims * (dims + 1) // 2)\n    return tfd.MultivariateNormalTriL(\n        loc=loc_fn(x[..., :dims]),\n        scale_tril=scale_fn(x[..., dims:]))"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef bernoulli(x, layer_fn=tf.compat.v1.layers.dense, name=None):\n  with tf.compat.v1.name_scope(name, 'bernoulli', [x]):\n    x = tf.convert_to_tensor(value=x, name='x')\n    logits = tf.squeeze(layer_fn(x, 1), axis=-1)\n    return tfd.Bernoulli(logits=logits)", "response": "Constructs a trainable Bernoulli distribution."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nconstructs a trainable tfd. Normal distribution.", "response": "def normal(x,\n           layer_fn=tf.compat.v1.layers.dense,\n           loc_fn=lambda x: x,\n           scale_fn=1.,\n           name=None):\n  \"\"\"Constructs a trainable `tfd.Normal` distribution.\n\n\n  This function creates a Normal distribution parameterized by loc and scale.\n  Using default args, this function is mathematically equivalent to:\n\n  ```none\n  Y = Normal(loc=matmul(W, x) + b, scale=1)\n\n  where,\n    W in R^[d, n]\n    b in R^d\n  ```\n\n  #### Examples\n\n  This function can be used as a [linear regression](\n  https://en.wikipedia.org/wiki/Linear_regression) loss.\n\n  ```python\n  # This example fits a linear regression loss.\n  import tensorflow as tf\n  import tensorflow_probability as tfp\n\n  # Create fictitious training data.\n  dtype = np.float32\n  n = 3000    # number of samples\n  x_size = 4  # size of single x\n  def make_training_data():\n    np.random.seed(142)\n    x = np.random.randn(n, x_size).astype(dtype)\n    w = np.random.randn(x_size).astype(dtype)\n    b = np.random.randn(1).astype(dtype)\n    true_mean = np.tensordot(x, w, axes=[[-1], [-1]]) + b\n    noise = np.random.randn(n).astype(dtype)\n    y = true_mean + noise\n    return y, x\n  y, x = make_training_data()\n\n  # Build TF graph for fitting Normal maximum likelihood estimator.\n  normal = tfp.trainable_distributions.normal(x)\n  loss = -tf.reduce_mean(normal.log_prob(y))\n  train_op = tf.train.AdamOptimizer(learning_rate=2.**-5).minimize(loss)\n  mse = tf.reduce_mean(tf.squared_difference(y, normal.mean()))\n  init_op = tf.global_variables_initializer()\n\n  # Run graph 1000 times.\n  num_steps = 1000\n  loss_ = np.zeros(num_steps)   # Style: `_` to indicate sess.run result.\n  mse_ = np.zeros(num_steps)\n  with tf.Session() as sess:\n    sess.run(init_op)\n    for it in xrange(loss_.size):\n      _, loss_[it], mse_[it] = sess.run([train_op, loss, mse])\n      if it % 200 == 0 or it == loss_.size - 1:\n        print(\"iteration:{}  loss:{}  mse:{}\".format(it, loss_[it], mse_[it]))\n\n  # ==> iteration:0    loss:6.34114170074  mse:10.8444051743\n  #     iteration:200  loss:1.40146839619  mse:0.965059816837\n  #     iteration:400  loss:1.40052902699  mse:0.963181257248\n  #     iteration:600  loss:1.40052902699  mse:0.963181257248\n  #     iteration:800  loss:1.40052902699  mse:0.963181257248\n  #     iteration:999  loss:1.40052902699  mse:0.963181257248\n  ```\n\n  Args:\n    x: `Tensor` with floating type. Must have statically defined rank and\n      statically known right-most dimension.\n    layer_fn: Python `callable` which takes input `x` and `int` scalar `d` and\n      returns a transformation of `x` with shape\n      `tf.concat([tf.shape(x)[:-1], [1]], axis=0)`.\n      Default value: `tf.layers.dense`.\n    loc_fn: Python `callable` which transforms the `loc` parameter. Takes a\n      (batch of) length-`dims` vectors and returns a `Tensor` of same shape and\n      `dtype`.\n      Default value: `lambda x: x`.\n    scale_fn: Python `callable` or `Tensor`. If a `callable` transforms the\n      `scale` parameters; if `Tensor` is the `tfd.Normal` `scale` argument.\n      Takes a (batch of) length-`dims` vectors and returns a `Tensor` of same\n      size. (Taking a `callable` or `Tensor` is how `tf.Variable` intializers\n      behave.)\n      Default value: `1`.\n    name: A `name_scope` name for operations created by this function.\n      Default value: `None` (i.e., \"normal\").\n\n  Returns:\n    normal: An instance of `tfd.Normal`.\n  \"\"\"\n  with tf.compat.v1.name_scope(name, 'normal', [x]):\n    x = tf.convert_to_tensor(value=x, name='x')\n    if callable(scale_fn):\n      y = layer_fn(x, 2)\n      loc = loc_fn(y[..., 0])\n      scale = scale_fn(y[..., 1])\n    else:\n      y = tf.squeeze(layer_fn(x, 1), axis=-1)\n      loc = loc_fn(y)\n      scale = tf.cast(scale_fn, loc.dtype.base_dtype)\n    return tfd.Normal(loc=loc, scale=scale)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef poisson(x,\n            layer_fn=tf.compat.v1.layers.dense,\n            log_rate_fn=lambda x: x,\n            name=None):\n  \"\"\"Constructs a trainable `tfd.Poisson` distribution.\n\n  This function creates a Poisson distribution parameterized by log rate.\n  Using default args, this function is mathematically equivalent to:\n\n  ```none\n  Y = Poisson(log_rate=matmul(W, x) + b)\n\n  where,\n    W in R^[d, n]\n    b in R^d\n  ```\n\n  #### Examples\n\n  This can be used as a [Poisson regression](\n  https://en.wikipedia.org/wiki/Poisson_regression) loss.\n\n  ```python\n  # This example fits a poisson regression loss.\n  import numpy as np\n  import tensorflow as tf\n  import tensorflow_probability as tfp\n\n  # Create fictitious training data.\n  dtype = np.float32\n  n = 3000    # number of samples\n  x_size = 4  # size of single x\n  def make_training_data():\n    np.random.seed(142)\n    x = np.random.randn(n, x_size).astype(dtype)\n    w = np.random.randn(x_size).astype(dtype)\n    b = np.random.randn(1).astype(dtype)\n    true_log_rate = np.tensordot(x, w, axes=[[-1], [-1]]) + b\n    y = np.random.poisson(lam=np.exp(true_log_rate)).astype(dtype)\n    return y, x\n  y, x = make_training_data()\n\n  # Build TF graph for fitting Poisson maximum likelihood estimator.\n  poisson = tfp.trainable_distributions.poisson(x)\n  loss = -tf.reduce_mean(poisson.log_prob(y))\n  train_op = tf.train.AdamOptimizer(learning_rate=2.**-5).minimize(loss)\n  mse = tf.reduce_mean(tf.squared_difference(y, poisson.mean()))\n  init_op = tf.global_variables_initializer()\n\n  # Run graph 1000 times.\n  num_steps = 1000\n  loss_ = np.zeros(num_steps)   # Style: `_` to indicate sess.run result.\n  mse_ = np.zeros(num_steps)\n  with tf.Session() as sess:\n    sess.run(init_op)\n    for it in xrange(loss_.size):\n      _, loss_[it], mse_[it] = sess.run([train_op, loss, mse])\n      if it % 200 == 0 or it == loss_.size - 1:\n        print(\"iteration:{}  loss:{}  mse:{}\".format(it, loss_[it], mse_[it]))\n\n  # ==> iteration:0    loss:37.0814208984  mse:6359.41259766\n  #     iteration:200  loss:1.42010736465  mse:40.7654914856\n  #     iteration:400  loss:1.39027583599  mse:8.77660560608\n  #     iteration:600  loss:1.3902695179   mse:8.78443241119\n  #     iteration:800  loss:1.39026939869  mse:8.78443622589\n  #     iteration:999  loss:1.39026939869  mse:8.78444766998\n  ```\n\n  Args:\n    x: `Tensor` with floating type. Must have statically defined rank and\n      statically known right-most dimension.\n    layer_fn: Python `callable` which takes input `x` and `int` scalar `d` and\n      returns a transformation of `x` with shape\n      `tf.concat([tf.shape(x)[:-1], [1]], axis=0)`.\n      Default value: `tf.layers.dense`.\n    log_rate_fn: Python `callable` which transforms the `log_rate` parameter.\n      Takes a (batch of) length-`dims` vectors and returns a `Tensor` of same\n      shape and `dtype`.\n      Default value: `lambda x: x`.\n    name: A `name_scope` name for operations created by this function.\n      Default value: `None` (i.e., \"poisson\").\n\n  Returns:\n    poisson: An instance of `tfd.Poisson`.\n  \"\"\"\n  with tf.compat.v1.name_scope(name, 'poisson', [x]):\n    x = tf.convert_to_tensor(value=x, name='x')\n    log_rate = log_rate_fn(tf.squeeze(layer_fn(x, 1), axis=-1))\n    return tfd.Poisson(log_rate=log_rate)", "response": "Constructs a trainable tfd. Poisson distribution."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _euler_method(random_draw_parts,\n                  state_parts,\n                  drift_parts,\n                  step_size_parts,\n                  volatility_parts,\n                  name=None):\n  \"\"\"Applies one step of Euler-Maruyama method.\n\n  Generates proposal of the form:\n\n  ```python\n  tfd.Normal(loc=state_parts + _get_drift(state_parts, ...),\n             scale=tf.sqrt(step_size * volatility_fn(current_state)))\n  ```\n\n  `_get_drift(state_parts, ..)` is a diffusion drift value at `state_parts`.\n\n\n  Args:\n    random_draw_parts: Python `list` of `Tensor`s containing the value(s) of the\n      random perturbation variable(s). Must broadcast with the shape of\n      `state_parts`.\n    state_parts: Python `list` of `Tensor`s representing the current\n      state(s) of the Markov chain(s).\n    drift_parts: Python `list` of `Tensor`s representing value of the drift\n      `_get_drift(*state_parts, ..)`. Must broadcast with the shape of\n      `state_parts`.\n    step_size_parts: Python `list` of `Tensor`s representing the step size for\n      the Euler-Maruyama method. Must broadcast with the shape of\n      `state_parts`.  Larger step sizes lead to faster progress, but\n      too-large step sizes make rejection exponentially more likely. When\n      possible, it's often helpful to match per-variable step sizes to the\n      standard deviations of the target distribution in each variable.\n    volatility_parts: Python `list` of `Tensor`s representing the value of\n      `volatility_fn(*state_parts)`. Must broadcast with the shape of\n      `state_parts`.\n    name: Python `str` name prefixed to Ops created by this function.\n      Default value: `None` (i.e., 'mala_euler_method').\n\n  Returns:\n    proposed_state_parts: Tensor or Python list of `Tensor`s representing the\n      state(s) of the Markov chain(s) at each result step. Has same shape as\n      input `current_state_parts`.\n  \"\"\"\n  with tf.compat.v1.name_scope(name, 'mala_euler_method', [\n      random_draw_parts, state_parts, drift_parts, step_size_parts,\n      volatility_parts\n  ]):\n    proposed_state_parts = []\n    for random_draw, state, drift, step_size, volatility in zip(\n        random_draw_parts,\n        state_parts,\n        drift_parts,\n        step_size_parts,\n        volatility_parts):\n      proposal = state + drift + volatility * tf.sqrt(step_size) * random_draw\n      proposed_state_parts.append(proposal)\n\n    return proposed_state_parts", "response": "Applies one step of Euler - Maruyama method."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef _get_drift(step_size_parts, volatility_parts, grads_volatility,\n               grads_target_log_prob,\n               name=None):\n  \"\"\"Compute diffusion drift at the current location `current_state`.\n\n  The drift of the diffusion at is computed as\n\n  ```none\n  0.5 * `step_size` * volatility_parts * `target_log_prob_fn(current_state)`\n  + `step_size` * `grads_volatility`\n  ```\n\n  where `volatility_parts` = `volatility_fn(current_state)**2` and\n  `grads_volatility` is a gradient of `volatility_parts` at the `current_state`.\n\n  Args:\n    step_size_parts: Python `list` of `Tensor`s representing the step size for\n      Euler-Maruyama method. Must broadcast with the shape of\n      `volatility_parts`.  Larger step sizes lead to faster progress, but\n      too-large step sizes make rejection exponentially more likely. When\n      possible, it's often helpful to match per-variable step sizes to the\n      standard deviations of the target distribution in each variable.\n    volatility_parts: Python `list` of `Tensor`s representing the value of\n      `volatility_fn(*state_parts)`.\n    grads_volatility: Python list of `Tensor`s representing the value of the\n      gradient of `volatility_parts**2` wrt the state of the chain.\n    grads_target_log_prob: Python list of `Tensor`s representing\n      gradient of `target_log_prob_fn(*state_parts`) wrt `state_parts`. Must\n      have same shape as `volatility_parts`.\n    name: Python `str` name prefixed to Ops created by this function.\n      Default value: `None` (i.e., 'mala_get_drift').\n\n  Returns:\n    drift_parts: Tensor or Python list of `Tensor`s representing the\n      state(s) of the Markov chain(s) at each result step. Has same shape as\n      input `current_state_parts`.\n  \"\"\"\n\n  with tf.compat.v1.name_scope(name, 'mala_get_drift', [\n      step_size_parts, volatility_parts, grads_volatility, grads_target_log_prob\n  ]):\n\n    drift_parts = []\n\n    for step_size, volatility, grad_volatility, grad_target_log_prob in (\n        zip(step_size_parts,\n            volatility_parts,\n            grads_volatility,\n            grads_target_log_prob)):\n      volatility_squared = tf.square(volatility)\n      drift = 0.5 * step_size * (volatility_squared * grad_target_log_prob\n                                 + grad_volatility)\n      drift_parts.append(drift)\n\n    return drift_parts", "response": "Compute the diffusion drift at the current location."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef _maybe_broadcast_volatility(volatility_parts,\n                                state_parts):\n  \"\"\"Helper to broadcast `volatility_parts` to the shape of `state_parts`.\"\"\"\n  return [v + tf.zeros_like(sp, dtype=sp.dtype.base_dtype)\n          for v, sp in zip(volatility_parts, state_parts)]", "response": "Helper to broadcast volatility_parts to the shape of state_parts."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nbuilding an AR transition matrix for a autoregressive state space model.", "response": "def make_ar_transition_matrix(coefficients):\n  \"\"\"Build transition matrix for an autoregressive StateSpaceModel.\n\n  When applied to a vector of previous values, this matrix computes\n  the expected new value (summing the previous states according to the\n  autoregressive coefficients) in the top dimension of the state space,\n  and moves all previous values down by one dimension, 'forgetting' the\n  final (least recent) value. That is, it looks like this:\n\n  ```\n  ar_matrix = [ coefs[0], coefs[1], ..., coefs[order]\n                1.,       0 ,       ..., 0.\n                0.,       1.,       ..., 0.\n                ...\n                0.,       0.,  ..., 1.,  0.            ]\n  ```\n\n  Args:\n    coefficients: float `Tensor` of shape `concat([batch_shape, [order]])`.\n\n  Returns:\n    ar_matrix: float `Tensor` with shape `concat([batch_shape,\n    [order, order]])`.\n  \"\"\"\n\n  top_row = tf.expand_dims(coefficients, -2)\n  coef_shape = dist_util.prefer_static_shape(coefficients)\n  batch_shape, order = coef_shape[:-1], coef_shape[-1]\n  remaining_rows = tf.concat([\n      tf.eye(order - 1, dtype=coefficients.dtype, batch_shape=batch_shape),\n      tf.zeros(tf.concat([batch_shape, (order - 1, 1)], axis=0),\n               dtype=coefficients.dtype)\n  ], axis=-1)\n  ar_matrix = tf.concat([top_row, remaining_rows], axis=-2)\n  return ar_matrix"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef diag_jacobian(xs,\n                  ys=None,\n                  sample_shape=None,\n                  fn=None,\n                  parallel_iterations=10,\n                  name=None):\n  \"\"\"Computes diagonal of the Jacobian matrix of `ys=fn(xs)` wrt `xs`.\n\n    If `ys` is a tensor or a list of tensors of the form `(ys_1, .., ys_n)` and\n    `xs` is of the form `(xs_1, .., xs_n)`, the function `jacobians_diag`\n    computes the diagonal of the Jacobian matrix, i.e., the partial derivatives\n    `(dys_1/dxs_1,.., dys_n/dxs_n`). For definition details, see\n    https://en.wikipedia.org/wiki/Jacobian_matrix_and_determinant\n  #### Example\n\n  ##### Diagonal Hessian of the log-density of a 3D Gaussian distribution\n\n  In this example we sample from a standard univariate normal\n  distribution using MALA with `step_size` equal to 0.75.\n\n  ```python\n  import tensorflow as tf\n  import tensorflow_probability as tfp\n  import numpy as np\n\n  tfd = tfp.distributions\n\n  dtype = np.float32\n  with tf.Session(graph=tf.Graph()) as sess:\n    true_mean = dtype([0, 0, 0])\n    true_cov = dtype([[1, 0.25, 0.25], [0.25, 2, 0.25], [0.25, 0.25, 3]])\n    chol = tf.linalg.cholesky(true_cov)\n    target = tfd.MultivariateNormalTriL(loc=true_mean, scale_tril=chol)\n\n    # Assume that the state is passed as a list of tensors `x` and `y`.\n    # Then the target function is defined as follows:\n    def target_fn(x, y):\n      # Stack the input tensors together\n      z = tf.concat([x, y], axis=-1) - true_mean\n      return target.log_prob(z)\n\n    sample_shape = [3, 5]\n    state = [tf.ones(sample_shape + [2], dtype=dtype),\n             tf.ones(sample_shape + [1], dtype=dtype)]\n    fn_val, grads = tfp.math.value_and_gradient(target_fn, state)\n\n    # We can either pass the `sample_shape` of the `state` or not, which impacts\n    # computational speed of `diag_jacobian`\n    _, diag_jacobian_shape_passed = diag_jacobian(\n        xs=state, ys=grads, sample_shape=tf.shape(fn_val))\n    _, diag_jacobian_shape_none = diag_jacobian(\n        xs=state, ys=grads)\n\n    diag_jacobian_shape_passed_ = sess.run(diag_jacobian_shape_passed)\n    diag_jacobian_shape_none_ = sess.run(diag_jacobian_shape_none)\n\n  print('hessian computed through `diag_jacobian`, sample_shape passed: ',\n        np.concatenate(diag_jacobian_shape_passed_, -1))\n  print('hessian computed through `diag_jacobian`, sample_shape skipped',\n        np.concatenate(diag_jacobian_shape_none_, -1))\n\n  ```\n\n  Args:\n    xs: `Tensor` or a python `list` of `Tensors` of real-like dtypes and shapes\n      `sample_shape` + `event_shape_i`, where `event_shape_i` can be different\n      for different tensors.\n    ys: `Tensor` or a python `list` of `Tensors` of the same dtype as `xs`. Must\n        broadcast with the shape of `xs`. Can be omitted if `fn` is provided.\n    sample_shape: A common `sample_shape` of the input tensors of `xs`. If not,\n      provided, assumed to be `[1]`, which may result in a slow performance of\n      `jacobians_diag`.\n    fn: Python callable that takes `xs` as an argument (or `*xs`, if it is a\n      list) and returns `ys`. Might be skipped if `ys` is provided and\n      `tf.enable_eager_execution()` is disabled.\n    parallel_iterations: `int` that specifies the allowed number of coordinates\n      of the input tensor `xs`, for which the partial derivatives `dys_i/dxs_i`\n      can be computed in parallel.\n    name: Python `str` name prefixed to `Ops` created by this function.\n      Default value: `None` (i.e., \"diag_jacobian\").\n\n  Returns:\n    ys: a list, which coincides with the input `ys`, when provided.\n      If the input `ys` is None, `fn(*xs)` gets computed and returned as a list.\n    jacobians_diag_res: a `Tensor` or a Python list of `Tensor`s of the same\n      dtypes and shapes as the input `xs`. This is the diagonal of the Jacobian\n      of ys wrt xs.\n\n  Raises:\n    ValueError: if lists `xs` and `ys` have different length or both `ys` and\n      `fn` are `None`, or `fn` is None in the eager execution mode.\n  \"\"\"\n  with tf.compat.v1.name_scope(name, 'jacobians_diag', [xs, ys]):\n    if sample_shape is None:\n      sample_shape = [1]\n    # Output Jacobian diagonal\n    jacobians_diag_res = []\n    # Convert input `xs` to a list\n    xs = list(xs) if _is_list_like(xs) else [xs]\n    xs = [tf.convert_to_tensor(value=x) for x in xs]\n    if not tf.executing_eagerly():\n      if ys is None:\n        if fn is None:\n          raise ValueError('Both `ys` and `fn` can not be `None`')\n        else:\n          ys = fn(*xs)\n      # Convert ys to a list\n      ys = list(ys) if _is_list_like(ys) else [ys]\n      if len(xs) != len(ys):\n        raise ValueError('`xs` and `ys` should have the same length')\n      for y, x in zip(ys, xs):\n        # Broadcast `y` to the shape of `x`.\n        y_ = y + tf.zeros_like(x)\n        # Change `event_shape` to one-dimension\n        y_ = tf.reshape(y, tf.concat([sample_shape, [-1]], -1))\n\n        # Declare an iterator and tensor array loop variables for the gradients.\n        n = tf.size(input=x) / tf.cast(\n            tf.reduce_prod(input_tensor=sample_shape), dtype=tf.int32)\n        n = tf.cast(n, dtype=tf.int32)\n        loop_vars = [\n            0,\n            tf.TensorArray(x.dtype, n)\n        ]\n\n        def loop_body(j):\n          \"\"\"Loop function to compute gradients of the each direction.\"\"\"\n          # Gradient along direction `j`.\n          res = tf.gradients(ys=y_[..., j], xs=x)[0]  # pylint: disable=cell-var-from-loop\n          if res is None:\n            # Return zero, if the gradient is `None`.\n            res = tf.zeros(tf.concat([sample_shape, [1]], -1),\n                           dtype=x.dtype)  # pylint: disable=cell-var-from-loop\n          else:\n            # Reshape `event_shape` to 1D\n            res = tf.reshape(res, tf.concat([sample_shape, [-1]], -1))\n            # Add artificial dimension for the case of zero shape input tensor\n            res = tf.expand_dims(res, 0)\n            res = res[..., j]\n          return res  # pylint: disable=cell-var-from-loop\n\n        # Iterate over all elements of the gradient and compute second order\n        # derivatives.\n        _, jacobian_diag_res = tf.while_loop(\n            cond=lambda j, _: j < n,  # pylint: disable=cell-var-from-loop\n            body=lambda j, result: (j + 1, result.write(j, loop_body(j))),\n            loop_vars=loop_vars,\n            parallel_iterations=parallel_iterations)\n\n        shape_x = tf.shape(input=x)\n        # Stack gradients together and move flattened `event_shape` to the\n        # zero position\n        reshaped_jacobian_diag = tf.transpose(a=jacobian_diag_res.stack())\n        # Reshape to the original tensor\n        reshaped_jacobian_diag = tf.reshape(reshaped_jacobian_diag, shape_x)\n        jacobians_diag_res.append(reshaped_jacobian_diag)\n\n    else:\n      if fn is None:\n        raise ValueError('`fn` can not be `None` when eager execution is '\n                         'enabled')\n      if ys is None:\n        ys = fn(*xs)\n\n      def fn_slice(i, j):\n        \"\"\"Broadcast y[i], flatten event shape of y[i], return y[i][..., j].\"\"\"\n        def fn_broadcast(*state):\n          res = fn(*state)\n          res = list(res) if _is_list_like(res) else [res]\n          if len(res) != len(state):\n            res *= len(state)\n          res = [tf.reshape(r + tf.zeros_like(s),\n                            tf.concat([sample_shape, [-1]], -1))\n                 for r, s in zip(res, state)]\n          return res\n        # Expand dimensions before returning in order to support 0D input `xs`\n        return lambda *state: tf.expand_dims(fn_broadcast(*state)[i], 0)[..., j]\n\n      def make_loop_body(i, x):\n        \"\"\"Loop function to compute gradients of the each direction.\"\"\"\n        def _fn(j, result):\n          res = value_and_gradient(fn_slice(i, j), xs)[1][i]\n          if res is None:\n            res = tf.zeros(tf.concat([sample_shape, [1]], -1), dtype=x.dtype)\n          else:\n            res = tf.reshape(res, tf.concat([sample_shape, [-1]], -1))\n            res = res[..., j]\n          return j + 1, result.write(j, res)\n        return _fn\n\n      for i, x in enumerate(xs):\n        # Declare an iterator and tensor array loop variables for the gradients.\n        n = tf.size(input=x) / tf.cast(\n            tf.reduce_prod(input_tensor=sample_shape), dtype=tf.int32)\n        n = tf.cast(n, dtype=tf.int32)\n        loop_vars = [\n            0,\n            tf.TensorArray(x.dtype, n)\n        ]\n\n        # Iterate over all elements of the gradient and compute second order\n        # derivatives.\n        _, jacobian_diag_res = tf.while_loop(\n            cond=lambda j, _: j < n,\n            body=make_loop_body(i, x),\n            loop_vars=loop_vars,\n            parallel_iterations=parallel_iterations)\n\n        shape_x = tf.shape(input=x)\n        # Stack gradients together and move flattened `event_shape` to the\n        # zero position\n        reshaped_jacobian_diag = tf.transpose(a=jacobian_diag_res.stack())\n        # Reshape to the original tensor\n        reshaped_jacobian_diag = tf.reshape(reshaped_jacobian_diag, shape_x)\n        jacobians_diag_res.append(reshaped_jacobian_diag)\n\n  return ys, jacobians_diag_res", "response": "Diagonal Hessian of the Jacobian matrix of ys = fn ( xs ys_n ) wrt xs."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef calculate_reshape(original_shape, new_shape, validate=False, name=None):\n  batch_shape_static = tensorshape_util.constant_value_as_shape(new_shape)\n  if tensorshape_util.is_fully_defined(batch_shape_static):\n    return np.int32(batch_shape_static), batch_shape_static, []\n  with tf.name_scope(name or \"calculate_reshape\"):\n    original_size = tf.reduce_prod(input_tensor=original_shape)\n    implicit_dim = tf.equal(new_shape, -1)\n    size_implicit_dim = (\n        original_size // tf.maximum(1, -tf.reduce_prod(input_tensor=new_shape)))\n    new_ndims = tf.shape(input=new_shape)\n    expanded_new_shape = tf.where(  # Assumes exactly one `-1`.\n        implicit_dim, tf.fill(new_ndims, size_implicit_dim), new_shape)\n    validations = [] if not validate else [  # pylint: disable=g-long-ternary\n        assert_util.assert_rank(\n            original_shape, 1, message=\"Original shape must be a vector.\"),\n        assert_util.assert_rank(\n            new_shape, 1, message=\"New shape must be a vector.\"),\n        assert_util.assert_less_equal(\n            tf.math.count_nonzero(implicit_dim, dtype=tf.int32),\n            1,\n            message=\"At most one dimension can be unknown.\"),\n        assert_util.assert_positive(\n            expanded_new_shape, message=\"Shape elements must be >=-1.\"),\n        assert_util.assert_equal(\n            tf.reduce_prod(input_tensor=expanded_new_shape),\n            original_size,\n            message=\"Shape sizes do not match.\"),\n    ]\n    return expanded_new_shape, batch_shape_static, validations", "response": "Calculates the reshaped dimensions."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef validate_init_args_statically(distribution, batch_shape):\n  if tensorshape_util.rank(batch_shape.shape) is not None:\n    if tensorshape_util.rank(batch_shape.shape) != 1:\n      raise ValueError(\"`batch_shape` must be a vector \"\n                       \"(saw rank: {}).\".format(\n                           tensorshape_util.rank(batch_shape.shape)))\n\n  batch_shape_static = tensorshape_util.constant_value_as_shape(batch_shape)\n  batch_size_static = tensorshape_util.num_elements(batch_shape_static)\n  dist_batch_size_static = tensorshape_util.num_elements(\n      distribution.batch_shape)\n\n  if batch_size_static is not None and dist_batch_size_static is not None:\n    if batch_size_static != dist_batch_size_static:\n      raise ValueError(\"`batch_shape` size ({}) must match \"\n                       \"`distribution.batch_shape` size ({}).\".format(\n                           batch_size_static, dist_batch_size_static))\n\n  if tensorshape_util.dims(batch_shape_static) is not None:\n    if any(\n        tf.compat.dimension_value(dim) is not None and\n        tf.compat.dimension_value(dim) < 1 for dim in batch_shape_static):\n      raise ValueError(\"`batch_shape` elements must be >=-1.\")", "response": "Helper to __init__ which makes or raises assertions."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef _sample_shape(self, x):\n    x_ndims = (\n        tf.rank(x) if tensorshape_util.rank(x.shape) is None else\n        tensorshape_util.rank(x.shape))\n    event_ndims = (\n        tf.size(input=self.event_shape_tensor())\n        if tensorshape_util.rank(self.event_shape) is None else\n        tensorshape_util.rank(self.event_shape))\n    batch_ndims = (\n        tf.size(input=self._batch_shape_unexpanded)\n        if tensorshape_util.rank(self.batch_shape) is None else\n        tensorshape_util.rank(self.batch_shape))\n    sample_ndims = x_ndims - batch_ndims - event_ndims\n    if isinstance(sample_ndims, int):\n      static_sample_shape = x.shape[:sample_ndims]\n    else:\n      static_sample_shape = tf.TensorShape(None)\n    if tensorshape_util.is_fully_defined(static_sample_shape):\n      sample_shape = np.int32(static_sample_shape)\n    else:\n      sample_shape = tf.shape(input=x)[:sample_ndims]\n    return sample_shape, static_sample_shape", "response": "Computes graph and static sample_shape."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\ncalls fn appropriately reshaping its input x and output.", "response": "def _call_reshape_input_output(self, fn, x, extra_kwargs=None):\n    \"\"\"Calls `fn`, appropriately reshaping its input `x` and output.\"\"\"\n    # Note: we take `extra_kwargs` as a dict rather than `**extra_kwargs`\n    # because it is possible the user provided extra kwargs would itself\n    # have `fn` and/or `x` as a key.\n    with tf.control_dependencies(self._runtime_assertions +\n                                 self._validate_sample_arg(x)):\n      sample_shape, static_sample_shape = self._sample_shape(x)\n      old_shape = tf.concat(\n          [\n              sample_shape,\n              self.distribution.batch_shape_tensor(),\n              self.event_shape_tensor(),\n          ],\n          axis=0)\n      x_reshape = tf.reshape(x, old_shape)\n      result = fn(x_reshape, **extra_kwargs) if extra_kwargs else fn(x_reshape)\n      new_shape = tf.concat(\n          [\n              sample_shape,\n              self._batch_shape_unexpanded,\n          ], axis=0)\n      result = tf.reshape(result, new_shape)\n      if (tensorshape_util.rank(static_sample_shape) is not None and\n          tensorshape_util.rank(self.batch_shape) is not None):\n        new_shape = tensorshape_util.concatenate(static_sample_shape,\n                                                 self.batch_shape)\n        tensorshape_util.set_shape(result, new_shape)\n      return result"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef _call_and_reshape_output(\n      self,\n      fn,\n      event_shape_list=None,\n      static_event_shape_list=None,\n      extra_kwargs=None):\n    \"\"\"Calls `fn` and appropriately reshapes its output.\"\"\"\n    # Note: we take `extra_kwargs` as a dict rather than `**extra_kwargs`\n    # because it is possible the user provided extra kwargs would itself\n    # have `fn`, `event_shape_list`, `static_event_shape_list` and/or\n    # `extra_kwargs` as keys.\n    with tf.control_dependencies(self._runtime_assertions):\n      if event_shape_list is None:\n        event_shape_list = [self._event_shape_tensor()]\n      if static_event_shape_list is None:\n        static_event_shape_list = [self.event_shape]\n      new_shape = tf.concat(\n          [self._batch_shape_unexpanded] + event_shape_list, axis=0)\n      result = tf.reshape(fn(**extra_kwargs) if extra_kwargs else fn(),\n                          new_shape)\n      if (tensorshape_util.rank(self.batch_shape) is not None and\n          tensorshape_util.rank(self.event_shape) is not None):\n        event_shape = tf.TensorShape([])\n        for rss in static_event_shape_list:\n          event_shape = tensorshape_util.concatenate(event_shape, rss)\n        static_shape = tensorshape_util.concatenate(\n            self.batch_shape, event_shape)\n        tensorshape_util.set_shape(result, static_shape)\n      return result", "response": "Calls fn and appropriately reshapes its output."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef _validate_sample_arg(self, x):\n    with tf.name_scope(\"validate_sample_arg\"):\n      x_ndims = (\n          tf.rank(x) if tensorshape_util.rank(x.shape) is None else\n          tensorshape_util.rank(x.shape))\n      event_ndims = (\n          tf.size(input=self.event_shape_tensor())\n          if tensorshape_util.rank(self.event_shape) is None else\n          tensorshape_util.rank(self.event_shape))\n      batch_ndims = (\n          tf.size(input=self._batch_shape_unexpanded)\n          if tensorshape_util.rank(self.batch_shape) is None else\n          tensorshape_util.rank(self.batch_shape))\n      expected_batch_event_ndims = batch_ndims + event_ndims\n\n      if (isinstance(x_ndims, int) and\n          isinstance(expected_batch_event_ndims, int)):\n        if x_ndims < expected_batch_event_ndims:\n          raise NotImplementedError(\n              \"Broadcasting is not supported; too few batch and event dims \"\n              \"(expected at least {}, saw {}).\".format(\n                  expected_batch_event_ndims, x_ndims))\n        ndims_assertion = []\n      elif self.validate_args:\n        ndims_assertion = [\n            assert_util.assert_greater_equal(\n                x_ndims,\n                expected_batch_event_ndims,\n                message=(\"Broadcasting is not supported; too few \"\n                         \"batch and event dims.\"),\n                name=\"assert_batch_and_event_ndims_large_enough\"),\n        ]\n\n      if (tensorshape_util.is_fully_defined(self.batch_shape) and\n          tensorshape_util.is_fully_defined(self.event_shape)):\n        expected_batch_event_shape = np.int32(\n            tensorshape_util.concatenate(self.batch_shape, self.event_shape))\n      else:\n        expected_batch_event_shape = tf.concat(\n            [\n                self.batch_shape_tensor(),\n                self.event_shape_tensor(),\n            ], axis=0)\n\n      sample_ndims = x_ndims - expected_batch_event_ndims\n      if isinstance(sample_ndims, int):\n        sample_ndims = max(sample_ndims, 0)\n      if (isinstance(sample_ndims, int) and\n          tensorshape_util.is_fully_defined(x.shape[sample_ndims:])):\n        actual_batch_event_shape = np.int32(x.shape[sample_ndims:])\n      else:\n        sample_ndims = tf.maximum(sample_ndims, 0)\n        actual_batch_event_shape = tf.shape(input=x)[sample_ndims:]\n\n      if (isinstance(expected_batch_event_shape, np.ndarray) and\n          isinstance(actual_batch_event_shape, np.ndarray)):\n        if any(expected_batch_event_shape != actual_batch_event_shape):\n          raise NotImplementedError(\"Broadcasting is not supported; \"\n                                    \"unexpected batch and event shape \"\n                                    \"(expected {}, saw {}).\".format(\n                                        expected_batch_event_shape,\n                                        actual_batch_event_shape))\n        # We need to set the final runtime-assertions to `ndims_assertion` since\n        # its possible this assertion was created. We could add a condition to\n        # only do so if `self.validate_args == True`, however this is redundant\n        # as `ndims_assertion` already encodes this information.\n        runtime_assertions = ndims_assertion\n      elif self.validate_args:\n        # We need to make the `ndims_assertion` a control dep because otherwise\n        # TF itself might raise an exception owing to this assertion being\n        # ill-defined, ie, one cannot even compare different rank Tensors.\n        with tf.control_dependencies(ndims_assertion):\n          shape_assertion = assert_util.assert_equal(\n              expected_batch_event_shape,\n              actual_batch_event_shape,\n              message=(\"Broadcasting is not supported; \"\n                       \"unexpected batch and event shape.\"),\n              name=\"assert_batch_and_event_shape_same\")\n        runtime_assertions = [shape_assertion]\n      else:\n        runtime_assertions = []\n\n      return runtime_assertions", "response": "Helper which validates sample arg."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _maybe_assert_valid_sample(self, counts):\n    if not self.validate_args:\n      return counts\n    counts = distribution_util.embed_check_nonnegative_integer_form(counts)\n    return distribution_util.with_dependencies([\n        assert_util.assert_less_equal(\n            counts,\n            self.total_count,\n            message=\"counts are not less than or equal to n.\"),\n    ], counts)", "response": "Check counts for proper shape values then return tensor version."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef _flat_sample_distributions(self, sample_shape=(), seed=None, value=None):\n    ds = []\n    values_out = []\n    seed = seed_stream.SeedStream('JointDistributionCoroutine', seed)\n    gen = self._model()\n    index = 0\n    d = next(gen)\n    try:\n      while True:\n        actual_distribution = d.distribution if isinstance(d, self.Root) else d\n        ds.append(actual_distribution)\n        if (value is not None and len(value) > index and\n            value[index] is not None):\n          seed()\n          next_value = value[index]\n        else:\n          next_value = actual_distribution.sample(\n              sample_shape=sample_shape if isinstance(d, self.Root) else (),\n              seed=seed())\n        values_out.append(next_value)\n        index += 1\n        d = gen.send(next_value)\n    except StopIteration:\n      pass\n    return ds, values_out", "response": "Executes model and creates both samples and distributions."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef _kl_pareto_pareto(a, b, name=None):\n  with tf.name_scope(name or \"kl_pareto_pareto\"):\n    # Consistent with\n    # http://www.mast.queensu.ca/~communications/Papers/gil-msc11.pdf, page 55\n    # Terminology is different from source to source for Pareto distributions.\n    # The 'concentration' parameter corresponds to 'a' in that source, and the\n    # 'scale' parameter corresponds to 'm'.\n    final_batch_shape = distribution_util.get_broadcast_shape(\n        a.concentration, b.concentration, a.scale, b.scale)\n    common_type = dtype_util.common_dtype(\n        [a.concentration, b.concentration, a.scale, b.scale], tf.float32)\n    return tf.where(\n        a.scale >= b.scale,\n        b.concentration * (tf.math.log(a.scale) - tf.math.log(b.scale)) +\n        tf.math.log(a.concentration) - tf.math.log(b.concentration) +\n        b.concentration / a.concentration - 1.0,\n        tf.broadcast_to(tf.cast(np.inf, common_type), final_batch_shape))", "response": "Calculate the batched KL divergence of a Pareto distribution."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nextending the support of a single function at x.", "response": "def _extend_support(self, x, f, alt):\n    \"\"\"Returns `f(x)` if x is in the support, and `alt` otherwise.\n\n    Given `f` which is defined on the support of this distribution\n    (e.g. x > scale), extend the function definition to the real line\n    by defining `f(x) = alt` for `x < scale`.\n\n    Args:\n      x: Floating-point Tensor to evaluate `f` at.\n      f: Lambda that takes in a tensor and returns a tensor. This represents\n        the function who we want to extend the domain of definition.\n      alt: Python or numpy literal representing the value to use for extending\n        the domain.\n    Returns:\n      Tensor representing an extension of `f(x)`.\n    \"\"\"\n    # We need to do a series of broadcasts for the tf.where.\n    scale = self.scale + tf.zeros_like(self.concentration)\n    is_invalid = x < scale\n    scale = scale + tf.zeros_like(x)\n    x = x + tf.zeros_like(scale)\n    # We need to do this to ensure gradients are sound.\n    y = f(tf.where(is_invalid, scale, x))\n    if alt == 0.:\n      alt = tf.zeros_like(y)\n    elif alt == 1.:\n      alt = tf.ones_like(y)\n    else:\n      alt = tf.fill(\n          dims=tf.shape(input=y),\n          value=dtype_util.as_numpy_dtype(self.dtype)(alt))\n    return tf.where(is_invalid, alt, y)"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\ncreate the variational distribution for LDA.", "response": "def make_lda_variational(activation, num_topics, layer_sizes):\n  \"\"\"Creates the variational distribution for LDA.\n\n  Args:\n    activation: Activation function to use.\n    num_topics: The number of topics.\n    layer_sizes: The number of hidden units per layer in the encoder.\n\n  Returns:\n    lda_variational: A function that takes a bag-of-words Tensor as\n      input and returns a distribution over topics.\n  \"\"\"\n  encoder_net = tf.keras.Sequential()\n  for num_hidden_units in layer_sizes:\n    encoder_net.add(\n        tf.keras.layers.Dense(\n            num_hidden_units,\n            activation=activation,\n            kernel_initializer=tf.compat.v1.glorot_normal_initializer()))\n  encoder_net.add(\n      tf.keras.layers.Dense(\n          num_topics,\n          activation=tf.nn.softplus,\n          kernel_initializer=tf.compat.v1.glorot_normal_initializer()))\n\n  def lda_variational(bag_of_words):\n    concentration = _clip_dirichlet_parameters(encoder_net(bag_of_words))\n    return ed.Dirichlet(concentration=concentration, name=\"topics_posterior\")\n\n  return lda_variational"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef model_fn(features, labels, mode, params, config):\n  del labels, config\n\n  # Set up the model's learnable parameters.\n  logit_concentration = tf.compat.v1.get_variable(\n      \"logit_concentration\",\n      shape=[1, params[\"num_topics\"]],\n      initializer=tf.compat.v1.initializers.constant(\n          _softplus_inverse(params[\"prior_initial_value\"])))\n  concentration = _clip_dirichlet_parameters(\n      tf.nn.softplus(logit_concentration))\n\n  num_words = features.shape[1]\n  topics_words_logits = tf.compat.v1.get_variable(\n      \"topics_words_logits\",\n      shape=[params[\"num_topics\"], num_words],\n      initializer=tf.compat.v1.glorot_normal_initializer())\n  topics_words = tf.nn.softmax(topics_words_logits, axis=-1)\n\n  # Compute expected log-likelihood. First, sample from the variational\n  # distribution; second, compute the log-likelihood given the sample.\n  lda_variational = make_lda_variational(\n      params[\"activation\"],\n      params[\"num_topics\"],\n      params[\"layer_sizes\"])\n  with ed.tape() as variational_tape:\n    _ = lda_variational(features)\n\n  with ed.tape() as model_tape:\n    with ed.interception(\n        make_value_setter(topics=variational_tape[\"topics_posterior\"])):\n      posterior_predictive = latent_dirichlet_allocation(concentration,\n                                                         topics_words)\n\n  log_likelihood = posterior_predictive.distribution.log_prob(features)\n  tf.compat.v1.summary.scalar(\"log_likelihood\",\n                              tf.reduce_mean(input_tensor=log_likelihood))\n\n  # Compute the KL-divergence between two Dirichlets analytically.\n  # The sampled KL does not work well for \"sparse\" distributions\n  # (see Appendix D of [2]).\n  kl = variational_tape[\"topics_posterior\"].distribution.kl_divergence(\n      model_tape[\"topics\"].distribution)\n  tf.compat.v1.summary.scalar(\"kl\", tf.reduce_mean(input_tensor=kl))\n\n  # Ensure that the KL is non-negative (up to a very small slack).\n  # Negative KL can happen due to numerical instability.\n  with tf.control_dependencies(\n      [tf.compat.v1.assert_greater(kl, -1e-3, message=\"kl\")]):\n    kl = tf.identity(kl)\n\n  elbo = log_likelihood - kl\n  avg_elbo = tf.reduce_mean(input_tensor=elbo)\n  tf.compat.v1.summary.scalar(\"elbo\", avg_elbo)\n  loss = -avg_elbo\n\n  # Perform variational inference by minimizing the -ELBO.\n  global_step = tf.compat.v1.train.get_or_create_global_step()\n  optimizer = tf.compat.v1.train.AdamOptimizer(params[\"learning_rate\"])\n\n  # This implements the \"burn-in\" for prior parameters (see Appendix D of [2]).\n  # For the first prior_burn_in_steps steps they are fixed, and then trained\n  # jointly with the other parameters.\n  grads_and_vars = optimizer.compute_gradients(loss)\n  grads_and_vars_except_prior = [\n      x for x in grads_and_vars if x[1] != logit_concentration]\n\n  def train_op_except_prior():\n    return optimizer.apply_gradients(\n        grads_and_vars_except_prior,\n        global_step=global_step)\n\n  def train_op_all():\n    return optimizer.apply_gradients(\n        grads_and_vars,\n        global_step=global_step)\n\n  train_op = tf.cond(\n      pred=global_step < params[\"prior_burn_in_steps\"],\n      true_fn=train_op_except_prior,\n      false_fn=train_op_all)\n\n  # The perplexity is an exponent of the average negative ELBO per word.\n  words_per_document = tf.reduce_sum(input_tensor=features, axis=1)\n  log_perplexity = -elbo / words_per_document\n  tf.compat.v1.summary.scalar(\n      \"perplexity\", tf.exp(tf.reduce_mean(input_tensor=log_perplexity)))\n  (log_perplexity_tensor,\n   log_perplexity_update) = tf.compat.v1.metrics.mean(log_perplexity)\n  perplexity_tensor = tf.exp(log_perplexity_tensor)\n\n  # Obtain the topics summary. Implemented as a py_func for simplicity.\n  topics = tf.compat.v1.py_func(\n      functools.partial(get_topics_strings, vocabulary=params[\"vocabulary\"]),\n      [topics_words, concentration],\n      tf.string,\n      stateful=False)\n  tf.compat.v1.summary.text(\"topics\", topics)\n\n  return tf.estimator.EstimatorSpec(\n      mode=mode,\n      loss=loss,\n      train_op=train_op,\n      eval_metric_ops={\n          \"elbo\": tf.compat.v1.metrics.mean(elbo),\n          \"log_likelihood\": tf.compat.v1.metrics.mean(log_likelihood),\n          \"kl\": tf.compat.v1.metrics.mean(kl),\n          \"perplexity\": (perplexity_tensor, log_perplexity_update),\n          \"topics\": (topics, tf.no_op()),\n      },\n  )", "response": "Builds the model function for use in an Estimator."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nreturn the summary of the learned topics.", "response": "def get_topics_strings(topics_words, alpha, vocabulary,\n                       topics_to_print=10, words_per_topic=10):\n  \"\"\"Returns the summary of the learned topics.\n\n  Arguments:\n    topics_words: KxV tensor with topics as rows and words as columns.\n    alpha: 1xK tensor of prior Dirichlet concentrations for the\n        topics.\n    vocabulary: A mapping of word's integer index to the corresponding string.\n    topics_to_print: The number of topics with highest prior weight to\n        summarize.\n    words_per_topic: Number of wodrs per topic to return.\n\n  Returns:\n    summary: A np.array with strings.\n  \"\"\"\n  alpha = np.squeeze(alpha, axis=0)\n  # Use a stable sorting algorithm so that when alpha is fixed\n  # we always get the same topics.\n  highest_weight_topics = np.argsort(-alpha, kind=\"mergesort\")\n  top_words = np.argsort(-topics_words, axis=1)\n\n  res = []\n  for topic_idx in highest_weight_topics[:topics_to_print]:\n    l = [\"index={} alpha={:.2f}\".format(topic_idx, alpha[topic_idx])]\n    l += [vocabulary[word] for word in top_words[topic_idx, :words_per_topic]]\n    res.append(\" \".join(l))\n\n  return np.array(res)"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef build_fake_input_fns(batch_size):\n  num_words = 1000\n  vocabulary = [str(i) for i in range(num_words)]\n\n  random_sample = np.random.randint(\n      10, size=(batch_size, num_words)).astype(np.float32)\n\n  def train_input_fn():\n    dataset = tf.data.Dataset.from_tensor_slices(random_sample)\n    dataset = dataset.batch(batch_size).repeat()\n    return tf.compat.v1.data.make_one_shot_iterator(dataset).get_next()\n\n  def eval_input_fn():\n    dataset = tf.data.Dataset.from_tensor_slices(random_sample)\n    dataset = dataset.batch(batch_size)\n    return tf.compat.v1.data.make_one_shot_iterator(dataset).get_next()\n\n  return train_input_fn, eval_input_fn, vocabulary", "response": "Builds fake data for unit testing."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef build_input_fns(data_dir, batch_size):\n\n  with open(download(data_dir, \"vocab.pkl\"), \"r\") as f:\n    words_to_idx = pickle.load(f)\n  num_words = len(words_to_idx)\n\n  vocabulary = [None] * num_words\n  for word, idx in words_to_idx.items():\n    vocabulary[idx] = word\n\n  # Build an iterator over training batches.\n  def train_input_fn():\n    dataset = newsgroups_dataset(\n        data_dir, \"train\", num_words, shuffle_and_repeat=True)\n    # Prefetching makes training about 1.5x faster.\n    dataset = dataset.batch(batch_size).prefetch(32)\n    return tf.compat.v1.data.make_one_shot_iterator(dataset).get_next()\n\n  # Build an iterator over the heldout set.\n  def eval_input_fn():\n    dataset = newsgroups_dataset(\n        data_dir, \"test\", num_words, shuffle_and_repeat=False)\n    dataset = dataset.batch(batch_size)\n    return tf.compat.v1.data.make_one_shot_iterator(dataset).get_next()\n\n  return train_input_fn, eval_input_fn, vocabulary", "response": "Builds iterators for train and evaluation data."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\ncalculating the batched KL divergence KL with a and b Chi.", "response": "def _kl_chi_chi(a, b, name=None):\n  \"\"\"Calculate the batched KL divergence KL(a || b) with a and b Chi.\n\n  Args:\n    a: instance of a Chi distribution object.\n    b: instance of a Chi distribution object.\n    name: (optional) Name to use for created operations.\n      default is \"kl_chi_chi\".\n\n  Returns:\n    Batchwise KL(a || b)\n  \"\"\"\n  with tf.name_scope(name or \"kl_chi_chi\"):\n    # Consistent with\n    # https://mast.queensu.ca/~communications/Papers/gil-msc11.pdf, page 118\n    # The paper introduces an additional scaling parameter; setting that\n    # parameter to 1 and simplifying yields the expression we use here.\n    return (0.5 * tf.math.digamma(0.5 * a.df) * (a.df - b.df) +\n            tf.math.lgamma(0.5 * b.df) - tf.math.lgamma(0.5 * a.df))"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nreturn a dense column of a Tensor or SparseTensor.", "response": "def _sparse_or_dense_matmul_onehot(sparse_or_dense_matrix, col_index):\n  \"\"\"Returns a (dense) column of a Tensor or SparseTensor.\n\n  Args:\n    sparse_or_dense_matrix: matrix-shaped, `float` `Tensor` or `SparseTensor`.\n    col_index: scalar, `int` `Tensor` representing the index of the desired\n      column.\n\n  Returns:\n    column: vector-shaped, `float` `Tensor` with the same dtype as\n      `sparse_or_dense_matrix`, representing the `col_index`th column of\n      `sparse_or_dense_matrix`.\n  \"\"\"\n  if isinstance(sparse_or_dense_matrix,\n                (tf.SparseTensor, tf.compat.v1.SparseTensorValue)):\n    # TODO(b/111924846): Implement better (ideally in a way that allows us to\n    # eliminate the `num_rows` arg, if possible).\n    num_rows = _get_shape(sparse_or_dense_matrix)[-2]\n    batch_shape = _get_shape(sparse_or_dense_matrix)[:-2]\n    slice_start = tf.concat([tf.zeros_like(batch_shape), [0, col_index]],\n                            axis=0)\n    slice_size = tf.concat([batch_shape, [num_rows, 1]], axis=0)\n    # We momentarily lose static shape information in tf.sparse_slice. However\n    # we regain it in the following tf.reshape.\n    sparse_slice = tf.sparse.slice(sparse_or_dense_matrix,\n                                   tf.cast(slice_start, tf.int64),\n                                   tf.cast(slice_size, tf.int64))\n\n    output_shape = tf.concat([batch_shape, [num_rows]], axis=0)\n    return tf.reshape(tf.sparse.to_dense(sparse_slice), output_shape)\n  else:\n    return tf.gather(sparse_or_dense_matrix, col_index, axis=-1)"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef minimize(grad_and_hessian_loss_fn,\n             x_start,\n             tolerance,\n             l1_regularizer,\n             l2_regularizer=None,\n             maximum_iterations=1,\n             maximum_full_sweeps_per_iteration=1,\n             learning_rate=None,\n             name=None):\n  \"\"\"Minimize using Hessian-informed proximal gradient descent.\n\n  This function solves the regularized minimization problem\n\n  ```none\n  argmin{ Loss(x)\n            + l1_regularizer * ||x||_1\n            + l2_regularizer * ||x||_2**2\n          : x in R^n }\n  ```\n\n  where `Loss` is a convex C^2 function (typically, `Loss` is the negative log\n  likelihood of a model and `x` is a vector of model coefficients).  The `Loss`\n  function does not need to be supplied directly, but this optimizer does need a\n  way to compute the gradient and Hessian of the Loss function at a given value\n  of `x`.  The gradient and Hessian are often computationally expensive, and\n  this optimizer calls them relatively few times compared with other algorithms.\n\n  Args:\n    grad_and_hessian_loss_fn: callable that takes as input a (batch of) `Tensor`\n      of the same shape and dtype as `x_start` and returns the triple\n      `(gradient_unregularized_loss, hessian_unregularized_loss_outer,\n      hessian_unregularized_loss_middle)` as defined in the argument spec of\n      `minimize_one_step`.\n    x_start: (Batch of) vector-shaped, `float` `Tensor` representing the initial\n      value of the argument to the `Loss` function.\n    tolerance: scalar, `float` `Tensor` representing the tolerance for each\n      optimization step; see the `tolerance` argument of\n      `minimize_one_step`.\n    l1_regularizer: scalar, `float` `Tensor` representing the weight of the L1\n      regularization term (see equation above).\n    l2_regularizer: scalar, `float` `Tensor` representing the weight of the L2\n      regularization term (see equation above).\n      Default value: `None` (i.e., no L2 regularization).\n    maximum_iterations: Python integer specifying the maximum number of\n      iterations of the outer loop of the optimizer.  After this many iterations\n      of the outer loop, the algorithm will terminate even if the return value\n      `optimal_x` has not converged.\n      Default value: `1`.\n    maximum_full_sweeps_per_iteration: Python integer specifying the maximum\n      number of sweeps allowed in each iteration of the outer loop of the\n      optimizer.  Passed as the `maximum_full_sweeps` argument to\n      `minimize_one_step`.\n      Default value: `1`.\n    learning_rate: scalar, `float` `Tensor` representing a multiplicative factor\n      used to dampen the proximal gradient descent steps.\n      Default value: `None` (i.e., factor is conceptually `1`).\n    name: Python string representing the name of the TensorFlow operation.\n      The default name is `\"minimize\"`.\n\n  Returns:\n    x: `Tensor` of the same shape and dtype as `x_start`, representing the\n      (batches of) computed values of `x` which minimizes `Loss(x)`.\n    is_converged: scalar, `bool` `Tensor` indicating whether the minimization\n      procedure converged within the specified number of iterations across all\n      batches.  Here convergence means that an iteration of the inner loop\n      (`minimize_one_step`) returns `True` for its `is_converged` output value.\n    iter: scalar, `int` `Tensor` indicating the actual number of iterations of\n      the outer loop of the optimizer completed (i.e., number of calls to\n      `minimize_one_step` before achieving convergence).\n\n  #### References\n\n  [1]: Jerome Friedman, Trevor Hastie and Rob Tibshirani. Regularization Paths\n       for Generalized Linear Models via Coordinate Descent. _Journal of\n       Statistical Software_, 33(1), 2010.\n       https://www.jstatsoft.org/article/view/v033i01/v33i01.pdf\n\n  [2]: Guo-Xun Yuan, Chia-Hua Ho and Chih-Jen Lin. An Improved GLMNET for\n       L1-regularized Logistic Regression. _Journal of Machine Learning\n       Research_, 13, 2012.\n       http://www.jmlr.org/papers/volume13/yuan12a/yuan12a.pdf\n  \"\"\"\n  graph_deps = [\n      x_start,\n      l1_regularizer,\n      l2_regularizer,\n      maximum_iterations,\n      maximum_full_sweeps_per_iteration,\n      tolerance,\n      learning_rate,\n  ],\n  with tf.compat.v1.name_scope(name, 'minimize', graph_deps):\n\n    def _loop_cond(x_start, converged, iter_):\n      del x_start\n      return tf.logical_and(iter_ < maximum_iterations,\n                            tf.logical_not(converged))\n\n    def _loop_body(x_start, converged, iter_):  # pylint: disable=missing-docstring\n      g, h_outer, h_middle = grad_and_hessian_loss_fn(x_start)\n      x_start, converged, _ = minimize_one_step(\n          gradient_unregularized_loss=g,\n          hessian_unregularized_loss_outer=h_outer,\n          hessian_unregularized_loss_middle=h_middle,\n          x_start=x_start,\n          l1_regularizer=l1_regularizer,\n          l2_regularizer=l2_regularizer,\n          maximum_full_sweeps=maximum_full_sweeps_per_iteration,\n          tolerance=tolerance,\n          learning_rate=learning_rate)\n      return x_start, converged, iter_ + 1\n\n    return tf.while_loop(\n        cond=_loop_cond,\n        body=_loop_body,\n        loop_vars=[\n            x_start,\n            tf.zeros([], np.bool, name='converged'),\n            tf.zeros([], np.int32, name='iter'),\n        ])", "response": "Minimize using regularized minimization problem."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\ncreate the encoder function.", "response": "def make_encoder(base_depth, activation, latent_size, code_size):\n  \"\"\"Creates the encoder function.\n\n  Args:\n    base_depth: Layer base depth in encoder net.\n    activation: Activation function in hidden layers.\n    latent_size: The number of latent variables in the code.\n    code_size: The dimensionality of each latent variable.\n\n  Returns:\n    encoder: A `callable` mapping a `Tensor` of images to a `Tensor` of shape\n      `[..., latent_size, code_size]`.\n  \"\"\"\n  conv = functools.partial(\n      tf.keras.layers.Conv2D, padding=\"SAME\", activation=activation)\n\n  encoder_net = tf.keras.Sequential([\n      conv(base_depth, 5, 1),\n      conv(base_depth, 5, 2),\n      conv(2 * base_depth, 5, 1),\n      conv(2 * base_depth, 5, 2),\n      conv(4 * latent_size, 7, padding=\"VALID\"),\n      tf.keras.layers.Flatten(),\n      tf.keras.layers.Dense(latent_size * code_size, activation=None),\n      tf.keras.layers.Reshape([latent_size, code_size])\n  ])\n\n  def encoder(images):\n    \"\"\"Encodes a batch of images.\n\n    Args:\n      images: A `Tensor` representing the inputs to be encoded, of shape `[...,\n        channels]`.\n\n    Returns:\n      codes: A `float`-like `Tensor` of shape `[..., latent_size, code_size]`.\n        It represents latent vectors to be matched with the codebook.\n    \"\"\"\n    images = 2 * tf.cast(images, dtype=tf.float32) - 1\n    codes = encoder_net(images)\n    return codes\n\n  return encoder"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef make_decoder(base_depth, activation, input_size, output_shape):\n  deconv = functools.partial(\n      tf.keras.layers.Conv2DTranspose, padding=\"SAME\", activation=activation)\n  conv = functools.partial(\n      tf.keras.layers.Conv2D, padding=\"SAME\", activation=activation)\n  decoder_net = tf.keras.Sequential([\n      tf.keras.layers.Reshape((1, 1, input_size)),\n      deconv(2 * base_depth, 7, padding=\"VALID\"),\n      deconv(2 * base_depth, 5),\n      deconv(2 * base_depth, 5, 2),\n      deconv(base_depth, 5),\n      deconv(base_depth, 5, 2),\n      deconv(base_depth, 5),\n      conv(output_shape[-1], 5, activation=None),\n      tf.keras.layers.Reshape(output_shape),\n  ])\n\n  def decoder(codes):\n    \"\"\"Builds a distribution over images given codes.\n\n    Args:\n      codes: A `Tensor` representing the inputs to be decoded, of shape `[...,\n        code_size]`.\n\n    Returns:\n      decoder_distribution: A multivariate `Bernoulli` distribution.\n    \"\"\"\n    logits = decoder_net(codes)\n    return tfd.Independent(tfd.Bernoulli(logits=logits),\n                           reinterpreted_batch_ndims=len(output_shape),\n                           name=\"decoder_distribution\")\n\n  return decoder", "response": "Creates the decoder function."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nadds control dependencies to the commmitment loss.", "response": "def add_ema_control_dependencies(vector_quantizer,\n                                 one_hot_assignments,\n                                 codes,\n                                 commitment_loss,\n                                 decay):\n  \"\"\"Add control dependencies to the commmitment loss to update the codebook.\n\n  Args:\n    vector_quantizer: An instance of the VectorQuantizer class.\n    one_hot_assignments: The one-hot vectors corresponding to the matched\n      codebook entry for each code in the batch.\n    codes: A `float`-like `Tensor` containing the latent vectors to be compared\n      to the codebook.\n    commitment_loss: The commitment loss from comparing the encoder outputs to\n      their neighboring codebook entries.\n    decay: Decay factor for exponential moving average.\n\n  Returns:\n    commitment_loss: Commitment loss with control dependencies.\n  \"\"\"\n  # Use an exponential moving average to update the codebook.\n  updated_ema_count = moving_averages.assign_moving_average(\n      vector_quantizer.ema_count,\n      tf.reduce_sum(input_tensor=one_hot_assignments, axis=[0, 1]),\n      decay,\n      zero_debias=False)\n  updated_ema_means = moving_averages.assign_moving_average(\n      vector_quantizer.ema_means,\n      tf.reduce_sum(\n          input_tensor=tf.expand_dims(codes, 2) *\n          tf.expand_dims(one_hot_assignments, 3),\n          axis=[0, 1]),\n      decay,\n      zero_debias=False)\n\n  # Add small value to avoid dividing by zero.\n  perturbed_ema_count = updated_ema_count + 1e-5\n  with tf.control_dependencies([commitment_loss]):\n    update_means = tf.compat.v1.assign(\n        vector_quantizer.codebook,\n        updated_ema_means / perturbed_ema_count[..., tf.newaxis])\n    with tf.control_dependencies([update_means]):\n      return tf.identity(commitment_loss)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nsaving a grid of images to a PNG file.", "response": "def save_imgs(x, fname):\n  \"\"\"Helper method to save a grid of images to a PNG file.\n\n  Args:\n    x: A numpy array of shape [n_images, height, width].\n    fname: The filename to write to (including extension).\n  \"\"\"\n  n = x.shape[0]\n  fig = figure.Figure(figsize=(n, 1), frameon=False)\n  canvas = backend_agg.FigureCanvasAgg(fig)\n  for i in range(n):\n    ax = fig.add_subplot(1, n, i+1)\n    ax.imshow(x[i].squeeze(),\n              interpolation=\"none\",\n              cmap=cm.get_cmap(\"binary\"))\n    ax.axis(\"off\")\n  canvas.print_figure(fname, format=\"png\")\n  print(\"saved %s\" % fname)"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef load_bernoulli_mnist_dataset(directory, split_name):\n  amat_file = download(directory, FILE_TEMPLATE.format(split=split_name))\n  dataset = tf.data.TextLineDataset(amat_file)\n  str_to_arr = lambda string: np.array([c == b\"1\" for c in string.split()])\n\n  def _parser(s):\n    booltensor = tf.compat.v1.py_func(str_to_arr, [s], tf.bool)\n    reshaped = tf.reshape(booltensor, [28, 28, 1])\n    return tf.cast(reshaped, dtype=tf.float32), tf.constant(0, tf.int32)\n\n  return dataset.map(_parser)", "response": "Returns Hugo Larochelle s binary static MNIST dataset."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nbuilding an iterator that can switch between train and heldout data.", "response": "def build_input_pipeline(data_dir, batch_size, heldout_size, mnist_type):\n  \"\"\"Builds an Iterator switching between train and heldout data.\"\"\"\n  # Build an iterator over training batches.\n  if mnist_type in [MnistType.FAKE_DATA, MnistType.THRESHOLD]:\n    if mnist_type == MnistType.FAKE_DATA:\n      mnist_data = build_fake_data()\n    else:\n      mnist_data = mnist.read_data_sets(data_dir)\n    training_dataset = tf.data.Dataset.from_tensor_slices(\n        (mnist_data.train.images, np.int32(mnist_data.train.labels)))\n    heldout_dataset = tf.data.Dataset.from_tensor_slices(\n        (mnist_data.validation.images,\n         np.int32(mnist_data.validation.labels)))\n  elif mnist_type == MnistType.BERNOULLI:\n    training_dataset = load_bernoulli_mnist_dataset(data_dir, \"train\")\n    heldout_dataset = load_bernoulli_mnist_dataset(data_dir, \"valid\")\n  else:\n    raise ValueError(\"Unknown MNIST type.\")\n\n  training_batches = training_dataset.repeat().batch(batch_size)\n  training_iterator = tf.compat.v1.data.make_one_shot_iterator(training_batches)\n\n  # Build a iterator over the heldout set with batch_size=heldout_size,\n  # i.e., return the entire heldout set as a constant.\n  heldout_frozen = (heldout_dataset.take(heldout_size).\n                    repeat().batch(heldout_size))\n  heldout_iterator = tf.compat.v1.data.make_one_shot_iterator(heldout_frozen)\n\n  # Combine these into a feedable iterator that can switch between training\n  # and validation inputs.\n  handle = tf.compat.v1.placeholder(tf.string, shape=[])\n  feedable_iterator = tf.compat.v1.data.Iterator.from_string_handle(\n      handle, training_batches.output_types, training_batches.output_shapes)\n  images, labels = feedable_iterator.get_next()\n  # Reshape as a pixel image and binarize pixels.\n  images = tf.reshape(images, shape=[-1] + IMAGE_SHAPE)\n  if mnist_type in [MnistType.FAKE_DATA, MnistType.THRESHOLD]:\n    images = tf.cast(images > 0.5, dtype=tf.int32)\n\n  return images, labels, handle, training_iterator, heldout_iterator"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nreturn a np. dtype based on this dtype.", "response": "def as_numpy_dtype(dtype):\n  \"\"\"Returns a `np.dtype` based on this `dtype`.\"\"\"\n  dtype = tf.as_dtype(dtype)\n  if hasattr(dtype, 'as_numpy_dtype'):\n    return dtype.as_numpy_dtype\n  return dtype"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nreturns a non - reference dtype based on this dtype.", "response": "def base_dtype(dtype):\n  \"\"\"Returns a non-reference `dtype` based on this `dtype`.\"\"\"\n  dtype = tf.as_dtype(dtype)\n  if hasattr(dtype, 'base_dtype'):\n    return dtype.base_dtype\n  return dtype"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nreturns whether this is a boolean data type.", "response": "def is_bool(dtype):\n  \"\"\"Returns whether this is a boolean data type.\"\"\"\n  dtype = tf.as_dtype(dtype)\n  if hasattr(dtype, 'is_bool'):\n    return dtype.is_bool\n  # We use `kind` because:\n  # np.issubdtype(np.uint8, np.bool) == True.\n  return np.dtype(dtype).kind == 'b'"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nreturns whether this is a complex floating point type.", "response": "def is_complex(dtype):\n  \"\"\"Returns whether this is a complex floating point type.\"\"\"\n  dtype = tf.as_dtype(dtype)\n  if hasattr(dtype, 'is_complex'):\n    return dtype.is_complex\n  return np.issubdtype(np.dtype(dtype), np.complex)"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nreturn whether this is a non - quantized floating point type.", "response": "def is_floating(dtype):\n  \"\"\"Returns whether this is a (non-quantized, real) floating point type.\"\"\"\n  dtype = tf.as_dtype(dtype)\n  if hasattr(dtype, 'is_floating'):\n    return dtype.is_floating\n  return np.issubdtype(np.dtype(dtype), np.float)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef is_integer(dtype):\n  dtype = tf.as_dtype(dtype)\n  if hasattr(dtype, 'is_integer'):\n    return dtype.is_integer\n  return np.issubdtype(np.dtype(dtype), np.integer)", "response": "Returns whether this is a non - quantized integer type."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef max(dtype):  # pylint: disable=redefined-builtin\n  dtype = tf.as_dtype(dtype)\n  if hasattr(dtype, 'max'):\n    return dtype.max\n  use_finfo = is_floating(dtype) or is_complex(dtype)\n  return np.finfo(dtype).max if use_finfo else np.iinfo(dtype).max", "response": "Returns the maximum representable value in this data type."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef name(dtype):\n  dtype = tf.as_dtype(dtype)\n  if hasattr(dtype, 'name'):\n    return dtype.name\n  if hasattr(dtype, '__name__'):\n    return dtype.__name__\n  return str(dtype)", "response": "Returns the string name for this dtype."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nreturns the number of bytes to represent this dtype.", "response": "def size(dtype):\n  \"\"\"Returns the number of bytes to represent this `dtype`.\"\"\"\n  dtype = tf.as_dtype(dtype)\n  if hasattr(dtype, 'size'):\n    return dtype.size\n  return np.dtype(dtype).itemsize"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef _assert_same_base_type(items, expected_type=None):\n  original_expected_type = expected_type\n  mismatch = False\n  for item in items:\n    if item is not None:\n      item_type = base_dtype(item.dtype)\n      if not expected_type:\n        expected_type = item_type\n      elif expected_type != item_type:\n        mismatch = True\n        break\n  if mismatch:\n    # Loop back through and build up an informative error message (this is very\n    # slow, so we don't do it unless we found an error above).\n    expected_type = original_expected_type\n    original_item_str = None\n    get_name = lambda x: x.name if hasattr(x, 'name') else str(x)\n    for item in items:\n      if item is not None:\n        item_type = base_dtype(item.dtype)\n        if not expected_type:\n          expected_type = item_type\n          original_item_str = get_name(item)\n        elif expected_type != item_type:\n          raise ValueError(\n              '{}, type={}, must be of the same type ({}){}.'.format(\n                  get_name(item),\n                  item_type,\n                  expected_type,\n                  ((' as {}'.format(original_item_str))\n                   if original_item_str else '')))\n    return expected_type  # Should be unreachable\n  else:\n    return expected_type", "response": "r Asserts all items are of the same base type."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef assert_same_float_dtype(tensors=None, dtype=None):\n  if tensors:\n    dtype = _assert_same_base_type(tensors, dtype)\n  if not dtype:\n    dtype = tf.float32\n  elif not is_floating(dtype):\n    raise ValueError('Expected floating point type, got {}.'.format(dtype))\n  return dtype", "response": "Validate and return float type based on tensors and dtype."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef _kl_categorical_categorical(a, b, name=None):\n  with tf.name_scope(name or \"kl_categorical_categorical\"):\n    # sum(p ln(p / q))\n    return tf.reduce_sum(\n        input_tensor=tf.nn.softmax(a.logits) *\n        (tf.nn.log_softmax(a.logits) - tf.nn.log_softmax(b.logits)),\n        axis=-1)", "response": "Calculate the batched KL divergence KL with a b OneHotCategorical distribution."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nminimizing an objective function using the Nelder Mead simplex algorithm.", "response": "def minimize(objective_function,\n             initial_simplex=None,\n             initial_vertex=None,\n             step_sizes=None,\n             objective_at_initial_simplex=None,\n             objective_at_initial_vertex=None,\n             batch_evaluate_objective=False,\n             func_tolerance=1e-8,\n             position_tolerance=1e-8,\n             parallel_iterations=1,\n             max_iterations=None,\n             reflection=None,\n             expansion=None,\n             contraction=None,\n             shrinkage=None,\n             name=None):\n  \"\"\"Minimum of the objective function using the Nelder Mead simplex algorithm.\n\n  Performs an unconstrained minimization of a (possibly non-smooth) function\n  using the Nelder Mead simplex method. Nelder Mead method does not support\n  univariate functions. Hence the dimensions of the domain must be 2 or greater.\n  For details of the algorithm, see\n  [Press, Teukolsky, Vetterling and Flannery(2007)][1].\n\n  Points in the domain of the objective function may be represented as a\n  `Tensor` of general shape but with rank at least 1. The algorithm proceeds\n  by modifying a full rank simplex in the domain. The initial simplex may\n  either be specified by the user or can be constructed using a single vertex\n  supplied by the user. In the latter case, if `v0` is the supplied vertex,\n  the simplex is the convex hull of the set:\n\n  ```None\n  S = {v0} + {v0 + step_i * e_i}\n  ```\n\n  Here `e_i` is a vector which is `1` along the `i`-th axis and zero elsewhere\n  and `step_i` is a characteristic length scale along the `i`-th axis. If the\n  step size is not supplied by the user, a unit step size is used in every axis.\n  Alternately, a single step size may be specified which is used for every\n  axis. The most flexible option is to supply a bespoke step size for every\n  axis.\n\n  ### Usage:\n\n  The following example demonstrates the usage of the Nelder Mead minimzation\n  on a two dimensional problem with the minimum located at a non-differentiable\n  point.\n\n  ```python\n    # The objective function\n    def sqrt_quadratic(x):\n      return tf.sqrt(tf.reduce_sum(x ** 2, axis=-1))\n\n    start = tf.constant([6.0, -21.0])  # Starting point for the search.\n    optim_results = tfp.optimizer.nelder_mead_minimize(\n        sqrt_quadratic, initial_vertex=start, func_tolerance=1e-8,\n        batch_evaluate_objective=True)\n\n    with tf.Session() as session:\n      results = session.run(optim_results)\n      # Check that the search converged\n      assert(results.converged)\n      # Check that the argmin is close to the actual value.\n      np.testing.assert_allclose(results.position, np.array([0.0, 0.0]),\n                                 atol=1e-7)\n      # Print out the total number of function evaluations it took.\n      print (\"Function evaluations: %d\" % results.num_objective_evaluations)\n  ```\n\n  ### References:\n  [1]: William Press, Saul Teukolsky, William Vetterling and Brian Flannery.\n    Numerical Recipes in C++, third edition. pp. 502-507. (2007).\n    http://numerical.recipes/cpppages/chap0sel.pdf\n\n  [2]: Jeffrey Lagarias, James Reeds, Margaret Wright and Paul Wright.\n    Convergence properties of the Nelder-Mead simplex method in low dimensions,\n    Siam J. Optim., Vol 9, No. 1, pp. 112-147. (1998).\n    http://www.math.kent.edu/~reichel/courses/Opt/reading.material.2/nelder.mead.pdf\n\n  [3]: Fuchang Gao and Lixing Han. Implementing the Nelder-Mead simplex\n    algorithm with adaptive parameters. Computational Optimization and\n    Applications, Vol 51, Issue 1, pp 259-277. (2012).\n    https://pdfs.semanticscholar.org/15b4/c4aa7437df4d032c6ee6ce98d6030dd627be.pdf\n\n  Args:\n    objective_function:  A Python callable that accepts a point as a\n      real `Tensor` and returns a `Tensor` of real dtype containing\n      the value of the function at that point. The function\n      to be minimized. If `batch_evaluate_objective` is `True`, the callable\n      may be evaluated on a `Tensor` of shape `[n+1] + s ` where `n` is\n      the dimension of the problem and `s` is the shape of a single point\n      in the domain (so `n` is the size of a `Tensor` representing a\n      single point).\n      In this case, the expected return value is a `Tensor` of shape `[n+1]`.\n      Note that this method does not support univariate functions so the problem\n      dimension `n` must be strictly greater than 1.\n    initial_simplex: (Optional) `Tensor` of real dtype. The initial simplex to\n      start the search. If supplied, should be a `Tensor` of shape `[n+1] + s`\n      where `n` is the dimension of the problem and `s` is the shape of a\n      single point in the domain. Each row (i.e. the `Tensor` with a given\n      value of the first index) is interpreted as a vertex of a simplex and\n      hence the rows must be affinely independent. If not supplied, an axes\n      aligned simplex is constructed using the `initial_vertex` and\n      `step_sizes`. Only one and at least one of `initial_simplex` and\n      `initial_vertex` must be supplied.\n    initial_vertex: (Optional) `Tensor` of real dtype and any shape that can\n      be consumed by the `objective_function`. A single point in the domain that\n      will be used to construct an axes aligned initial simplex.\n    step_sizes: (Optional) `Tensor` of real dtype and shape broadcasting\n      compatible with `initial_vertex`. Supplies the simplex scale along each\n      axes. Only used if `initial_simplex` is not supplied. See description\n      above for details on how step sizes and initial vertex are used to\n      construct the initial simplex.\n    objective_at_initial_simplex: (Optional) Rank `1` `Tensor` of real dtype\n      of a rank `1` `Tensor`. The value of the objective function at the\n      initial simplex. May be supplied only if `initial_simplex` is\n      supplied. If not supplied, it will be computed.\n    objective_at_initial_vertex: (Optional) Scalar `Tensor` of real dtype. The\n      value of the objective function at the initial vertex. May be supplied\n      only if the `initial_vertex` is also supplied.\n    batch_evaluate_objective: (Optional) Python `bool`. If True, the objective\n      function will be evaluated on all the vertices of the simplex packed\n      into a single tensor. If False, the objective will be mapped across each\n      vertex separately. Evaluating the objective function in a batch allows\n      use of vectorization and should be preferred if the objective function\n      allows it.\n    func_tolerance: (Optional) Scalar `Tensor` of real dtype. The algorithm\n      stops if the absolute difference between the largest and the smallest\n      function value on the vertices of the simplex is below this number.\n    position_tolerance: (Optional) Scalar `Tensor` of real dtype. The\n      algorithm stops if the largest absolute difference between the\n      coordinates of the vertices is below this threshold.\n    parallel_iterations: (Optional) Positive integer. The number of iterations\n      allowed to run in parallel.\n    max_iterations: (Optional) Scalar positive `Tensor` of dtype `int32`.\n      The maximum number of iterations allowed. If `None` then no limit is\n      applied.\n    reflection: (Optional) Positive Scalar `Tensor` of same dtype as\n      `initial_vertex`. This parameter controls the scaling of the reflected\n      vertex. See, [Press et al(2007)][1] for details. If not specified,\n      uses the dimension dependent prescription of [Gao and Han(2012)][3].\n    expansion: (Optional) Positive Scalar `Tensor` of same dtype as\n      `initial_vertex`. Should be greater than `1` and `reflection`. This\n      parameter controls the expanded scaling of a reflected vertex.\n      See, [Press et al(2007)][1] for details. If not specified, uses the\n      dimension dependent prescription of [Gao and Han(2012)][3].\n    contraction: (Optional) Positive scalar `Tensor` of same dtype as\n      `initial_vertex`. Must be between `0` and `1`. This parameter controls\n      the contraction of the reflected vertex when the objective function at\n      the reflected point fails to show sufficient decrease.\n      See, [Press et al(2007)][1] for more details. If not specified, uses\n      the dimension dependent prescription of [Gao and Han(2012][3].\n    shrinkage: (Optional) Positive scalar `Tensor` of same dtype as\n      `initial_vertex`. Must be between `0` and `1`. This parameter is the scale\n      by which the simplex is shrunk around the best point when the other\n      steps fail to produce improvements.\n      See, [Press et al(2007)][1] for more details. If not specified, uses\n      the dimension dependent prescription of [Gao and Han(2012][3].\n    name: (Optional) Python str. The name prefixed to the ops created by this\n      function. If not supplied, the default name 'minimize' is used.\n\n  Returns:\n    optimizer_results: A namedtuple containing the following items:\n      converged: Scalar boolean tensor indicating whether the minimum was\n        found within tolerance.\n      num_objective_evaluations: The total number of objective\n        evaluations performed.\n      position: A `Tensor` containing the last argument value found\n        during the search. If the search converged, then\n        this value is the argmin of the objective function.\n      objective_value: A tensor containing the value of the objective\n        function at the `position`. If the search\n        converged, then this is the (local) minimum of\n        the objective function.\n      final_simplex: The last simplex constructed before stopping.\n      final_objective_values: The objective function evaluated at the\n        vertices of the final simplex.\n      initial_simplex: The starting simplex.\n      initial_objective_values: The objective function evaluated at the\n        vertices of the initial simplex.\n      num_iterations: The number of iterations of the main algorithm body.\n\n  Raises:\n    ValueError: If any of the following conditions hold\n      1. If none or more than one of `initial_simplex` and `initial_vertex` are\n        supplied.\n      2. If `initial_simplex` and `step_sizes` are both specified.\n  \"\"\"\n  with tf.compat.v1.name_scope(name, 'minimize', [\n      initial_simplex, initial_vertex, step_sizes, objective_at_initial_simplex,\n      objective_at_initial_vertex, func_tolerance, position_tolerance\n  ]):\n    (\n        dim,\n        _,\n        simplex,\n        objective_at_simplex,\n        num_evaluations\n    ) = _prepare_args(objective_function,\n                      initial_simplex,\n                      initial_vertex,\n                      step_sizes,\n                      objective_at_initial_simplex,\n                      objective_at_initial_vertex,\n                      batch_evaluate_objective)\n    domain_dtype = simplex.dtype\n    (\n        reflection,\n        expansion,\n        contraction,\n        shrinkage\n    ) = _resolve_parameters(dim,\n                            reflection,\n                            expansion,\n                            contraction,\n                            shrinkage,\n                            domain_dtype)\n\n    closure_kwargs = dict(\n        objective_function=objective_function,\n        dim=dim,\n        func_tolerance=func_tolerance,\n        position_tolerance=position_tolerance,\n        batch_evaluate_objective=batch_evaluate_objective,\n        reflection=reflection,\n        expansion=expansion,\n        contraction=contraction,\n        shrinkage=shrinkage)\n\n    def _loop_body(_, iterations, simplex, objective_at_simplex,\n                   num_evaluations):\n      (\n          converged,\n          next_simplex,\n          next_objective,\n          evaluations\n      ) = nelder_mead_one_step(simplex, objective_at_simplex, **closure_kwargs)\n\n      return (converged, iterations + 1, next_simplex, next_objective,\n              num_evaluations + evaluations)\n\n    initial_args = (False, 0, simplex, objective_at_simplex,\n                    num_evaluations)\n    # Loop until either we have converged or if the max iterations are supplied\n    # then until we have converged or exhausted the available iteration budget.\n    def _is_converged(converged, num_iterations, *ignored_args):  # pylint:disable=unused-argument\n      # It is important to ensure that not_converged is a tensor. If\n      # converged is not a tensor but a Python bool, then the overloaded\n      # op '~' acts as bitwise complement so ~True = -2 and ~False = -1.\n      # In that case, the loop will never terminate.\n      not_converged = tf.logical_not(converged)\n      return (not_converged if max_iterations is None\n              else (not_converged & (num_iterations < max_iterations)))\n\n    (converged, num_iterations, final_simplex, final_objective_values,\n     final_evaluations) = tf.while_loop(\n         cond=_is_converged,\n         body=_loop_body,\n         loop_vars=initial_args,\n         parallel_iterations=parallel_iterations)\n    order = tf.argsort(\n        final_objective_values, direction='ASCENDING', stable=True)\n    best_index = order[0]\n    # The explicit cast to Tensor below is done to avoid returning a mixture\n    # of Python types and Tensors which cause problems with session.run.\n    # In the eager mode, converged may remain a Python bool. Trying to evaluate\n    # the whole tuple in one evaluate call will raise an exception because\n    # of the presence of non-tensors. This is very annoying so we explicitly\n    # cast those arguments to Tensors.\n    return NelderMeadOptimizerResults(\n        converged=tf.convert_to_tensor(value=converged),\n        num_objective_evaluations=final_evaluations,\n        position=final_simplex[best_index],\n        objective_value=final_objective_values[best_index],\n        final_simplex=final_simplex,\n        final_objective_values=final_objective_values,\n        num_iterations=tf.convert_to_tensor(value=num_iterations),\n        initial_simplex=simplex,\n        initial_objective_values=objective_at_simplex)"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _accept_reflected_fn(simplex,\n                         objective_values,\n                         worst_index,\n                         reflected,\n                         objective_at_reflected):\n  \"\"\"Creates the condition function pair for a reflection to be accepted.\"\"\"\n  def _replace_worst_with_reflected():\n    next_simplex = _replace_at_index(simplex, worst_index, reflected)\n    next_objective_values = _replace_at_index(objective_values, worst_index,\n                                              objective_at_reflected)\n    return False, next_simplex, next_objective_values, 0\n  return _replace_worst_with_reflected", "response": "Creates a condition function pair for a reflection to be accepted."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef _expansion_fn(objective_function,\n                  simplex,\n                  objective_values,\n                  worst_index,\n                  reflected,\n                  objective_at_reflected,\n                  face_centroid,\n                  expansion):\n  \"\"\"Creates the condition function pair for an expansion.\"\"\"\n  def _expand_and_maybe_replace():\n    \"\"\"Performs the expansion step.\"\"\"\n    expanded = face_centroid + expansion * (reflected - face_centroid)\n    expanded_objective_value = objective_function(expanded)\n    expanded_is_better = (expanded_objective_value <\n                          objective_at_reflected)\n    accept_expanded_fn = lambda: (expanded, expanded_objective_value)\n    accept_reflected_fn = lambda: (reflected, objective_at_reflected)\n    next_pt, next_objective_value = prefer_static.cond(\n        expanded_is_better, accept_expanded_fn, accept_reflected_fn)\n    next_simplex = _replace_at_index(simplex, worst_index, next_pt)\n    next_objective_at_simplex = _replace_at_index(objective_values,\n                                                  worst_index,\n                                                  next_objective_value)\n    return False, next_simplex, next_objective_at_simplex, 1\n  return _expand_and_maybe_replace", "response": "Creates a condition function pair for an expansion."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\ncreating a condition function pair for an outside contraction.", "response": "def _outside_contraction_fn(objective_function,\n                            simplex,\n                            objective_values,\n                            face_centroid,\n                            best_index,\n                            worst_index,\n                            reflected,\n                            objective_at_reflected,\n                            contraction,\n                            shrinkage,\n                            batch_evaluate_objective):\n  \"\"\"Creates the condition function pair for an outside contraction.\"\"\"\n  def _contraction():\n    \"\"\"Performs a contraction.\"\"\"\n    contracted = face_centroid + contraction * (reflected - face_centroid)\n    objective_at_contracted = objective_function(contracted)\n    is_contracted_acceptable = objective_at_contracted <= objective_at_reflected\n    def _accept_contraction():\n      next_simplex = _replace_at_index(simplex, worst_index, contracted)\n      objective_at_next_simplex = _replace_at_index(\n          objective_values,\n          worst_index,\n          objective_at_contracted)\n      return (False,\n              next_simplex,\n              objective_at_next_simplex,\n              1)\n\n    def _reject_contraction():\n      return _shrink_towards_best(objective_function,\n                                  simplex,\n                                  best_index,\n                                  shrinkage,\n                                  batch_evaluate_objective)\n\n    return prefer_static.cond(is_contracted_acceptable,\n                              _accept_contraction,\n                              _reject_contraction)\n  return _contraction"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef _shrink_towards_best(objective_function,\n                         simplex,\n                         best_index,\n                         shrinkage,\n                         batch_evaluate_objective):\n  \"\"\"Shrinks the simplex around the best vertex.\"\"\"\n\n  # If the contraction step fails to improve the average objective enough,\n  # the simplex is shrunk towards the best vertex.\n  best_vertex = simplex[best_index]\n  shrunk_simplex = best_vertex + shrinkage * (simplex - best_vertex)\n  objective_at_shrunk_simplex, evals = _evaluate_objective_multiple(\n      objective_function,\n      shrunk_simplex,\n      batch_evaluate_objective)\n  return (False,\n          shrunk_simplex,\n          objective_at_shrunk_simplex,\n          evals)", "response": "Shrinks the simplex around the best vertex."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _replace_at_index(x, index, replacement):\n  x_new = tf.concat([x[:index], tf.expand_dims(replacement, axis=0),\n                     x[(index + 1):]], axis=0)\n  return x_new", "response": "Replaces an element at supplied index."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef _check_convergence(simplex,\n                       best_vertex,\n                       best_objective,\n                       worst_objective,\n                       func_tolerance,\n                       position_tolerance):\n  \"\"\"Returns True if the simplex has converged.\n\n  If the simplex size is smaller than the `position_tolerance` or the variation\n  of the function value over the vertices of the simplex is smaller than the\n  `func_tolerance` return True else False.\n\n  Args:\n    simplex: `Tensor` of real dtype. The simplex to test for convergence. For\n      more details, see the docstring for `initial_simplex` argument\n      of `minimize`.\n    best_vertex: `Tensor` of real dtype and rank one less than `simplex`. The\n      vertex with the best (i.e. smallest) objective value.\n    best_objective: Scalar `Tensor` of real dtype. The best (i.e. smallest)\n      value of the objective function at a vertex.\n    worst_objective: Scalar `Tensor` of same dtype as `best_objective`. The\n      worst (i.e. largest) value of the objective function at a vertex.\n    func_tolerance: Scalar positive `Tensor`. The tolerance for the variation\n      of the objective function value over the simplex. If the variation over\n      the simplex vertices is below this threshold, convergence is True.\n    position_tolerance: Scalar positive `Tensor`. The algorithm stops if the\n      lengths (under the supremum norm) of edges connecting to the best vertex\n      are below this threshold.\n\n  Returns:\n    has_converged: A scalar boolean `Tensor` indicating whether the algorithm\n      is deemed to have converged.\n  \"\"\"\n  objective_convergence = tf.abs(worst_objective -\n                                 best_objective) < func_tolerance\n  simplex_degeneracy = tf.reduce_max(\n      input_tensor=tf.abs(simplex - best_vertex)) < position_tolerance\n  return objective_convergence | simplex_degeneracy", "response": "Checks if the simplex has converged."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _prepare_args(objective_function,\n                  initial_simplex,\n                  initial_vertex,\n                  step_sizes,\n                  objective_at_initial_simplex,\n                  objective_at_initial_vertex,\n                  batch_evaluate_objective):\n  \"\"\"Computes the initial simplex and the objective values at the simplex.\n\n  Args:\n    objective_function:  A Python callable that accepts a point as a\n      real `Tensor` and returns a `Tensor` of real dtype containing\n      the value of the function at that point. The function\n      to be evaluated at the simplex. If `batch_evaluate_objective` is `True`,\n      the callable may be evaluated on a `Tensor` of shape `[n+1] + s `\n      where `n` is the dimension of the problem and `s` is the shape of a\n      single point in the domain (so `n` is the size of a `Tensor`\n      representing a single point).\n      In this case, the expected return value is a `Tensor` of shape `[n+1]`.\n    initial_simplex: None or `Tensor` of real dtype. The initial simplex to\n      start the search. If supplied, should be a `Tensor` of shape `[n+1] + s`\n      where `n` is the dimension of the problem and `s` is the shape of a\n      single point in the domain. Each row (i.e. the `Tensor` with a given\n      value of the first index) is interpreted as a vertex of a simplex and\n      hence the rows must be affinely independent. If not supplied, an axes\n      aligned simplex is constructed using the `initial_vertex` and\n      `step_sizes`. Only one and at least one of `initial_simplex` and\n      `initial_vertex` must be supplied.\n    initial_vertex: None or `Tensor` of real dtype and any shape that can\n      be consumed by the `objective_function`. A single point in the domain that\n      will be used to construct an axes aligned initial simplex.\n    step_sizes: None or `Tensor` of real dtype and shape broadcasting\n      compatible with `initial_vertex`. Supplies the simplex scale along each\n      axes. Only used if `initial_simplex` is not supplied. See the docstring\n      of `minimize` for more details.\n    objective_at_initial_simplex: None or rank `1` `Tensor` of real dtype.\n      The value of the objective function at the initial simplex.\n      May be supplied only if `initial_simplex` is\n      supplied. If not supplied, it will be computed.\n    objective_at_initial_vertex: None or scalar `Tensor` of real dtype. The\n      value of the objective function at the initial vertex. May be supplied\n      only if the `initial_vertex` is also supplied.\n    batch_evaluate_objective: Python `bool`. If True, the objective function\n      will be evaluated on all the vertices of the simplex packed into a\n      single tensor. If False, the objective will be mapped across each\n      vertex separately.\n\n  Returns:\n    prepared_args: A tuple containing the following elements:\n      dimension: Scalar `Tensor` of `int32` dtype. The dimension of the problem\n        as inferred from the supplied arguments.\n      num_vertices: Scalar `Tensor` of `int32` dtype. The number of vertices\n        in the simplex.\n      simplex: A `Tensor` of same dtype as `initial_simplex`\n        (or `initial_vertex`). The first component of the shape of the\n        `Tensor` is `num_vertices` and each element represents a vertex of\n        the simplex.\n      objective_at_simplex: A `Tensor` of same dtype as the dtype of the\n        return value of objective_function. The shape is a vector of size\n        `num_vertices`. The objective function evaluated at the simplex.\n      num_evaluations: An `int32` scalar `Tensor`. The number of points on\n        which the objective function was evaluated.\n\n  Raises:\n    ValueError: If any of the following conditions hold\n      1. If none or more than one of `initial_simplex` and `initial_vertex` are\n        supplied.\n      2. If `initial_simplex` and `step_sizes` are both specified.\n  \"\"\"\n  if objective_at_initial_simplex is not None and initial_simplex is None:\n    raise ValueError('`objective_at_initial_simplex` specified but the'\n                     '`initial_simplex` was not.')\n\n  if objective_at_initial_vertex is not None and initial_vertex is None:\n    raise ValueError('`objective_at_initial_vertex` specified but the'\n                     '`initial_vertex` was not.')\n\n  # The full simplex was specified.\n  if initial_simplex is not None:\n    if initial_vertex is not None:\n      raise ValueError('Both `initial_simplex` and `initial_vertex` specified.'\n                       ' Only one of the two should be specified.')\n\n    if step_sizes is not None:\n      raise ValueError('`step_sizes` must not be specified when an'\n                       ' `initial_simplex` has been specified.')\n    return _prepare_args_with_initial_simplex(objective_function,\n                                              initial_simplex,\n                                              objective_at_initial_simplex,\n                                              batch_evaluate_objective)\n\n  if initial_vertex is None:\n    raise ValueError('One of `initial_simplex` or `initial_vertex`'\n                     ' must be supplied')\n\n  if step_sizes is None:\n    step_sizes = _default_step_sizes(initial_vertex)\n\n  return _prepare_args_with_initial_vertex(objective_function,\n                                           initial_vertex,\n                                           step_sizes,\n                                           objective_at_initial_vertex,\n                                           batch_evaluate_objective)", "response": "Prepare the arguments for the objective function at the simplex."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef _default_step_sizes(reference_vertex):\n  # Step size to choose when the coordinate is zero.\n  small_sizes = tf.ones_like(reference_vertex) * 0.00025\n  # Step size to choose when the coordinate is non-zero.\n  large_sizes = reference_vertex * 0.05\n  return tf.where(tf.abs(reference_vertex) < _EPSILON,\n                  small_sizes,\n                  large_sizes)", "response": "Chooses default step sizes according to [ Gao and Han 2000][ 3 ]."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\npreparing the args for the objective function at the specified initial simplex.", "response": "def _prepare_args_with_initial_simplex(objective_function,\n                                       initial_simplex,\n                                       objective_at_initial_simplex,\n                                       batch_evaluate_objective):\n  \"\"\"Evaluates the objective function at the specified initial simplex.\"\"\"\n  initial_simplex = tf.convert_to_tensor(value=initial_simplex)\n\n  # If d is the dimension of the problem, the number of vertices in the\n  # simplex should be d+1. From this, we can infer the number of dimensions\n  # as n - 1 where n is the number of vertices specified.\n  num_vertices = tf.shape(input=initial_simplex)[0]\n  dim = num_vertices - 1\n  num_evaluations = 0\n\n  if objective_at_initial_simplex is None:\n    objective_at_initial_simplex, n_evals = _evaluate_objective_multiple(\n        objective_function, initial_simplex, batch_evaluate_objective)\n    num_evaluations += n_evals\n  objective_at_initial_simplex = tf.convert_to_tensor(\n      value=objective_at_initial_simplex)\n  return (dim,\n          num_vertices,\n          initial_simplex,\n          objective_at_initial_simplex,\n          num_evaluations)"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _prepare_args_with_initial_vertex(objective_function,\n                                      initial_vertex,\n                                      step_sizes,\n                                      objective_at_initial_vertex,\n                                      batch_evaluate_objective):\n  \"\"\"Constructs a standard axes aligned simplex.\"\"\"\n  dim = tf.size(input=initial_vertex)\n  num_vertices = dim + 1\n  unit_vectors_along_axes = tf.reshape(\n      tf.eye(dim, dim, dtype=initial_vertex.dtype.base_dtype),\n      tf.concat([[dim], tf.shape(input=initial_vertex)], axis=0))\n\n  # If step_sizes does not broadcast to initial_vertex, the multiplication\n  # in the second term will fail.\n  simplex_face = initial_vertex + step_sizes * unit_vectors_along_axes\n  simplex = tf.concat([tf.expand_dims(initial_vertex, axis=0),\n                       simplex_face], axis=0)\n  num_evaluations = 0\n  # Evaluate the objective function at the simplex vertices.\n  if objective_at_initial_vertex is None:\n    objective_at_initial_vertex = objective_function(initial_vertex)\n    num_evaluations += 1\n\n  objective_at_simplex_face, num_evals = _evaluate_objective_multiple(\n      objective_function, simplex_face, batch_evaluate_objective)\n  num_evaluations += num_evals\n\n  objective_at_simplex = tf.concat(\n      [\n          tf.expand_dims(objective_at_initial_vertex, axis=0),\n          objective_at_simplex_face\n      ], axis=0)\n\n  return (dim,\n          num_vertices,\n          simplex,\n          objective_at_simplex,\n          num_evaluations)", "response": "Constructs a standard axes aligned simplex."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\napply the Gao and Han presciption to the unspecified parameters.", "response": "def _resolve_parameters(dim,\n                        reflection,\n                        expansion,\n                        contraction,\n                        shrinkage,\n                        dtype):\n  \"\"\"Applies the [Gao and Han][3] presciption to the unspecified parameters.\"\"\"\n  dim = tf.cast(dim, dtype=dtype)\n  reflection = 1. if reflection is None else reflection\n  expansion = (1. + 2. / dim) if expansion is None else expansion\n  contraction = (0.75 - 1. / (2 * dim)) if contraction is None else contraction\n  shrinkage = (1. - 1. / dim) if shrinkage is None else shrinkage\n  return reflection, expansion, contraction, shrinkage"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef _evaluate_objective_multiple(objective_function, arg_batch,\n                                 batch_evaluate_objective):\n  \"\"\"Evaluates the objective function on a batch of points.\n\n  If `batch_evaluate_objective` is True, returns\n  `objective function(arg_batch)` else it maps the `objective_function`\n  across the `arg_batch`.\n\n  Args:\n    objective_function: A Python callable that accepts a single `Tensor` of\n      rank 'R > 1' and any shape 's' and returns a scalar `Tensor` of real dtype\n      containing the value of the function at that point. If\n      `batch a `Tensor` of shape `[batch_size] + s ` where `batch_size` is the\n      size of the batch of args. In this case, the expected return value is a\n      `Tensor` of shape `[batch_size]`.\n    arg_batch: A `Tensor` of real dtype. The batch of arguments at which to\n      evaluate the `objective_function`. If `batch_evaluate_objective` is False,\n      `arg_batch` will be unpacked along the zeroth axis and the\n      `objective_function` will be applied to each element.\n    batch_evaluate_objective: `bool`. Whether the `objective_function` can\n      evaluate a batch of arguments at once.\n\n  Returns:\n    A tuple containing:\n      objective_values: A `Tensor` of real dtype and shape `[batch_size]`.\n        The value of the objective function evaluated at the supplied\n        `arg_batch`.\n      num_evaluations: An `int32` scalar `Tensor`containing the number of\n        points on which the objective function was evaluated (i.e `batch_size`).\n  \"\"\"\n  n_points = tf.shape(input=arg_batch)[0]\n  if batch_evaluate_objective:\n    return objective_function(arg_batch), n_points\n  return tf.map_fn(objective_function, arg_batch), n_points", "response": "Evaluates the objective function on a batch of points."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nsaving a PNG plot with histograms of weight means and stddevs.", "response": "def plot_weight_posteriors(names, qm_vals, qs_vals, fname):\n  \"\"\"Save a PNG plot with histograms of weight means and stddevs.\n\n  Args:\n    names: A Python `iterable` of `str` variable names.\n    qm_vals: A Python `iterable`, the same length as `names`,\n      whose elements are Numpy `array`s, of any shape, containing\n      posterior means of weight varibles.\n    qs_vals: A Python `iterable`, the same length as `names`,\n      whose elements are Numpy `array`s, of any shape, containing\n      posterior standard deviations of weight varibles.\n    fname: Python `str` filename to save the plot to.\n  \"\"\"\n  fig = figure.Figure(figsize=(6, 3))\n  canvas = backend_agg.FigureCanvasAgg(fig)\n\n  ax = fig.add_subplot(1, 2, 1)\n  for n, qm in zip(names, qm_vals):\n    sns.distplot(qm.flatten(), ax=ax, label=n)\n  ax.set_title(\"weight means\")\n  ax.set_xlim([-1.5, 1.5])\n  ax.legend()\n\n  ax = fig.add_subplot(1, 2, 2)\n  for n, qs in zip(names, qs_vals):\n    sns.distplot(qs.flatten(), ax=ax)\n  ax.set_title(\"weight stddevs\")\n  ax.set_xlim([0, 1.])\n\n  fig.tight_layout()\n  canvas.print_figure(fname, format=\"png\")\n  print(\"saved {}\".format(fname))"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nsaving a PNG plot visualizing posterior uncertainty on heldout data.", "response": "def plot_heldout_prediction(input_vals, probs,\n                            fname, n=10, title=\"\"):\n  \"\"\"Save a PNG plot visualizing posterior uncertainty on heldout data.\n\n  Args:\n    input_vals: A `float`-like Numpy `array` of shape\n      `[num_heldout] + IMAGE_SHAPE`, containing heldout input images.\n    probs: A `float`-like Numpy array of shape `[num_monte_carlo,\n      num_heldout, num_classes]` containing Monte Carlo samples of\n      class probabilities for each heldout sample.\n    fname: Python `str` filename to save the plot to.\n    n: Python `int` number of datapoints to vizualize.\n    title: Python `str` title for the plot.\n  \"\"\"\n  fig = figure.Figure(figsize=(9, 3*n))\n  canvas = backend_agg.FigureCanvasAgg(fig)\n  for i in range(n):\n    ax = fig.add_subplot(n, 3, 3*i + 1)\n    ax.imshow(input_vals[i, :].reshape(IMAGE_SHAPE[:-1]), interpolation=\"None\")\n\n    ax = fig.add_subplot(n, 3, 3*i + 2)\n    for prob_sample in probs:\n      sns.barplot(np.arange(10), prob_sample[i, :], alpha=0.1, ax=ax)\n      ax.set_ylim([0, 1])\n    ax.set_title(\"posterior samples\")\n\n    ax = fig.add_subplot(n, 3, 3*i + 3)\n    sns.barplot(np.arange(10), np.mean(probs[:, i, :], axis=0), ax=ax)\n    ax.set_ylim([0, 1])\n    ax.set_title(\"predictive probs\")\n  fig.suptitle(title)\n  fig.tight_layout()\n\n  canvas.print_figure(fname, format=\"png\")\n  print(\"saved {}\".format(fname))"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nbuilds an iterator that can switch between train and heldout data.", "response": "def build_input_pipeline(mnist_data, batch_size, heldout_size):\n  \"\"\"Build an Iterator switching between train and heldout data.\"\"\"\n\n  # Build an iterator over training batches.\n  training_dataset = tf.data.Dataset.from_tensor_slices(\n      (mnist_data.train.images, np.int32(mnist_data.train.labels)))\n  training_batches = training_dataset.shuffle(\n      50000, reshuffle_each_iteration=True).repeat().batch(batch_size)\n  training_iterator = tf.compat.v1.data.make_one_shot_iterator(training_batches)\n\n  # Build a iterator over the heldout set with batch_size=heldout_size,\n  # i.e., return the entire heldout set as a constant.\n  heldout_dataset = tf.data.Dataset.from_tensor_slices(\n      (mnist_data.validation.images,\n       np.int32(mnist_data.validation.labels)))\n  heldout_frozen = (heldout_dataset.take(heldout_size).\n                    repeat().batch(heldout_size))\n  heldout_iterator = tf.compat.v1.data.make_one_shot_iterator(heldout_frozen)\n\n  # Combine these into a feedable iterator that can switch between training\n  # and validation inputs.\n  handle = tf.compat.v1.placeholder(tf.string, shape=[])\n  feedable_iterator = tf.compat.v1.data.Iterator.from_string_handle(\n      handle, training_batches.output_types, training_batches.output_shapes)\n  images, labels = feedable_iterator.get_next()\n\n  return images, labels, handle, training_iterator, heldout_iterator"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nbuilds fake MNIST - style data for unit testing.", "response": "def build_fake_data(num_examples=10):\n  \"\"\"Build fake MNIST-style data for unit testing.\"\"\"\n\n  class Dummy(object):\n    pass\n\n  num_examples = 10\n  mnist_data = Dummy()\n  mnist_data.train = Dummy()\n  mnist_data.train.images = np.float32(np.random.randn(\n      num_examples, *IMAGE_SHAPE))\n  mnist_data.train.labels = np.int32(np.random.permutation(\n      np.arange(num_examples)))\n  mnist_data.train.num_examples = num_examples\n  mnist_data.validation = Dummy()\n  mnist_data.validation.images = np.float32(np.random.randn(\n      num_examples, *IMAGE_SHAPE))\n  mnist_data.validation.labels = np.int32(np.random.permutation(\n      np.arange(num_examples)))\n  mnist_data.validation.num_examples = num_examples\n  return mnist_data"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef _kl_bernoulli_bernoulli(a, b, name=None):\n  with tf.name_scope(name or \"kl_bernoulli_bernoulli\"):\n    delta_probs0 = tf.nn.softplus(-b.logits) - tf.nn.softplus(-a.logits)\n    delta_probs1 = tf.nn.softplus(b.logits) - tf.nn.softplus(a.logits)\n    return (tf.sigmoid(a.logits) * delta_probs0\n            + tf.sigmoid(-a.logits) * delta_probs1)", "response": "Calculate the batched KL divergence KL with a and b Bernoulli."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nreturns the configuration as a JSON - serializable dict.", "response": "def get_config(self):\n    \"\"\"Returns initializer configuration as a JSON-serializable dict.\"\"\"\n    return {\n        'initializers': [\n            tf.compat.v2.initializers.serialize(\n                tf.keras.initializers.get(init))\n            for init in self.initializers\n        ],\n        'sizes': self.sizes,\n        'validate_args': self.validate_args,\n    }"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef from_config(cls, config):\n    return cls(**{\n        'initializers': [tf.compat.v2.initializers.deserialize(init)\n                         for init in config.get('initializers', [])],\n        'sizes': config.get('sizes', []),\n        'validate_args': config.get('validate_args', False),\n    })", "response": "Instantiates an initializer from a configuration dictionary."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nwraps for scipy. sparse. matrix. matmul.", "response": "def _matmul(a, b,\n            transpose_a=False, transpose_b=False,\n            adjoint_a=False, adjoint_b=False,\n            a_is_sparse=False, b_is_sparse=False,\n            name=None):  # pylint: disable=unused-argument\n  \"\"\"Numpy matmul wrapper.\"\"\"\n  if a_is_sparse or b_is_sparse:\n    raise NotImplementedError('Numpy backend does not support sparse matmul.')\n  if transpose_a or adjoint_a:\n    a = _matrix_transpose(a, conjugate=adjoint_a)\n  if transpose_b or adjoint_b:\n    b = _matrix_transpose(b, conjugate=adjoint_b)\n  return np.matmul(a, b)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef assign_moving_mean_variance(\n    mean_var, variance_var, value, decay, name=None):\n  \"\"\"Compute exponentially weighted moving {mean,variance} of a streaming value.\n\n  The `value` updated exponentially weighted moving `mean_var` and\n  `variance_var` are given by the following recurrence relations:\n\n  ```python\n  variance_var = decay * (variance_var + (1 - decay) * (value - mean_var)**2)\n  mean_var     = decay * mean_var + (1 - decay) * value\n  ```\n\n  Note: `mean_var` is updated *after* `variance_var`, i.e., `variance_var` uses\n  the lag-1 mean.\n\n  For derivation justification, see [Finch (2009; Eq. 143)][1].\n  Parameterization: Finch's `alpha` is `1 - decay`.\n\n  Args:\n    mean_var: `float`-like `Variable` representing the exponentially weighted\n      moving mean. Same shape as `variance_var` and `value`.\n    variance_var: `float`-like `Variable` representing the\n      exponentially weighted moving variance. Same shape as `mean_var` and\n      `value`.\n    value: `float`-like `Tensor`. Same shape as `mean_var` and `variance_var`.\n    decay: A `float`-like `Tensor`. The moving mean decay. Typically close to\n      `1.`, e.g., `0.999`.\n    name: Optional name of the returned operation.\n\n  Returns:\n    mean_var: `Variable` representing the `value`-updated exponentially weighted\n      moving mean.\n    variance_var: `Variable` representing the `value`-updated\n      exponentially weighted moving variance.\n\n  Raises:\n    TypeError: if `mean_var` does not have float type `dtype`.\n    TypeError: if `mean_var`, `variance_var`, `value`, `decay` have different\n      `base_dtype`.\n\n  #### References\n\n  [1]: Tony Finch. Incremental calculation of weighted mean and variance.\n       _Technical Report_, 2009.\n       http://people.ds.cam.ac.uk/fanf2/hermes/doc/antiforgery/stats.pdf\n  \"\"\"\n  with tf.compat.v1.name_scope(name, \"assign_moving_mean_variance\",\n                               [variance_var, mean_var, value, decay]):\n    with tf.compat.v1.colocate_with(variance_var):\n      with tf.compat.v1.colocate_with(mean_var):\n        base_dtype = mean_var.dtype.base_dtype\n        if not base_dtype.is_floating:\n          raise TypeError(\n              \"mean_var.base_dtype({}) does not have float type \"\n              \"`dtype`.\".format(base_dtype.name))\n        if base_dtype != variance_var.dtype.base_dtype:\n          raise TypeError(\n              \"mean_var.base_dtype({}) != variance_var.base_dtype({})\".format(\n                  base_dtype.name,\n                  variance_var.dtype.base_dtype.name))\n        value = tf.convert_to_tensor(\n            value=value, dtype=base_dtype, name=\"value\")\n        decay = tf.convert_to_tensor(\n            value=decay, dtype=base_dtype, name=\"decay\")\n        delta = value - mean_var\n        with tf.control_dependencies([delta]):\n          # We want mean_{t+1} = decay * mean_t + (1. - decay) * value\n          # We compute mean += decay * mean_t - mean_t + (1. - decay) * value =\n          #   = (1. - decay) * (value - mean_t)\n          mean_var = mean_var.assign_add((1. - decay) * delta)\n          # We want variance_{t+1} = decay * (variance_t +\n          #   + (1 - decay) * (value - mean_var)**2).\n          # We compute variance -= variance_t - decay * (variance_t +\n          #     + (1 - decay) * (value - mean_var)**2) =\n          #   = (1 - decay) * variance_t\n          #     - decay * (1 - decay) * (value - mean_var)**2\n          #   = (1 - decay) * (variance_t - decay * (value - mean_var)**2).\n          variance_var = variance_var.assign_sub(\n              (1. - decay) * (variance_var - decay * tf.square(delta)))\n        return mean_var, variance_var", "response": "Assign a new value to the exponentially weighted moving mean and variance."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef assign_log_moving_mean_exp(\n    log_mean_exp_var, log_value, decay, name=None):\n  \"\"\"Compute the log of the exponentially weighted moving mean of the exp.\n\n  If `log_value` is a draw from a stationary random variable, this function\n  approximates `log(E[exp(log_value)])`, i.e., a weighted log-sum-exp. More\n  precisely, a `tf.Variable`, `log_mean_exp_var`, is updated by `log_value`\n  using the following identity:\n\n  ```none\n  log_mean_exp_var =\n  = log(decay exp(log_mean_exp_var) + (1 - decay) exp(log_value))\n  = log(exp(log_mean_exp_var + log(decay)) + exp(log_value + log1p(-decay)))\n  = log_mean_exp_var\n    + log(  exp(log_mean_exp_var   - log_mean_exp_var + log(decay))\n          + exp(log_value - log_mean_exp_var + log1p(-decay)))\n  = log_mean_exp_var\n    + log_sum_exp([log(decay), log_value - log_mean_exp_var + log1p(-decay)]).\n  ```\n\n  In addition to numerical stability, this formulation is advantageous because\n  `log_mean_exp_var` can be updated in a lock-free manner, i.e., using\n  `assign_add`. (Note: the updates are not thread-safe; it's just that the\n  update to the tf.Variable is presumed efficient due to being lock-free.)\n\n  Args:\n    log_mean_exp_var: `float`-like `Variable` representing the log of the\n      exponentially weighted moving mean of the exp. Same shape as `log_value`.\n    log_value: `float`-like `Tensor` representing a new (streaming) observation.\n      Same shape as `log_mean_exp_var`.\n    decay: A `float`-like `Tensor`. The moving mean decay. Typically close to\n      `1.`, e.g., `0.999`.\n    name: Optional name of the returned operation.\n\n  Returns:\n    log_mean_exp_var: A reference to the input 'Variable' tensor with the\n      `log_value`-updated log of the exponentially weighted moving mean of exp.\n\n  Raises:\n    TypeError: if `log_mean_exp_var` does not have float type `dtype`.\n    TypeError: if `log_mean_exp_var`, `log_value`, `decay` have different\n      `base_dtype`.\n  \"\"\"\n  with tf.compat.v1.name_scope(name, \"assign_log_moving_mean_exp\",\n                               [log_mean_exp_var, log_value, decay]):\n    # We want to update the variable in a numerically stable and lock-free way.\n    # To do this, observe that variable `x` updated by `v` is:\n    # x = log(w exp(x) + (1-w) exp(v))\n    #   = log(exp(x + log(w)) + exp(v + log1p(-w)))\n    #   = x + log(exp(x - x + log(w)) + exp(v - x + log1p(-w)))\n    #   = x + lse([log(w), v - x + log1p(-w)])\n    with tf.compat.v1.colocate_with(log_mean_exp_var):\n      base_dtype = log_mean_exp_var.dtype.base_dtype\n      if not base_dtype.is_floating:\n        raise TypeError(\n            \"log_mean_exp_var.base_dtype({}) does not have float type \"\n            \"`dtype`.\".format(base_dtype.name))\n      log_value = tf.convert_to_tensor(\n          value=log_value, dtype=base_dtype, name=\"log_value\")\n      decay = tf.convert_to_tensor(value=decay, dtype=base_dtype, name=\"decay\")\n      delta = (log_value - log_mean_exp_var)[tf.newaxis, ...]\n      x = tf.concat([\n          tf.math.log(decay) * tf.ones_like(delta),\n          delta + tf.math.log1p(-decay)\n      ],\n                    axis=0)\n      x = tf.reduce_logsumexp(input_tensor=x, axis=0)\n      return log_mean_exp_var.assign_add(x)", "response": "Assign a log of the exponentially weighted moving mean of the exp."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\ncomputes exponentially weighted moving mean and variance.", "response": "def moving_mean_variance(value, decay, name=None):\n  \"\"\"Compute exponentially weighted moving {mean,variance} of a streaming value.\n\n  The exponentially-weighting moving `mean_var` and `variance_var` are updated\n  by `value` according to the following recurrence:\n\n  ```python\n  variance_var = decay * (variance_var + (1-decay) * (value - mean_var)**2)\n  mean_var     = decay * mean_var + (1 - decay) * value\n  ```\n\n  Note: `mean_var` is updated *after* `variance_var`, i.e., `variance_var` uses\n  the lag-`1` mean.\n\n  For derivation justification, see [Finch (2009; Eq. 143)][1].\n\n  Unlike `assign_moving_mean_variance`, this function handles\n  variable creation.\n\n  Args:\n    value: `float`-like `Tensor`. Same shape as `mean_var` and `variance_var`.\n    decay: A `float`-like `Tensor`. The moving mean decay. Typically close to\n      `1.`, e.g., `0.999`.\n    name: Optional name of the returned operation.\n\n  Returns:\n    mean_var: `Variable` representing the `value`-updated exponentially weighted\n      moving mean.\n    variance_var: `Variable` representing the `value`-updated\n      exponentially weighted moving variance.\n\n  Raises:\n    TypeError: if `value_var` does not have float type `dtype`.\n    TypeError: if `value`, `decay` have different `base_dtype`.\n\n  #### References\n\n  [1]: Tony Finch. Incremental calculation of weighted mean and variance.\n       _Technical Report_, 2009.\n       http://people.ds.cam.ac.uk/fanf2/hermes/doc/antiforgery/stats.pdf\n  \"\"\"\n  with tf.compat.v1.variable_scope(name, \"moving_mean_variance\",\n                                   [value, decay]):\n    value = tf.convert_to_tensor(value=value, name=\"value\")\n    base_dtype = value.dtype.base_dtype\n    if not base_dtype.is_floating:\n      raise TypeError(\n          \"value.base_dtype({}) does not have float type `dtype`.\".format(\n              base_dtype.name))\n    decay = tf.convert_to_tensor(value=decay, dtype=base_dtype, name=\"decay\")\n    variance_var = tf.compat.v2.Variable(\n        name=\"moving_variance\",\n        initial_value=tf.zeros(shape=value.shape, dtype=value.dtype),\n        trainable=False)\n    mean_var = tf.compat.v2.Variable(\n        name=\"moving_mean\",\n        initial_value=tf.zeros(shape=value.shape, dtype=value.dtype),\n        trainable=False)\n    return assign_moving_mean_variance(\n        mean_var, variance_var, value, decay)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nensuring non - scalar input has at least one column.", "response": "def _make_columnar(self, x):\n    \"\"\"Ensures non-scalar input has at least one column.\n\n    Example:\n      If `x = [1, 2, 3]` then the output is `[[1], [2], [3]]`.\n\n      If `x = [[1, 2, 3], [4, 5, 6]]` then the output is unchanged.\n\n      If `x = 1` then the output is unchanged.\n\n    Args:\n      x: `Tensor`.\n\n    Returns:\n      columnar_x: `Tensor` with at least two dimensions.\n    \"\"\"\n    if tensorshape_util.rank(x.shape) is not None:\n      if tensorshape_util.rank(x.shape) == 1:\n        x = x[tf.newaxis, :]\n      return x\n    shape = tf.shape(input=x)\n    maybe_expanded_shape = tf.concat([\n        shape[:-1],\n        distribution_util.pick_vector(\n            tf.equal(tf.rank(x), 1), [1], np.array([], dtype=np.int32)),\n        shape[-1:],\n    ], 0)\n    return tf.reshape(x, maybe_expanded_shape)"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\ncalculates the batched KL divergence KL with a and b Laplace.", "response": "def _kl_laplace_laplace(a, b, name=None):\n  \"\"\"Calculate the batched KL divergence KL(a || b) with a and b Laplace.\n\n  Args:\n    a: instance of a Laplace distribution object.\n    b: instance of a Laplace distribution object.\n    name: (optional) Name to use for created operations.\n      default is \"kl_laplace_laplace\".\n\n  Returns:\n    Batchwise KL(a || b)\n  \"\"\"\n  with tf.name_scope(name or \"kl_laplace_laplace\"):\n    # Consistent with\n    # http://www.mast.queensu.ca/~communications/Papers/gil-msc11.pdf, page 38\n    distance = tf.abs(a.loc - b.loc)\n    ratio = a.scale / b.scale\n\n    return (-tf.math.log(ratio) - 1 + distance / b.scale +\n            ratio * tf.exp(-distance / a.scale))"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef random_rademacher(shape, dtype=tf.float32, seed=None, name=None):\n  with tf.compat.v1.name_scope(name, 'random_rademacher', [shape, seed]):\n    # Choose the dtype to cause `2 * random_bernoulli - 1` to run in the same\n    # memory (host or device) as the downstream cast will want to put it.  The\n    # convention on GPU is that int32 are in host memory and int64 are in device\n    # memory.\n    generation_dtype = tf.int64 if tf.as_dtype(dtype) != tf.int32 else tf.int32\n    random_bernoulli = tf.random.uniform(\n        shape, minval=0, maxval=2, dtype=generation_dtype, seed=seed)\n    return tf.cast(2 * random_bernoulli - 1, dtype)", "response": "Generates a random tensor from the specified shape and dtype."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\ngenerating a Tensor of random positive reals drawn from a Rayleigh distribution with specified shape and dtype.", "response": "def random_rayleigh(shape, scale=None, dtype=tf.float32, seed=None, name=None):\n  \"\"\"Generates `Tensor` of positive reals drawn from a Rayleigh distributions.\n\n  The probability density function of a Rayleigh distribution with `scale`\n  parameter is given by:\n\n  ```none\n  f(x) = x scale**-2 exp(-x**2 0.5 scale**-2)\n  ```\n\n  For more details, see [Rayleigh distribution](\n  https://en.wikipedia.org/wiki/Rayleigh_distribution)\n\n  Args:\n    shape: Vector-shaped, `int` `Tensor` representing shape of output.\n    scale: (Optional) Positive `float` `Tensor` representing `Rayleigh` scale.\n      Default value: `None` (i.e., `scale = 1.`).\n    dtype: (Optional) TF `dtype` representing `dtype` of output.\n      Default value: `tf.float32`.\n    seed: (Optional) Python integer to seed the random number generator.\n      Default value: `None` (i.e., no seed).\n    name: Python `str` name prefixed to Ops created by this function.\n      Default value: `None` (i.e., 'random_rayleigh').\n\n  Returns:\n    rayleigh: `Tensor` with specified `shape` and `dtype` consisting of positive\n      real values drawn from a Rayleigh distribution with specified `scale`.\n  \"\"\"\n  with tf.compat.v1.name_scope(name, 'random_rayleigh', [shape, scale, seed]):\n    if scale is not None:\n      # Its important to expand the shape to match scale's, otherwise we won't\n      # have independent draws.\n      scale = tf.convert_to_tensor(value=scale, dtype=dtype, name='scale')\n      shape = tf.broadcast_dynamic_shape(shape, tf.shape(input=scale))\n    x = tf.sqrt(-2. * tf.math.log(\n        tf.random.uniform(shape, minval=0, maxval=1, dtype=dtype, seed=seed)))\n    if scale is None:\n      return x\n    return x * scale"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nfinishing computation of log_prob on one element of the inverse image.", "response": "def _finish_log_prob_for_one_fiber(self, y, x, ildj, event_ndims,\n                                     **distribution_kwargs):\n    \"\"\"Finish computation of log_prob on one element of the inverse image.\"\"\"\n    x = self._maybe_rotate_dims(x, rotate_right=True)\n    log_prob = self.distribution.log_prob(x, **distribution_kwargs)\n    if self._is_maybe_event_override:\n      log_prob = tf.reduce_sum(\n          input_tensor=log_prob, axis=self._reduce_event_indices)\n    log_prob += tf.cast(ildj, log_prob.dtype)\n    if self._is_maybe_event_override and isinstance(event_ndims, int):\n      tensorshape_util.set_shape(\n          log_prob,\n          tf.broadcast_static_shape(\n              tensorshape_util.with_rank_at_least(y.shape, 1)[:-event_ndims],\n              self.batch_shape))\n    return log_prob"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nfinish computation of prob on one element of the inverse image.", "response": "def _finish_prob_for_one_fiber(self, y, x, ildj, event_ndims,\n                                 **distribution_kwargs):\n    \"\"\"Finish computation of prob on one element of the inverse image.\"\"\"\n    x = self._maybe_rotate_dims(x, rotate_right=True)\n    prob = self.distribution.prob(x, **distribution_kwargs)\n    if self._is_maybe_event_override:\n      prob = tf.reduce_prod(input_tensor=prob, axis=self._reduce_event_indices)\n    prob *= tf.exp(tf.cast(ildj, prob.dtype))\n    if self._is_maybe_event_override and isinstance(event_ndims, int):\n      tensorshape_util.set_shape(\n          prob,\n          tf.broadcast_static_shape(\n              tensorshape_util.with_rank_at_least(y.shape, 1)[:-event_ndims],\n              self.batch_shape))\n    return prob"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef _maybe_validate_shape_override(self, override_shape, base_is_scalar,\n                                     validate_args, name):\n    \"\"\"Helper to __init__ which ensures override batch/event_shape are valid.\"\"\"\n    if override_shape is None:\n      override_shape = []\n\n    override_shape = tf.convert_to_tensor(\n        value=override_shape, dtype=tf.int32, name=name)\n\n    if not dtype_util.is_integer(override_shape.dtype):\n      raise TypeError(\"shape override must be an integer\")\n\n    override_is_scalar = _is_scalar_from_shape_tensor(override_shape)\n    if tf.get_static_value(override_is_scalar):\n      return self._empty\n\n    dynamic_assertions = []\n\n    if tensorshape_util.rank(override_shape.shape) is not None:\n      if tensorshape_util.rank(override_shape.shape) != 1:\n        raise ValueError(\"shape override must be a vector\")\n    elif validate_args:\n      dynamic_assertions += [\n          assert_util.assert_rank(\n              override_shape, 1, message=\"shape override must be a vector\")\n      ]\n\n    if tf.get_static_value(override_shape) is not None:\n      if any(s < 0 for s in tf.get_static_value(override_shape)):\n        raise ValueError(\"shape override must have non-negative elements\")\n    elif validate_args:\n      dynamic_assertions += [\n          assert_util.assert_non_negative(\n              override_shape,\n              message=\"shape override must have non-negative elements\")\n      ]\n\n    is_both_nonscalar = prefer_static.logical_and(\n        prefer_static.logical_not(base_is_scalar),\n        prefer_static.logical_not(override_is_scalar))\n    if tf.get_static_value(is_both_nonscalar) is not None:\n      if tf.get_static_value(is_both_nonscalar):\n        raise ValueError(\"base distribution not scalar\")\n    elif validate_args:\n      dynamic_assertions += [\n          assert_util.assert_equal(\n              is_both_nonscalar, False, message=\"base distribution not scalar\")\n      ]\n\n    if not dynamic_assertions:\n      return override_shape\n    return distribution_util.with_dependencies(\n        dynamic_assertions, override_shape)", "response": "Helper to __init__ which ensures override batch or event_shape are valid."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _undo_batch_normalization(x,\n                              mean,\n                              variance,\n                              offset,\n                              scale,\n                              variance_epsilon,\n                              name=None):\n  r\"\"\"Inverse of tf.nn.batch_normalization.\n\n  Args:\n    x: Input `Tensor` of arbitrary dimensionality.\n    mean: A mean `Tensor`.\n    variance: A variance `Tensor`.\n    offset: An offset `Tensor`, often denoted `beta` in equations, or\n      None. If present, will be added to the normalized tensor.\n    scale: A scale `Tensor`, often denoted `gamma` in equations, or\n      `None`. If present, the scale is applied to the normalized tensor.\n    variance_epsilon: A small `float` added to the minibatch `variance` to\n      prevent dividing by zero.\n    name: A name for this operation (optional).\n\n  Returns:\n    batch_unnormalized: The de-normalized, de-scaled, de-offset `Tensor`.\n  \"\"\"\n  with tf.compat.v2.name_scope(name or \"undo_batchnorm\"):\n    # inv = tf.rsqrt(variance + variance_epsilon)\n    # if scale is not None:\n    #   inv *= scale\n    # return x * inv + (\n    #     offset - mean * inv if offset is not None else -mean * inv)\n    rescale = tf.sqrt(variance + variance_epsilon)\n    if scale is not None:\n      rescale /= scale\n    batch_unnormalized = x * rescale + (\n        mean - offset * rescale if offset is not None else mean)\n    return batch_unnormalized", "response": "r Inverse of tf. nn. batch_normalization."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef _validate_bn_layer(self, layer):\n    if (not isinstance(layer, tf.keras.layers.BatchNormalization) and\n        not isinstance(layer, tf.compat.v1.layers.BatchNormalization)):\n      raise ValueError(\n          \"batchnorm_layer must be an instance of BatchNormalization layer.\")\n    if layer.renorm:\n      raise ValueError(\"BatchNorm Bijector does not support renormalization.\")\n    if layer.virtual_batch_size:\n      raise ValueError(\n          \"BatchNorm Bijector does not support virtual batch sizes.\")", "response": "Check for valid BatchNormalization layer."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nslice a single parameter of a distribution.", "response": "def _slice_single_param(param, param_event_ndims, slices, dist_batch_shape):\n  \"\"\"Slices a single parameter of a distribution.\n\n  Args:\n    param: A `Tensor`, the original parameter to slice.\n    param_event_ndims: `int` event parameterization rank for this parameter.\n    slices: A `tuple` of normalized slices.\n    dist_batch_shape: The distribution's batch shape `Tensor`.\n\n  Returns:\n    new_param: A `Tensor`, batch-sliced according to slices.\n  \"\"\"\n  # Extend param shape with ones on the left to match dist_batch_shape.\n  param_shape = tf.shape(input=param)\n  insert_ones = tf.ones(\n      [tf.size(input=dist_batch_shape) + param_event_ndims - tf.rank(param)],\n      dtype=param_shape.dtype)\n  new_param_shape = tf.concat([insert_ones, param_shape], axis=0)\n  full_batch_param = tf.reshape(param, new_param_shape)\n  param_slices = []\n  # We separately track the batch axis from the parameter axis because we want\n  # them to align for positive indexing, and be offset by param_event_ndims for\n  # negative indexing.\n  param_dim_idx = 0\n  batch_dim_idx = 0\n  for slc in slices:\n    if slc is tf.newaxis:\n      param_slices.append(slc)\n      continue\n    if slc is Ellipsis:\n      if batch_dim_idx < 0:\n        raise ValueError('Found multiple `...` in slices {}'.format(slices))\n      param_slices.append(slc)\n      # Switch over to negative indexing for the broadcast check.\n      num_remaining_non_newaxis_slices = sum(\n          [s is not tf.newaxis for s in slices[slices.index(Ellipsis) + 1:]])\n      batch_dim_idx = -num_remaining_non_newaxis_slices\n      param_dim_idx = batch_dim_idx - param_event_ndims\n      continue\n    # Find the batch dimension sizes for both parameter and distribution.\n    param_dim_size = new_param_shape[param_dim_idx]\n    batch_dim_size = dist_batch_shape[batch_dim_idx]\n    is_broadcast = batch_dim_size > param_dim_size\n    # Slices are denoted by start:stop:step.\n    if isinstance(slc, slice):\n      start, stop, step = slc.start, slc.stop, slc.step\n      if start is not None:\n        start = tf.where(is_broadcast, 0, start)\n      if stop is not None:\n        stop = tf.where(is_broadcast, 1, stop)\n      if step is not None:\n        step = tf.where(is_broadcast, 1, step)\n      param_slices.append(slice(start, stop, step))\n    else:  # int, or int Tensor, e.g. d[d.batch_shape_tensor()[0] // 2]\n      param_slices.append(tf.where(is_broadcast, 0, slc))\n    param_dim_idx += 1\n    batch_dim_idx += 1\n  param_slices.extend([ALL_SLICE] * param_event_ndims)\n  return full_batch_param.__getitem__(param_slices)"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef _slice_params_to_dict(dist, params_event_ndims, slices):\n  override_dict = {}\n  for param_name, param_event_ndims in six.iteritems(params_event_ndims):\n    # Verify that either None or a legit value is in the parameters dict.\n    if param_name not in dist.parameters:\n      raise ValueError('Distribution {} is missing advertised '\n                       'parameter {}'.format(dist, param_name))\n    param = dist.parameters[param_name]\n    if param is None:\n      # some distributions have multiple possible parameterizations; this\n      # param was not provided\n      continue\n    dtype = None\n    if hasattr(dist, param_name):\n      attr = getattr(dist, param_name)\n      dtype = getattr(attr, 'dtype', None)\n    if dtype is None:\n      dtype = dist.dtype\n      warnings.warn('Unable to find property getter for parameter Tensor {} '\n                    'on {}, falling back to Distribution.dtype {}'.format(\n                        param_name, dist, dtype))\n    param = tf.convert_to_tensor(value=param, dtype=dtype)\n    override_dict[param_name] = _slice_single_param(param, param_event_ndims,\n                                                    slices,\n                                                    dist.batch_shape_tensor())\n  return override_dict", "response": "Slice the parameters of a given distribution into a dictionary of batch - sliced parameters."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef _apply_single_step(dist, params_event_ndims, slices, params_overrides):\n  if len(slices) == 1 and slices[0] == Ellipsis:\n    # The path used by Distribution.copy: batch_slice(...args..., Ellipsis)\n    override_dict = {}\n  else:\n    override_dict = _slice_params_to_dict(dist, params_event_ndims, slices)\n  override_dict.update(params_overrides)\n  parameters = dict(dist.parameters, **override_dict)\n  new_dist = type(dist)(**parameters)\n  return new_dist", "response": "Applies a single slicing step to dist returning a new instance."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef _apply_slice_sequence(dist, params_event_ndims, slice_overrides_seq):\n  for slices, overrides in slice_overrides_seq:\n    dist = _apply_single_step(dist, params_event_ndims, slices, overrides)\n  return dist", "response": "Applies a sequence of slice or copy - with - overrides operations to dist."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nslicing dist along its batch dimensions.", "response": "def batch_slice(dist, params_event_ndims, params_overrides, slices):\n  \"\"\"Slices `dist` along its batch dimensions. Helper for tfd.Distribution.\n\n  Args:\n    dist: A `tfd.Distribution` instance.\n    params_event_ndims: A `dict` of `str->int` indicating the number of\n      dimensions of a given parameter required to parameterize a single event.\n    params_overrides: A `dict` of parameter overrides. (e.g. from\n      `Distribution.copy`).\n    slices: A `slice` or `int` or `int` `Tensor` or `tf.newaxis` or `tuple`\n      thereof. (e.g. the argument of a `__getitem__` method).\n\n  Returns:\n    new_dist: A batch-sliced `tfd.Distribution`.\n  \"\"\"\n  if not isinstance(slices, collections.Sequence):\n    slices = (slices,)\n  # We track the history of slice and copy(**param_overrides) in order to trace\n  # back to the original distribution's source variables.\n  orig_dist, slice_overrides_seq = getattr(dist, PROVENANCE_ATTR, (dist, []))\n  slice_overrides_seq += [(slices, params_overrides)]\n  # Re-doing the full sequence of slice+copy override work here enables\n  # gradients all the way back to the original distribution's arguments.\n  dist = _apply_slice_sequence(orig_dist, params_event_ndims,\n                               slice_overrides_seq)\n  setattr(dist, PROVENANCE_ATTR, (orig_dist, slice_overrides_seq))\n  return dist"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nfits multiple Fisher scoring steps.", "response": "def fit(\n    model_matrix,\n    response,\n    model,\n    model_coefficients_start=None,\n    predicted_linear_response_start=None,\n    l2_regularizer=None,\n    dispersion=None,\n    offset=None,\n    convergence_criteria_fn=None,\n    learning_rate=None,\n    fast_unsafe_numerics=True,\n    maximum_iterations=None,\n    name=None):\n  \"\"\"Runs multiple Fisher scoring steps.\n\n  Args:\n    model_matrix: (Batch of) `float`-like, matrix-shaped `Tensor` where each row\n      represents a sample's features.\n    response: (Batch of) vector-shaped `Tensor` where each element represents a\n      sample's observed response (to the corresponding row of features). Must\n      have same `dtype` as `model_matrix`.\n    model: `tfp.glm.ExponentialFamily`-like instance which implicitly\n      characterizes a negative log-likelihood loss by specifying the\n      distribuion's `mean`, `gradient_mean`, and `variance`.\n    model_coefficients_start: Optional (batch of) vector-shaped `Tensor`\n      representing the initial model coefficients, one for each column in\n      `model_matrix`. Must have same `dtype` as `model_matrix`.\n      Default value: Zeros.\n    predicted_linear_response_start: Optional `Tensor` with `shape`, `dtype`\n      matching `response`; represents `offset` shifted initial linear\n      predictions based on `model_coefficients_start`.\n      Default value: `offset` if `model_coefficients is None`, and\n      `tf.linalg.matvec(model_matrix, model_coefficients_start) + offset`\n      otherwise.\n    l2_regularizer: Optional scalar `Tensor` representing L2 regularization\n      penalty, i.e.,\n      `loss(w) = sum{-log p(y[i]|x[i],w) : i=1..n} + l2_regularizer ||w||_2^2`.\n      Default value: `None` (i.e., no L2 regularization).\n    dispersion: Optional (batch of) `Tensor` representing `response` dispersion,\n      i.e., as in, `p(y|theta) := exp((y theta - A(theta)) / dispersion)`.\n      Must broadcast with rows of `model_matrix`.\n      Default value: `None` (i.e., \"no dispersion\").\n    offset: Optional `Tensor` representing constant shift applied to\n      `predicted_linear_response`.  Must broadcast to `response`.\n      Default value: `None` (i.e., `tf.zeros_like(response)`).\n    convergence_criteria_fn: Python `callable` taking:\n      `is_converged_previous`, `iter_`, `model_coefficients_previous`,\n      `predicted_linear_response_previous`, `model_coefficients_next`,\n      `predicted_linear_response_next`, `response`, `model`, `dispersion` and\n      returning a `bool` `Tensor` indicating that Fisher scoring has converged.\n      See `convergence_criteria_small_relative_norm_weights_change` as an\n      example function.\n      Default value: `None` (i.e.,\n      `convergence_criteria_small_relative_norm_weights_change`).\n    learning_rate: Optional (batch of) scalar `Tensor` used to dampen iterative\n      progress. Typically only needed if optimization diverges, should be no\n      larger than `1` and typically very close to `1`.\n      Default value: `None` (i.e., `1`).\n    fast_unsafe_numerics: Optional Python `bool` indicating if faster, less\n      numerically accurate methods can be employed for computing the weighted\n      least-squares solution.\n      Default value: `True` (i.e., \"fast but possibly diminished accuracy\").\n    maximum_iterations: Optional maximum number of iterations of Fisher scoring\n      to run; \"and-ed\" with result of `convergence_criteria_fn`.\n      Default value: `None` (i.e., `infinity`).\n    name: Python `str` used as name prefix to ops created by this function.\n      Default value: `\"fit\"`.\n\n  Returns:\n    model_coefficients: (Batch of) vector-shaped `Tensor`; represents the\n      fitted model coefficients, one for each column in `model_matrix`.\n    predicted_linear_response: `response`-shaped `Tensor` representing linear\n      predictions based on new `model_coefficients`, i.e.,\n      `tf.linalg.matvec(model_matrix, model_coefficients) + offset`.\n    is_converged: `bool` `Tensor` indicating that the returned\n      `model_coefficients` met the `convergence_criteria_fn` criteria within the\n      `maximum_iterations` limit.\n    iter_: `int32` `Tensor` indicating the number of iterations taken.\n\n  #### Example\n\n  ```python\n  from __future__ import print_function\n  import numpy as np\n  import tensorflow as tf\n  import tensorflow_probability as tfp\n  tfd = tfp.distributions\n\n  def make_dataset(n, d, link, scale=1., dtype=np.float32):\n    model_coefficients = tfd.Uniform(\n        low=np.array(-1, dtype),\n        high=np.array(1, dtype)).sample(d, seed=42)\n    radius = np.sqrt(2.)\n    model_coefficients *= radius / tf.linalg.norm(model_coefficients)\n    model_matrix = tfd.Normal(\n        loc=np.array(0, dtype),\n        scale=np.array(1, dtype)).sample([n, d], seed=43)\n    scale = tf.convert_to_tensor(scale, dtype)\n    linear_response = tf.tensordot(\n        model_matrix, model_coefficients, axes=[[1], [0]])\n    if link == 'linear':\n      response = tfd.Normal(loc=linear_response, scale=scale).sample(seed=44)\n    elif link == 'probit':\n      response = tf.cast(\n          tfd.Normal(loc=linear_response, scale=scale).sample(seed=44) > 0,\n          dtype)\n    elif link == 'logit':\n      response = tfd.Bernoulli(logits=linear_response).sample(seed=44)\n    else:\n      raise ValueError('unrecognized true link: {}'.format(link))\n    return model_matrix, response, model_coefficients\n\n  X, Y, w_true = make_dataset(n=int(1e6), d=100, link='probit')\n\n  w, linear_response, is_converged, num_iter = tfp.glm.fit(\n      model_matrix=X,\n      response=Y,\n      model=tfp.glm.BernoulliNormalCDF())\n  log_likelihood = tfp.glm.BernoulliNormalCDF().log_prob(Y, linear_response)\n\n  with tf.Session() as sess:\n    [w_, linear_response_, is_converged_, num_iter_, Y_, w_true_,\n     log_likelihood_] = sess.run([\n        w, linear_response, is_converged, num_iter, Y, w_true,\n        log_likelihood])\n\n  print('is_converged: ', is_converged_)\n  print('    num_iter: ', num_iter_)\n  print('    accuracy: ', np.mean((linear_response_ > 0.) == Y_))\n  print('    deviance: ', 2. * np.mean(log_likelihood_))\n  print('||w0-w1||_2 / (1+||w0||_2): ', (np.linalg.norm(w_true_ - w_, ord=2) /\n                                         (1. + np.linalg.norm(w_true_, ord=2))))\n\n  # ==>\n  # is_converged:  True\n  #     num_iter:  6\n  #     accuracy:  0.804382\n  #     deviance:  -0.820746600628\n  # ||w0-w1||_2 / (1+||w0||_2):  0.00619245105309\n  ```\n\n  \"\"\"\n  graph_deps = [model_matrix, response, model_coefficients_start,\n                predicted_linear_response_start, dispersion, offset,\n                learning_rate, maximum_iterations]\n  with tf.compat.v1.name_scope(name, 'fit', graph_deps):\n    [\n        model_matrix,\n        response,\n        model_coefficients_start,\n        predicted_linear_response_start,\n        offset,\n    ] = prepare_args(\n        model_matrix,\n        response,\n        model_coefficients_start,\n        predicted_linear_response_start,\n        offset)\n    if convergence_criteria_fn is None:\n      convergence_criteria_fn = (\n          convergence_criteria_small_relative_norm_weights_change())\n\n    def _body(\n        is_converged_previous,\n        iter_,\n        model_coefficients_previous,\n        predicted_linear_response_previous):\n      \"\"\"`tf.while_loop` body.\"\"\"\n      model_coefficients_next, predicted_linear_response_next = fit_one_step(\n          model_matrix,\n          response,\n          model,\n          model_coefficients_previous,\n          predicted_linear_response_previous,\n          l2_regularizer,\n          dispersion,\n          offset,\n          learning_rate,\n          fast_unsafe_numerics)\n      is_converged_next = convergence_criteria_fn(\n          is_converged_previous=is_converged_previous,\n          iter_=iter_,\n          model_coefficients_previous=model_coefficients_previous,\n          predicted_linear_response_previous=predicted_linear_response_previous,\n          model_coefficients_next=model_coefficients_next,\n          predicted_linear_response_next=predicted_linear_response_next,\n          response=response,\n          model=model,\n          dispersion=dispersion)\n      return [\n          is_converged_next,\n          iter_ + 1,\n          model_coefficients_next,\n          predicted_linear_response_next,\n      ]\n\n    # while not converged:\n    #   fit_one_step\n    [\n        is_converged,\n        iter_,\n        model_coefficients,\n        predicted_linear_response,\n    ] = tf.while_loop(\n        cond=lambda is_converged, *args: tf.logical_not(is_converged),\n        body=_body,\n        loop_vars=[\n            tf.zeros([], np.bool),   # is_converged\n            tf.zeros([], np.int32),  # iter_\n            model_coefficients_start,\n            predicted_linear_response_start,\n        ],\n        maximum_iterations=maximum_iterations)\n\n    return [\n        model_coefficients,\n        predicted_linear_response,\n        is_converged,\n        iter_\n    ]"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nfit one step of Fisher scoring.", "response": "def fit_one_step(\n    model_matrix,\n    response,\n    model,\n    model_coefficients_start=None,\n    predicted_linear_response_start=None,\n    l2_regularizer=None,\n    dispersion=None,\n    offset=None,\n    learning_rate=None,\n    fast_unsafe_numerics=True,\n    name=None):\n  \"\"\"Runs one step of Fisher scoring.\n\n  Args:\n    model_matrix: (Batch of) `float`-like, matrix-shaped `Tensor` where each row\n      represents a sample's features.\n    response: (Batch of) vector-shaped `Tensor` where each element represents a\n      sample's observed response (to the corresponding row of features). Must\n      have same `dtype` as `model_matrix`.\n    model: `tfp.glm.ExponentialFamily`-like instance used to construct the\n      negative log-likelihood loss, gradient, and expected Hessian (i.e., the\n      Fisher information matrix).\n    model_coefficients_start: Optional (batch of) vector-shaped `Tensor`\n      representing the initial model coefficients, one for each column in\n      `model_matrix`. Must have same `dtype` as `model_matrix`.\n      Default value: Zeros.\n    predicted_linear_response_start: Optional `Tensor` with `shape`, `dtype`\n      matching `response`; represents `offset` shifted initial linear\n      predictions based on `model_coefficients_start`.\n      Default value: `offset` if `model_coefficients is None`, and\n      `tf.linalg.matvec(model_matrix, model_coefficients_start) + offset`\n      otherwise.\n    l2_regularizer: Optional scalar `Tensor` representing L2 regularization\n      penalty, i.e.,\n      `loss(w) = sum{-log p(y[i]|x[i],w) : i=1..n} + l2_regularizer ||w||_2^2`.\n      Default value: `None` (i.e., no L2 regularization).\n    dispersion: Optional (batch of) `Tensor` representing `response` dispersion,\n      i.e., as in, `p(y|theta) := exp((y theta - A(theta)) / dispersion)`.\n      Must broadcast with rows of `model_matrix`.\n      Default value: `None` (i.e., \"no dispersion\").\n    offset: Optional `Tensor` representing constant shift applied to\n      `predicted_linear_response`.  Must broadcast to `response`.\n      Default value: `None` (i.e., `tf.zeros_like(response)`).\n    learning_rate: Optional (batch of) scalar `Tensor` used to dampen iterative\n      progress. Typically only needed if optimization diverges, should be no\n      larger than `1` and typically very close to `1`.\n      Default value: `None` (i.e., `1`).\n    fast_unsafe_numerics: Optional Python `bool` indicating if solve should be\n      based on Cholesky or QR decomposition.\n      Default value: `True` (i.e., \"prefer speed via Cholesky decomposition\").\n    name: Python `str` used as name prefix to ops created by this function.\n      Default value: `\"fit_one_step\"`.\n\n  Returns:\n    model_coefficients: (Batch of) vector-shaped `Tensor`; represents the\n      next estimate of the model coefficients, one for each column in\n      `model_matrix`.\n    predicted_linear_response: `response`-shaped `Tensor` representing linear\n      predictions based on new `model_coefficients`, i.e.,\n      `tf.linalg.matvec(model_matrix, model_coefficients_next) + offset`.\n  \"\"\"\n  graph_deps = [model_matrix, response, model_coefficients_start,\n                predicted_linear_response_start, dispersion, learning_rate]\n  with tf.compat.v1.name_scope(name, 'fit_one_step', graph_deps):\n\n    [\n        model_matrix,\n        response,\n        model_coefficients_start,\n        predicted_linear_response_start,\n        offset,\n    ] = prepare_args(\n        model_matrix,\n        response,\n        model_coefficients_start,\n        predicted_linear_response_start,\n        offset)\n\n    # Compute: mean, grad(mean, predicted_linear_response_start), and variance.\n    mean, variance, grad_mean = model(predicted_linear_response_start)\n\n    # If either `grad_mean` or `variance is non-finite or zero, then we'll\n    # replace it with a value such that the row is zeroed out. Although this\n    # procedure may seem circuitous, it is necessary to ensure this algorithm is\n    # itself differentiable.\n    is_valid = (\n        tf.math.is_finite(grad_mean) & tf.not_equal(grad_mean, 0.)\n        & tf.math.is_finite(variance) & (variance > 0.))\n\n    def mask_if_invalid(x, mask):\n      mask = tf.fill(\n          tf.shape(input=x), value=np.array(mask, x.dtype.as_numpy_dtype))\n      return tf.where(is_valid, x, mask)\n\n    # Run one step of iteratively reweighted least-squares.\n    # Compute \"`z`\", the adjusted predicted linear response.\n    # z = predicted_linear_response_start\n    #     + learning_rate * (response - mean) / grad_mean\n    z = (response - mean) / mask_if_invalid(grad_mean, 1.)\n    # TODO(jvdillon): Rather than use learning rate, we should consider using\n    # backtracking line search.\n    if learning_rate is not None:\n      z *= learning_rate[..., tf.newaxis]\n    z += predicted_linear_response_start\n    if offset is not None:\n      z -= offset\n\n    # Compute \"`w`\", the per-sample weight.\n    if dispersion is not None:\n      # For convenience, we'll now scale the variance by the dispersion factor.\n      variance *= dispersion\n    w = (\n        mask_if_invalid(grad_mean, 0.) *\n        tf.math.rsqrt(mask_if_invalid(variance, np.inf)))\n\n    a = model_matrix * w[..., tf.newaxis]\n    b = z * w\n    # Solve `min{ || A @ model_coefficients - b ||_2**2 : model_coefficients }`\n    # where `@` denotes `matmul`.\n\n    if l2_regularizer is None:\n      l2_regularizer = np.array(0, a.dtype.as_numpy_dtype)\n    else:\n      l2_regularizer_ = distribution_util.maybe_get_static_value(\n          l2_regularizer, a.dtype.as_numpy_dtype)\n      if l2_regularizer_ is not None:\n        l2_regularizer = l2_regularizer_\n\n    def _embed_l2_regularization():\n      \"\"\"Adds synthetic observations to implement L2 regularization.\"\"\"\n      # `tf.matrix_solve_ls` does not respect the `l2_regularization` argument\n      # when `fast_unsafe_numerics` is `False`. This function  adds synthetic\n      # observations to the data to implement the regularization instead.\n      # Adding observations `sqrt(l2_regularizer) * I` is mathematically\n      # equivalent to adding the term\n      # `-l2_regularizer ||coefficients||_2**2` to the log-likelihood.\n      num_model_coefficients = num_cols(model_matrix)\n      batch_shape = tf.shape(input=model_matrix)[:-2]\n      eye = tf.eye(\n          num_model_coefficients, batch_shape=batch_shape, dtype=a.dtype)\n      a_ = tf.concat([a, tf.sqrt(l2_regularizer) * eye], axis=-2)\n      b_ = distribution_util.pad(\n          b, count=num_model_coefficients, axis=-1, back=True)\n      # Return l2_regularizer=0 since its now embedded.\n      l2_regularizer_ = np.array(0, a.dtype.as_numpy_dtype)\n      return a_, b_, l2_regularizer_\n\n    a, b, l2_regularizer = prefer_static.cond(\n        prefer_static.reduce_all([not(fast_unsafe_numerics),\n                                  l2_regularizer > 0.]),\n        _embed_l2_regularization,\n        lambda: (a, b, l2_regularizer))\n\n    model_coefficients_next = tf.linalg.lstsq(\n        a,\n        b[..., tf.newaxis],\n        fast=fast_unsafe_numerics,\n        l2_regularizer=l2_regularizer,\n        name='model_coefficients_next')\n    model_coefficients_next = model_coefficients_next[..., 0]\n\n    # TODO(b/79122261): The approach used in `matrix_solve_ls` could be made\n    # faster by avoiding explicitly forming Q and instead keeping the\n    # factorization in 'implicit' form with stacked (rescaled) Householder\n    # vectors underneath the 'R' and then applying the (accumulated)\n    # reflectors in the appropriate order to apply Q'. However, we don't\n    # presently do this because we lack core TF functionality. For reference,\n    # the vanilla QR approach is:\n    #   q, r = tf.linalg.qr(a)\n    #   c = tf.matmul(q, b, adjoint_a=True)\n    #   model_coefficients_next = tf.matrix_triangular_solve(\n    #       r, c, lower=False, name='model_coefficients_next')\n\n    predicted_linear_response_next = calculate_linear_predictor(\n        model_matrix,\n        model_coefficients_next,\n        offset,\n        name='predicted_linear_response_next')\n\n    return model_coefficients_next, predicted_linear_response_next"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nreturns a convergence criteria function which returns True if the fitting procedure has converged.", "response": "def convergence_criteria_small_relative_norm_weights_change(\n    tolerance=1e-5,\n    norm_order=2):\n  \"\"\"Returns Python `callable` which indicates fitting procedure has converged.\n\n  Writing old, new `model_coefficients` as `w0`, `w1`, this function\n  defines convergence as,\n\n  ```python\n  relative_euclidean_norm = (tf.norm(w0 - w1, ord=2, axis=-1) /\n                             (1. + tf.norm(w0, ord=2, axis=-1)))\n  reduce_all(relative_euclidean_norm < tolerance)\n  ```\n\n  where `tf.norm(x, ord=2)` denotes the [Euclidean norm](\n  https://en.wikipedia.org/wiki/Norm_(mathematics)#Euclidean_norm) of `x`.\n\n  Args:\n    tolerance: `float`-like `Tensor` indicating convergence, i.e., when\n      max relative Euclidean norm weights difference < tolerance`.\n      Default value: `1e-5`.\n    norm_order: Order of the norm. Default value: `2` (i.e., \"Euclidean norm\".)\n\n  Returns:\n    convergence_criteria_fn: Python `callable` which returns `bool` `Tensor`\n      indicated fitting procedure has converged. (See inner function\n      specification for argument signature.)\n      Default value: `1e-5`.\n  \"\"\"\n  def convergence_criteria_fn(\n      is_converged_previous,  # pylint: disable=unused-argument\n      iter_,\n      model_coefficients_previous,\n      predicted_linear_response_previous,  # pylint: disable=unused-argument\n      model_coefficients_next,\n      predicted_linear_response_next,  # pylint: disable=unused-argument\n      response,  # pylint: disable=unused-argument\n      model,  # pylint: disable=unused-argument\n      dispersion):  # pylint: disable=unused-argument\n    \"\"\"Returns `bool` `Tensor` indicating if fitting procedure has converged.\n\n    Args:\n      is_converged_previous: \"old\" convergence results.\n      iter_: Iteration number.\n      model_coefficients_previous: \"old\" `model_coefficients`.\n      predicted_linear_response_previous: \"old\" `predicted_linear_response`.\n      model_coefficients_next: \"new\" `model_coefficients`.\n      predicted_linear_response_next: \"new: `predicted_linear_response`.\n      response: (Batch of) vector-shaped `Tensor` where each element represents\n        a sample's observed response (to the corresponding row of features).\n        Must have same `dtype` as `model_matrix`.\n      model: `tfp.glm.ExponentialFamily`-like instance used to construct the\n        negative log-likelihood loss, gradient, and expected Hessian (i.e., the\n        Fisher information matrix).\n      dispersion: `Tensor` representing `response` dispersion, i.e., as in:\n        `p(y|theta) := exp((y theta - A(theta)) / dispersion)`. Must broadcast\n        with rows of `model_matrix`.\n        Default value: `None` (i.e., \"no dispersion\").\n\n    Returns:\n      is_converged: `bool` `Tensor`.\n    \"\"\"\n    relative_euclidean_norm = (\n        tf.norm(\n            tensor=model_coefficients_previous - model_coefficients_next,\n            ord=norm_order,\n            axis=-1) /\n        (1. +\n         tf.norm(tensor=model_coefficients_previous, ord=norm_order, axis=-1)))\n    return (iter_ > 0) & tf.reduce_all(\n        input_tensor=relative_euclidean_norm < tolerance)\n\n  return convergence_criteria_fn"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\ncomputes model_matrix @ model_coefficients + offset.", "response": "def calculate_linear_predictor(model_matrix, model_coefficients, offset=None,\n                               name=None):\n  \"\"\"Computes `model_matrix @ model_coefficients + offset`.\"\"\"\n  with tf.compat.v1.name_scope(name, 'calculate_linear_predictor',\n                               [model_matrix, model_coefficients, offset]):\n    predicted_linear_response = tf.linalg.matvec(model_matrix,\n                                                 model_coefficients)\n    if offset is not None:\n      predicted_linear_response += offset\n    return predicted_linear_response"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef num_cols(x):\n  if tf.compat.dimension_value(x.shape[-1]) is not None:\n    return tf.compat.dimension_value(x.shape[-1])\n  return tf.shape(input=x)[-1]", "response": "Returns the number of columns in a given Tensor."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nwraps original_fn preferring to call static_fn when inputs are static.", "response": "def _prefer_static(original_fn, static_fn):\n  \"\"\"Wraps original_fn, preferring to call static_fn when inputs are static.\"\"\"\n  original_spec = tf_inspect.getfullargspec(original_fn)\n  static_spec = tf_inspect.getfullargspec(static_fn)\n  if original_spec != static_spec:\n    raise ValueError(\n        'Arg specs do not match: original={}, static={}, fn={}'.format(\n            original_spec, static_spec, original_fn))\n  @decorator.decorator\n  def wrap(wrapped_fn, *args, **kwargs):\n    del wrapped_fn\n    [args_, kwargs_], all_static = _maybe_get_static_args([args, kwargs])\n    if all_static:\n      return static_fn(*args_, **kwargs_)\n    return original_fn(*args, **kwargs)\n  return wrap(original_fn)"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef _copy_docstring(original_fn, new_fn):\n  original_spec = tf_inspect.getfullargspec(original_fn)\n  new_spec = tf_inspect.getfullargspec(new_fn)\n  if original_spec != new_spec:\n    raise ValueError(\n        'Arg specs do not match: original={}, new={}, fn={}'.format(\n            original_spec, new_spec, original_fn))\n  @decorator.decorator\n  def wrap(wrapped_fn, *args, **kwargs):\n    del wrapped_fn\n    return new_fn(*args, **kwargs)\n  return wrap(original_fn)", "response": "Wraps original_fn with the doc of new_fn."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef rank_from_shape(shape_tensor_fn, tensorshape=None):\n\n  if tensorshape is None:\n    shape_tensor = (shape_tensor_fn() if callable(shape_tensor_fn)\n                    else shape_tensor_fn)\n    if (hasattr(shape_tensor, 'shape') and\n        hasattr(shape_tensor.shape, 'num_elements')):\n      ndims_ = tensorshape_util.num_elements(shape_tensor.shape)\n    else:\n      ndims_ = len(shape_tensor)\n    ndims_fn = lambda: tf.size(input=shape_tensor)\n  else:\n    ndims_ = tensorshape_util.rank(tensorshape)\n    ndims_fn = lambda: tf.size(input=shape_tensor_fn()  # pylint: disable=g-long-lambda\n                               if callable(shape_tensor_fn)\n                               else shape_tensor_fn)\n  return ndims_fn() if ndims_ is None else ndims_", "response": "Computes rank given a Tensor s shape."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nreturn either true_fn or false_fn depending on the predicate.", "response": "def cond(pred, true_fn=None, false_fn=None, name=None):\n  \"\"\"Return either `true_fn()` if predicate `pred` is true else `false_fn()`.\n\n  If `pred` is a bool or has a constant value, we return either `true_fn()`\n  or `false_fn()`, otherwise we use `tf.cond` to dynamically route to both.\n\n  Arguments:\n    pred: A scalar determining whether to return the result of `true_fn` or\n      `false_fn`.\n    true_fn: The callable to be performed if pred is true.\n    false_fn: The callable to be performed if pred is false.\n    name: Optional name prefix when using `tf.cond`.\n\n  Returns:\n    Tensors returned by the call to either `true_fn` or `false_fn`.\n\n  Raises:\n    TypeError: If `true_fn` or `false_fn` is not callable.\n  \"\"\"\n  if not callable(true_fn):\n    raise TypeError('`true_fn` must be callable.')\n  if not callable(false_fn):\n    raise TypeError('`false_fn` must be callable.')\n\n  pred_value = _get_static_predicate(pred)\n  if pred_value is not None:\n    if pred_value:\n      return true_fn()\n    else:\n      return false_fn()\n  else:\n    return tf.cond(pred=pred, true_fn=true_fn, false_fn=false_fn, name=name)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nlike tf. case except attempts to statically evaluate predicates.", "response": "def case(pred_fn_pairs, default=None, exclusive=False, name='smart_case'):\n  \"\"\"Like tf.case, except attempts to statically evaluate predicates.\n\n  If any predicate in `pred_fn_pairs` is a bool or has a constant value, the\n  associated callable will be called or omitted depending on its value.\n  Otherwise this functions like tf.case.\n\n  Args:\n    pred_fn_pairs: Dict or list of pairs of a boolean scalar tensor and a\n                   callable which returns a list of tensors.\n    default: Optional callable that returns a list of tensors.\n    exclusive: True iff at most one predicate is allowed to evaluate to `True`.\n    name: A name for this operation (optional).\n\n  Returns:\n    The tensors returned by the first pair whose predicate evaluated to True, or\n    those returned by `default` if none does.\n\n  Raises:\n    TypeError: If `pred_fn_pairs` is not a list/dictionary.\n    TypeError: If `pred_fn_pairs` is a list but does not contain 2-tuples.\n    TypeError: If `fns[i]` is not callable for any i, or `default` is not\n               callable.\n  \"\"\"\n  return control_flow_ops._case_helper(  # pylint: disable=protected-access\n      cond, pred_fn_pairs, default, exclusive, name, allow_python_preds=True)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef log_prob(self, response, predicted_linear_response, name=None):\n\n    with self._name_scope(\n        name, 'log_prob', [response, predicted_linear_response]):\n      dtype = dtype_util.common_dtype([response, predicted_linear_response])\n      response = tf.convert_to_tensor(\n          value=response, dtype=dtype, name='response')\n      predicted_linear_response = tf.convert_to_tensor(\n          value=predicted_linear_response, name='predicted_linear_response')\n      return self._log_prob(response, predicted_linear_response)", "response": "Computes D ( param = mean ( r ). log_prob ( response ) for linear response r."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef mixture_stddev(mixture_weight_vector, mean_vector, stddev_vector):\n  tensorshape_util.assert_has_rank(mixture_weight_vector.shape, 2)\n  if not tensorshape_util.is_compatible_with(mean_vector.shape,\n                                             mixture_weight_vector.shape):\n    raise ValueError(\"Expecting means to have same shape as mixture weights.\")\n  if not tensorshape_util.is_compatible_with(stddev_vector.shape,\n                                             mixture_weight_vector.shape):\n    raise ValueError(\"Expecting stddevs to have same shape as mixture weights.\")\n\n  # Reshape the distribution parameters for batched vectorized dot products.\n  pi_for_dot_prod = tf.expand_dims(mixture_weight_vector, axis=1)\n  mu_for_dot_prod = tf.expand_dims(mean_vector, axis=2)\n  sigma_for_dot_prod = tf.expand_dims(stddev_vector, axis=2)\n\n  # weighted average of component means under mixture distribution.\n  mean_wa = tf.matmul(pi_for_dot_prod, mu_for_dot_prod)\n  mean_wa = tf.reshape(mean_wa, (-1,))\n  # weighted average of component variances under mixture distribution.\n  var_wa = tf.matmul(pi_for_dot_prod, tf.square(sigma_for_dot_prod))\n  var_wa = tf.reshape(var_wa, (-1,))\n  # weighted average of component squared means under mixture distribution.\n  sq_mean_wa = tf.matmul(pi_for_dot_prod, tf.square(mu_for_dot_prod))\n  sq_mean_wa = tf.reshape(sq_mean_wa, (-1,))\n  mixture_variance = var_wa + sq_mean_wa - tf.square(mean_wa)\n  return tf.sqrt(mixture_variance)", "response": "Computes the standard deviation of a mixture distribution."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\ncreating a LinearOperator representing a lower triangular matrix.", "response": "def make_tril_scale(loc=None,\n                    scale_tril=None,\n                    scale_diag=None,\n                    scale_identity_multiplier=None,\n                    shape_hint=None,\n                    validate_args=False,\n                    assert_positive=False,\n                    name=None):\n  \"\"\"Creates a LinearOperator representing a lower triangular matrix.\n\n  Args:\n    loc: Floating-point `Tensor`. This is used for inferring shape in the case\n      where only `scale_identity_multiplier` is set.\n    scale_tril: Floating-point `Tensor` representing the diagonal matrix.\n      `scale_diag` has shape [N1, N2, ...  k, k], which represents a k x k lower\n      triangular matrix. When `None` no `scale_tril` term is added to the\n      LinearOperator. The upper triangular elements above the diagonal are\n      ignored.\n    scale_diag: Floating-point `Tensor` representing the diagonal matrix.\n      `scale_diag` has shape [N1, N2, ...  k], which represents a k x k diagonal\n      matrix. When `None` no diagonal term is added to the LinearOperator.\n    scale_identity_multiplier: floating point rank 0 `Tensor` representing a\n      scaling done to the identity matrix. When `scale_identity_multiplier =\n      scale_diag = scale_tril = None` then `scale += IdentityMatrix`. Otherwise\n      no scaled-identity-matrix is added to `scale`.\n    shape_hint: scalar integer `Tensor` representing a hint at the dimension of\n      the identity matrix when only `scale_identity_multiplier` is set.\n    validate_args: Python `bool` indicating whether arguments should be checked\n      for correctness.\n    assert_positive: Python `bool` indicating whether LinearOperator should be\n      checked for being positive definite.\n    name: Python `str` name given to ops managed by this object.\n\n  Returns:\n    `LinearOperator` representing a lower triangular matrix.\n\n  Raises:\n    ValueError:  If only `scale_identity_multiplier` is set and `loc` and\n      `shape_hint` are both None.\n  \"\"\"\n\n  def _maybe_attach_assertion(x):\n    if not validate_args:\n      return x\n    if assert_positive:\n      return with_dependencies([\n          assert_util.assert_positive(\n              tf.linalg.diag_part(x), message=\"diagonal part must be positive\"),\n      ], x)\n    return with_dependencies([\n        assert_util.assert_none_equal(\n            tf.linalg.diag_part(x),\n            tf.zeros([], x.dtype),\n            message=\"diagonal part must be non-zero\"),\n    ], x)\n\n  with tf.name_scope(name or \"make_tril_scale\"):\n\n    dtype = dtype_util.common_dtype(\n        [loc, scale_tril, scale_diag, scale_identity_multiplier],\n        preferred_dtype=tf.float32)\n    loc = _convert_to_tensor(loc, name=\"loc\", dtype=dtype)\n    scale_tril = _convert_to_tensor(scale_tril, name=\"scale_tril\", dtype=dtype)\n    scale_diag = _convert_to_tensor(scale_diag, name=\"scale_diag\", dtype=dtype)\n    scale_identity_multiplier = _convert_to_tensor(\n        scale_identity_multiplier,\n        name=\"scale_identity_multiplier\",\n        dtype=dtype)\n\n  if scale_tril is not None:\n    scale_tril = tf.linalg.band_part(scale_tril, -1, 0)  # Zero out TriU.\n    tril_diag = tf.linalg.diag_part(scale_tril)\n    if scale_diag is not None:\n      tril_diag += scale_diag\n    if scale_identity_multiplier is not None:\n      tril_diag += scale_identity_multiplier[..., tf.newaxis]\n\n    scale_tril = tf.linalg.set_diag(scale_tril, tril_diag)\n\n    return tf.linalg.LinearOperatorLowerTriangular(\n        tril=_maybe_attach_assertion(scale_tril),\n        is_non_singular=True,\n        is_self_adjoint=False,\n        is_positive_definite=assert_positive)\n\n  return make_diag_scale(\n      loc=loc,\n      scale_diag=scale_diag,\n      scale_identity_multiplier=scale_identity_multiplier,\n      shape_hint=shape_hint,\n      validate_args=validate_args,\n      assert_positive=assert_positive,\n      name=name)"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\ncreates a LinearOperator representing a diagonal matrix.", "response": "def make_diag_scale(loc=None,\n                    scale_diag=None,\n                    scale_identity_multiplier=None,\n                    shape_hint=None,\n                    validate_args=False,\n                    assert_positive=False,\n                    name=None,\n                    dtype=None):\n  \"\"\"Creates a LinearOperator representing a diagonal matrix.\n\n  Args:\n    loc: Floating-point `Tensor`. This is used for inferring shape in the case\n      where only `scale_identity_multiplier` is set.\n    scale_diag: Floating-point `Tensor` representing the diagonal matrix.\n      `scale_diag` has shape [N1, N2, ...  k], which represents a k x k diagonal\n      matrix. When `None` no diagonal term is added to the LinearOperator.\n    scale_identity_multiplier: floating point rank 0 `Tensor` representing a\n      scaling done to the identity matrix. When `scale_identity_multiplier =\n      scale_diag = scale_tril = None` then `scale += IdentityMatrix`. Otherwise\n      no scaled-identity-matrix is added to `scale`.\n    shape_hint: scalar integer `Tensor` representing a hint at the dimension of\n      the identity matrix when only `scale_identity_multiplier` is set.\n    validate_args: Python `bool` indicating whether arguments should be checked\n      for correctness.\n    assert_positive: Python `bool` indicating whether LinearOperator should be\n      checked for being positive definite.\n    name: Python `str` name given to ops managed by this object.\n    dtype: TF `DType` to prefer when converting args to `Tensor`s. Else, we fall\n      back to a compatible dtype across all of `loc`, `scale_diag`, and\n      `scale_identity_multiplier`.\n\n  Returns:\n    `LinearOperator` representing a lower triangular matrix.\n\n  Raises:\n    ValueError:  If only `scale_identity_multiplier` is set and `loc` and\n      `shape_hint` are both None.\n  \"\"\"\n\n  def _maybe_attach_assertion(x):\n    if not validate_args:\n      return x\n    if assert_positive:\n      return with_dependencies([\n          assert_util.assert_positive(\n              x, message=\"diagonal part must be positive\"),\n      ], x)\n    return with_dependencies([\n        assert_util.assert_none_equal(\n            x, tf.zeros([], x.dtype), message=\"diagonal part must be non-zero\")\n    ], x)\n\n  with tf.name_scope(name or \"make_diag_scale\"):\n    if dtype is None:\n      dtype = dtype_util.common_dtype(\n          [loc, scale_diag, scale_identity_multiplier],\n          preferred_dtype=tf.float32)\n    loc = _convert_to_tensor(loc, name=\"loc\", dtype=dtype)\n    scale_diag = _convert_to_tensor(scale_diag, name=\"scale_diag\", dtype=dtype)\n    scale_identity_multiplier = _convert_to_tensor(\n        scale_identity_multiplier,\n        name=\"scale_identity_multiplier\",\n        dtype=dtype)\n\n    if scale_diag is not None:\n      if scale_identity_multiplier is not None:\n        scale_diag += scale_identity_multiplier[..., tf.newaxis]\n      return tf.linalg.LinearOperatorDiag(\n          diag=_maybe_attach_assertion(scale_diag),\n          is_non_singular=True,\n          is_self_adjoint=True,\n          is_positive_definite=assert_positive)\n\n    if loc is None and shape_hint is None:\n      raise ValueError(\"Cannot infer `event_shape` unless `loc` or \"\n                       \"`shape_hint` is specified.\")\n\n    num_rows = shape_hint\n    del shape_hint\n    if num_rows is None:\n      num_rows = tf.compat.dimension_value(loc.shape[-1])\n      if num_rows is None:\n        num_rows = tf.shape(input=loc)[-1]\n\n    if scale_identity_multiplier is None:\n      return tf.linalg.LinearOperatorIdentity(\n          num_rows=num_rows,\n          dtype=dtype,\n          is_self_adjoint=True,\n          is_positive_definite=True,\n          assert_proper_shapes=validate_args)\n\n    return tf.linalg.LinearOperatorScaledIdentity(\n        num_rows=num_rows,\n        multiplier=_maybe_attach_assertion(scale_identity_multiplier),\n        is_non_singular=True,\n        is_self_adjoint=True,\n        is_positive_definite=assert_positive,\n        assert_proper_shapes=validate_args)"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\ninfer batch and event shapes from a location and scale.", "response": "def shapes_from_loc_and_scale(loc, scale, name=\"shapes_from_loc_and_scale\"):\n  \"\"\"Infer distribution batch and event shapes from a location and scale.\n\n  Location and scale family distributions determine their batch/event shape by\n  broadcasting the `loc` and `scale` args.  This helper does that broadcast,\n  statically if possible.\n\n  Batch shape broadcasts as per the normal rules.\n  We allow the `loc` event shape to broadcast up to that of `scale`.  We do not\n  allow `scale`'s event shape to change.  Therefore, the last dimension of `loc`\n  must either be size `1`, or the same as `scale.range_dimension`.\n\n  See `MultivariateNormalLinearOperator` for a usage example.\n\n  Args:\n    loc: `Tensor` (already converted to tensor) or `None`. If `None`, or\n      `rank(loc)==0`, both batch and event shape are determined by `scale`.\n    scale:  A `LinearOperator` instance.\n    name:  A string name to prepend to created ops.\n\n  Returns:\n    batch_shape:  `TensorShape` (if broadcast is done statically), or `Tensor`.\n    event_shape:  `TensorShape` (if broadcast is done statically), or `Tensor`.\n\n  Raises:\n    ValueError:  If the last dimension of `loc` is determined statically to be\n      different than the range of `scale`.\n  \"\"\"\n  if loc is not None and tensorshape_util.rank(loc.shape) == 0:\n    loc = None  # scalar loc is irrelevant to determining batch/event shape.\n  with tf.name_scope(name):\n    # Get event shape.\n    event_size = scale.range_dimension_tensor()\n    event_size_ = tf.get_static_value(event_size)\n    loc_event_size_ = (None if loc is None\n                       else tf.compat.dimension_value(loc.shape[-1]))\n\n    if event_size_ is not None and loc_event_size_ is not None:\n      # Static check that event shapes match.\n      if loc_event_size_ != 1 and loc_event_size_ != event_size_:\n        raise ValueError(\n            \"Event size of 'scale' ({}) could not be broadcast up to that \"\n            \"of 'loc' ({}).\".format(event_size_, loc_event_size_))\n    elif loc_event_size_ is not None and loc_event_size_ != 1:\n      event_size_ = loc_event_size_\n\n    if event_size_ is None:\n      event_shape = event_size[tf.newaxis]\n    else:\n      event_shape = tf.convert_to_tensor(\n          value=np.reshape(event_size_, [1]),\n          dtype=tf.int32,\n          name=\"event_shape\")\n\n    # Get batch shape.\n    batch_shape = scale.batch_shape_tensor()\n    if loc is not None:\n      loc_batch_shape = tensorshape_util.with_rank_at_least(loc.shape, 1)[:-1]\n      if tensorshape_util.rank(\n          loc.shape) is None or not tensorshape_util.is_fully_defined(\n              loc_batch_shape):\n        loc_batch_shape = tf.shape(input=loc)[:-1]\n      else:\n        loc_batch_shape = tf.convert_to_tensor(\n            value=loc_batch_shape, dtype=tf.int32, name=\"loc_batch_shape\")\n      # This is defined in the core util module.\n      batch_shape = prefer_static_broadcast_shape(batch_shape, loc_batch_shape)  # pylint: disable=undefined-variable\n      batch_shape = tf.convert_to_tensor(\n          value=batch_shape, dtype=tf.int32, name=\"batch_shape\")\n\n    return batch_shape, event_shape"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\ngetting broadcast shape as a Python list of integers ( preferred ) or a Tensor.", "response": "def get_broadcast_shape(*tensors):\n  \"\"\"Get broadcast shape as a Python list of integers (preferred) or `Tensor`.\n\n  Args:\n    *tensors:  One or more `Tensor` objects (already converted!).\n\n  Returns:\n    broadcast shape:  Python list (if shapes determined statically), otherwise\n      an `int32` `Tensor`.\n  \"\"\"\n  # Try static.\n  s_shape = tensors[0].shape\n  for t in tensors[1:]:\n    s_shape = tf.broadcast_static_shape(s_shape, t.shape)\n  if tensorshape_util.is_fully_defined(s_shape):\n    return tensorshape_util.as_list(s_shape)\n\n  # Fallback on dynamic.\n  d_shape = tf.shape(input=tensors[0])\n  for t in tensors[1:]:\n    d_shape = tf.broadcast_dynamic_shape(d_shape, tf.shape(input=t))\n  return d_shape"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef is_diagonal_scale(scale):\n  if not isinstance(scale, tf.linalg.LinearOperator):\n    raise TypeError(\"Expected argument 'scale' to be instance of LinearOperator\"\n                    \". Found: %s\" % scale)\n  return (isinstance(scale, tf.linalg.LinearOperatorIdentity) or\n          isinstance(scale, tf.linalg.LinearOperatorScaledIdentity) or\n          isinstance(scale, tf.linalg.LinearOperatorDiag))", "response": "Returns True if scale is a diagonal scale."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef pad_mixture_dimensions(x, mixture_distribution, categorical_distribution,\n                           event_ndims):\n  \"\"\"Pad dimensions of event tensors for mixture distributions.\n\n  See `Mixture._sample_n` and `MixtureSameFamily._sample_n` for usage examples.\n\n  Args:\n    x: event tensor to pad.\n    mixture_distribution: Base distribution of the mixture.\n    categorical_distribution: `Categorical` distribution that mixes the base\n      distribution.\n    event_ndims: Integer specifying the number of event dimensions in the event\n      tensor.\n\n  Returns:\n    A padded version of `x` that can broadcast with `categorical_distribution`.\n  \"\"\"\n  with tf.name_scope(\"pad_mix_dims\"):\n\n    def _get_ndims(d):\n      if tensorshape_util.rank(d.batch_shape) is not None:\n        return tensorshape_util.rank(d.batch_shape)\n      return tf.shape(input=d.batch_shape_tensor())[0]\n\n    dist_batch_ndims = _get_ndims(mixture_distribution)\n    cat_batch_ndims = _get_ndims(categorical_distribution)\n    pad_ndims = tf.where(categorical_distribution.is_scalar_batch(),\n                         dist_batch_ndims, dist_batch_ndims - cat_batch_ndims)\n    s = tf.shape(input=x)\n    x = tf.reshape(\n        x,\n        shape=tf.concat([\n            s[:-1],\n            tf.ones([pad_ndims], dtype=tf.int32),\n            s[-1:],\n            tf.ones([event_ndims], dtype=tf.int32),\n        ],\n                        axis=0))\n    return x", "response": "Pad dimensions of event tensors for mixture distributions."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef pick_scalar_condition(pred, true_value, false_value, name=None):\n  with tf.name_scope(name or \"pick_scalar_condition\"):\n    pred = tf.convert_to_tensor(\n        value=pred, dtype_hint=tf.bool, name=\"pred\")\n    true_value = tf.convert_to_tensor(value=true_value, name=\"true_value\")\n    false_value = tf.convert_to_tensor(value=false_value, name=\"false_value\")\n    pred_ = tf.get_static_value(pred)\n    if pred_ is None:\n      return tf.where(pred, true_value, false_value)\n    return true_value if pred_ else false_value", "response": "Convenience function that chooses one of two values based on the predicate."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nmaking possibly negatively indexed axis non - negative.", "response": "def make_non_negative_axis(axis, rank):\n  \"\"\"Make (possibly negatively indexed) `axis` argument non-negative.\"\"\"\n  axis = tf.convert_to_tensor(value=axis, name=\"axis\")\n  rank = tf.convert_to_tensor(value=rank, name=\"rank\")\n  axis_ = tf.get_static_value(axis)\n  rank_ = tf.get_static_value(rank)\n\n  # Static case.\n  if axis_ is not None and rank_ is not None:\n    is_scalar = axis_.ndim == 0\n    if is_scalar:\n      axis_ = [axis_]\n    positive_axis = []\n    for a_ in axis_:\n      if a_ < 0:\n        positive_axis.append(rank_ + a_)\n      else:\n        positive_axis.append(a_)\n    if is_scalar:\n      positive_axis = positive_axis[0]\n    return tf.convert_to_tensor(value=positive_axis, dtype=axis.dtype)\n\n  # Dynamic case.\n  # Unfortunately static values are lost by this tf.where.\n  return tf.where(axis < 0, rank + axis, axis)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef move_dimension(x, source_idx, dest_idx):\n  ndims = prefer_static_rank(x)\n  dtype = dtype_util.common_dtype([source_idx, dest_idx],\n                                  preferred_dtype=tf.int32)\n  source_idx = tf.convert_to_tensor(value=source_idx, dtype=dtype)\n  dest_idx = tf.convert_to_tensor(value=dest_idx, dtype=dtype)\n\n  # Handle negative indexing.\n  source_idx = pick_scalar_condition(source_idx < 0, ndims + source_idx,\n                                     source_idx)\n  dest_idx = pick_scalar_condition(dest_idx < 0, ndims + dest_idx, dest_idx)\n\n  # Construct the appropriate permutation of dimensions, depending\n  # whether the source is before or after the destination.\n  def move_left_permutation():\n    return prefer_static_value(\n        tf.concat([\n            tf.range(0, dest_idx, dtype=dtype), [source_idx],\n            tf.range(dest_idx, source_idx, dtype=dtype),\n            tf.range(source_idx + 1, ndims, dtype=dtype)\n        ],\n                  axis=0))\n\n  def move_right_permutation():\n    return prefer_static_value(\n        tf.concat([\n            tf.range(0, source_idx, dtype=dtype),\n            tf.range(source_idx + 1, dest_idx + 1, dtype=dtype), [source_idx],\n            tf.range(dest_idx + 1, ndims, dtype=dtype)\n        ],\n                  axis=0))\n\n  def x_permuted():\n    return tf.transpose(\n        a=x,\n        perm=prefer_static.cond(source_idx < dest_idx,\n                                move_right_permutation,\n                                move_left_permutation))\n\n  # One final conditional to handle the special case where source\n  # and destination indices are equal.\n  return prefer_static.cond(tf.equal(source_idx, dest_idx),\n                            lambda: x, x_permuted)", "response": "Move a single tensor dimension within its shape."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef assert_integer_form(x,\n                        data=None,\n                        summarize=None,\n                        message=None,\n                        int_dtype=None,\n                        name=\"assert_integer_form\"):\n  \"\"\"Assert that x has integer components (or floats equal to integers).\n\n  Args:\n    x: Floating-point `Tensor`\n    data: The tensors to print out if the condition is `False`. Defaults to\n      error message and first few entries of `x` and `y`.\n    summarize: Print this many entries of each tensor.\n    message: A string to prefix to the default message.\n    int_dtype: A `tf.dtype` used to cast the float to. The default (`None`)\n      implies the smallest possible signed int will be used for casting.\n    name: A name for this operation (optional).\n\n  Returns:\n    Op raising `InvalidArgumentError` if `cast(x, int_dtype) != x`.\n  \"\"\"\n  with tf.name_scope(name):\n    x = tf.convert_to_tensor(value=x, name=\"x\")\n    if dtype_util.is_integer(x.dtype):\n      return tf.no_op()\n    message = message or \"{} has non-integer components\".format(x)\n    if int_dtype is None:\n      try:\n        int_dtype = {\n            tf.float16: tf.int16,\n            tf.float32: tf.int32,\n            tf.float64: tf.int64,\n        }[dtype_util.base_dtype(x.dtype)]\n      except KeyError:\n        raise TypeError(\"Unrecognized type {}\".format(dtype_util.name(x.dtype)))\n    return assert_util.assert_equal(\n        x,\n        tf.cast(tf.cast(x, int_dtype), x.dtype),\n        data=data,\n        summarize=summarize,\n        message=message,\n        name=name)", "response": "Assert that x has integer components or floats equal to integers."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef embed_check_nonnegative_integer_form(\n    x, name=\"embed_check_nonnegative_integer_form\"):\n  \"\"\"Assert x is a non-negative tensor, and optionally of integers.\"\"\"\n  with tf.name_scope(name):\n    x = tf.convert_to_tensor(value=x, name=\"x\")\n    assertions = [\n        assert_util.assert_non_negative(\n            x, message=\"'{}' must be non-negative.\".format(x)),\n    ]\n    if not dtype_util.is_integer(x.dtype):\n      assertions += [\n          assert_integer_form(\n              x,\n              message=\"'{}' cannot contain fractional components.\".format(x)),\n      ]\n    return with_dependencies(assertions, x)", "response": "Embeds x to be a non - negative tensor and optionally of integers."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nreturning True if both tensors have the same dynamic shape.", "response": "def same_dynamic_shape(a, b):\n  \"\"\"Returns whether a and b have the same dynamic shape.\n\n  Args:\n    a: `Tensor`\n    b: `Tensor`\n\n  Returns:\n    `bool` `Tensor` representing if both tensors have the same shape.\n  \"\"\"\n  a = tf.convert_to_tensor(value=a, name=\"a\")\n  b = tf.convert_to_tensor(value=b, name=\"b\")\n\n  # Here we can't just do tf.equal(a.shape, b.shape), since\n  # static shape inference may break the equality comparison between\n  # shape(a) and shape(b) in tf.equal.\n  def all_shapes_equal():\n    return tf.reduce_all(\n        input_tensor=tf.equal(\n            tf.concat([tf.shape(input=a), tf.shape(input=b)], 0),\n            tf.concat([tf.shape(input=b), tf.shape(input=a)], 0)))\n\n  # One of the shapes isn't fully defined, so we need to use the dynamic\n  # shape.\n  return tf.cond(\n      pred=tf.equal(tf.rank(a), tf.rank(b)),\n      true_fn=all_shapes_equal,\n      false_fn=lambda: tf.constant(False))"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef maybe_get_static_value(x, dtype=None):\n  if x is None:\n    return x\n  try:\n    # This returns an np.ndarray.\n    x_ = tf.get_static_value(x)\n  except TypeError:\n    x_ = x\n  if x_ is None or dtype is None:\n    return x_\n  return np.array(x_, dtype)", "response": "Helper which tries to return a static value."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nconvert logit to probabilities or vice - versa and returns both.", "response": "def get_logits_and_probs(logits=None,\n                         probs=None,\n                         multidimensional=False,\n                         validate_args=False,\n                         name=\"get_logits_and_probs\",\n                         dtype=None):\n  \"\"\"Converts logit to probabilities (or vice-versa), and returns both.\n\n  Args:\n    logits: Floating-point `Tensor` representing log-odds.\n    probs: Floating-point `Tensor` representing probabilities.\n    multidimensional: Python `bool`, default `False`. If `True`, represents\n      whether the last dimension of `logits` or `probs`, a `[N1, N2, ...  k]`\n      dimensional tensor, representing the logit or probability of `shape[-1]`\n      classes.\n    validate_args: Python `bool`, default `False`. When `True`, either assert `0\n      <= probs <= 1` (if not `multidimensional`) or that the last dimension of\n      `probs` sums to one.\n    name: A name for this operation (optional).\n    dtype: `tf.DType` to prefer when converting args to `Tensor`s.\n\n  Returns:\n    logits, probs: Tuple of `Tensor`s. If `probs` has an entry that is `0` or\n      `1`, then the corresponding entry in the returned logit will be `-Inf` and\n      `Inf` respectively.\n\n  Raises:\n    ValueError: if neither `probs` nor `logits` were passed in, or both were.\n  \"\"\"\n  if dtype is None:\n    dtype = dtype_util.common_dtype([probs, logits], preferred_dtype=tf.float32)\n  with tf.name_scope(name):\n    if (probs is None) == (logits is None):\n      raise ValueError(\"Must pass probs or logits, but not both.\")\n\n    if probs is None:\n      logits = tf.convert_to_tensor(value=logits, name=\"logits\", dtype=dtype)\n      if not dtype_util.is_floating(logits.dtype):\n        raise TypeError(\"logits must having floating type.\")\n      # We can early return since we constructed probs and therefore know\n      # they're valid.\n      if multidimensional:\n        if validate_args:\n          logits = embed_check_categorical_event_shape(logits)\n        return logits, tf.nn.softmax(logits, name=\"probs\")\n      return logits, tf.sigmoid(logits, name=\"probs\")\n\n    probs = tf.convert_to_tensor(value=probs, name=\"probs\", dtype=dtype)\n    if not dtype_util.is_floating(probs.dtype):\n      raise TypeError(\"probs must having floating type.\")\n\n    if validate_args:\n      with tf.name_scope(\"validate_probs\"):\n        one = tf.constant(1., probs.dtype)\n        dependencies = [assert_util.assert_non_negative(probs)]\n        if multidimensional:\n          probs = embed_check_categorical_event_shape(probs)\n          dependencies += [\n              assert_util.assert_near(\n                  tf.reduce_sum(input_tensor=probs, axis=-1),\n                  one,\n                  message=\"probs does not sum to 1.\")\n          ]\n        else:\n          dependencies += [\n              assert_util.assert_less_equal(\n                  probs, one, message=\"probs has components greater than 1.\")\n          ]\n        probs = with_dependencies(dependencies, probs)\n\n    with tf.name_scope(\"logits\"):\n      if multidimensional:\n        # Here we don't compute the multidimensional case, in a manner\n        # consistent with respect to the unidimensional case. We do so\n        # following the TF convention. Typically, you might expect to see\n        # logits = log(probs) - log(probs[pivot]). A side-effect of\n        # being consistent with the TF approach is that the unidimensional case\n        # implicitly handles the second dimension but the multidimensional case\n        # explicitly keeps the pivot dimension.\n        return tf.math.log(probs), probs\n      return tf.math.log(probs) - tf.math.log1p(-1. * probs), probs"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef _largest_integer_by_dtype(dt):\n  if not _is_known_dtype(dt):\n    raise TypeError(\"Unrecognized dtype: {}\".format(dt.name))\n  if dt.is_floating:\n    return int(2**(np.finfo(dt.as_numpy_dtype).nmant + 1))\n  if dt.is_integer:\n    return np.iinfo(dt.as_numpy_dtype).max\n  if dt.base_dtype == tf.bool:\n    return int(1)\n  # We actually can't land here but keep the case for completeness.\n  raise TypeError(\"Unrecognized dtype: {}\".format(dt.name))", "response": "Helper returning the largest integer exactly representable by dtype."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef embed_check_categorical_event_shape(\n    categorical_param, name=\"embed_check_categorical_event_shape\"):\n  \"\"\"Embeds checks that categorical distributions don't have too many classes.\n\n  A categorical-type distribution is one which, e.g., returns the class label\n  rather than a one-hot encoding.  E.g., `Categorical(probs)`.\n\n  Since distributions output samples in the same dtype as the parameters, we\n  must ensure that casting doesn't lose precision. That is, the\n  `parameter.dtype` implies a maximum number of classes. However, since shape is\n  `int32` and categorical variables are presumed to be indexes into a `Tensor`,\n  we must also ensure that the number of classes is no larger than the largest\n  possible `int32` index, i.e., `2**31-1`.\n\n  In other words the number of classes, `K`, must satisfy the following\n  condition:\n\n  ```python\n  K <= min(\n      int(2**31 - 1),  # Largest float as an index.\n      {\n          tf.float16: int(2**11),   # Largest int as a float16.\n          tf.float32: int(2**24),\n          tf.float64: int(2**53),\n      }.get(dtype_util.base_dtype(categorical_param.dtype), 0))\n  ```\n\n  Args:\n    categorical_param: Floating-point `Tensor` representing parameters of\n      distribution over categories. The rightmost shape is presumed to be the\n      number of categories.\n    name: A name for this operation (optional).\n\n  Returns:\n    categorical_param: Input `Tensor` with appropriate assertions embedded.\n\n  Raises:\n    TypeError: if `categorical_param` has an unknown `dtype`.\n    ValueError: if we can statically identify `categorical_param` as being too\n      large (for being closed under int32/float casting).\n  \"\"\"\n  with tf.name_scope(name):\n    x = tf.convert_to_tensor(value=categorical_param, name=\"categorical_param\")\n    # The size must not exceed both of:\n    # - The largest possible int32 (since categorical values are presumed to be\n    #   indexes into a Tensor).\n    # - The largest possible integer exactly representable under the given\n    #   floating-point dtype (since we need to cast to/from).\n    #\n    # The chosen floating-point thresholds are 2**(1 + mantissa_bits).\n    # For more details, see:\n    # https://en.wikipedia.org/wiki/Floating-point_arithmetic#Internal_representation\n    x_dtype = dtype_util.base_dtype(x.dtype)\n    max_event_size = (\n        _largest_integer_by_dtype(x_dtype)\n        if dtype_util.is_floating(x_dtype) else 0)\n    if max_event_size is 0:\n      raise TypeError(\"Unable to validate size of unrecognized dtype \"\n                      \"({}).\".format(dtype_util.name(x_dtype)))\n    try:\n      x_shape_static = tensorshape_util.with_rank_at_least(x.shape, 1)\n    except ValueError:\n      raise ValueError(\"A categorical-distribution parameter must have \"\n                       \"at least 1 dimension.\")\n    event_size = tf.compat.dimension_value(x_shape_static[-1])\n    if event_size is not None:\n      if event_size < 2:\n        raise ValueError(\"A categorical-distribution parameter must have at \"\n                         \"least 2 events.\")\n      if event_size > max_event_size:\n        raise ValueError(\"Number of classes exceeds `dtype` precision, i.e., \"\n                         \"{} implies shape ({}) cannot exceed {}.\".format(\n                             dtype_util.name(x_dtype), event_size,\n                             max_event_size))\n      return x\n    else:\n      event_size = tf.shape(input=x, out_type=tf.int64, name=\"x_shape\")[-1]\n      return with_dependencies([\n          assert_util.assert_rank_at_least(\n              x,\n              1,\n              message=(\"A categorical-distribution parameter must have \"\n                       \"at least 1 dimension.\")),\n          assert_util.assert_greater_equal(\n              tf.shape(input=x)[-1],\n              2,\n              message=(\"A categorical-distribution parameter must have at \"\n                       \"least 2 events.\")),\n          assert_util.assert_less_equal(\n              event_size,\n              tf.convert_to_tensor(max_event_size, dtype=tf.int64),\n              message=\"Number of classes exceeds `dtype` precision, \"\n              \"i.e., {} dtype cannot exceed {} shape.\".format(\n                  dtype_util.name(x_dtype), max_event_size)),\n      ], x)", "response": "Embeds checks that categorical - type distributions don t have too many classes."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef embed_check_integer_casting_closed(x,\n                                       target_dtype,\n                                       assert_nonnegative=True,\n                                       assert_positive=False,\n                                       name=\"embed_check_casting_closed\"):\n  \"\"\"Ensures integers remain unaffected despite casting to/from int/float types.\n\n  Example integer-types: `uint8`, `int32`, `bool`.\n  Example floating-types: `float32`, `float64`.\n\n  The largest possible integer representable by an IEEE754 floating-point is\n  `2**(1 + mantissa_bits)` yet the largest possible integer as an int-type is\n  `2**(bits - 1) - 1`. This function ensures that a `Tensor` purporting to have\n  integer-form values can be cast to some other type without loss of precision.\n\n  The smallest representable integer is the negative of the largest\n  representable integer, except for types: `uint8`, `uint16`, `bool`. For these\n  types, the smallest representable integer is `0`.\n\n  Args:\n    x: `Tensor` representing integer-form values.\n    target_dtype: TF `dtype` under which `x` should have identical values.\n    assert_nonnegative: `bool` indicating `x` should contain nonnegative values.\n    assert_positive: `bool` indicating `x` should contain positive values.\n    name: A name for this operation (optional).\n\n  Returns:\n    x: Input `Tensor` with appropriate assertions embedded.\n\n  Raises:\n    TypeError: if `x` is neither integer- nor floating-type.\n    TypeError: if `target_dtype` is neither integer- nor floating-type.\n    TypeError: if neither `x` nor `target_dtype` are integer-type.\n  \"\"\"\n\n  with tf.name_scope(name):\n    x = tf.convert_to_tensor(value=x, name=\"x\")\n    if (not _is_integer_like_by_dtype(x.dtype) and\n        not dtype_util.is_floating(x.dtype)):\n      raise TypeError(\"{}.dtype must be floating- or \"\n                      \"integer-type.\".format(dtype_util.name(x.dtype)))\n    if (not _is_integer_like_by_dtype(target_dtype) and\n        not dtype_util.is_floating(target_dtype)):\n      raise TypeError(\"target_dtype ({}) must be floating- or \"\n                      \"integer-type.\".format(dtype_util.name(target_dtype)))\n    if (not _is_integer_like_by_dtype(x.dtype) and\n        not _is_integer_like_by_dtype(target_dtype)):\n      raise TypeError(\"At least one of {}.dtype ({}) and target_dtype ({}) \"\n                      \"must be integer-type.\".format(\n                          x, dtype_util.name(x.dtype),\n                          dtype_util.name(target_dtype)))\n\n    assertions = []\n    if assert_positive:\n      assertions += [\n          assert_util.assert_positive(x, message=\"Elements must be positive.\"),\n      ]\n    elif assert_nonnegative:\n      assertions += [\n          assert_util.assert_non_negative(\n              x, message=\"Elements must be non-negative.\"),\n      ]\n\n    if dtype_util.is_floating(x.dtype):\n      # Being here means _is_integer_like_by_dtype(target_dtype) = True.\n      # Since this check implies the magnitude check below, we need only it.\n      assertions += [\n          assert_integer_form(\n              x,\n              int_dtype=target_dtype,\n              message=\"Elements must be {}-equivalent.\".format(\n                  dtype_util.name(target_dtype))),\n      ]\n    else:\n      if (_largest_integer_by_dtype(x.dtype) >\n          _largest_integer_by_dtype(target_dtype)):\n        # Cast may lose integer precision.\n        assertions += [\n            assert_util.assert_less_equal(\n                x,\n                _largest_integer_by_dtype(target_dtype),\n                message=(\"Elements cannot exceed {}.\".format(\n                    _largest_integer_by_dtype(target_dtype)))),\n        ]\n      if (not assert_nonnegative and (_smallest_integer_by_dtype(\n          x.dtype) < _smallest_integer_by_dtype(target_dtype))):\n        assertions += [\n            assert_util.assert_greater_equal(\n                x,\n                _smallest_integer_by_dtype(target_dtype),\n                message=(\"Elements cannot be smaller than {}.\".format(\n                    _smallest_integer_by_dtype(target_dtype)))),\n        ]\n\n    if not assertions:\n      return x\n    return with_dependencies(assertions, x)", "response": "Embeds an integer - casting - closed tensor to a single integer - form."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\ncompute the log combinations of n and counts.", "response": "def log_combinations(n, counts, name=\"log_combinations\"):\n  \"\"\"Multinomial coefficient.\n\n  Given `n` and `counts`, where `counts` has last dimension `k`, we compute\n  the multinomial coefficient as:\n\n  ```n! / sum_i n_i!```\n\n  where `i` runs over all `k` classes.\n\n  Args:\n    n: Floating-point `Tensor` broadcastable with `counts`. This represents `n`\n      outcomes.\n    counts: Floating-point `Tensor` broadcastable with `n`. This represents\n      counts in `k` classes, where `k` is the last dimension of the tensor.\n    name: A name for this operation (optional).\n\n  Returns:\n    `Tensor` representing the multinomial coefficient between `n` and `counts`.\n  \"\"\"\n  # First a bit about the number of ways counts could have come in:\n  # E.g. if counts = [1, 2], then this is 3 choose 2.\n  # In general, this is (sum counts)! / sum(counts!)\n  # The sum should be along the last dimension of counts. This is the\n  # \"distribution\" dimension. Here n a priori represents the sum of counts.\n  with tf.name_scope(name):\n    n = tf.convert_to_tensor(value=n, name=\"n\")\n    counts = tf.convert_to_tensor(value=counts, name=\"counts\")\n    total_permutations = tf.math.lgamma(n + 1)\n    counts_factorial = tf.math.lgamma(counts + 1)\n    redundant_permutations = tf.reduce_sum(\n        input_tensor=counts_factorial, axis=[-1])\n    return total_permutations - redundant_permutations"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef matrix_diag_transform(matrix, transform=None, name=None):\n  with tf.name_scope(name or \"matrix_diag_transform\"):\n    matrix = tf.convert_to_tensor(value=matrix, name=\"matrix\")\n    if transform is None:\n      return matrix\n    # Replace the diag with transformed diag.\n    diag = tf.linalg.diag_part(matrix)\n    transformed_diag = transform(diag)\n    transformed_mat = tf.linalg.set_diag(matrix, transformed_diag)\n\n  return transformed_mat", "response": "Transform diagonal of matrix."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef rotate_transpose(x, shift, name=\"rotate_transpose\"):\n  with tf.name_scope(name):\n    x = tf.convert_to_tensor(value=x, name=\"x\")\n    shift = tf.convert_to_tensor(value=shift, name=\"shift\")\n    # We do not assign back to preserve constant-ness.\n    assert_util.assert_integer(shift)\n    shift_value_static = tf.get_static_value(shift)\n    ndims = tensorshape_util.rank(x.shape)\n    if ndims is not None and shift_value_static is not None:\n      if ndims < 2:\n        return x\n      shift_value_static = np.sign(shift_value_static) * (\n          abs(shift_value_static) % ndims)\n      if shift_value_static == 0:\n        return x\n      perm = np.roll(np.arange(ndims), shift_value_static)\n      return tf.transpose(a=x, perm=perm)\n    else:\n      # Consider if we always had a positive shift, and some specified\n      # direction.\n      # When shifting left we want the new array:\n      #   last(x, n-shift) + first(x, shift)\n      # and if shifting right then we want:\n      #   last(x, shift) + first(x, n-shift)\n      # Observe that last(a) == slice(a, n) and first(a) == slice(0, a).\n      # Also, we can encode direction and shift as one: direction * shift.\n      # Combining these facts, we have:\n      #   a = cond(shift<0, -shift, n-shift)\n      #   last(x, n-a) + first(x, a) == x[a:n] + x[0:a]\n      # Finally, we transform shift by modulo length so it can be specified\n      # independently from the array upon which it operates (like python).\n      ndims = tf.rank(x)\n      shift = tf.where(\n          tf.less(shift, 0), -shift % ndims,\n          ndims - shift % ndims)\n      first = tf.range(0, shift)\n      last = tf.range(shift, ndims)\n      perm = tf.concat([last, first], 0)\n      return tf.transpose(a=x, perm=perm)", "response": "Rotates x by shift."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef pick_vector(cond, true_vector, false_vector, name=\"pick_vector\"):\n  with tf.name_scope(name):\n    cond = tf.convert_to_tensor(\n        value=cond, dtype_hint=tf.bool, name=\"cond\")\n    if cond.dtype != tf.bool:\n      raise TypeError(\n          \"{}.dtype={} which is not {}\".format(cond, cond.dtype, tf.bool))\n\n    true_vector = tf.convert_to_tensor(value=true_vector, name=\"true_vector\")\n    false_vector = tf.convert_to_tensor(value=false_vector, name=\"false_vector\")\n    if true_vector.dtype != false_vector.dtype:\n      raise TypeError(\n          \"{}.dtype={} does not match {}.dtype={}\".format(\n              true_vector, true_vector.dtype, false_vector, false_vector.dtype))\n\n    cond_value_static = tf.get_static_value(cond)\n    if cond_value_static is not None:\n      return true_vector if cond_value_static else false_vector\n    n = tf.shape(input=true_vector)[0]\n    return tf.slice(\n        tf.concat([true_vector, false_vector], 0), [tf.where(cond, 0, n)],\n        [tf.where(cond, n, -1)])", "response": "Picks possibly different length row tensors based on condition."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef gen_new_seed(seed, salt):\n  if seed is None:\n    return None\n  string = (str(seed) + salt).encode(\"utf-8\")\n  return int(hashlib.md5(string).hexdigest()[:8], 16) & 0x7FFFFFFF", "response": "Generate a new seed from the given seed and salt."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef fill_triangular(x, upper=False, name=None):\n\n  with tf.name_scope(name or \"fill_triangular\"):\n    x = tf.convert_to_tensor(value=x, name=\"x\")\n    m = tf.compat.dimension_value(\n        tensorshape_util.with_rank_at_least(x.shape, 1)[-1])\n    if m is not None:\n      # Formula derived by solving for n: m = n(n+1)/2.\n      m = np.int32(m)\n      n = np.sqrt(0.25 + 2. * m) - 0.5\n      if n != np.floor(n):\n        raise ValueError(\"Input right-most shape ({}) does not \"\n                         \"correspond to a triangular matrix.\".format(m))\n      n = np.int32(n)\n      static_final_shape = x.shape[:-1].concatenate([n, n])\n    else:\n      m = tf.shape(input=x)[-1]\n      # For derivation, see above. Casting automatically lops off the 0.5, so we\n      # omit it.  We don't validate n is an integer because this has\n      # graph-execution cost; an error will be thrown from the reshape, below.\n      n = tf.cast(\n          tf.sqrt(0.25 + tf.cast(2 * m, dtype=tf.float32)), dtype=tf.int32)\n      static_final_shape = tensorshape_util.with_rank_at_least(\n          x.shape, 1)[:-1].concatenate([None, None])\n\n    # Try it out in numpy:\n    #  n = 3\n    #  x = np.arange(n * (n + 1) / 2)\n    #  m = x.shape[0]\n    #  n = np.int32(np.sqrt(.25 + 2 * m) - .5)\n    #  x_tail = x[(m - (n**2 - m)):]\n    #  np.concatenate([x_tail, x[::-1]], 0).reshape(n, n)  # lower\n    #  # ==> array([[3, 4, 5],\n    #               [5, 4, 3],\n    #               [2, 1, 0]])\n    #  np.concatenate([x, x_tail[::-1]], 0).reshape(n, n)  # upper\n    #  # ==> array([[0, 1, 2],\n    #               [3, 4, 5],\n    #               [5, 4, 3]])\n    #\n    # Note that we can't simply do `x[..., -(n**2 - m):]` because this doesn't\n    # correctly handle `m == n == 1`. Hence, we do nonnegative indexing.\n    # Furthermore observe that:\n    #   m - (n**2 - m)\n    #   = n**2 / 2 + n / 2 - (n**2 - n**2 / 2 + n / 2)\n    #   = 2 (n**2 / 2 + n / 2) - n**2\n    #   = n**2 + n - n**2\n    #   = n\n    ndims = prefer_static_rank(x)\n    if upper:\n      x_list = [x, tf.reverse(x[..., n:], axis=[ndims - 1])]\n    else:\n      x_list = [x[..., n:], tf.reverse(x, axis=[ndims - 1])]\n    new_shape = (\n        tensorshape_util.as_list(static_final_shape)\n        if tensorshape_util.is_fully_defined(static_final_shape) else tf.concat(\n            [tf.shape(input=x)[:-1], [n, n]], axis=0))\n    x = tf.reshape(tf.concat(x_list, axis=-1), new_shape)\n    x = tf.linalg.band_part(\n        x, num_lower=(0 if upper else -1), num_upper=(-1 if upper else 0))\n    tensorshape_util.set_shape(x, static_final_shape)\n    return x", "response": "r Creates a triangular matrix from a vector of inputs."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\ncreate a vector from a batch of triangular elements.", "response": "def fill_triangular_inverse(x, upper=False, name=None):\n  \"\"\"Creates a vector from a (batch of) triangular matrix.\n\n  The vector is created from the lower-triangular or upper-triangular portion\n  depending on the value of the parameter `upper`.\n\n  If `x.shape` is `[b1, b2, ..., bB, n, n]` then the output shape is\n  `[b1, b2, ..., bB, d]` where `d = n (n + 1) / 2`.\n\n  Example:\n\n  ```python\n  fill_triangular_inverse(\n    [[4, 0, 0],\n     [6, 5, 0],\n     [3, 2, 1]])\n\n  # ==> [1, 2, 3, 4, 5, 6]\n\n  fill_triangular_inverse(\n    [[1, 2, 3],\n     [0, 5, 6],\n     [0, 0, 4]], upper=True)\n\n  # ==> [1, 2, 3, 4, 5, 6]\n  ```\n\n  Args:\n    x: `Tensor` representing lower (or upper) triangular elements.\n    upper: Python `bool` representing whether output matrix should be upper\n      triangular (`True`) or lower triangular (`False`, default).\n    name: Python `str`. The name to give this op.\n\n  Returns:\n    flat_tril: (Batch of) vector-shaped `Tensor` representing vectorized lower\n      (or upper) triangular elements from `x`.\n  \"\"\"\n\n  with tf.name_scope(name or \"fill_triangular_inverse\"):\n    x = tf.convert_to_tensor(value=x, name=\"x\")\n    n = tf.compat.dimension_value(\n        tensorshape_util.with_rank_at_least(x.shape, 2)[-1])\n    if n is not None:\n      n = np.int32(n)\n      m = np.int32((n * (n + 1)) // 2)\n      static_final_shape = x.shape[:-2].concatenate([m])\n    else:\n      n = tf.shape(input=x)[-1]\n      m = (n * (n + 1)) // 2\n      static_final_shape = tensorshape_util.with_rank_at_least(\n          x.shape, 2)[:-2].concatenate([None])\n    ndims = prefer_static_rank(x)\n    if upper:\n      initial_elements = x[..., 0, :]\n      triangular_portion = x[..., 1:, :]\n    else:\n      initial_elements = tf.reverse(x[..., -1, :], axis=[ndims - 2])\n      triangular_portion = x[..., :-1, :]\n    rotated_triangular_portion = tf.reverse(\n        tf.reverse(triangular_portion, axis=[ndims - 1]), axis=[ndims - 2])\n    consolidated_matrix = triangular_portion + rotated_triangular_portion\n    end_sequence = tf.reshape(\n        consolidated_matrix,\n        tf.concat([tf.shape(input=x)[:-2], [n * (n - 1)]], axis=0))\n    y = tf.concat([initial_elements, end_sequence[..., :m - n]], axis=-1)\n    tensorshape_util.set_shape(y, static_final_shape)\n    return y"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\ncreate a matrix with values set above, below, and on the diagonal. Example: ```python tridiag(below=[1., 2., 3.], diag=[4., 5., 6., 7.], above=[8., 9., 10.]) # ==> array([[ 4., 8., 0., 0.], # [ 1., 5., 9., 0.], # [ 0., 2., 6., 10.], # [ 0., 0., 3., 7.]], dtype=float32) ``` Warning: This Op is intended for convenience, not efficiency. Args: below: `Tensor` of shape `[B1, ..., Bb, d-1]` corresponding to the below diagonal part. `None` is logically equivalent to `below = 0`. diag: `Tensor` of shape `[B1, ..., Bb, d]` corresponding to the diagonal part. `None` is logically equivalent to `diag = 0`. above: `Tensor` of shape `[B1, ..., Bb, d-1]` corresponding to the above diagonal part. `None` is logically equivalent to `above = 0`. name: Python `str`. The name to give this op. Returns: tridiag: `Tensor` with values set above, below and on the diagonal. Raises: ValueError: if all inputs are `None`.", "response": "def tridiag(below=None, diag=None, above=None, name=None):\n  \"\"\"Creates a matrix with values set above, below, and on the diagonal.\n\n  Example:\n\n  ```python\n  tridiag(below=[1., 2., 3.],\n          diag=[4., 5., 6., 7.],\n          above=[8., 9., 10.])\n  # ==> array([[  4.,   8.,   0.,   0.],\n  #            [  1.,   5.,   9.,   0.],\n  #            [  0.,   2.,   6.,  10.],\n  #            [  0.,   0.,   3.,   7.]], dtype=float32)\n  ```\n\n  Warning: This Op is intended for convenience, not efficiency.\n\n  Args:\n    below: `Tensor` of shape `[B1, ..., Bb, d-1]` corresponding to the below\n      diagonal part. `None` is logically equivalent to `below = 0`.\n    diag: `Tensor` of shape `[B1, ..., Bb, d]` corresponding to the diagonal\n      part.  `None` is logically equivalent to `diag = 0`.\n    above: `Tensor` of shape `[B1, ..., Bb, d-1]` corresponding to the above\n      diagonal part.  `None` is logically equivalent to `above = 0`.\n    name: Python `str`. The name to give this op.\n\n  Returns:\n    tridiag: `Tensor` with values set above, below and on the diagonal.\n\n  Raises:\n    ValueError: if all inputs are `None`.\n  \"\"\"\n\n  def _pad(x):\n    \"\"\"Prepends and appends a zero to every vector in a batch of vectors.\"\"\"\n    shape = tf.concat([tf.shape(input=x)[:-1], [1]], axis=0)\n    z = tf.zeros(shape, dtype=x.dtype)\n    return tf.concat([z, x, z], axis=-1)\n\n  def _add(*x):\n    \"\"\"Adds list of Tensors, ignoring `None`.\"\"\"\n    s = None\n    for y in x:\n      if y is None:\n        continue\n      elif s is None:\n        s = y\n      else:\n        s += y\n    if s is None:\n      raise ValueError(\"Must specify at least one of `below`, `diag`, `above`.\")\n    return s\n\n  with tf.name_scope(name or \"tridiag\"):\n    if below is not None:\n      below = tf.convert_to_tensor(value=below, name=\"below\")\n      below = tf.linalg.diag(_pad(below))[..., :-1, 1:]\n    if diag is not None:\n      diag = tf.convert_to_tensor(value=diag, name=\"diag\")\n      diag = tf.linalg.diag(diag)\n    if above is not None:\n      above = tf.convert_to_tensor(value=above, name=\"above\")\n      above = tf.linalg.diag(_pad(above))[..., 1:, :-1]\n    # TODO(jvdillon): Consider using scatter_nd instead of creating three full\n    # matrices.\n    return _add(below, diag, above)"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nreducing the logx along the given axis and returns the corresponding logx.", "response": "def reduce_weighted_logsumexp(logx,\n                              w=None,\n                              axis=None,\n                              keep_dims=False,\n                              return_sign=False,\n                              name=None):\n  \"\"\"Computes `log(abs(sum(weight * exp(elements across tensor dimensions))))`.\n\n  If all weights `w` are known to be positive, it is more efficient to directly\n  use `reduce_logsumexp`, i.e., `tf.reduce_logsumexp(logx + tf.log(w))` is more\n  efficient than `du.reduce_weighted_logsumexp(logx, w)`.\n\n  Reduces `input_tensor` along the dimensions given in `axis`.\n  Unless `keep_dims` is true, the rank of the tensor is reduced by 1 for each\n  entry in `axis`. If `keep_dims` is true, the reduced dimensions\n  are retained with length 1.\n\n  If `axis` has no entries, all dimensions are reduced, and a\n  tensor with a single element is returned.\n\n  This function is more numerically stable than log(sum(w * exp(input))). It\n  avoids overflows caused by taking the exp of large inputs and underflows\n  caused by taking the log of small inputs.\n\n  For example:\n\n  ```python\n  x = tf.constant([[0., 0, 0],\n                   [0, 0, 0]])\n\n  w = tf.constant([[-1., 1, 1],\n                   [1, 1, 1]])\n\n  du.reduce_weighted_logsumexp(x, w)\n  # ==> log(-1*1 + 1*1 + 1*1 + 1*1 + 1*1 + 1*1) = log(4)\n\n  du.reduce_weighted_logsumexp(x, w, axis=0)\n  # ==> [log(-1+1), log(1+1), log(1+1)]\n\n  du.reduce_weighted_logsumexp(x, w, axis=1)\n  # ==> [log(-1+1+1), log(1+1+1)]\n\n  du.reduce_weighted_logsumexp(x, w, axis=1, keep_dims=True)\n  # ==> [[log(-1+1+1)], [log(1+1+1)]]\n\n  du.reduce_weighted_logsumexp(x, w, axis=[0, 1])\n  # ==> log(-1+5)\n  ```\n\n  Args:\n    logx: The tensor to reduce. Should have numeric type.\n    w: The weight tensor. Should have numeric type identical to `logx`.\n    axis: The dimensions to reduce. If `None` (the default), reduces all\n      dimensions. Must be in the range `[-rank(input_tensor),\n      rank(input_tensor))`.\n    keep_dims: If true, retains reduced dimensions with length 1.\n    return_sign: If `True`, returns the sign of the result.\n    name: A name for the operation (optional).\n\n  Returns:\n    lswe: The `log(abs(sum(weight * exp(x))))` reduced tensor.\n    sign: (Optional) The sign of `sum(weight * exp(x))`.\n  \"\"\"\n  with tf.name_scope(name or \"reduce_weighted_logsumexp\"):\n    logx = tf.convert_to_tensor(value=logx, name=\"logx\")\n    if w is None:\n      lswe = tf.reduce_logsumexp(\n          input_tensor=logx, axis=axis, keepdims=keep_dims)\n      if return_sign:\n        sgn = tf.ones_like(lswe)\n        return lswe, sgn\n      return lswe\n    w = tf.convert_to_tensor(value=w, dtype=logx.dtype, name=\"w\")\n    log_absw_x = logx + tf.math.log(tf.abs(w))\n    max_log_absw_x = tf.reduce_max(\n        input_tensor=log_absw_x, axis=axis, keepdims=True)\n    # If the largest element is `-inf` or `inf` then we don't bother subtracting\n    # off the max. We do this because otherwise we'd get `inf - inf = NaN`. That\n    # this is ok follows from the fact that we're actually free to subtract any\n    # value we like, so long as we add it back after taking the `log(sum(...))`.\n    max_log_absw_x = tf.where(\n        tf.math.is_inf(max_log_absw_x), tf.zeros_like(max_log_absw_x),\n        max_log_absw_x)\n    wx_over_max_absw_x = (tf.sign(w) * tf.exp(log_absw_x - max_log_absw_x))\n    sum_wx_over_max_absw_x = tf.reduce_sum(\n        input_tensor=wx_over_max_absw_x, axis=axis, keepdims=keep_dims)\n    if not keep_dims:\n      max_log_absw_x = tf.squeeze(max_log_absw_x, axis)\n    sgn = tf.sign(sum_wx_over_max_absw_x)\n    lswe = max_log_absw_x + tf.math.log(sgn * sum_wx_over_max_absw_x)\n    if return_sign:\n      return lswe, sgn\n    return lswe"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef softplus_inverse(x, name=None):\n  with tf.name_scope(name or \"softplus_inverse\"):\n    x = tf.convert_to_tensor(value=x, name=\"x\")\n    # We begin by deriving a more numerically stable softplus_inverse:\n    # x = softplus(y) = Log[1 + exp{y}], (which means x > 0).\n    # ==> exp{x} = 1 + exp{y}                                (1)\n    # ==> y = Log[exp{x} - 1]                                (2)\n    #       = Log[(exp{x} - 1) / exp{x}] + Log[exp{x}]\n    #       = Log[(1 - exp{-x}) / 1] + Log[exp{x}]\n    #       = Log[1 - exp{-x}] + x                           (3)\n    # (2) is the \"obvious\" inverse, but (3) is more stable than (2) for large x.\n    # For small x (e.g. x = 1e-10), (3) will become -inf since 1 - exp{-x} will\n    # be zero. To fix this, we use 1 - exp{-x} approx x for small x > 0.\n    #\n    # In addition to the numerically stable derivation above, we clamp\n    # small/large values to be congruent with the logic in:\n    # tensorflow/core/kernels/softplus_op.h\n    #\n    # Finally, we set the input to one whenever the input is too large or too\n    # small. This ensures that no unchosen codepath is +/- inf. This is\n    # necessary to ensure the gradient doesn't get NaNs. Recall that the\n    # gradient of `where` behaves like `pred*pred_true + (1-pred)*pred_false`\n    # thus an `inf` in an unselected path results in `0*inf=nan`. We are careful\n    # to overwrite `x` with ones only when we will never actually use this\n    # value. Note that we use ones and not zeros since `log(expm1(0.)) = -inf`.\n    threshold = np.log(np.finfo(dtype_util.as_numpy_dtype(x.dtype)).eps) + 2.\n    is_too_small = tf.less(x, np.exp(threshold))\n    is_too_large = tf.greater(x, -threshold)\n    too_small_value = tf.math.log(x)\n    too_large_value = x\n    # This `where` will ultimately be a NOP because we won't select this\n    # codepath whenever we used the surrogate `ones_like`.\n    x = tf.where(tf.logical_or(is_too_small, is_too_large), tf.ones_like(x), x)\n    y = x + tf.math.log(-tf.math.expm1(-x))  # == log(expm1(x))\n    return tf.where(is_too_small, too_small_value,\n                    tf.where(is_too_large, too_large_value, y))", "response": "Computes the inverse softplus."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nreturns the size of a specific dimension.", "response": "def dimension_size(x, axis):\n  \"\"\"Returns the size of a specific dimension.\"\"\"\n  # Since tf.gather isn't \"constant-in, constant-out\", we must first check the\n  # static shape or fallback to dynamic shape.\n  s = tf.compat.dimension_value(\n      tensorshape_util.with_rank_at_least(x.shape, np.abs(axis))[axis])\n  if s is not None:\n    return s\n  return tf.shape(input=x)[axis]"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nvalidate quadrature grid and probs or computes them as necessary.", "response": "def process_quadrature_grid_and_probs(quadrature_grid_and_probs,\n                                      dtype,\n                                      validate_args,\n                                      name=None):\n  \"\"\"Validates quadrature grid, probs or computes them as necessary.\n\n  Args:\n    quadrature_grid_and_probs: Python pair of `float`-like `Tensor`s\n      representing the sample points and the corresponding (possibly\n      normalized) weight.  When `None`, defaults to:\n        `np.polynomial.hermite.hermgauss(deg=8)`.\n    dtype: The expected `dtype` of `grid` and `probs`.\n    validate_args: Python `bool`, default `False`. When `True` distribution\n      parameters are checked for validity despite possibly degrading runtime\n      performance. When `False` invalid inputs may silently render incorrect\n      outputs.\n    name: Python `str` name prefixed to Ops created by this class.\n\n  Returns:\n     quadrature_grid_and_probs: Python pair of `float`-like `Tensor`s\n      representing the sample points and the corresponding (possibly\n      normalized) weight.\n\n  Raises:\n    ValueError: if `quadrature_grid_and_probs is not None` and\n      `len(quadrature_grid_and_probs[0]) != len(quadrature_grid_and_probs[1])`\n  \"\"\"\n  with tf.name_scope(name or \"process_quadrature_grid_and_probs\"):\n    if quadrature_grid_and_probs is None:\n      grid, probs = np.polynomial.hermite.hermgauss(deg=8)\n      grid = grid.astype(dtype_util.as_numpy_dtype(dtype))\n      probs = probs.astype(dtype_util.as_numpy_dtype(dtype))\n      probs /= np.linalg.norm(probs, ord=1, keepdims=True)\n      grid = tf.convert_to_tensor(value=grid, name=\"grid\", dtype=dtype)\n      probs = tf.convert_to_tensor(value=probs, name=\"probs\", dtype=dtype)\n      return grid, probs\n\n    grid, probs = tuple(quadrature_grid_and_probs)\n    grid = tf.convert_to_tensor(value=grid, name=\"grid\", dtype=dtype)\n    probs = tf.convert_to_tensor(\n        value=probs, name=\"unnormalized_probs\", dtype=dtype)\n    probs /= tf.norm(tensor=probs, ord=1, axis=-1, keepdims=True, name=\"probs\")\n\n    def _static_event_size(x):\n      \"\"\"Returns the static size of a specific dimension or `None`.\"\"\"\n      return tf.compat.dimension_value(\n          tensorshape_util.with_rank_at_least(x.shape, 1)[-1])\n\n    m, n = _static_event_size(probs), _static_event_size(grid)\n    if m is not None and n is not None:\n      if m != n:\n        raise ValueError(\"`quadrature_grid_and_probs` must be a `tuple` of \"\n                         \"same-length zero-th-dimension `Tensor`s \"\n                         \"(saw lengths {}, {})\".format(m, n))\n    elif validate_args:\n      assertions = [\n          assert_util.assert_equal(\n              dimension_size(probs, axis=-1),\n              dimension_size(grid, axis=-1),\n              message=(\"`quadrature_grid_and_probs` must be a `tuple` of \"\n                       \"same-length zero-th-dimension `Tensor`s\")),\n      ]\n      with tf.control_dependencies(assertions):\n        grid = tf.identity(grid)\n        probs = tf.identity(probs)\n    return grid, probs"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef pad(x, axis, front=False, back=False, value=0, count=1, name=None):\n  with tf.name_scope(name or \"pad\"):\n    x = tf.convert_to_tensor(value=x, name=\"x\")\n    value = tf.convert_to_tensor(value=value, dtype=x.dtype, name=\"value\")\n    count = tf.convert_to_tensor(value=count, name=\"count\")\n    if not dtype_util.is_integer(count.dtype):\n      raise TypeError(\"`count.dtype` (`{}`) must be `int`-like.\".format(\n          dtype_util.name(count.dtype)))\n    if not front and not back:\n      raise ValueError(\"At least one of `front`, `back` must be `True`.\")\n    ndims = (\n        tensorshape_util.rank(x.shape)\n        if tensorshape_util.rank(x.shape) is not None else tf.rank(\n            x, name=\"ndims\"))\n    axis = tf.convert_to_tensor(value=axis, name=\"axis\")\n    axis_ = tf.get_static_value(axis)\n    if axis_ is not None:\n      axis = axis_\n      if axis < 0:\n        axis = ndims + axis\n      count_ = tf.get_static_value(count)\n      if axis_ >= 0 or tensorshape_util.rank(x.shape) is not None:\n        head = x.shape[:axis]\n        mid_dim_value = tf.compat.dimension_value(x.shape[axis])\n        if count_ is None or mid_dim_value is None:\n          middle = tf.TensorShape(None)\n        else:\n          middle = tf.TensorShape(mid_dim_value + count_ * (front + back))\n        tail = x.shape[axis + 1:]\n        final_shape = head.concatenate(middle.concatenate(tail))\n      else:\n        final_shape = None\n    else:\n      axis = tf.where(axis < 0, ndims + axis, axis)\n      final_shape = None\n    x = tf.pad(\n        tensor=x,\n        paddings=tf.one_hot(\n            indices=tf.stack([axis if front else -1, axis if back else -1]),\n            depth=ndims,\n            axis=0,\n            on_value=count,\n            dtype=tf.int32),\n        constant_values=value)\n    if final_shape is not None:\n      tensorshape_util.set_shape(x, final_shape)\n    return x", "response": "Pads value to the front and back of a Tensor dim count times."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef parent_frame_arguments():\n  # All arguments and the names used for *varargs, and **kwargs\n  arg_names, variable_arg_name, keyword_arg_name, local_vars = (\n      tf_inspect._inspect.getargvalues(  # pylint: disable=protected-access\n          # Get the first frame of the caller of this method.\n          tf_inspect._inspect.stack()[1][0]))  # pylint: disable=protected-access\n\n  # Remove the *varargs, and flatten the **kwargs. Both are\n  # nested lists.\n  local_vars.pop(variable_arg_name, {})\n  keyword_args = local_vars.pop(keyword_arg_name, {})\n\n  final_args = {}\n  # Copy over arguments and their values. In general, local_vars\n  # may contain more than just the arguments, since this method\n  # can be called anywhere in a function.\n  for arg_name in arg_names:\n    final_args[arg_name] = local_vars.pop(arg_name)\n  final_args.update(keyword_args)\n\n  return final_args", "response": "Returns the arguments of the caller s function\n."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\ntransforms a 0 - D tensor to be 1 - D.", "response": "def expand_to_vector(x, tensor_name=None, op_name=None, validate_args=False):\n  \"\"\"Transform a 0-D or 1-D `Tensor` to be 1-D.\n\n  For user convenience, many parts of the TensorFlow Probability API accept\n  inputs of rank 0 or 1 -- i.e., allowing an `event_shape` of `[5]` to be passed\n  to the API as either `5` or `[5]`.  This function can be used to transform\n  such an argument to always be 1-D.\n\n  NOTE: Python or NumPy values will be converted to `Tensor`s with standard type\n  inference/conversion.  In particular, an empty list or tuple will become an\n  empty `Tensor` with dtype `float32`.  Callers should convert values to\n  `Tensor`s before calling this function if different behavior is desired\n  (e.g. converting empty lists / other values to `Tensor`s with dtype `int32`).\n\n  Args:\n    x: A 0-D or 1-D `Tensor`.\n    tensor_name: Python `str` name for `Tensor`s created by this function.\n    op_name: Python `str` name for `Op`s created by this function.\n    validate_args: Python `bool, default `False`.  When `True`, arguments may be\n      checked for validity at execution time, possibly degrading runtime\n      performance.  When `False`, invalid inputs may silently render incorrect\n        outputs.\n  Returns:\n    vector: a 1-D `Tensor`.\n  \"\"\"\n  with tf.name_scope(op_name or \"expand_to_vector\"):\n    x = tf.convert_to_tensor(value=x, name=\"x\")\n    ndims = tensorshape_util.rank(x.shape)\n\n    if ndims is None:\n      # Maybe expand ndims from 0 to 1.\n      if validate_args:\n        x = with_dependencies([\n            assert_util.assert_rank_at_most(\n                x, 1, message=\"Input is neither scalar nor vector.\")\n        ], x)\n      ndims = tf.rank(x)\n      expanded_shape = pick_vector(\n          tf.equal(ndims, 0), np.array([1], dtype=np.int32), tf.shape(input=x))\n      return tf.reshape(x, expanded_shape)\n\n    elif ndims == 0:\n      # Definitely expand ndims from 0 to 1.\n      x_const = tf.get_static_value(x)\n      if x_const is not None:\n        return tf.convert_to_tensor(\n            value=dtype_util.as_numpy_dtype(x.dtype)([x_const]),\n            name=tensor_name)\n\n      else:\n        return tf.reshape(x, [1])\n\n    elif ndims != 1:\n      raise ValueError(\"Input is neither scalar nor vector.\")\n\n    # ndims == 1\n    return x"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef with_dependencies(dependencies, output_tensor, name=None):\n  if tf.executing_eagerly():\n    return output_tensor\n  with tf.name_scope(name or \"control_dependency\") as name:\n    with tf.control_dependencies(d for d in dependencies if d is not None):\n      output_tensor = tf.convert_to_tensor(value=output_tensor)\n      if isinstance(output_tensor, tf.Tensor):\n        return tf.identity(output_tensor, name=name)\n      else:\n        return tf.IndexedSlices(\n            tf.identity(output_tensor.values, name=name),\n            output_tensor.indices,\n            output_tensor.dense_shape)", "response": "Produces the content of output_tensor only after all operations in dependencies have run."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\ncheck that the rightmost transposed dims are valid.", "response": "def _maybe_validate_rightmost_transposed_ndims(\n    rightmost_transposed_ndims, validate_args, name=None):\n  \"\"\"Checks that `rightmost_transposed_ndims` is valid.\"\"\"\n  with tf.name_scope(name or 'maybe_validate_rightmost_transposed_ndims'):\n    assertions = []\n    if not dtype_util.is_integer(rightmost_transposed_ndims.dtype):\n      raise TypeError('`rightmost_transposed_ndims` must be integer type.')\n\n    if tensorshape_util.rank(rightmost_transposed_ndims.shape) is not None:\n      if tensorshape_util.rank(rightmost_transposed_ndims.shape) != 0:\n        raise ValueError('`rightmost_transposed_ndims` must be a scalar, '\n                         'saw rank: {}.'.format(\n                             tensorshape_util.rank(\n                                 rightmost_transposed_ndims.shape)))\n    elif validate_args:\n      assertions += [assert_util.assert_rank(rightmost_transposed_ndims, 0)]\n\n    rightmost_transposed_ndims_ = tf.get_static_value(\n        rightmost_transposed_ndims)\n    msg = '`rightmost_transposed_ndims` must be non-negative.'\n    if rightmost_transposed_ndims_ is not None:\n      if rightmost_transposed_ndims_ < 0:\n        raise ValueError(msg[:-1] + ', saw: {}.'.format(\n            rightmost_transposed_ndims_))\n    elif validate_args:\n      assertions += [\n          assert_util.assert_non_negative(\n              rightmost_transposed_ndims, message=msg)\n      ]\n\n    return assertions"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef _maybe_validate_perm(perm, validate_args, name=None):\n  with tf.name_scope(name or 'maybe_validate_perm'):\n    assertions = []\n    if not dtype_util.is_integer(perm.dtype):\n      raise TypeError('`perm` must be integer type')\n\n    msg = '`perm` must be a vector.'\n    if tensorshape_util.rank(perm.shape) is not None:\n      if tensorshape_util.rank(perm.shape) != 1:\n        raise ValueError(\n            msg[:-1] +\n            ', saw rank: {}.'.format(tensorshape_util.rank(perm.shape)))\n    elif validate_args:\n      assertions += [assert_util.assert_rank(perm, 1, message=msg)]\n\n    perm_ = tf.get_static_value(perm)\n    msg = '`perm` must be a valid permutation vector.'\n    if perm_ is not None:\n      if not np.all(np.arange(np.size(perm_)) == np.sort(perm_)):\n        raise ValueError(msg[:-1] + ', saw: {}.'.format(perm_))\n    elif validate_args:\n      assertions += [\n          assert_util.assert_equal(\n              tf.sort(perm), tf.range(tf.size(input=perm)), message=msg)\n      ]\n\n    return assertions", "response": "Checks that perm is valid."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef concatenate(x, other):\n  return type(x)(tf.TensorShape(x).concatenate(other))", "response": "Returns the concatenation of the dimension in x and other."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef constant_value_as_shape(tensor):  # pylint: disable=invalid-name\n  shape = tf.get_static_value(tensor)\n  if shape is not None:\n    return [None if dim == -1 else dim for dim in shape]\n  return tensor_util.constant_value_as_shape(tensor)", "response": "A version of constant_value that returns a TensorShape."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef dims(x):\n  if isinstance(x, tf.TensorShape):\n    return x.dims\n  r = tf.TensorShape(x).dims\n  return None if r is None else list(map(tf.compat.dimension_value, r))", "response": "Returns a list of dimensions for each a\n     ."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nreturns a shape combining the information in x and other.", "response": "def merge_with(x, other):\n  \"\"\"Returns a shape combining the information in `x` and `other`.\n\n  The dimensions in `x` and `other` are merged elementwise, according to the\n  rules defined for `tf.Dimension.merge_with()`.\n\n  For more details, see `help(tf.TensorShape.merge_with)`.\n\n  Args:\n    x: object representing a shape; convertible to `tf.TensorShape`.\n    other: object representing a shape; convertible to `tf.TensorShape`.\n\n  Returns:\n    merged_shape: shape having `type(x)` containing the combined information of\n      `x` and `other`.\n\n  Raises:\n    ValueError: If `x` and `other` are not compatible.\n  \"\"\"\n  return type(x)(tf.TensorShape(x).merge_with(other))"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef with_rank_at_least(x, rank):  # pylint: disable=redefined-outer-name\n  return type(x)(tf.TensorShape(x).with_rank_at_least(rank))", "response": "Returns a shape based on x with at least the given rank."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef _check_equal_shape(name,\n                       static_shape,\n                       dynamic_shape,\n                       static_target_shape,\n                       dynamic_target_shape=None):\n  \"\"\"Check that source and target shape match, statically if possible.\"\"\"\n\n  static_target_shape = tf.TensorShape(static_target_shape)\n  if tensorshape_util.is_fully_defined(\n      static_shape) and tensorshape_util.is_fully_defined(static_target_shape):\n    if static_shape != static_target_shape:\n      raise ValueError(\"{}: required shape {} but found {}\".\n                       format(name, static_target_shape, static_shape))\n    return None\n  else:\n    if dynamic_target_shape is None:\n      if tensorshape_util.is_fully_defined(static_target_shape):\n        dynamic_target_shape = tensorshape_util.as_list(static_target_shape)\n      else:\n        raise ValueError(\"{}: cannot infer target shape: no dynamic shape \"\n                         \"specified and static shape {} is not fully defined\".\n                         format(name, static_target_shape))\n    return assert_util.assert_equal(\n        dynamic_shape,\n        dynamic_target_shape,\n        message=(\"{}: required shape {}\".format(name, static_target_shape)))", "response": "Check that source and target shape match statically if possible."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\naugment a sample shape to broadcast batch dimensions.", "response": "def _augment_sample_shape(partial_batch_dist,\n                          full_sample_and_batch_shape,\n                          validate_args=False):\n  \"\"\"Augment a sample shape to broadcast batch dimensions.\n\n  Computes an augmented sample shape, so that any batch dimensions not\n  part of the distribution `partial_batch_dist` are treated as identical\n  distributions.\n\n  # partial_batch_dist.batch_shape  = [      7]\n  # full_sample_and_batch_shape     = [3, 4, 7]\n  # => return an augmented sample shape of [3, 4] so that\n  #    partial_batch_dist.sample(augmented_sample_shape) has combined\n  #    sample and batch shape of [3, 4, 7].\n\n  Args:\n    partial_batch_dist: `tfd.Distribution` instance with batch shape a\n      prefix of `full_sample_and_batch_shape`.\n    full_sample_and_batch_shape: a Tensor or Tensor-like shape.\n    validate_args: if True, check for shape errors at runtime.\n  Returns:\n    augmented_sample_shape: sample shape such that\n      `partial_batch_dist.sample(augmented_sample_shape)` has combined\n      sample and batch shape of `full_sample_and_batch_shape`.\n\n  Raises:\n    ValueError: if `partial_batch_dist.batch_shape` has more dimensions than\n      `full_sample_and_batch_shape`.\n    NotImplementedError: if broadcasting would be required to make\n      `partial_batch_dist.batch_shape` into a prefix of\n      `full_sample_and_batch_shape` .\n  \"\"\"\n  full_ndims = distribution_util.prefer_static_shape(\n      full_sample_and_batch_shape)[0]\n  partial_batch_ndims = (\n      tensorshape_util.rank(partial_batch_dist.batch_shape)  # pylint: disable=g-long-ternary\n      if tensorshape_util.rank(partial_batch_dist.batch_shape) is not None\n      else distribution_util.prefer_static_shape(\n          partial_batch_dist.batch_shape_tensor())[0])\n\n  num_broadcast_dims = full_ndims - partial_batch_ndims\n\n  expected_partial_batch_shape = (\n      full_sample_and_batch_shape[num_broadcast_dims:])\n  expected_partial_batch_shape_static = tf.get_static_value(\n      full_sample_and_batch_shape[num_broadcast_dims:])\n\n  # Raise errors statically if possible.\n  num_broadcast_dims_static = tf.get_static_value(num_broadcast_dims)\n  if num_broadcast_dims_static is not None:\n    if num_broadcast_dims_static < 0:\n      raise ValueError(\"Cannot broadcast distribution {} batch shape to \"\n                       \"target batch shape with fewer dimensions\"\n                       .format(partial_batch_dist))\n  if (expected_partial_batch_shape_static is not None and\n      tensorshape_util.is_fully_defined(partial_batch_dist.batch_shape)):\n    if (partial_batch_dist.batch_shape and\n        any(expected_partial_batch_shape_static != tensorshape_util.as_list(\n            partial_batch_dist.batch_shape))):\n      raise NotImplementedError(\"Broadcasting is not supported; \"\n                                \"unexpected batch shape \"\n                                \"(expected {}, saw {}).\".format(\n                                    expected_partial_batch_shape_static,\n                                    partial_batch_dist.batch_shape\n                                ))\n  runtime_assertions = []\n  if validate_args:\n    runtime_assertions.append(\n        assert_util.assert_greater_equal(\n            tf.convert_to_tensor(value=num_broadcast_dims, dtype=tf.int32),\n            tf.zeros((), dtype=tf.int32),\n            message=(\"Cannot broadcast distribution {} batch shape to \"\n                     \"target batch shape with fewer dimensions.\".format(\n                         partial_batch_dist))))\n    runtime_assertions.append(\n        assert_util.assert_equal(\n            expected_partial_batch_shape,\n            partial_batch_dist.batch_shape_tensor(),\n            message=(\"Broadcasting is not supported; \"\n                     \"unexpected batch shape.\"),\n            name=\"assert_batch_shape_same\"))\n\n  with tf.control_dependencies(runtime_assertions):\n    return full_sample_and_batch_shape[:num_broadcast_dims]"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef build_backward_pass_step(get_transition_matrix_for_timestep):\n\n  def backward_pass_step(state,\n                         filtered_parameters):\n    \"\"\"Run a single step of backward smoothing.\"\"\"\n\n    (filtered_mean, filtered_cov,\n     predicted_mean, predicted_cov) = filtered_parameters\n    transition_matrix = get_transition_matrix_for_timestep(state.timestep)\n\n    next_posterior_mean = state.backward_mean\n    next_posterior_cov = state.backward_cov\n\n    posterior_mean, posterior_cov = backward_smoothing_update(\n        filtered_mean,\n        filtered_cov,\n        predicted_mean,\n        predicted_cov,\n        next_posterior_mean,\n        next_posterior_cov,\n        transition_matrix)\n\n    return BackwardPassState(backward_mean=posterior_mean,\n                             backward_cov=posterior_cov,\n                             timestep=state.timestep-1)\n\n  return backward_pass_step", "response": "Builds a function that performs one step for backward smoothing."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef backward_smoothing_update(filtered_mean,\n                              filtered_cov,\n                              predicted_mean,\n                              predicted_cov,\n                              next_posterior_mean,\n                              next_posterior_cov,\n                              transition_matrix):\n  \"\"\"Backward update for a Kalman smoother.\n\n  Give the `filtered_mean` mu(t | t), `filtered_cov` sigma(t | t),\n  `predicted_mean` mu(t+1 | t) and `predicted_cov` sigma(t+1 | t),\n  as returns from the `forward_filter` function, as well as\n  `next_posterior_mean` mu(t+1 | 1:T) and `next_posterior_cov` sigma(t+1 | 1:T),\n  if the `transition_matrix` of states from time t to time t+1\n  is given as A(t+1), the 1 step backward smoothed distribution parameter\n  could be calculated as:\n  p(z(t) | Obs(1:T)) = N( mu(t | 1:T), sigma(t | 1:T)),\n  mu(t | 1:T) = mu(t | t) + J(t) * (mu(t+1 | 1:T) - mu(t+1 | t)),\n  sigma(t | 1:T) = sigma(t | t)\n                   + J(t) * (sigma(t+1 | 1:T) - sigma(t+1 | t) * J(t)',\n  J(t) = sigma(t | t) * A(t+1)' / sigma(t+1 | t),\n  where all the multiplications are matrix multiplication, and `/` is\n  the matrix inverse. J(t) is the backward Kalman gain matrix.\n\n  The algorithm can be intialized from mu(T | 1:T) and sigma(T | 1:T),\n  which are the last step parameters returned by forward_filter.\n\n\n  Args:\n    filtered_mean: `Tensor` with event shape `[latent_size, 1]` and\n      batch shape `B`, containing mu(t | t).\n    filtered_cov: `Tensor` with event shape `[latent_size, latent_size]` and\n      batch shape `B`, containing sigma(t | t).\n    predicted_mean: `Tensor` with event shape `[latent_size, 1]` and\n      batch shape `B`, containing mu(t+1 | t).\n    predicted_cov: `Tensor` with event shape `[latent_size, latent_size]` and\n      batch shape `B`, containing sigma(t+1 | t).\n    next_posterior_mean: `Tensor` with event shape `[latent_size, 1]` and\n      batch shape `B`, containing mu(t+1 | 1:T).\n    next_posterior_cov: `Tensor` with event shape `[latent_size, latent_size]`\n      and batch shape `B`, containing sigma(t+1 | 1:T).\n    transition_matrix: `LinearOperator` with shape\n      `[latent_size, latent_size]` and batch shape broadcastable\n      to `B`.\n\n  Returns:\n    posterior_mean: `Tensor` with event shape `[latent_size, 1]` and\n      batch shape `B`, containing mu(t | 1:T).\n    posterior_cov: `Tensor` with event shape `[latent_size, latent_size]` and\n      batch shape `B`, containing sigma(t | 1:T).\n  \"\"\"\n  # Compute backward Kalman gain:\n  # J = F * T' * P^{-1}\n  # Since both F(iltered) and P(redictive) are cov matrices,\n  # thus self-adjoint, we can take the transpose.\n  # computation:\n  #      = (P^{-1} * T * F)'\n  #      = (P^{-1} * tmp_gain_cov) '\n  #      = (P \\ tmp_gain_cov)'\n  tmp_gain_cov = transition_matrix.matmul(filtered_cov)\n  predicted_cov_chol = tf.linalg.cholesky(predicted_cov)\n  gain_transpose = tf.linalg.cholesky_solve(predicted_cov_chol, tmp_gain_cov)\n\n  posterior_mean = (filtered_mean +\n                    tf.linalg.matmul(gain_transpose,\n                                     next_posterior_mean - predicted_mean,\n                                     adjoint_a=True))\n  posterior_cov = (\n      filtered_cov +\n      tf.linalg.matmul(gain_transpose,\n                       tf.linalg.matmul(\n                           next_posterior_cov - predicted_cov, gain_transpose),\n                       adjoint_a=True))\n\n  return (posterior_mean, posterior_cov)", "response": "This function is used to update the backward smoothing of the Kalman distribution."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef build_kalman_filter_step(get_transition_matrix_for_timestep,\n                             get_transition_noise_for_timestep,\n                             get_observation_matrix_for_timestep,\n                             get_observation_noise_for_timestep):\n  \"\"\"Build a callable that performs one step of Kalman filtering.\n\n  Args:\n    get_transition_matrix_for_timestep: callable taking a timestep\n      as an integer `Tensor` argument, and returning a `LinearOperator`\n      of shape `[latent_size, latent_size]`.\n    get_transition_noise_for_timestep: callable taking a timestep as\n      an integer `Tensor` argument, and returning a\n      `MultivariateNormalLinearOperator` of event shape\n      `[latent_size]`.\n    get_observation_matrix_for_timestep: callable taking a timestep\n      as an integer `Tensor` argument, and returning a `LinearOperator`\n      of shape `[observation_size, observation_size]`.\n    get_observation_noise_for_timestep: callable taking a timestep as\n      an integer `Tensor` argument, and returning a\n      `MultivariateNormalLinearOperator` of event shape\n      `[observation_size]`.\n\n  Returns:\n    kalman_filter_step: a callable that updates a KalmanFilterState\n      from timestep `t-1` to `t`.\n  \"\"\"\n\n  def kalman_filter_step(state, elems_t):\n    \"\"\"Run a single step of Kalman filtering.\n\n    Args:\n      state: A `KalmanFilterState` object representing the previous\n        filter state at time `t-1`.\n      elems_t: A tuple of Tensors `(x_t, mask_t)`, or a `Tensor` `x_t`.\n        `x_t` is a `Tensor` with rightmost shape dimensions\n        `[observation_size, 1]` representing the vector observed at time `t`,\n        and `mask_t` is a `Tensor` with rightmost dimensions`[1, 1]`\n        representing the observation mask at time `t`. Both `x_t` and `mask_t`\n        may have batch dimensions, which must be compatible with the batch\n        dimensions of `state.predicted_mean` and `state.predictived_cov`\n        respectively. If `mask_t` is not provided, it is assumed to be `None`.\n\n    Returns:\n      new_state: A `KalmanFilterState` object representing the new\n        filter state at time `t`.\n    \"\"\"\n\n    if isinstance(elems_t, tuple):\n      x_t, mask_t = elems_t\n    else:\n      x_t = elems_t\n      mask_t = None\n\n    observation_matrix = get_observation_matrix_for_timestep(state.timestep)\n    observation_noise = get_observation_noise_for_timestep(state.timestep)\n    if mask_t is not None:\n      # Before running the update, fill in masked observations using the prior\n      # expectation. The precise filled value shouldn't matter since updates\n      # from masked elements will not be selected below, but we need to ensure\n      # that any results we incidently compute on masked values are at least\n      # finite (not inf or NaN) so that they don't screw up gradient propagation\n      # through `tf.where`, as described in\n      #  https://github.com/tensorflow/tensorflow/issues/2540.\n      # We fill with the prior expectation because any fixed value such as zero\n      # might be arbitrarily unlikely under the prior, leading to overflow in\n      # the updates, but the prior expectation should always be a\n      # 'reasonable' observation.\n      x_expected = _propagate_mean(state.predicted_mean,\n                                   observation_matrix,\n                                   observation_noise) * tf.ones_like(x_t)\n      x_t = tf.where(\n          tf.broadcast_to(mask_t, tf.shape(input=x_expected)), x_expected,\n          tf.broadcast_to(x_t, tf.shape(input=x_expected)))\n\n    # Given predicted mean u_{t|t-1} and covariance P_{t|t-1} from the\n    # previous step, incorporate the observation x_t, producing the\n    # filtered mean u_t and covariance P_t.\n    (filtered_mean,\n     filtered_cov,\n     observation_dist) = linear_gaussian_update(\n         state.predicted_mean, state.predicted_cov,\n         observation_matrix, observation_noise,\n         x_t)\n\n    # Compute the marginal likelihood p(x_{t} | x_{:t-1}) for this\n    # observation.\n    log_marginal_likelihood = observation_dist.log_prob(x_t[..., 0])\n\n    if mask_t is not None:\n      filtered_mean = tf.where(\n          tf.broadcast_to(mask_t, tf.shape(input=filtered_mean)),\n          state.predicted_mean, filtered_mean)\n      filtered_cov = tf.where(\n          tf.broadcast_to(mask_t, tf.shape(input=filtered_cov)),\n          state.predicted_cov, filtered_cov)\n      log_marginal_likelihood = tf.where(\n          tf.broadcast_to(mask_t[..., 0, 0],\n                          tf.shape(input=log_marginal_likelihood)),\n          tf.zeros_like(log_marginal_likelihood),\n          log_marginal_likelihood)\n\n    # Run the filtered posterior through the transition\n    # model to predict the next time step:\n    #  u_{t|t-1} = F_t u_{t-1} + b_t\n    #  P_{t|t-1} = F_t P_{t-1} F_t' + Q_t\n    predicted_mean, predicted_cov = kalman_transition(\n        filtered_mean,\n        filtered_cov,\n        get_transition_matrix_for_timestep(state.timestep),\n        get_transition_noise_for_timestep(state.timestep))\n\n    return KalmanFilterState(\n        filtered_mean, filtered_cov,\n        predicted_mean, predicted_cov,\n        observation_dist.mean()[..., tf.newaxis],\n        observation_dist.covariance(),\n        log_marginal_likelihood,\n        state.timestep+1)\n\n  return kalman_filter_step", "response": "Build a Kalman filter step."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nconjugating update for a linear Gaussian model.", "response": "def linear_gaussian_update(\n    prior_mean, prior_cov, observation_matrix, observation_noise, x_observed):\n  \"\"\"Conjugate update for a linear Gaussian model.\n\n  Given a normal prior on a latent variable `z`,\n    `p(z) = N(prior_mean, prior_cov) = N(u, P)`,\n  for which we observe a linear Gaussian transformation `x`,\n    `p(x|z) = N(H * z + c, R)`,\n  the posterior is also normal:\n    `p(z|x) = N(u*, P*)`.\n\n  We can write this update as\n     x_expected = H * u + c # pushforward prior mean\n     S = R + H * P * H'  # pushforward prior cov\n     K = P * H' * S^{-1} # optimal Kalman gain\n     u* = u + K * (x_observed - x_expected) # posterior mean\n     P* = (I - K * H) * P (I - K * H)' + K * R * K' # posterior cov\n  (see, e.g., https://en.wikipedia.org/wiki/Kalman_filter#Update)\n\n  Args:\n    prior_mean: `Tensor` with event shape `[latent_size, 1]` and\n      potential batch shape `B = [b1, ..., b_n]`.\n    prior_cov: `Tensor` with event shape `[latent_size, latent_size]`\n      and batch shape `B` (matching `prior_mean`).\n    observation_matrix: `LinearOperator` with shape\n      `[observation_size, latent_size]` and batch shape broadcastable\n      to `B`.\n    observation_noise: potentially-batched\n      `MultivariateNormalLinearOperator` instance with event shape\n      `[observation_size]` and batch shape broadcastable to `B`.\n    x_observed: potentially batched `Tensor` with event shape\n      `[observation_size, 1]` and batch shape `B`.\n\n  Returns:\n    posterior_mean: `Tensor` with event shape `[latent_size, 1]` and\n      batch shape `B`.\n    posterior_cov: `Tensor` with event shape `[latent_size,\n      latent_size]` and batch shape `B`.\n    predictive_dist: the prior predictive distribution `p(x|z)`,\n      as a `Distribution` instance with event\n      shape `[observation_size]` and batch shape `B`. This will\n      typically be `tfd.MultivariateNormalTriL`, but when\n      `observation_size=1` we return a `tfd.Independent(tfd.Normal)`\n      instance as an optimization.\n  \"\"\"\n\n  # If observations are scalar, we can avoid some matrix ops.\n  observation_size_is_static_and_scalar = (\n      tf.compat.dimension_value(observation_matrix.shape[-2]) == 1)\n\n  # Push the predicted mean for the latent state through the\n  # observation model\n  x_expected = _propagate_mean(prior_mean,\n                               observation_matrix,\n                               observation_noise)\n\n  # Push the predictive covariance of the latent state through the\n  # observation model:\n  #  S = R + H * P * H'.\n  # We use a temporary variable for H * P,\n  # reused below to compute Kalman gain.\n  tmp_obs_cov = observation_matrix.matmul(prior_cov)\n  predicted_obs_cov = (\n      observation_matrix.matmul(tmp_obs_cov, adjoint_arg=True)\n      + observation_noise.covariance())\n\n  # Compute optimal Kalman gain:\n  #  K = P * H' * S^{-1}\n  # Since both S and P are cov matrices, thus symmetric,\n  # we can take the transpose and reuse our previous\n  # computation:\n  #      = (S^{-1} * H * P)'\n  #      = (S^{-1} * tmp_obs_cov) '\n  #      = (S \\ tmp_obs_cov)'\n  if observation_size_is_static_and_scalar:\n    gain_transpose = tmp_obs_cov/predicted_obs_cov\n  else:\n    predicted_obs_cov_chol = tf.linalg.cholesky(predicted_obs_cov)\n    gain_transpose = tf.linalg.cholesky_solve(predicted_obs_cov_chol,\n                                              tmp_obs_cov)\n\n  # Compute the posterior mean, incorporating the observation.\n  #  u* = u + K (x_observed - x_expected)\n  posterior_mean = (prior_mean +\n                    tf.linalg.matmul(gain_transpose, x_observed - x_expected,\n                                     adjoint_a=True))\n\n  # For the posterior covariance, we could use the simple update\n  #  P* = P - K * H * P\n  # but this is prone to numerical issues because it subtracts a\n  # value from a PSD matrix.  We choose instead to use the more\n  # expensive Jordan form update\n  #  P* = (I - K H) * P * (I - K H)' + K R K'\n  # which always produces a PSD result. This uses\n  #  tmp_term = (I - K * H)'\n  # as an intermediate quantity.\n  tmp_term = -observation_matrix.matmul(gain_transpose, adjoint=True)  # -K * H\n  tmp_term = tf.linalg.set_diag(tmp_term, tf.linalg.diag_part(tmp_term) + 1)\n  posterior_cov = (\n      tf.linalg.matmul(\n          tmp_term, tf.linalg.matmul(prior_cov, tmp_term), adjoint_a=True)\n      + tf.linalg.matmul(gain_transpose,\n                         tf.linalg.matmul(\n                             observation_noise.covariance(), gain_transpose),\n                         adjoint_a=True))\n\n  if observation_size_is_static_and_scalar:\n    # A plain Normal would have event shape `[]`; wrapping with Independent\n    # ensures `event_shape=[1]` as required.\n    predictive_dist = independent.Independent(\n        normal.Normal(loc=x_expected[..., 0],\n                      scale=tf.sqrt(predicted_obs_cov[..., 0])),\n        reinterpreted_batch_ndims=1)\n\n    # Minor hack to define the covariance, so that `predictive_dist` can pass as\n    # an MVNTriL-like object.\n    predictive_dist.covariance = lambda: predicted_obs_cov\n  else:\n    predictive_dist = mvn_tril.MultivariateNormalTriL(\n        loc=x_expected[..., 0],\n        scale_tril=predicted_obs_cov_chol)\n\n  return posterior_mean, posterior_cov, predictive_dist"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef kalman_transition(filtered_mean, filtered_cov,\n                      transition_matrix, transition_noise):\n  \"\"\"Propagate a filtered distribution through a transition model.\"\"\"\n\n  predicted_mean = _propagate_mean(filtered_mean,\n                                   transition_matrix,\n                                   transition_noise)\n  predicted_cov = _propagate_cov(filtered_cov,\n                                 transition_matrix,\n                                 transition_noise)\n  return predicted_mean, predicted_cov", "response": "Propagate a filtered distribution through a transition model."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef build_kalman_mean_step(get_transition_matrix_for_timestep,\n                           get_transition_noise_for_timestep,\n                           get_observation_matrix_for_timestep,\n                           get_observation_noise_for_timestep):\n  \"\"\"Build a callable that performs one step of Kalman mean recursion.\n\n  Args:\n    get_transition_matrix_for_timestep: callable taking a timestep\n      as an integer `Tensor` argument, and returning a `LinearOperator`\n      of shape `[latent_size, latent_size]`.\n    get_transition_noise_for_timestep: callable taking a timestep as\n      an integer `Tensor` argument, and returning a\n      `MultivariateNormalLinearOperator` of event shape\n      `[latent_size]`.\n    get_observation_matrix_for_timestep: callable taking a timestep\n      as an integer `Tensor` argument, and returning a `LinearOperator`\n      of shape `[observation_size, observation_size]`.\n    get_observation_noise_for_timestep: callable taking a timestep as\n      an integer `Tensor` argument, and returning a\n      `MultivariateNormalLinearOperator` of event shape\n      `[observation_size]`.\n\n  Returns:\n    kalman_mean_step: a callable that computes latent state and\n      observation means at time `t`, given latent mean at time `t-1`.\n  \"\"\"\n\n  def mean_step(previous_means, t):\n    \"\"\"Single step of prior mean recursion.\"\"\"\n    previous_latent_mean, _ = previous_means\n\n    latent_mean = _propagate_mean(previous_latent_mean,\n                                  get_transition_matrix_for_timestep(t - 1),\n                                  get_transition_noise_for_timestep(t - 1))\n    observation_mean = _propagate_mean(latent_mean,\n                                       get_observation_matrix_for_timestep(t),\n                                       get_observation_noise_for_timestep(t))\n    return (latent_mean, observation_mean)\n\n  return mean_step", "response": "Builds a kalman mean step."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nbuilding a callable that computes the latent state and observation covariance at a given timestep.", "response": "def build_kalman_cov_step(get_transition_matrix_for_timestep,\n                          get_transition_noise_for_timestep,\n                          get_observation_matrix_for_timestep,\n                          get_observation_noise_for_timestep):\n  \"\"\"Build a callable for one step of Kalman covariance recursion.\n\n  Args:\n    get_transition_matrix_for_timestep: callable taking a timestep\n      as an integer `Tensor` argument, and returning a `LinearOperator`\n      of shape `[latent_size, latent_size]`.\n    get_transition_noise_for_timestep: callable taking a timestep as\n      an integer `Tensor` argument, and returning a\n      `MultivariateNormalLinearOperator` of event shape\n      `[latent_size]`.\n    get_observation_matrix_for_timestep: callable taking a timestep\n      as an integer `Tensor` argument, and returning a `LinearOperator`\n      of shape `[observation_size, observation_size]`.\n    get_observation_noise_for_timestep: callable taking a timestep as\n      an integer `Tensor` argument, and returning a\n      `MultivariateNormalLinearOperator` of event shape\n      `[observation_size]`.\n\n  Returns:\n    cov_step: a callable that computes latent state and observation\n      covariance at time `t`, given latent covariance at time `t-1`.\n  \"\"\"\n\n  def cov_step(previous_covs, t):\n    \"\"\"Single step of prior covariance recursion.\"\"\"\n    previous_latent_cov, _ = previous_covs\n\n    latent_cov = _propagate_cov(\n        previous_latent_cov,\n        get_transition_matrix_for_timestep(t - 1),\n        get_transition_noise_for_timestep(t - 1))\n    observation_cov = _propagate_cov(\n        latent_cov,\n        get_observation_matrix_for_timestep(t),\n        get_observation_noise_for_timestep(t))\n\n    return (latent_cov, observation_cov)\n\n  return cov_step"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef build_kalman_sample_step(get_transition_matrix_for_timestep,\n                             get_transition_noise_for_timestep,\n                             get_observation_matrix_for_timestep,\n                             get_observation_noise_for_timestep,\n                             full_sample_and_batch_shape,\n                             stream,\n                             validate_args=False):\n  \"\"\"Build a callable for one step of Kalman sampling recursion.\n\n  Args:\n    get_transition_matrix_for_timestep: callable taking a timestep\n      as an integer `Tensor` argument, and returning a `LinearOperator`\n      of shape `[latent_size, latent_size]`.\n    get_transition_noise_for_timestep: callable taking a timestep as\n      an integer `Tensor` argument, and returning a\n      `MultivariateNormalLinearOperator` of event shape\n      `[latent_size]`.\n    get_observation_matrix_for_timestep: callable taking a timestep\n      as an integer `Tensor` argument, and returning a `LinearOperator`\n      of shape `[observation_size, observation_size]`.\n    get_observation_noise_for_timestep: callable taking a timestep as\n      an integer `Tensor` argument, and returning a\n      `MultivariateNormalLinearOperator` of event shape\n      `[observation_size]`.\n    full_sample_and_batch_shape: Desired sample and batch shape of the\n      returned samples, concatenated in a single `Tensor`.\n    stream: `tfd.SeedStream` instance used to generate a\n      sequence of random seeds.\n    validate_args: if True, perform error checking at runtime.\n\n  Returns:\n    sample_step: a callable that samples the latent state and\n      observation at time `t`, given latent state at time `t-1`.\n  \"\"\"\n\n  def sample_step(sampled_prev, t):\n    \"\"\"Sample values for a single timestep.\"\"\"\n    latent_prev, _ = sampled_prev\n\n    transition_matrix = get_transition_matrix_for_timestep(t - 1)\n    transition_noise = get_transition_noise_for_timestep(t - 1)\n\n    latent_pred = transition_matrix.matmul(latent_prev)\n    latent_sampled = latent_pred + transition_noise.sample(\n        sample_shape=_augment_sample_shape(\n            transition_noise,\n            full_sample_and_batch_shape,\n            validate_args),\n        seed=stream())[..., tf.newaxis]\n\n    observation_matrix = get_observation_matrix_for_timestep(t)\n    observation_noise = get_observation_noise_for_timestep(t)\n\n    observation_pred = observation_matrix.matmul(latent_sampled)\n    observation_sampled = observation_pred + observation_noise.sample(\n        sample_shape=_augment_sample_shape(\n            observation_noise,\n            full_sample_and_batch_shape,\n            validate_args),\n        seed=stream())[..., tf.newaxis]\n\n    return (latent_sampled, observation_sampled)\n\n  return sample_step", "response": "Builds a callable that samples the latent state and observation at a single timestep."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef build_pushforward_latents_step(get_observation_matrix_for_timestep,\n                                   get_observation_noise_for_timestep):\n  \"\"\"Build a callable to push latent means/covs to observed means/covs.\n\n  Args:\n    get_observation_matrix_for_timestep: callable taking a timestep\n      as an integer `Tensor` argument, and returning a `LinearOperator`\n      of shape `[observation_size, observation_size]`.\n    get_observation_noise_for_timestep: callable taking a timestep as\n      an integer `Tensor` argument, and returning a\n      `MultivariateNormalLinearOperator` of event shape\n      `[observation_size]`.\n\n  Returns:\n    pushforward_latents_step: a callable that computes the observation mean and\n    covariance at time `t`, given latent mean and covariance at time `t`.\n  \"\"\"\n\n  def pushforward_latents_step(_, latent_t_mean_cov):\n    \"\"\"Loop body fn to pushforward latents to observations at a time step.\"\"\"\n    t, latent_mean, latent_cov = latent_t_mean_cov\n\n    observation_matrix = get_observation_matrix_for_timestep(t)\n    observation_noise = get_observation_noise_for_timestep(t)\n    observation_mean = _propagate_mean(latent_mean,\n                                       observation_matrix,\n                                       observation_noise)\n    observation_cov = _propagate_cov(latent_cov,\n                                     observation_matrix,\n                                     observation_noise)\n\n    return (observation_mean, observation_cov)\n\n  return pushforward_latents_step", "response": "Builds a function that computes the observation mean and covariance at a time step."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef _propagate_mean(mean, linop, dist):\n  return linop.matmul(mean) + dist.mean()[..., tf.newaxis]", "response": "Propagate a mean through linear Gaussian transformation."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef _propagate_cov(cov, linop, dist):\n  # For linop A and input cov P, returns `A P A' + dist.cov()`\n  return linop.matmul(linop.matmul(cov), adjoint_arg=True) + dist.covariance()", "response": "Propagate covariance through linear Gaussian transformation."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nrunning the backward pass in Kalman smoother. The backward smoothing is using Rauch, Tung and Striebel smoother as as discussed in section 18.3.2 of Kevin P. Murphy, 2012, Machine Learning: A Probabilistic Perspective, The MIT Press. The inputs are returned by `forward_filter` function. Args: filtered_means: Means of the per-timestep filtered marginal distributions p(z_t | x_{:t}), as a Tensor of shape `sample_shape(x) + batch_shape + [num_timesteps, latent_size]`. filtered_covs: Covariances of the per-timestep filtered marginal distributions p(z_t | x_{:t}), as a Tensor of shape `batch_shape + [num_timesteps, latent_size, latent_size]`. predicted_means: Means of the per-timestep predictive distributions over latent states, p(z_{t+1} | x_{:t}), as a Tensor of shape `sample_shape(x) + batch_shape + [num_timesteps, latent_size]`. predicted_covs: Covariances of the per-timestep predictive distributions over latent states, p(z_{t+1} | x_{:t}), as a Tensor of shape `batch_shape + [num_timesteps, latent_size, latent_size]`. Returns: posterior_means: Means of the smoothed marginal distributions p(z_t | x_{1:T}), as a Tensor of shape `sample_shape(x) + batch_shape + [num_timesteps, latent_size]`, which is of the same shape as filtered_means. posterior_covs: Covariances of the smoothed marginal distributions p(z_t | x_{1:T}), as a Tensor of shape `batch_shape + [num_timesteps, latent_size, latent_size]`. which is of the same shape as filtered_covs.", "response": "def backward_smoothing_pass(self,\n                              filtered_means,\n                              filtered_covs,\n                              predicted_means,\n                              predicted_covs):\n    \"\"\"Run the backward pass in Kalman smoother.\n\n    The backward smoothing is using Rauch, Tung and Striebel smoother as\n    as discussed in section 18.3.2 of Kevin P. Murphy, 2012, Machine Learning:\n    A Probabilistic Perspective, The MIT Press. The inputs are returned by\n    `forward_filter` function.\n\n    Args:\n      filtered_means: Means of the per-timestep filtered marginal\n        distributions p(z_t | x_{:t}), as a Tensor of shape\n        `sample_shape(x) + batch_shape + [num_timesteps, latent_size]`.\n      filtered_covs: Covariances of the per-timestep filtered marginal\n        distributions p(z_t | x_{:t}), as a Tensor of shape\n        `batch_shape + [num_timesteps, latent_size, latent_size]`.\n      predicted_means: Means of the per-timestep predictive\n         distributions over latent states, p(z_{t+1} | x_{:t}), as a\n         Tensor of shape `sample_shape(x) + batch_shape +\n         [num_timesteps, latent_size]`.\n      predicted_covs: Covariances of the per-timestep predictive\n         distributions over latent states, p(z_{t+1} | x_{:t}), as a\n         Tensor of shape `batch_shape + [num_timesteps, latent_size,\n         latent_size]`.\n\n    Returns:\n      posterior_means: Means of the smoothed marginal distributions\n        p(z_t | x_{1:T}), as a Tensor of shape\n        `sample_shape(x) + batch_shape + [num_timesteps, latent_size]`,\n        which is of the same shape as filtered_means.\n      posterior_covs: Covariances of the smoothed marginal distributions\n        p(z_t | x_{1:T}), as a Tensor of shape\n        `batch_shape + [num_timesteps, latent_size, latent_size]`.\n        which is of the same shape as filtered_covs.\n    \"\"\"\n    with tf.name_scope(\"backward_pass\"):\n      filtered_means = tf.convert_to_tensor(\n          value=filtered_means, name=\"filtered_means\")\n      filtered_covs = tf.convert_to_tensor(\n          value=filtered_covs, name=\"filtered_covs\")\n      predicted_means = tf.convert_to_tensor(\n          value=predicted_means, name=\"predicted_means\")\n      predicted_covs = tf.convert_to_tensor(\n          value=predicted_covs, name=\"predicted_covs\")\n\n      # To scan over time dimension, we need to move 'num_timesteps' from the\n      # event shape to the initial dimension of the tensor.\n      filtered_means = distribution_util.move_dimension(filtered_means, -2, 0)\n      filtered_covs = distribution_util.move_dimension(filtered_covs, -3, 0)\n      predicted_means = distribution_util.move_dimension(predicted_means, -2, 0)\n      predicted_covs = distribution_util.move_dimension(predicted_covs, -3, 0)\n\n      # The means are assumed to be vectors. Adding a dummy index to\n      # ensure the `matmul` op working smoothly.\n      filtered_means = filtered_means[..., tf.newaxis]\n      predicted_means = predicted_means[..., tf.newaxis]\n\n      initial_backward_mean = predicted_means[-1, ...]\n      initial_backward_cov = predicted_covs[-1, ...]\n\n      num_timesteps = tf.shape(input=filtered_means)[0]\n      initial_state = BackwardPassState(\n          backward_mean=initial_backward_mean,\n          backward_cov=initial_backward_cov,\n          timestep=self.initial_step + num_timesteps - 1)\n\n      update_step_fn = build_backward_pass_step(\n          self.get_transition_matrix_for_timestep)\n\n      # For backward pass, it scans the `elems` from last to first.\n      posterior_states = tf.scan(update_step_fn,\n                                 elems=(filtered_means,\n                                        filtered_covs,\n                                        predicted_means,\n                                        predicted_covs),\n                                 initializer=initial_state,\n                                 reverse=True)\n\n      # Move the time dimension back into the event shape.\n      posterior_means = distribution_util.move_dimension(\n          posterior_states.backward_mean[..., 0], 0, -2)\n      posterior_covs = distribution_util.move_dimension(\n          posterior_states.backward_cov, 0, -3)\n\n      return (posterior_means, posterior_covs)"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef _joint_sample_n(self, n, seed=None):\n\n    with tf.name_scope(\"sample_n_joint\"):\n      stream = seed_stream.SeedStream(\n          seed, salt=\"LinearGaussianStateSpaceModel_sample_n_joint\")\n\n      sample_and_batch_shape = distribution_util.prefer_static_value(\n          tf.concat([[n], self.batch_shape_tensor()],\n                    axis=0))\n\n      # Sample the initial timestep from the prior.  Since we want\n      # this sample to have full batch shape (not just the batch shape\n      # of the self.initial_state_prior object which might in general be\n      # smaller), we augment the sample shape to include whatever\n      # extra batch dimensions are required.\n      with tf.control_dependencies(self.runtime_assertions):\n        initial_latent = self.initial_state_prior.sample(\n            sample_shape=_augment_sample_shape(\n                self.initial_state_prior,\n                sample_and_batch_shape,\n                self.validate_args),\n            seed=stream())\n\n        # Add a dummy dimension so that matmul() does matrix-vector\n        # multiplication.\n        initial_latent = initial_latent[..., tf.newaxis]\n\n      initial_observation_matrix = (\n          self.get_observation_matrix_for_timestep(self.initial_step))\n      initial_observation_noise = (\n          self.get_observation_noise_for_timestep(self.initial_step))\n\n      initial_observation_pred = initial_observation_matrix.matmul(\n          initial_latent)\n      initial_observation = (initial_observation_pred +\n                             initial_observation_noise.sample(\n                                 sample_shape=_augment_sample_shape(\n                                     initial_observation_noise,\n                                     sample_and_batch_shape,\n                                     self.validate_args),\n                                 seed=stream())[..., tf.newaxis])\n\n      sample_step = build_kalman_sample_step(\n          self.get_transition_matrix_for_timestep,\n          self.get_transition_noise_for_timestep,\n          self.get_observation_matrix_for_timestep,\n          self.get_observation_noise_for_timestep,\n          full_sample_and_batch_shape=sample_and_batch_shape,\n          stream=stream,\n          validate_args=self.validate_args)\n\n      # Scan over all timesteps to sample latents and observations.\n      (latents, observations) = tf.scan(\n          sample_step,\n          elems=tf.range(self.initial_step+1, self.final_step),\n          initializer=(initial_latent, initial_observation))\n\n      # Combine the initial sampled timestep with the remaining timesteps.\n      latents = tf.concat([initial_latent[tf.newaxis, ...],\n                           latents], axis=0)\n      observations = tf.concat([initial_observation[tf.newaxis, ...],\n                                observations], axis=0)\n\n      # Put dimensions back in order. The samples we've computed are\n      # ordered by timestep, with shape `[num_timesteps, num_samples,\n      # batch_shape, size, 1]` where `size` represents `latent_size`\n      # or `observation_size` respectively. But timesteps are really\n      # part of each probabilistic event, so we need to return a Tensor\n      # of shape `[num_samples, batch_shape, num_timesteps, size]`.\n      latents = tf.squeeze(latents, -1)\n      latents = distribution_util.move_dimension(latents, 0, -2)\n      observations = tf.squeeze(observations, -1)\n      observations = distribution_util.move_dimension(observations, 0, -2)\n\n    return latents, observations", "response": "Draw a joint sample from the prior over latents and observations."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nrunning a Kalman filter over a provided sequence of outputs.", "response": "def forward_filter(self, x, mask=None):\n    \"\"\"Run a Kalman filter over a provided sequence of outputs.\n\n    Note that the returned values `filtered_means`, `predicted_means`, and\n    `observation_means` depend on the observed time series `x`, while the\n    corresponding covariances are independent of the observed series; i.e., they\n    depend only on the model itself. This means that the mean values have shape\n    `concat([sample_shape(x), batch_shape, [num_timesteps,\n    {latent/observation}_size]])`, while the covariances have shape\n    `concat[(batch_shape, [num_timesteps, {latent/observation}_size,\n    {latent/observation}_size]])`, which does not depend on the sample shape.\n\n    Args:\n      x: a float-type `Tensor` with rightmost dimensions\n        `[num_timesteps, observation_size]` matching\n        `self.event_shape`. Additional dimensions must match or be\n        broadcastable to `self.batch_shape`; any further dimensions\n        are interpreted as a sample shape.\n      mask: optional bool-type `Tensor` with rightmost dimension\n        `[num_timesteps]`; `True` values specify that the value of `x`\n        at that timestep is masked, i.e., not conditioned on. Additional\n        dimensions must match or be broadcastable to `self.batch_shape`; any\n        further dimensions must match or be broadcastable to the sample\n        shape of `x`.\n        Default value: `None`.\n\n    Returns:\n      log_likelihoods: Per-timestep log marginal likelihoods `log\n        p(x_t | x_{:t-1})` evaluated at the input `x`, as a `Tensor`\n        of shape `sample_shape(x) + batch_shape + [num_timesteps].`\n      filtered_means: Means of the per-timestep filtered marginal\n         distributions p(z_t | x_{:t}), as a Tensor of shape\n        `sample_shape(x) + batch_shape + [num_timesteps, latent_size]`.\n      filtered_covs: Covariances of the per-timestep filtered marginal\n         distributions p(z_t | x_{:t}), as a Tensor of shape\n        `sample_shape(mask) + batch_shape + [num_timesteps, latent_size,\n        latent_size]`. Note that the covariances depend only on the model and\n        the mask, not on the data, so this may have fewer dimensions than\n        `filtered_means`.\n      predicted_means: Means of the per-timestep predictive\n         distributions over latent states, p(z_{t+1} | x_{:t}), as a\n         Tensor of shape `sample_shape(x) + batch_shape +\n         [num_timesteps, latent_size]`.\n      predicted_covs: Covariances of the per-timestep predictive\n         distributions over latent states, p(z_{t+1} | x_{:t}), as a\n         Tensor of shape `sample_shape(mask) + batch_shape +\n         [num_timesteps, latent_size, latent_size]`. Note that the covariances\n         depend only on the model and the mask, not on the data, so this may\n         have fewer dimensions than `predicted_means`.\n      observation_means: Means of the per-timestep predictive\n         distributions over observations, p(x_{t} | x_{:t-1}), as a\n         Tensor of shape `sample_shape(x) + batch_shape +\n         [num_timesteps, observation_size]`.\n      observation_covs: Covariances of the per-timestep predictive\n         distributions over observations, p(x_{t} | x_{:t-1}), as a\n         Tensor of shape `sample_shape(mask) + batch_shape + [num_timesteps,\n         observation_size, observation_size]`. Note that the covariances depend\n         only on the model and the mask, not on the data, so this may have fewer\n         dimensions than `observation_means`.\n    \"\"\"\n\n    with tf.name_scope(\"forward_filter\"):\n      x = tf.convert_to_tensor(value=x, name=\"x\")\n      if mask is not None:\n        mask = tf.convert_to_tensor(value=mask, name=\"mask\", dtype_hint=tf.bool)\n\n      # Check event shape statically if possible\n      check_x_shape_op = _check_equal_shape(\n          \"x\", x.shape[-2:], tf.shape(input=x)[-2:],\n          self.event_shape, self.event_shape_tensor())\n      check_mask_dims_op = None\n      check_mask_shape_op = None\n      if mask is not None:\n        if (tensorshape_util.rank(mask.shape) is None or\n            tensorshape_util.rank(x.shape) is None):\n          check_mask_dims_op = assert_util.assert_greater_equal(\n              tf.rank(x),\n              tf.rank(mask),\n              message=(\"mask cannot have higher rank than x!\"))\n        elif tensorshape_util.rank(mask.shape) > tensorshape_util.rank(x.shape):\n          raise ValueError(\n              \"mask cannot have higher rank than x! ({} vs {})\".format(\n                  tensorshape_util.rank(mask.shape),\n                  tensorshape_util.rank(x.shape)))\n        check_mask_shape_op = _check_equal_shape(\n            \"mask\", mask.shape[-1:], tf.shape(input=mask)[-1:],\n            self.event_shape[-2:-1], self.event_shape_tensor()[-2:-1])\n      if self.validate_args:\n        runtime_assertions = self.runtime_assertions\n        if check_x_shape_op is not None:\n          runtime_assertions += [check_x_shape_op]\n        if check_mask_shape_op is not None:\n          runtime_assertions += [check_mask_shape_op]\n        if check_mask_dims_op is not None:\n          runtime_assertions += [check_mask_dims_op]\n        with tf.control_dependencies(runtime_assertions):\n          x = tf.identity(x)\n\n      # Get the full output sample_shape + batch shape. Usually\n      # this will just be x[:-2], i.e. the input shape excluding\n      # event shape. But users can specify inputs that broadcast\n      # batch dimensions, so we need to broadcast this against\n      # self.batch_shape.\n      if tensorshape_util.is_fully_defined(\n          self.batch_shape) and tensorshape_util.is_fully_defined(x.shape):\n        sample_and_batch_shape = tf.broadcast_static_shape(\n            x.shape[:-2], self.batch_shape)\n      else:\n        sample_and_batch_shape = tf.broadcast_dynamic_shape(\n            tf.shape(input=x)[:-2], self.batch_shape_tensor())\n\n      # Get the full output shape for covariances. The posterior variances\n      # in a LGSSM depend only on the model params (batch shape) and on the\n      # missingness pattern (mask shape), so in general this may be smaller\n      # than the full `sample_and_batch_shape`.\n      if mask is None:\n        mask_sample_and_batch_shape = self.batch_shape_tensor()\n      else:\n        if (tensorshape_util.is_fully_defined(self.batch_shape) and\n            tensorshape_util.is_fully_defined(mask.shape)):\n          mask_sample_and_batch_shape = tf.broadcast_static_shape(\n              mask.shape[:-1], self.batch_shape)\n        else:\n          mask_sample_and_batch_shape = tf.broadcast_dynamic_shape(\n              tf.shape(input=mask)[:-1], self.batch_shape_tensor())\n\n      # To scan over timesteps we need to move `num_timsteps` from the\n      # event shape to the initial dimension of the tensor.\n      x = distribution_util.move_dimension(x, -2, 0)\n      if mask is not None:\n        mask = distribution_util.move_dimension(mask, -1, 0)\n\n      # Observations are assumed to be vectors, but we add a dummy\n      # extra dimension to allow us to use `matmul` throughout.\n      x = x[..., tf.newaxis]\n      if mask is not None:\n        # Align mask.shape with x.shape, including a unit dimension to broadcast\n        # against `observation_size`.\n        mask = mask[..., tf.newaxis, tf.newaxis]\n\n      # Initialize filtering distribution from the prior. The mean in\n      # a Kalman filter depends on data, so should match the full\n      # sample and batch shape. The covariance is data-independent, so\n      # only has batch shape.\n      prior_mean = _broadcast_to_shape(\n          self.initial_state_prior.mean()[..., tf.newaxis],\n          tf.concat([sample_and_batch_shape,\n                     [self.latent_size, 1]], axis=0))\n      prior_cov = _broadcast_to_shape(\n          self.initial_state_prior.covariance(),\n          tf.concat([mask_sample_and_batch_shape,\n                     [self.latent_size, self.latent_size]], axis=0))\n\n      initial_observation_matrix = (\n          self.get_observation_matrix_for_timestep(self.initial_step))\n      initial_observation_noise = (\n          self.get_observation_noise_for_timestep(self.initial_step))\n\n      initial_observation_mean = _propagate_mean(prior_mean,\n                                                 initial_observation_matrix,\n                                                 initial_observation_noise)\n      initial_observation_cov = _propagate_cov(prior_cov,\n                                               initial_observation_matrix,\n                                               initial_observation_noise)\n\n      initial_state = KalmanFilterState(\n          predicted_mean=prior_mean,\n          predicted_cov=prior_cov,\n          filtered_mean=prior_mean,  # establishes shape, value ignored\n          filtered_cov=prior_cov,  # establishes shape, value ignored\n          observation_mean=initial_observation_mean,\n          observation_cov=initial_observation_cov,\n          log_marginal_likelihood=tf.zeros(\n              shape=sample_and_batch_shape, dtype=self.dtype),\n          timestep=tf.convert_to_tensor(\n              value=self.initial_step, dtype=tf.int32, name=\"initial_step\"))\n\n      update_step_fn = build_kalman_filter_step(\n          self.get_transition_matrix_for_timestep,\n          self.get_transition_noise_for_timestep,\n          self.get_observation_matrix_for_timestep,\n          self.get_observation_noise_for_timestep)\n\n      filter_states = tf.scan(update_step_fn,\n                              elems=x if mask is None else (x, mask),\n                              initializer=initial_state)\n\n      log_likelihoods = distribution_util.move_dimension(\n          filter_states.log_marginal_likelihood, 0, -1)\n\n      # Move the time dimension back into the event shape.\n      filtered_means = distribution_util.move_dimension(\n          filter_states.filtered_mean[..., 0], 0, -2)\n      filtered_covs = distribution_util.move_dimension(\n          filter_states.filtered_cov, 0, -3)\n      predicted_means = distribution_util.move_dimension(\n          filter_states.predicted_mean[..., 0], 0, -2)\n      predicted_covs = distribution_util.move_dimension(\n          filter_states.predicted_cov, 0, -3)\n      observation_means = distribution_util.move_dimension(\n          filter_states.observation_mean[..., 0], 0, -2)\n      observation_covs = distribution_util.move_dimension(\n          filter_states.observation_cov, 0, -3)\n      # We could directly construct the batch Distributions\n      # filtered_marginals = tfd.MultivariateNormalFullCovariance(\n      #      filtered_means, filtered_covs)\n      # predicted_marginals = tfd.MultivariateNormalFullCovariance(\n      #      predicted_means, predicted_covs)\n      # but we choose not to: returning the raw means and covariances\n      # saves computation in Eager mode (avoiding an immediate\n      # Cholesky factorization that the user may not want) and aids\n      # debugging of numerical issues.\n\n      return (log_likelihoods,\n              filtered_means, filtered_covs,\n              predicted_means, predicted_covs,\n              observation_means, observation_covs)"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef posterior_marginals(self, x, mask=None):\n\n    with tf.name_scope(\"smooth\"):\n      x = tf.convert_to_tensor(value=x, name=\"x\")\n      (_, filtered_means, filtered_covs,\n       predicted_means, predicted_covs, _, _) = self.forward_filter(\n           x, mask=mask)\n\n      (smoothed_means, smoothed_covs) = self.backward_smoothing_pass(\n          filtered_means, filtered_covs,\n          predicted_means, predicted_covs)\n\n      return (smoothed_means, smoothed_covs)", "response": "Run a Kalman smoother to return the posterior mean and covariance of the time series x."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _joint_mean(self):\n\n    with tf.name_scope(\"mean_joint\"):\n\n      # The initial timestep is a special case, since we sample the\n      # latent state from the prior rather than the transition model.\n\n      with tf.control_dependencies(self.runtime_assertions):\n        # Broadcast to ensure we represent the full batch shape.\n        initial_latent_mean = _broadcast_to_shape(\n            self.initial_state_prior.mean()[..., tf.newaxis],\n            tf.concat([self.batch_shape_tensor(),\n                       [self.latent_size, 1]], axis=0))\n\n      initial_observation_mean = _propagate_mean(\n          initial_latent_mean,\n          self.get_observation_matrix_for_timestep(self.initial_step),\n          self.get_observation_noise_for_timestep(self.initial_step))\n\n      mean_step = build_kalman_mean_step(\n          self.get_transition_matrix_for_timestep,\n          self.get_transition_noise_for_timestep,\n          self.get_observation_matrix_for_timestep,\n          self.get_observation_noise_for_timestep)\n\n      # Scan over all timesteps following the initial step.\n      (latent_means, observation_means) = tf.scan(\n          mean_step,\n          elems=tf.range(self.initial_step+1, self.final_step),\n          initializer=(initial_latent_mean, initial_observation_mean))\n\n      # Squish the initial step back on top of the other (scanned) timesteps\n      latent_means = tf.concat([initial_latent_mean[tf.newaxis, ...],\n                                latent_means], axis=0)\n      observation_means = tf.concat([initial_observation_mean[tf.newaxis, ...],\n                                     observation_means], axis=0)\n\n      # Put dimensions back in order. The samples we've computed have\n      # shape `[num_timesteps, batch_shape, size, 1]`, where `size`\n      # is the dimension of the latent or observation spaces\n      # respectively, but we want to return values with shape\n      # `[batch_shape, num_timesteps, size]`.\n      latent_means = tf.squeeze(latent_means, -1)\n      latent_means = distribution_util.move_dimension(latent_means, 0, -2)\n      observation_means = tf.squeeze(observation_means, -1)\n      observation_means = distribution_util.move_dimension(\n          observation_means, 0, -2)\n\n      return latent_means, observation_means", "response": "Compute prior means for all variables via dynamic programming."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\ncompute prior covariance matrices for all variables via dynamic programming.", "response": "def _joint_covariances(self):\n    \"\"\"Compute prior covariances for all variables via dynamic programming.\n\n    Returns:\n      latent_covs: Prior covariance matrices of latent states `z_t`, as\n        a `Tensor` of shape `batch_shape + [num_timesteps,\n        latent_size, latent_size]`\n      observation_covs: Prior covariance matrices of observations\n        `x_t`, as a `Tensor` of shape `batch_shape + [num_timesteps,\n        observation_size, observation_size]`\n    \"\"\"\n\n    with tf.name_scope(\"covariance_joint\"):\n\n      with tf.control_dependencies(self.runtime_assertions):\n        initial_latent_cov = _broadcast_to_shape(\n            self.initial_state_prior.covariance(),\n            tf.concat([self.batch_shape_tensor(),\n                       [self.latent_size, self.latent_size]], axis=0))\n\n      initial_observation_cov = _propagate_cov(\n          initial_latent_cov,\n          self.get_observation_matrix_for_timestep(self.initial_step),\n          self.get_observation_noise_for_timestep(self.initial_step))\n\n      cov_step = build_kalman_cov_step(\n          self.get_transition_matrix_for_timestep,\n          self.get_transition_noise_for_timestep,\n          self.get_observation_matrix_for_timestep,\n          self.get_observation_noise_for_timestep)\n\n      # Scan over all timesteps following the initial step.\n      (latent_covs, observation_covs) = tf.scan(\n          cov_step,\n          elems=tf.range(self.initial_step+1, self.final_step),\n          initializer=(initial_latent_cov, initial_observation_cov))\n\n      # Squish the initial step back on top of the other (scanned) timesteps\n      latent_covs = tf.concat([initial_latent_cov[tf.newaxis, ...],\n                               latent_covs], axis=0)\n      observation_covs = tf.concat([initial_observation_cov[tf.newaxis, ...],\n                                    observation_covs], axis=0)\n\n      # Put dimensions back in order. The samples we've computed have\n      # shape `[num_timesteps, batch_shape, size, size]`, where `size`\n      # is the dimension of the state or observation spaces\n      # respectively, but we want to return values with shape\n      # `[batch_shape, num_timesteps, size, size]`.\n      latent_covs = distribution_util.move_dimension(latent_covs, 0, -3)\n      observation_covs = distribution_util.move_dimension(\n          observation_covs, 0, -3)\n      return latent_covs, observation_covs"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef latents_to_observations(self, latent_means, latent_covs):\n\n    with tf.name_scope(\"latents_to_observations\"):\n\n      pushforward_latents_step = build_pushforward_latents_step(\n          self.get_observation_matrix_for_timestep,\n          self.get_observation_noise_for_timestep)\n\n      latent_means = distribution_util.move_dimension(\n          latent_means, source_idx=-2, dest_idx=0)\n      latent_means = latent_means[..., tf.newaxis]  # Make matmul happy.\n      latent_covs = distribution_util.move_dimension(\n          latent_covs, source_idx=-3, dest_idx=0)\n\n      (initial_observation_mean,\n       initial_observation_cov) = pushforward_latents_step(\n           _=None,  # Loop body ignores previous observations.\n           latent_t_mean_cov=(self.initial_step,\n                              latent_means[self.initial_step],\n                              latent_covs[self.initial_step]))\n\n      # TODO(davmre) this loop is embarassingly parallel; replace with `pfor`.\n      timesteps = tf.range(self.initial_step,\n                           self.initial_step + self.num_timesteps)\n      observation_means, observation_covs = tf.scan(\n          pushforward_latents_step,\n          elems=(timesteps, latent_means, latent_covs),\n          initializer=(initial_observation_mean, initial_observation_cov),\n          parallel_iterations=10000)\n\n      observation_means = distribution_util.move_dimension(\n          observation_means[..., 0], source_idx=0, dest_idx=-2)\n      observation_covs = distribution_util.move_dimension(\n          observation_covs, source_idx=0, dest_idx=-3)\n\n      return observation_means, observation_covs", "response": "Push latent means and covariance forward through the observation model."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\ncomputing I_v z using a recurrence relation where z > 0.", "response": "def _bessel_ive(v, z, cache=None):\n  \"\"\"Computes I_v(z)*exp(-abs(z)) using a recurrence relation, where z > 0.\"\"\"\n  # TODO(b/67497980): Switch to a more numerically faithful implementation.\n  z = tf.convert_to_tensor(value=z)\n\n  wrap = lambda result: tf.debugging.check_numerics(result, 'besseli{}'.format(v\n                                                                              ))\n\n  if float(v) >= 2:\n    raise ValueError(\n        'Evaluating bessel_i by recurrence becomes imprecise for large v')\n\n  cache = cache or {}\n  safe_z = tf.where(z > 0, z, tf.ones_like(z))\n  if v in cache:\n    return wrap(cache[v])\n  if v == 0:\n    cache[v] = tf.math.bessel_i0e(z)\n  elif v == 1:\n    cache[v] = tf.math.bessel_i1e(z)\n  elif v == 0.5:\n    # sinh(x)*exp(-abs(x)), sinh(x) = (e^x - e^{-x}) / 2\n    sinhe = lambda x: (tf.exp(x - tf.abs(x)) - tf.exp(-x - tf.abs(x))) / 2\n    cache[v] = (\n        np.sqrt(2 / np.pi) * sinhe(z) *\n        tf.where(z > 0, tf.math.rsqrt(safe_z), tf.ones_like(safe_z)))\n  elif v == -0.5:\n    # cosh(x)*exp(-abs(x)), cosh(x) = (e^x + e^{-x}) / 2\n    coshe = lambda x: (tf.exp(x - tf.abs(x)) + tf.exp(-x - tf.abs(x))) / 2\n    cache[v] = (\n        np.sqrt(2 / np.pi) * coshe(z) *\n        tf.where(z > 0, tf.math.rsqrt(safe_z), tf.ones_like(safe_z)))\n  if v <= 1:\n    return wrap(cache[v])\n  # Recurrence relation:\n  cache[v] = (_bessel_ive(v - 2, z, cache) -\n              (2 * (v - 1)) * _bessel_ive(v - 1, z, cache) / z)\n  return wrap(cache[v])"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\ncompute the log - normalizer of the distribution.", "response": "def _log_normalization(self):\n    \"\"\"Computes the log-normalizer of the distribution.\"\"\"\n    event_dim = tf.compat.dimension_value(self.event_shape[0])\n    if event_dim is None:\n      raise ValueError('vMF _log_normalizer currently only supports '\n                       'statically known event shape')\n    safe_conc = tf.where(self.concentration > 0,\n                         self.concentration,\n                         tf.ones_like(self.concentration))\n    safe_lognorm = ((event_dim / 2 - 1) * tf.math.log(safe_conc) -\n                    (event_dim / 2) * np.log(2 * np.pi) -\n                    tf.math.log(_bessel_ive(event_dim / 2 - 1, safe_conc)) -\n                    tf.abs(safe_conc))\n    log_nsphere_surface_area = (\n        np.log(2.) + (event_dim / 2) * np.log(np.pi) -\n        tf.math.lgamma(tf.cast(event_dim / 2, self.dtype)))\n    return tf.where(self.concentration > 0,\n                    -safe_lognorm,\n                    log_nsphere_surface_area * tf.ones_like(safe_lognorm))"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef _maybe_assert_valid_sample(self, samples):\n    if not self.validate_args:\n      return samples\n    with tf.control_dependencies([\n        assert_util.assert_near(\n            1.,\n            tf.linalg.norm(tensor=samples, axis=-1),\n            message='samples must be unit length'),\n        assert_util.assert_equal(\n            tf.shape(input=samples)[-1:],\n            self.event_shape_tensor(),\n            message=('samples must have innermost dimension matching that of '\n                     '`self.mean_direction`')),\n    ]):\n      return tf.identity(samples)", "response": "Check counts for proper shape values then return tensor version."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef _mode(self):\n    return (self.mean_direction +\n            tf.zeros_like(self.concentration)[..., tf.newaxis])", "response": "The mode of the von Mises - Fisher distribution."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef _rotate(self, samples):\n    event_dim = (\n        tf.compat.dimension_value(self.event_shape[0]) or\n        self._event_shape_tensor()[0])\n    basis = tf.concat([[1.], tf.zeros([event_dim - 1], dtype=self.dtype)],\n                      axis=0),\n    u = tf.nn.l2_normalize(basis - self.mean_direction, axis=-1)\n    return samples - 2 * tf.reduce_sum(\n        input_tensor=samples * u, axis=-1, keepdims=True) * u", "response": "Applies a Householder rotation to samples."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nspecializes inversion sampler for 3D.", "response": "def _sample_3d(self, n, seed=None):\n    \"\"\"Specialized inversion sampler for 3D.\"\"\"\n    seed = seed_stream.SeedStream(seed, salt='von_mises_fisher_3d')\n    u_shape = tf.concat([[n], self._batch_shape_tensor()], axis=0)\n    z = tf.random.uniform(u_shape, seed=seed(), dtype=self.dtype)\n    # TODO(bjp): Higher-order odd dim analytic CDFs are available in [1], could\n    # be bisected for bounded sampling runtime (i.e. not rejection sampling).\n    # [1]: Inversion sampler via: https://ieeexplore.ieee.org/document/7347705/\n    # The inversion is: u = 1 + log(z + (1-z)*exp(-2*kappa)) / kappa\n    # We must protect against both kappa and z being zero.\n    safe_conc = tf.where(self.concentration > 0,\n                         self.concentration,\n                         tf.ones_like(self.concentration))\n    safe_z = tf.where(z > 0, z, tf.ones_like(z))\n    safe_u = 1 + tf.reduce_logsumexp(\n        input_tensor=[\n            tf.math.log(safe_z),\n            tf.math.log1p(-safe_z) - 2 * safe_conc\n        ],\n        axis=0) / safe_conc\n    # Limit of the above expression as kappa->0 is 2*z-1\n    u = tf.where(self.concentration > tf.zeros_like(safe_u), safe_u,\n                 2 * z - 1)\n    # Limit of the expression as z->0 is -1.\n    u = tf.where(tf.equal(z, 0), -tf.ones_like(u), u)\n    if not self._allow_nan_stats:\n      u = tf.debugging.check_numerics(u, 'u in _sample_3d')\n    return u[..., tf.newaxis]"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\ncreate a deep copy of a function.", "response": "def _copy_fn(fn):\n  \"\"\"Create a deep copy of fn.\n\n  Args:\n    fn: a callable\n\n  Returns:\n    A `FunctionType`: a deep copy of fn.\n\n  Raises:\n    TypeError: if `fn` is not a callable.\n  \"\"\"\n  if not callable(fn):\n    raise TypeError(\"fn is not callable: {}\".format(fn))\n  # The blessed way to copy a function. copy.deepcopy fails to create a\n  # non-reference copy. Since:\n  #   types.FunctionType == type(lambda: None),\n  # and the docstring for the function type states:\n  #\n  #   function(code, globals[, name[, argdefs[, closure]]])\n  #\n  #   Create a function object from a code object and a dictionary.\n  #   ...\n  #\n  # Here we can use this to create a new function with the old function's\n  # code, globals, closure, etc.\n  return types.FunctionType(\n      code=fn.__code__, globals=fn.__globals__,\n      name=fn.__name__, argdefs=fn.__defaults__,\n      closure=fn.__closure__)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef _update_docstring(old_str, append_str):\n  old_str = old_str or \"\"\n  old_str_lines = old_str.split(\"\\n\")\n\n  # Step 0: Prepend spaces to all lines of append_str. This is\n  # necessary for correct markdown generation.\n  append_str = \"\\n\".join(\"    %s\" % line for line in append_str.split(\"\\n\"))\n\n  # Step 1: Find mention of \"Args\":\n  has_args_ix = [\n      ix for ix, line in enumerate(old_str_lines)\n      if line.strip().lower() == \"args:\"]\n  if has_args_ix:\n    final_args_ix = has_args_ix[-1]\n    return (\"\\n\".join(old_str_lines[:final_args_ix])\n            + \"\\n\\n\" + append_str + \"\\n\\n\"\n            + \"\\n\".join(old_str_lines[final_args_ix:]))\n  else:\n    return old_str + \"\\n\\n\" + append_str", "response": "Update old_str by inserting append_str just before the args section."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef _convert_to_tensor(value, dtype=None, dtype_hint=None, name=None):\n  if (tf.nest.is_nested(dtype) or\n      tf.nest.is_nested(dtype_hint)):\n    if dtype is None:\n      fn = lambda v, pd: tf.convert_to_tensor(v, dtype_hint=pd, name=name)\n      return tf.nest.map_structure(fn, value, dtype_hint)\n    elif dtype_hint is None:\n      fn = lambda v, d: tf.convert_to_tensor(v, dtype=d, name=name)\n      return tf.nest.map_structure(fn, value, dtype_hint)\n    else:\n      fn = lambda v, d, pd: tf.convert_to_tensor(  # pylint: disable=g-long-lambda\n          v, dtype=d, dtype_hint=pd, name=name)\n      return tf.nest.map_structure(fn, value, dtype, dtype_hint)\n  return tf.convert_to_tensor(\n      value=value, dtype=dtype, dtype_hint=dtype_hint, name=name)", "response": "Converts the given value to a tensor."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nremoves dict keys which have self as value.", "response": "def _remove_dict_keys_with_value(dict_, val):\n  \"\"\"Removes `dict` keys which have have `self` as value.\"\"\"\n  return {k: v for k, v in dict_.items() if v is not val}"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef _recursively_replace_dict_for_pretty_dict(x):\n  # We use \"PrettyDict\" because collections.OrderedDict repr/str has the word\n  # \"OrderedDict\" in it. We only want to print \"OrderedDict\" if in fact the\n  # input really is an OrderedDict.\n  if isinstance(x, dict):\n    return _PrettyDict({\n        k: _recursively_replace_dict_for_pretty_dict(v)\n        for k, v in x.items()})\n  if (isinstance(x, collections.Sequence) and\n      not isinstance(x, six.string_types)):\n    args = (_recursively_replace_dict_for_pretty_dict(x_) for x_ in x)\n    is_named_tuple = (isinstance(x, tuple) and\n                      hasattr(x, \"_asdict\") and\n                      hasattr(x, \"_fields\"))\n    return type(x)(*args) if is_named_tuple else type(x)(args)\n  if isinstance(x, collections.Mapping):\n    return type(x)(**{k: _recursively_replace_dict_for_pretty_dict(v)\n                      for k, v in x.items()})\n  return x", "response": "Recursively replace dict s with _PrettyDict."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\ncomputes the Monte-Carlo approximation of `E_p[f(X)]`. This function computes the Monte-Carlo approximation of an expectation, i.e., ```none E_p[f(X)] approx= m**-1 sum_i^m f(x_j), x_j ~iid p(X) ``` where: - `x_j = samples[j, ...]`, - `log(p(samples)) = log_prob(samples)` and - `m = prod(shape(samples)[axis])`. Tricks: Reparameterization and Score-Gradient When p is \"reparameterized\", i.e., a diffeomorphic transformation of a parameterless distribution (e.g., `Normal(Y; m, s) <=> Y = sX + m, X ~ Normal(0,1)`), we can swap gradient and expectation, i.e., `grad[ Avg{ s_i : i=1...n } ] = Avg{ grad[s_i] : i=1...n }` where `S_n = Avg{s_i}` and `s_i = f(x_i), x_i ~ p`. However, if p is not reparameterized, TensorFlow's gradient will be incorrect since the chain-rule stops at samples of non-reparameterized distributions. (The non-differentiated result, `approx_expectation`, is the same regardless of `use_reparametrization`.) In this circumstance using the Score-Gradient trick results in an unbiased gradient, i.e., ```none grad[ E_p[f(X)] ] = grad[ int dx p(x) f(x) ] = int dx grad[ p(x) f(x) ] = int dx [ p'(x) f(x) + p(x) f'(x) ] = int dx p(x) [p'(x) / p(x) f(x) + f'(x) ] = int dx p(x) grad[ f(x) p(x) / stop_grad[p(x)] ] = E_p[ grad[ f(x) p(x) / stop_grad[p(x)] ] ] ``` Unless p is not reparametrized, it is usually preferable to `use_reparametrization = True`. Warning: users are responsible for verifying `p` is a \"reparameterized\" distribution. Example Use: ```python # Monte-Carlo approximation of a reparameterized distribution, e.g., Normal. num_draws = int(1e5) p = tfp.distributions.Normal(loc=0., scale=1.) q = tfp.distributions.Normal(loc=1., scale=2.) exact_kl_normal_normal = tfp.distributions.kl_divergence(p, q) # ==> 0.44314718 approx_kl_normal_normal = tfp.monte_carlo.expectation( f=lambda x: p.log_prob(x) - q.log_prob(x), samples=p.sample(num_draws, seed=42), log_prob=p.log_prob, use_reparametrization=(p.reparameterization_type == tfp.distributions.FULLY_REPARAMETERIZED)) # ==> 0.44632751 # Relative Error: <1% # Monte-Carlo approximation of non-reparameterized distribution, # e.g., Bernoulli. num_draws = int(1e5) p = tfp.distributions.Bernoulli(probs=0.4) q = tfp.distributions.Bernoulli(probs=0.8) exact_kl_bernoulli_bernoulli = tfp.distributions.kl_divergence(p, q) # ==> 0.38190854 approx_kl_bernoulli_bernoulli = tfp.monte_carlo.expectation( f=lambda x: p.log_prob(x) - q.log_prob(x), samples=p.sample(num_draws, seed=42), log_prob=p.log_prob, use_reparametrization=(p.reparameterization_type == tfp.distributions.FULLY_REPARAMETERIZED)) # ==> 0.38336259 # Relative Error: <1% # For comparing the gradients, see `expectation_test.py`. ``` Note: The above example is for illustration only. To compute approximate KL-divergence, the following is preferred: ```python approx_kl_p_q = bf.monte_carlo_csiszar_f_divergence( f=bf.kl_reverse, p_log_prob=q.log_prob, q=p, num_draws=num_draws) ``` Args: f: Python callable which can return `f(samples)`. samples: `Tensor` of samples used to form the Monte-Carlo approximation of `E_p[f(X)]`. A batch of samples should be indexed by `axis` dimensions. log_prob: Python callable which can return `log_prob(samples)`. Must correspond to the natural-logarithm of the pdf/pmf of each sample. Only required/used if `use_reparametrization=False`. Default value: `None`. use_reparametrization: Python `bool` indicating that the approximation should use the fact that the gradient of samples is unbiased. Whether `True` or `False`, this arg only affects the gradient of the resulting `approx_expectation`. Default value: `True`. axis: The dimensions to average. If `None`, averages all dimensions. Default value: `0` (the left-most dimension). keep_dims: If True, retains averaged dimensions using size `1`. Default value: `False`. name: A `name_scope` for operations created by this function. Default value: `None` (which implies \"expectation\"). Returns: approx_expectation: `Tensor` corresponding to the Monte-Carlo approximation of `E_p[f(X)]`. Raises: ValueError: if `f` is not a Python `callable`. ValueError: if `use_reparametrization=False` and `log_prob` is not a Python `callable`.", "response": "def expectation(f, samples, log_prob=None, use_reparametrization=True,\n                axis=0, keep_dims=False, name=None):\n  \"\"\"Computes the Monte-Carlo approximation of `E_p[f(X)]`.\n\n  This function computes the Monte-Carlo approximation of an expectation, i.e.,\n\n  ```none\n  E_p[f(X)] approx= m**-1 sum_i^m f(x_j),  x_j ~iid p(X)\n  ```\n\n  where:\n\n  - `x_j = samples[j, ...]`,\n  - `log(p(samples)) = log_prob(samples)` and\n  - `m = prod(shape(samples)[axis])`.\n\n  Tricks: Reparameterization and Score-Gradient\n\n  When p is \"reparameterized\", i.e., a diffeomorphic transformation of a\n  parameterless distribution (e.g.,\n  `Normal(Y; m, s) <=> Y = sX + m, X ~ Normal(0,1)`), we can swap gradient and\n  expectation, i.e.,\n  `grad[ Avg{ s_i : i=1...n } ] = Avg{ grad[s_i] : i=1...n }` where\n  `S_n = Avg{s_i}` and `s_i = f(x_i), x_i ~ p`.\n\n  However, if p is not reparameterized, TensorFlow's gradient will be incorrect\n  since the chain-rule stops at samples of non-reparameterized distributions.\n  (The non-differentiated result, `approx_expectation`, is the same regardless\n  of `use_reparametrization`.) In this circumstance using the Score-Gradient\n  trick results in an unbiased gradient, i.e.,\n\n  ```none\n  grad[ E_p[f(X)] ]\n  = grad[ int dx p(x) f(x) ]\n  = int dx grad[ p(x) f(x) ]\n  = int dx [ p'(x) f(x) + p(x) f'(x) ]\n  = int dx p(x) [p'(x) / p(x) f(x) + f'(x) ]\n  = int dx p(x) grad[ f(x) p(x) / stop_grad[p(x)] ]\n  = E_p[ grad[ f(x) p(x) / stop_grad[p(x)] ] ]\n  ```\n\n  Unless p is not reparametrized, it is usually preferable to\n  `use_reparametrization = True`.\n\n  Warning: users are responsible for verifying `p` is a \"reparameterized\"\n  distribution.\n\n  Example Use:\n\n  ```python\n  # Monte-Carlo approximation of a reparameterized distribution, e.g., Normal.\n\n  num_draws = int(1e5)\n  p = tfp.distributions.Normal(loc=0., scale=1.)\n  q = tfp.distributions.Normal(loc=1., scale=2.)\n  exact_kl_normal_normal = tfp.distributions.kl_divergence(p, q)\n  # ==> 0.44314718\n  approx_kl_normal_normal = tfp.monte_carlo.expectation(\n      f=lambda x: p.log_prob(x) - q.log_prob(x),\n      samples=p.sample(num_draws, seed=42),\n      log_prob=p.log_prob,\n      use_reparametrization=(p.reparameterization_type\n                             == tfp.distributions.FULLY_REPARAMETERIZED))\n  # ==> 0.44632751\n  # Relative Error: <1%\n\n  # Monte-Carlo approximation of non-reparameterized distribution,\n  # e.g., Bernoulli.\n\n  num_draws = int(1e5)\n  p = tfp.distributions.Bernoulli(probs=0.4)\n  q = tfp.distributions.Bernoulli(probs=0.8)\n  exact_kl_bernoulli_bernoulli = tfp.distributions.kl_divergence(p, q)\n  # ==> 0.38190854\n  approx_kl_bernoulli_bernoulli = tfp.monte_carlo.expectation(\n      f=lambda x: p.log_prob(x) - q.log_prob(x),\n      samples=p.sample(num_draws, seed=42),\n      log_prob=p.log_prob,\n      use_reparametrization=(p.reparameterization_type\n                             == tfp.distributions.FULLY_REPARAMETERIZED))\n  # ==> 0.38336259\n  # Relative Error: <1%\n\n  # For comparing the gradients, see `expectation_test.py`.\n  ```\n\n  Note: The above example is for illustration only. To compute approximate\n  KL-divergence, the following is preferred:\n\n  ```python\n  approx_kl_p_q = bf.monte_carlo_csiszar_f_divergence(\n      f=bf.kl_reverse,\n      p_log_prob=q.log_prob,\n      q=p,\n      num_draws=num_draws)\n  ```\n\n  Args:\n    f: Python callable which can return `f(samples)`.\n    samples: `Tensor` of samples used to form the Monte-Carlo approximation of\n      `E_p[f(X)]`.  A batch of samples should be indexed by `axis` dimensions.\n    log_prob: Python callable which can return `log_prob(samples)`. Must\n      correspond to the natural-logarithm of the pdf/pmf of each sample. Only\n      required/used if `use_reparametrization=False`.\n      Default value: `None`.\n    use_reparametrization: Python `bool` indicating that the approximation\n      should use the fact that the gradient of samples is unbiased. Whether\n      `True` or `False`, this arg only affects the gradient of the resulting\n      `approx_expectation`.\n      Default value: `True`.\n    axis: The dimensions to average. If `None`, averages all\n      dimensions.\n      Default value: `0` (the left-most dimension).\n    keep_dims: If True, retains averaged dimensions using size `1`.\n      Default value: `False`.\n    name: A `name_scope` for operations created by this function.\n      Default value: `None` (which implies \"expectation\").\n\n  Returns:\n    approx_expectation: `Tensor` corresponding to the Monte-Carlo approximation\n      of `E_p[f(X)]`.\n\n  Raises:\n    ValueError: if `f` is not a Python `callable`.\n    ValueError: if `use_reparametrization=False` and `log_prob` is not a Python\n      `callable`.\n  \"\"\"\n\n  with tf.compat.v1.name_scope(name, 'expectation', [samples]):\n    if not callable(f):\n      raise ValueError('`f` must be a callable function.')\n    if use_reparametrization:\n      return tf.reduce_mean(\n          input_tensor=f(samples), axis=axis, keepdims=keep_dims)\n    else:\n      if not callable(log_prob):\n        raise ValueError('`log_prob` must be a callable function.')\n      stop = tf.stop_gradient  # For readability.\n      x = stop(samples)\n      logpx = log_prob(x)\n      fx = f(x)  # Call `f` once in case it has side-effects.\n      # To achieve this, we use the fact that:\n      #   `h(x) - stop(h(x)) == zeros_like(h(x))`\n      # but its gradient is grad[h(x)].\n      #\n      # This technique was published as:\n      # Jakob Foerster, Greg Farquhar, Maruan Al-Shedivat, Tim Rocktaeschel,\n      # Eric P. Xing, Shimon Whiteson (ICML 2018)\n      # \"DiCE: The Infinitely Differentiable Monte-Carlo Estimator\"\n      # https://arxiv.org/abs/1802.05098\n      #\n      # Unlike using:\n      #   fx = fx + stop(fx) * (logpx - stop(logpx)),\n      # DiCE ensures that any order gradients of the objective\n      # are unbiased gradient estimators.\n      #\n      # Note that IEEE754 specifies that `x - x == 0.` and `x + 0. == x`, hence\n      # this trick loses no precision. For more discussion regarding the\n      # relevant portions of the IEEE754 standard, see the StackOverflow\n      # question,\n      # \"Is there a floating point value of x, for which x-x == 0 is false?\"\n      # http://stackoverflow.com/q/2686644\n      dice = fx * tf.exp(logpx - stop(logpx))\n      return tf.reduce_mean(input_tensor=dice, axis=axis, keepdims=keep_dims)"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef _get_samples(dist, z, n, seed):\n  with tf.compat.v1.name_scope('get_samples', values=[z, n]):\n    if (n is None) == (z is None):\n      raise ValueError(\n          'Must specify exactly one of arguments \"n\" and \"z\".  Found: '\n          'n = %s, z = %s' % (n, z))\n    if n is not None:\n      return dist.sample(n, seed=seed)\n    else:\n      return tf.convert_to_tensor(value=z, name='z')", "response": "Check args and return samples."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef _choose_base_case(is_accepted,\n                      accepted,\n                      rejected,\n                      name=None):\n  \"\"\"Helper to `choose` which expand_dims `is_accepted` and applies tf.where.\"\"\"\n  def _expand_is_accepted_like(x):\n    \"\"\"Helper to expand `is_accepted` like the shape of some input arg.\"\"\"\n    with tf.compat.v1.name_scope('expand_is_accepted_like'):\n      expand_shape = tf.concat([\n          tf.shape(input=is_accepted),\n          tf.ones([tf.rank(x) - tf.rank(is_accepted)], dtype=tf.int32),\n      ],\n                               axis=0)\n      multiples = tf.concat([\n          tf.ones([tf.rank(is_accepted)], dtype=tf.int32),\n          tf.shape(input=x)[tf.rank(is_accepted):],\n      ],\n                            axis=0)\n      m = tf.tile(tf.reshape(is_accepted, expand_shape),\n                  multiples)\n      m.set_shape(m.shape.merge_with(x.shape))\n      return m\n  def _where(accepted, rejected):\n    if accepted is rejected:\n      return accepted\n    accepted = tf.convert_to_tensor(value=accepted, name='accepted')\n    rejected = tf.convert_to_tensor(value=rejected, name='rejected')\n    r = tf.where(_expand_is_accepted_like(accepted), accepted, rejected)\n    r.set_shape(r.shape.merge_with(accepted.shape.merge_with(rejected.shape)))\n    return r\n\n  with tf.compat.v1.name_scope(\n      name, 'choose', values=[is_accepted, accepted, rejected]):\n    if not is_list_like(accepted):\n      return _where(accepted, rejected)\n    return [(choose(is_accepted, a, r, name=name) if is_namedtuple_like(a)\n             else _where(a, r))\n            for a, r in zip(accepted, rejected)]", "response": "Helper function to choose base case."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef choose(is_accepted, accepted, rejected, name=None):\n  if not is_namedtuple_like(accepted):\n    return _choose_base_case(is_accepted, accepted, rejected, name=name)\n  if not isinstance(accepted, type(rejected)):\n    raise TypeError('Type of `accepted` ({}) must be identical to '\n                    'type of `rejected` ({})'.format(\n                        type(accepted).__name__,\n                        type(rejected).__name__))\n  return type(accepted)(**dict(\n      [(fn,\n        choose(is_accepted,\n               getattr(accepted, fn),\n               getattr(rejected, fn),\n               name=name))\n       for fn in accepted._fields]))", "response": "Helper which expand_dims is_accepted then applies tf. where."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef safe_sum(x, alt_value=-np.inf, name=None):\n  with tf.compat.v1.name_scope(name, 'safe_sum', [x, alt_value]):\n    if not is_list_like(x):\n      raise TypeError('Expected list input.')\n    if not x:\n      raise ValueError('Input should not be empty.')\n    in_shape = x[0].shape\n    x = tf.stack(x, axis=-1)\n    x = tf.reduce_sum(input_tensor=x, axis=-1)\n    alt_value = np.array(alt_value, x.dtype.as_numpy_dtype)\n    alt_fill = tf.fill(tf.shape(input=x), value=alt_value)\n    x = tf.where(tf.math.is_finite(x), x, alt_fill)\n    x.set_shape(x.shape.merge_with(in_shape))\n    return x", "response": "Elementwise adds list members replacing non - finite results with alt_value."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\ncalling fn and computes the gradient of the result wrt args_list.", "response": "def maybe_call_fn_and_grads(fn,\n                            fn_arg_list,\n                            result=None,\n                            grads=None,\n                            check_non_none_grads=True,\n                            name=None):\n  \"\"\"Calls `fn` and computes the gradient of the result wrt `args_list`.\"\"\"\n  with tf.compat.v1.name_scope(name, 'maybe_call_fn_and_grads',\n                               [fn_arg_list, result, grads]):\n    fn_arg_list = (list(fn_arg_list) if is_list_like(fn_arg_list)\n                   else [fn_arg_list])\n    result, grads = _value_and_gradients(fn, fn_arg_list, result, grads)\n    if not all(r.dtype.is_floating\n               for r in (result if is_list_like(result) else [result])):  # pylint: disable=superfluous-parens\n      raise TypeError('Function result must be a `Tensor` with `float` '\n                      '`dtype`.')\n    if len(fn_arg_list) != len(grads):\n      raise ValueError('Function args must be in one-to-one correspondence '\n                       'with grads.')\n    if check_non_none_grads and any(g is None for g in grads):\n      raise ValueError('Encountered `None` gradient.\\n'\n                       '  fn_arg_list: {}\\n'\n                       '  grads: {}'.format(fn_arg_list, grads))\n    return result, grads"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef smart_for_loop(loop_num_iter, body_fn, initial_loop_vars,\n                   parallel_iterations=10, name=None):\n  \"\"\"Construct a for loop, preferring a python loop if `n` is staticaly known.\n\n  Given `loop_num_iter` and `body_fn`, return an op corresponding to executing\n  `body_fn` `loop_num_iter` times, feeding previous outputs of `body_fn` into\n  the next iteration.\n\n  If `loop_num_iter` is statically known, the op is constructed via python for\n  loop, and otherwise a `tf.while_loop` is used.\n\n  Args:\n    loop_num_iter: `Integer` `Tensor` representing the number of loop\n      iterations.\n    body_fn: Callable to be executed `loop_num_iter` times.\n    initial_loop_vars: Listlike object of `Tensors` to be passed in to\n      `body_fn`'s first execution.\n    parallel_iterations: The number of iterations allowed to run in parallel.\n      It must be a positive integer. See `tf.while_loop` for more details.\n      Default value: `10`.\n    name: Python `str` name prefixed to Ops created by this function.\n      Default value: `None` (i.e., \"smart_for_loop\").\n  Returns:\n    result: `Tensor` representing applying `body_fn` iteratively `n` times.\n  \"\"\"\n  with tf.compat.v1.name_scope(name, 'smart_for_loop',\n                               [loop_num_iter, initial_loop_vars]):\n    loop_num_iter_ = tf.get_static_value(loop_num_iter)\n    if (loop_num_iter_ is None or tf.executing_eagerly() or\n        control_flow_util.GraphOrParentsInXlaContext(\n            tf.compat.v1.get_default_graph())):\n      # Cast to int32 to run the comparison against i in host memory,\n      # where while/LoopCond needs it.\n      loop_num_iter = tf.cast(loop_num_iter, dtype=tf.int32)\n      return tf.while_loop(\n          cond=lambda i, *args: i < loop_num_iter,\n          body=lambda i, *args: [i + 1] + list(body_fn(*args)),\n          loop_vars=[np.int32(0)] + initial_loop_vars,\n          parallel_iterations=parallel_iterations\n      )[1:]\n    result = initial_loop_vars\n    for _ in range(loop_num_iter_):\n      result = body_fn(*result)\n    return result", "response": "Constructs a for loop."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef trace_scan(loop_fn,\n               initial_state,\n               elems,\n               trace_fn,\n               parallel_iterations=10,\n               name=None):\n  \"\"\"A simplified version of `tf.scan` that has configurable tracing.\n\n  This function repeatedly calls `loop_fn(state, elem)`, where `state` is the\n  `initial_state` during the first iteration, and the return value of `loop_fn`\n  for every iteration thereafter. `elem` is a slice of `elements` along the\n  first dimension, accessed in order. Additionally, it calls `trace_fn` on the\n  return value of `loop_fn`. The `Tensor`s in return values of `trace_fn` are\n  stacked and returned from this function, such that the first dimension of\n  those `Tensor`s matches the size of `elems`.\n\n  Args:\n    loop_fn: A callable that takes in a `Tensor` or a nested collection of\n      `Tensor`s with the same structure as `initial_state`, a slice of `elems`\n      and returns the same structure as `initial_state`.\n    initial_state: A `Tensor` or a nested collection of `Tensor`s passed to\n      `loop_fn` in the first iteration.\n    elems: A `Tensor` that is split along the first dimension and each element\n      of which is passed to `loop_fn`.\n    trace_fn: A callable that takes in the return value of `loop_fn` and returns\n      a `Tensor` or a nested collection of `Tensor`s.\n    parallel_iterations: Passed to the internal `tf.while_loop`.\n    name: Name scope used in this function. Default: 'trace_scan'.\n\n  Returns:\n    final_state: The final return value of `loop_fn`.\n    trace: The same structure as the return value of `trace_fn`, but with each\n      `Tensor` being a stack of the corresponding `Tensors` in the return value\n      of `trace_fn` for each slice of `elems`.\n  \"\"\"\n  with tf.compat.v1.name_scope(\n      name, 'trace_scan', [initial_state, elems]), tf.compat.v1.variable_scope(\n          tf.compat.v1.get_variable_scope()) as vs:\n    if vs.caching_device is None and not tf.executing_eagerly():\n      vs.set_caching_device(lambda op: op.device)\n\n    initial_state = tf.nest.map_structure(\n        lambda x: tf.convert_to_tensor(value=x, name='initial_state'),\n        initial_state)\n    elems = tf.convert_to_tensor(value=elems, name='elems')\n\n    static_length = elems.shape[0]\n    if tf.compat.dimension_value(static_length) is None:\n      length = tf.shape(input=elems)[0]\n    else:\n      length = tf.convert_to_tensor(\n          value=static_length, dtype=tf.int32, name='length')\n\n    # This is an TensorArray in part because of XLA, which had trouble with\n    # non-statically known indices. I.e. elems[i] errored, but\n    # elems_array.read(i) worked.\n    elems_array = tf.TensorArray(\n        elems.dtype, size=length, element_shape=elems.shape[1:])\n    elems_array = elems_array.unstack(elems)\n\n    trace_arrays = tf.nest.map_structure(\n        lambda x: tf.TensorArray(x.dtype, size=length, element_shape=x.shape),\n        trace_fn(initial_state))\n\n    def _body(i, state, trace_arrays):\n      state = loop_fn(state, elems_array.read(i))\n      trace_arrays = tf.nest.pack_sequence_as(trace_arrays, [\n          a.write(i, v) for a, v in zip(\n              tf.nest.flatten(trace_arrays), tf.nest.flatten(trace_fn(state)))\n      ])\n      return i + 1, state, trace_arrays\n\n    _, final_state, trace_arrays = tf.while_loop(\n        cond=lambda i, *args: i < length,\n        body=_body,\n        loop_vars=(0, initial_state, trace_arrays),\n        parallel_iterations=parallel_iterations)\n\n    stacked_trace = tf.nest.map_structure(lambda x: x.stack(), trace_arrays)\n\n    # Restore the static length if we know it.\n    def _merge_static_length(x):\n      x.set_shape(tf.TensorShape(static_length).concatenate(x.shape[1:]))\n      return x\n\n    stacked_trace = tf.nest.map_structure(_merge_static_length, stacked_trace)\n    return final_state, stacked_trace", "response": "A simplified version of tf. scan that has configurable tracing."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nwrapping a setter so it applies to the first n - kernel results in the first n - kernel result.", "response": "def make_innermost_setter(setter):\n  \"\"\"Wraps a setter so it applies to the inner-most results in `kernel_results`.\n\n  The wrapped setter unwraps `kernel_results` and applies `setter` to the first\n  results without an `inner_results` attribute.\n\n  Args:\n    setter: A callable that takes the kernel results as well as some `*args` and\n      `**kwargs` and returns a modified copy of those kernel results.\n\n  Returns:\n    new_setter: A wrapped `setter`.\n  \"\"\"\n\n  @functools.wraps(setter)\n  def _new_setter(kernel_results, *args, **kwargs):\n    \"\"\"Wrapped setter.\"\"\"\n    results_stack = []\n    while hasattr(kernel_results, 'inner_results'):\n      results_stack.append(kernel_results)\n      kernel_results = kernel_results.inner_results\n\n    new_kernel_results = setter(kernel_results, *args, **kwargs)\n    for outer_results in reversed(results_stack):\n      new_kernel_results = outer_results._replace(\n          inner_results=new_kernel_results)\n\n    return new_kernel_results\n\n  return _new_setter"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nwrap a getter so it applies to the first results in kernel_results.", "response": "def make_innermost_getter(getter):\n  \"\"\"Wraps a getter so it applies to the inner-most results in `kernel_results`.\n\n  The wrapped getter unwraps `kernel_results` and returns the return value of\n  `getter` called with the first results without an `inner_results` attribute.\n\n  Args:\n    getter: A callable that takes Kernel results and returns some value.\n\n  Returns:\n    new_getter: A wrapped `getter`.\n  \"\"\"\n\n  @functools.wraps(getter)\n  def _new_getter(kernel_results, *args, **kwargs):\n    \"\"\"Wrapped getter.\"\"\"\n    results_stack = []\n    while hasattr(kernel_results, 'inner_results'):\n      results_stack.append(kernel_results)\n      kernel_results = kernel_results.inner_results\n\n    return getter(kernel_results, *args, **kwargs)\n\n  return _new_getter"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nenabling the store_parameters_in_results parameter in a chain of kernels.", "response": "def enable_store_parameters_in_results(kernel):\n  \"\"\"Enables the `store_parameters_in_results` parameter in a chain of kernels.\n\n  This is a temporary utility for use during the transition period of the\n  parameter storage methods.\n\n  Args:\n    kernel: A TransitionKernel.\n\n  Returns:\n    kernel: The same kernel, but recreated with `store_parameters_in_results`\n        recursively set to `True` in its parameters and its inner kernels (as\n        appropriate).\n  \"\"\"\n  kernel_stack = []\n  while hasattr(kernel, 'parameters') and 'inner_kernel' in kernel.parameters:\n    kernel_stack.append(kernel)\n    kernel = kernel.parameters['inner_kernel']\n\n  def _recreate_kernel(kernel, parameters):\n    new_parameters = kernel.parameters.copy()\n    new_parameters.update(parameters)\n    if 'store_parameters_in_results' in new_parameters:\n      new_parameters['store_parameters_in_results'] = True\n    with deprecation.silence():\n      return type(kernel)(**new_parameters)\n\n  if hasattr(kernel, 'parameters'):\n    kernel = _recreate_kernel(kernel, {})\n\n  for outer_kernel in reversed(kernel_stack):\n    outer_kernel = _recreate_kernel(outer_kernel, {'inner_kernel': kernel})\n    kernel = outer_kernel\n\n  return kernel"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nreplace the rightmost dims in a Tensor with the same contents as input_shape except for the event dims in event_shape_in event_shape_out.", "response": "def _replace_event_shape_in_shape_tensor(\n    input_shape, event_shape_in, event_shape_out, validate_args):\n  \"\"\"Replaces the rightmost dims in a `Tensor` representing a shape.\n\n  Args:\n    input_shape: a rank-1 `Tensor` of integers\n    event_shape_in: the event shape expected to be present in rightmost dims\n      of `shape_in`.\n    event_shape_out: the event shape with which to replace `event_shape_in` in\n      the rightmost dims of `input_shape`.\n    validate_args: Python `bool` indicating whether arguments should\n      be checked for correctness.\n\n  Returns:\n    output_shape: A rank-1 integer `Tensor` with the same contents as\n      `input_shape` except for the event dims, which are replaced with\n      `event_shape_out`.\n  \"\"\"\n  output_tensorshape, is_validated = _replace_event_shape_in_tensorshape(\n      tensorshape_util.constant_value_as_shape(input_shape),\n      event_shape_in,\n      event_shape_out)\n\n  # TODO(b/124240153): Remove map(tf.identity, deps) once tf.function\n  # correctly supports control_dependencies.\n  validation_dependencies = (\n      map(tf.identity, (event_shape_in, event_shape_out))\n      if validate_args else ())\n\n  if (tensorshape_util.is_fully_defined(output_tensorshape) and\n      (is_validated or not validate_args)):\n    with tf.control_dependencies(validation_dependencies):\n      output_shape = tf.convert_to_tensor(\n          value=output_tensorshape, name='output_shape', dtype_hint=tf.int32)\n    return output_shape, output_tensorshape\n\n  with tf.control_dependencies(validation_dependencies):\n    event_shape_in_ndims = (\n        tf.size(input=event_shape_in)\n        if tensorshape_util.num_elements(event_shape_in.shape) is None else\n        tensorshape_util.num_elements(event_shape_in.shape))\n    input_non_event_shape, input_event_shape = tf.split(\n        input_shape, num_or_size_splits=[-1, event_shape_in_ndims])\n\n  additional_assertions = []\n  if is_validated:\n    pass\n  elif validate_args:\n    # Check that `input_event_shape` and `event_shape_in` are compatible in the\n    # sense that they have equal entries in any position that isn't a `-1` in\n    # `event_shape_in`. Note that our validations at construction time ensure\n    # there is at most one such entry in `event_shape_in`.\n    mask = event_shape_in >= 0\n    explicit_input_event_shape = tf.boolean_mask(\n        tensor=input_event_shape, mask=mask)\n    explicit_event_shape_in = tf.boolean_mask(\n        tensor=event_shape_in, mask=mask)\n    additional_assertions.append(\n        assert_util.assert_equal(\n            explicit_input_event_shape,\n            explicit_event_shape_in,\n            message='Input `event_shape` does not match `event_shape_in`.'))\n    # We don't explicitly additionally verify\n    # `tf.size(input_shape) > tf.size(event_shape_in)` since `tf.split`\n    # already makes this assertion.\n\n  with tf.control_dependencies(additional_assertions):\n    output_shape = tf.concat([input_non_event_shape, event_shape_out], axis=0,\n                             name='output_shape')\n\n  return output_shape, output_tensorshape"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\ncheck that a shape Tensor is int - type and otherwise sane.", "response": "def _maybe_check_valid_shape(shape, validate_args):\n  \"\"\"Check that a shape Tensor is int-type and otherwise sane.\"\"\"\n  if not dtype_util.is_integer(shape.dtype):\n    raise TypeError('{} dtype ({}) should be `int`-like.'.format(\n        shape, dtype_util.name(shape.dtype)))\n\n  assertions = []\n\n  message = '`{}` rank should be <= 1.'\n  if tensorshape_util.rank(shape.shape) is not None:\n    if tensorshape_util.rank(shape.shape) > 1:\n      raise ValueError(message.format(shape))\n  elif validate_args:\n    assertions.append(assert_util.assert_less(\n        tf.rank(shape), 2, message=message.format(shape)))\n\n  shape_ = tf.get_static_value(shape)\n\n  message = '`{}` elements must have at most one `-1`.'\n  if shape_ is not None:\n    if sum(shape_ == -1) > 1:\n      raise ValueError(message.format(shape))\n  elif validate_args:\n    assertions.append(assert_util.assert_less(\n        tf.reduce_sum(input_tensor=tf.cast(tf.equal(shape, -1), tf.int32)),\n        2,\n        message=message.format(shape)))\n\n  message = '`{}` elements must be either positive integers or `-1`.'\n  if shape_ is not None:\n    if np.any(shape_ < -1):\n      raise ValueError(message.format(shape))\n  elif validate_args:\n    assertions.append(assert_util.assert_greater(\n        shape, -2, message=message.format(shape)))\n\n  return assertions"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\ncalculate the batchwise KL divergence KL with d1 and d2 Beta.", "response": "def _kl_beta_beta(d1, d2, name=None):\n  \"\"\"Calculate the batchwise KL divergence KL(d1 || d2) with d1 and d2 Beta.\n\n  Args:\n    d1: instance of a Beta distribution object.\n    d2: instance of a Beta distribution object.\n    name: (optional) Name to use for created operations.\n      default is \"kl_beta_beta\".\n\n  Returns:\n    Batchwise KL(d1 || d2)\n  \"\"\"\n  def delta(fn, is_property=True):\n    fn1 = getattr(d1, fn)\n    fn2 = getattr(d2, fn)\n    return (fn2 - fn1) if is_property else (fn2() - fn1())\n\n  with tf.name_scope(name or \"kl_beta_beta\"):\n    return (delta(\"_log_normalization\", is_property=False) -\n            tf.math.digamma(d1.concentration1) * delta(\"concentration1\") -\n            tf.math.digamma(d1.concentration0) * delta(\"concentration0\") +\n            (tf.math.digamma(d1.total_concentration) *\n             delta(\"total_concentration\")))"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef _maybe_assert_valid_sample(self, x):\n    if not self.validate_args:\n      return x\n    return distribution_util.with_dependencies([\n        assert_util.assert_positive(x, message=\"sample must be positive\"),\n        assert_util.assert_less(x, 1., message=\"sample must be less than `1`.\"),\n    ], x)", "response": "Checks the validity of a sample."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef converged_any(converged, failed):\n  return (tf.reduce_any(input_tensor=converged) |\n          tf.reduce_all(input_tensor=failed))", "response": "Condition to stop when any batch member converges."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nreturn a dictionary of initial state arguments for the current object.", "response": "def get_initial_state_args(value_and_gradients_function,\n                           initial_position,\n                           grad_tolerance,\n                           control_inputs=None):\n  \"\"\"Returns a dictionary to populate the initial state of the search procedure.\n\n  Performs an initial convergence check and the first evaluation of the\n  objective function.\n\n  Args:\n    value_and_gradients_function: A Python callable that accepts a tensor and\n      returns a tuple of two tensors: the objective function value and its\n      derivative.\n    initial_position: The starting point of the search procedure.\n    grad_tolerance: The gradient tolerance for the procedure.\n    control_inputs: Optional ops used to assert the validity of inputs, these\n      are added as control dependencies to execute before the objective\n      function is evaluated for the first time.\n\n  Returns:\n    An dictionary with values for the following keys:\n      converged: True if the convergence check finds that the initial position\n        is already an argmin of the objective function.\n      failed: Initialized to False.\n      num_objective_evaluations: Initialized to 1.\n      position: Initialized to the initial position.\n      objective_value: Initialized to the value of the objective function at\n        the initial position.\n      objective_gradient: Initialized to the gradient of the objective\n        function at the initial position.\n  \"\"\"\n  if control_inputs:\n    with tf.control_dependencies(control_inputs):\n      f0, df0 = value_and_gradients_function(initial_position)\n  else:\n    f0, df0 = value_and_gradients_function(initial_position)\n  converged = norm(df0, dims=1) < grad_tolerance\n  return dict(\n      converged=converged,\n      failed=tf.zeros_like(converged),  # i.e. False.\n      num_iterations=tf.convert_to_tensor(value=0),\n      num_objective_evaluations=tf.convert_to_tensor(value=1),\n      position=initial_position,\n      objective_value=f0,\n      objective_gradient=df0)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nperforming the line search step of the BFGS search procedure. Uses hager_zhang line search procedure to compute a suitable step size to advance the current `state.position` along the given `search_direction`. Also, if the line search is successful, updates the `state.position` by taking the corresponding step. Args: state: A namedtuple instance holding values for the current state of the search procedure. The state must include the fields: `position`, `objective_value`, `objective_gradient`, `num_iterations`, `num_objective_evaluations`, `converged` and `failed`. value_and_gradients_function: A Python callable that accepts a point as a real `Tensor` of shape `[..., n]` and returns a tuple of two tensors of the same dtype: the objective function value, a real `Tensor` of shape `[...]`, and its derivative, another real `Tensor` of shape `[..., n]`. search_direction: A real `Tensor` of shape `[..., n]`. The direction along which to perform line search. grad_tolerance: Scalar `Tensor` of real dtype. Specifies the gradient tolerance for the procedure. f_relative_tolerance: Scalar `Tensor` of real dtype. Specifies the tolerance for the relative change in the objective value. x_tolerance: Scalar `Tensor` of real dtype. Specifies the tolerance for the change in the position. stopping_condition: A Python function that takes as input two Boolean tensors of shape `[...]`, and returns a Boolean scalar tensor. The input tensors are `converged` and `failed`, indicating the current status of each respective batch member; the return value states whether the algorithm should stop. Returns: A copy of the input state with the following fields updated: converged: a Boolean `Tensor` of shape `[...]` indicating whether the convergence criteria has been met. failed: a Boolean `Tensor` of shape `[...]` indicating whether the line search procedure failed to converge, or if either the updated gradient or objective function are no longer finite. num_iterations: Increased by 1. num_objective_evaluations: Increased by the number of times that the objective function got evaluated. position, objective_value, objective_gradient: If line search succeeded, updated by computing the new position and evaluating the objective function at that position.", "response": "def line_search_step(state, value_and_gradients_function, search_direction,\n                     grad_tolerance, f_relative_tolerance, x_tolerance,\n                     stopping_condition):\n  \"\"\"Performs the line search step of the BFGS search procedure.\n\n  Uses hager_zhang line search procedure to compute a suitable step size\n  to advance the current `state.position` along the given `search_direction`.\n  Also, if the line search is successful, updates the `state.position` by\n  taking the corresponding step.\n\n  Args:\n    state: A namedtuple instance holding values for the current state of the\n      search procedure. The state must include the fields: `position`,\n      `objective_value`, `objective_gradient`, `num_iterations`,\n      `num_objective_evaluations`, `converged` and `failed`.\n    value_and_gradients_function: A Python callable that accepts a point as a\n      real `Tensor` of shape `[..., n]` and returns a tuple of two tensors of\n      the same dtype: the objective function value, a real `Tensor` of shape\n      `[...]`, and its derivative, another real `Tensor` of shape `[..., n]`.\n    search_direction: A real `Tensor` of shape `[..., n]`. The direction along\n      which to perform line search.\n    grad_tolerance: Scalar `Tensor` of real dtype. Specifies the gradient\n      tolerance for the procedure.\n    f_relative_tolerance: Scalar `Tensor` of real dtype. Specifies the\n      tolerance for the relative change in the objective value.\n    x_tolerance: Scalar `Tensor` of real dtype. Specifies the tolerance for the\n      change in the position.\n    stopping_condition: A Python function that takes as input two Boolean\n      tensors of shape `[...]`, and returns a Boolean scalar tensor. The input\n      tensors are `converged` and `failed`, indicating the current status of\n      each respective batch member; the return value states whether the\n      algorithm should stop.\n  Returns:\n    A copy of the input state with the following fields updated:\n      converged: a Boolean `Tensor` of shape `[...]` indicating whether the\n        convergence criteria has been met.\n      failed: a Boolean `Tensor` of shape `[...]` indicating whether the line\n        search procedure failed to converge, or if either the updated gradient\n        or objective function are no longer finite.\n      num_iterations: Increased by 1.\n      num_objective_evaluations: Increased by the number of times that the\n        objective function got evaluated.\n      position, objective_value, objective_gradient: If line search succeeded,\n        updated by computing the new position and evaluating the objective\n        function at that position.\n  \"\"\"\n  line_search_value_grad_func = _restrict_along_direction(\n      value_and_gradients_function, state.position, search_direction)\n  derivative_at_start_pt = tf.reduce_sum(\n      input_tensor=state.objective_gradient * search_direction, axis=-1)\n  val_0 = ValueAndGradient(x=_broadcast(0, state.position),\n                           f=state.objective_value,\n                           df=derivative_at_start_pt,\n                           full_gradient=state.objective_gradient)\n  inactive = state.failed | state.converged\n  ls_result = linesearch.hager_zhang(\n      line_search_value_grad_func,\n      initial_step_size=_broadcast(1, state.position),\n      value_at_zero=val_0,\n      converged=inactive)  # No search needed for these.\n\n  state_after_ls = update_fields(\n      state,\n      failed=state.failed | ~ls_result.converged,\n      num_iterations=state.num_iterations + 1,\n      num_objective_evaluations=(\n          state.num_objective_evaluations + ls_result.func_evals))\n\n  def _do_update_position():\n    # For inactive batch members `left.x` is zero. However, their\n    # `search_direction` might also be undefined, so we can't rely on\n    # multiplication by zero to produce a `position_delta` of zero.\n    position_delta = tf.where(\n        inactive,\n        tf.zeros_like(search_direction),\n        search_direction * tf.expand_dims(ls_result.left.x, axis=-1))\n    return _update_position(\n        state_after_ls,\n        position_delta,\n        ls_result.left.f,\n        ls_result.left.full_gradient,\n        grad_tolerance, f_relative_tolerance, x_tolerance)\n\n  return prefer_static.cond(\n      stopping_condition(state.converged, state.failed),\n      true_fn=lambda: state_after_ls,\n      false_fn=_do_update_position)"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef _restrict_along_direction(value_and_gradients_function,\n                              position,\n                              direction):\n  \"\"\"Restricts a function in n-dimensions to a given direction.\n\n  Suppose f: R^n -> R. Then given a point x0 and a vector p0 in R^n, the\n  restriction of the function along that direction is defined by:\n\n  ```None\n  g(t) = f(x0 + t * p0)\n  ```\n\n  This function performs this restriction on the given function. In addition, it\n  also computes the gradient of the restricted function along the restriction\n  direction. This is equivalent to computing `dg/dt` in the definition above.\n\n  Args:\n    value_and_gradients_function: Callable accepting a single real `Tensor`\n      argument of shape `[..., n]` and returning a tuple of a real `Tensor` of\n      shape `[...]` and a real `Tensor` of shape `[..., n]`. The multivariate\n      function whose restriction is to be computed. The output values of the\n      callable are the function value and the gradients at the input argument.\n    position: `Tensor` of real dtype and shape consumable by\n      `value_and_gradients_function`. Corresponds to `x0` in the definition\n      above.\n    direction: `Tensor` of the same dtype and shape as `position`. The direction\n      along which to restrict the function. Note that the direction need not\n      be a unit vector.\n\n  Returns:\n    restricted_value_and_gradients_func: A callable accepting a tensor of shape\n      broadcastable to `[...]` and same dtype as `position` and returning a\n      namedtuple of `Tensors`. The input tensor is the parameter along the\n      direction labelled `t` above. The return value contains fields:\n        x: A real `Tensor` of shape `[...]`. The input value `t` where the line\n          function was evaluated, after any necessary broadcasting.\n        f: A real `Tensor` of shape `[...]` containing the value of the\n          function at the point `position + t * direction`.\n        df: A real `Tensor` of shape `[...]` containing the derivative at\n          `position + t * direction`.\n        full_gradient: A real `Tensor` of shape `[..., n]`, the full gradient\n          of the original `value_and_gradients_function`.\n  \"\"\"\n  def _restricted_func(t):\n    t = _broadcast(t, position)\n    pt = position + tf.expand_dims(t, axis=-1) * direction\n    objective_value, gradient = value_and_gradients_function(pt)\n    return ValueAndGradient(\n        x=t,\n        f=objective_value,\n        df=tf.reduce_sum(input_tensor=gradient * direction, axis=-1),\n        full_gradient=gradient)\n\n  return _restricted_func", "response": "Restrict a function to a given direction."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef _update_position(state,\n                     position_delta,\n                     next_objective,\n                     next_gradient,\n                     grad_tolerance,\n                     f_relative_tolerance,\n                     x_tolerance):\n  \"\"\"Updates the state advancing its position by a given position_delta.\"\"\"\n  failed = state.failed | ~tf.math.is_finite(next_objective) | ~tf.reduce_all(\n      input_tensor=tf.math.is_finite(next_gradient), axis=-1)\n\n  next_position = state.position + position_delta\n  converged = ~failed & _check_convergence(state.position,\n                                           next_position,\n                                           state.objective_value,\n                                           next_objective,\n                                           next_gradient,\n                                           grad_tolerance,\n                                           f_relative_tolerance,\n                                           x_tolerance)\n  return update_fields(\n      state,\n      converged=state.converged | converged,\n      failed=failed,\n      position=next_position,\n      objective_value=next_objective,\n      objective_gradient=next_gradient)", "response": "Updates the state advancing its position by a given position_delta."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef norm(value, dims, order=None):\n  if dims == 0:\n    return tf.math.abs(value)\n  elif dims == 1:\n    axis = -1\n  elif dims == 2:\n    axis = [-1, -2]\n  else:\n    ValueError(dims)\n  if order is None:\n    order = np.inf\n  return tf.norm(tensor=value, axis=axis, ord=order)", "response": "Compute the norm of the given value."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef _check_convergence(current_position,\n                       next_position,\n                       current_objective,\n                       next_objective,\n                       next_gradient,\n                       grad_tolerance,\n                       f_relative_tolerance,\n                       x_tolerance):\n  \"\"\"Checks if the algorithm satisfies the convergence criteria.\"\"\"\n  grad_converged = norm(next_gradient, dims=1) <= grad_tolerance\n  x_converged = norm(next_position - current_position, dims=1) <= x_tolerance\n  f_converged = (norm(next_objective - current_objective, dims=0) <=\n                 f_relative_tolerance * current_objective)\n  return grad_converged | x_converged | f_converged", "response": "Checks if the algorithm satisfies the convergence criteria."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef _broadcast(value, target):\n  return tf.broadcast_to(\n      tf.convert_to_tensor(value=value, dtype=target.dtype),\n      distribution_util.prefer_static_shape(target)[:-1])", "response": "Broadcasts a value to match the batching dimensions of a target."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef _harmonic_number(x):\n  one = tf.ones([], dtype=x.dtype)\n  return tf.math.digamma(x + one) - tf.math.digamma(one)", "response": "Compute the harmonic number from its analytic continuation."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef _moment(self, n):\n    total_concentration = self.concentration1 + self.concentration0\n    expanded_concentration1 = tf.ones_like(\n        total_concentration, dtype=self.dtype) * self.concentration1\n    expanded_concentration0 = tf.ones_like(\n        total_concentration, dtype=self.dtype) * self.concentration0\n    beta_arg0 = 1 + n / expanded_concentration1\n    beta_arg = tf.stack([beta_arg0, expanded_concentration0], -1)\n    log_moment = tf.math.log(expanded_concentration0) + tf.math.lbeta(beta_arg)\n    return tf.exp(log_moment)", "response": "Compute the n th uncentered moment."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef _maybe_validate_target_accept_prob(target_accept_prob, validate_args):\n  if not validate_args:\n    return target_accept_prob\n  with tf.control_dependencies([\n      tf.compat.v1.assert_positive(\n          target_accept_prob, message='`target_accept_prob` must be > 0.'),\n      tf.compat.v1.assert_less(\n          target_accept_prob,\n          tf.ones_like(target_accept_prob),\n          message='`target_accept_prob` must be < 1.')\n  ]):\n    return tf.identity(target_accept_prob)", "response": "Validates that target_accept_prob is in range 0 1."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\ndefault exchange proposal function for replica exchange MC.", "response": "def default_exchange_proposed_fn(prob_exchange):\n  \"\"\"Default exchange proposal function, for replica exchange MC.\n\n  With probability `prob_exchange` propose combinations of replica for exchange.\n  When exchanging, create combinations of adjacent replicas in\n  [Replica Exchange Monte Carlo](\n  https://en.wikipedia.org/wiki/Parallel_tempering)\n\n  ```\n  exchange_fn = default_exchange_proposed_fn(prob_exchange=0.5)\n  exchange_proposed = exchange_fn(num_replica=3)\n\n  exchange_proposed.eval()\n  ==> [[0, 1]]  # 1 exchange, 0 <--> 1\n\n  exchange_proposed.eval()\n  ==> []  # 0 exchanges\n  ```\n\n  Args:\n    prob_exchange: Scalar `Tensor` giving probability that any exchanges will\n      be generated.\n\n  Returns:\n    default_exchange_proposed_fn_: Python callable which take a number of\n      replicas (a Python integer), and return combinations of replicas for\n      exchange as an [n, 2] integer `Tensor`, `0 <= n <= num_replica // 2`,\n      with *unique* values in the set `{0, ..., num_replica}`.\n  \"\"\"\n\n  def default_exchange_proposed_fn_(num_replica, seed=None):\n    \"\"\"Default function for `exchange_proposed_fn` of `kernel`.\"\"\"\n    seed_stream = distributions.SeedStream(seed, 'default_exchange_proposed_fn')\n\n    zero_start = tf.random.uniform([], seed=seed_stream()) > 0.5\n    if num_replica % 2 == 0:\n\n      def _exchange():\n        flat_exchange = tf.range(num_replica)\n        if num_replica > 2:\n          start = tf.cast(~zero_start, dtype=tf.int32)\n          end = num_replica - start\n          flat_exchange = flat_exchange[start:end]\n        return tf.reshape(flat_exchange, [tf.size(input=flat_exchange) // 2, 2])\n    else:\n\n      def _exchange():\n        start = tf.cast(zero_start, dtype=tf.int32)\n        end = num_replica - tf.cast(~zero_start, dtype=tf.int32)\n        flat_exchange = tf.range(num_replica)[start:end]\n        return tf.reshape(flat_exchange, [tf.size(input=flat_exchange) // 2, 2])\n\n    def _null_exchange():\n      return tf.reshape(tf.cast([], dtype=tf.int32), shape=[0, 2])\n\n    return tf.cond(\n        pred=tf.random.uniform([], seed=seed_stream()) < prob_exchange,\n        true_fn=_exchange,\n        false_fn=_null_exchange)\n\n  return default_exchange_proposed_fn_"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef _get_field(kernel_results, field_name):\n  if hasattr(kernel_results, field_name):\n    return getattr(kernel_results, field_name)\n  if hasattr(kernel_results, 'accepted_results'):\n    return getattr(kernel_results.accepted_results, field_name)\n  raise TypeError('Cannot extract %s from %s' % (field_name, kernel_results))", "response": "Get a field from kernel_results or kernel_results. accepted_results."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\ntake one step of the TransitionKernel.", "response": "def one_step(self, current_state, previous_kernel_results):\n    \"\"\"Takes one step of the TransitionKernel.\n\n    Args:\n      current_state: `Tensor` or Python `list` of `Tensor`s representing the\n        current state(s) of the Markov chain(s).\n      previous_kernel_results: A (possibly nested) `tuple`, `namedtuple` or\n        `list` of `Tensor`s representing internal calculations made within the\n        previous call to this function (or as returned by `bootstrap_results`).\n\n    Returns:\n      next_state: `Tensor` or Python `list` of `Tensor`s representing the\n        next state(s) of the Markov chain(s).\n      kernel_results: A (possibly nested) `tuple`, `namedtuple` or `list` of\n        `Tensor`s representing internal calculations made within this function.\n        This inculdes replica states.\n    \"\"\"\n    # Key difficulty:  The type of exchanges differs from one call to the\n    # next...even the number of exchanges can differ.\n    # As a result, exchanges must happen dynamically, in while loops.\n    with tf.compat.v1.name_scope(\n        name=mcmc_util.make_name(self.name, 'remc', 'one_step'),\n        values=[current_state, previous_kernel_results]):\n\n      # Each replica does `one_step` to get pre-exchange states/KernelResults.\n      sampled_replica_states, sampled_replica_results = zip(*[\n          rk.one_step(previous_kernel_results.replica_states[i],\n                      previous_kernel_results.replica_results[i])\n          for i, rk in enumerate(self.replica_kernels)\n      ])\n      sampled_replica_states = list(sampled_replica_states)\n      sampled_replica_results = list(sampled_replica_results)\n\n      states_are_lists = mcmc_util.is_list_like(sampled_replica_states[0])\n\n      if not states_are_lists:\n        sampled_replica_states = [[s] for s in sampled_replica_states]\n      num_state_parts = len(sampled_replica_states[0])\n\n      dtype = sampled_replica_states[0][0].dtype\n\n      # Must put states into TensorArrays.  Why?  We will read/write states\n      # dynamically with Tensor index `i`, and you cannot do this with lists.\n      # old_states[k][i] is Tensor of (old) state part k, for replica i.\n      # The `k` will be known statically, and `i` is a Tensor.\n      old_states = [\n          tf.TensorArray(\n              dtype,\n              size=self.num_replica,\n              dynamic_size=False,\n              clear_after_read=False,\n              tensor_array_name='old_states',\n              # State part k has same shape, regardless of replica.  So use 0.\n              element_shape=sampled_replica_states[0][k].shape)\n          for k in range(num_state_parts)\n      ]\n      for k in range(num_state_parts):\n        for i in range(self.num_replica):\n          old_states[k] = old_states[k].write(i, sampled_replica_states[i][k])\n\n      exchange_proposed = self.exchange_proposed_fn(\n          self.num_replica, seed=self._seed_stream())\n      exchange_proposed_n = tf.shape(input=exchange_proposed)[0]\n\n      exchanged_states = self._get_exchanged_states(\n          old_states, exchange_proposed, exchange_proposed_n,\n          sampled_replica_states, sampled_replica_results)\n\n      no_exchange_proposed, _ = tf.compat.v1.setdiff1d(\n          tf.range(self.num_replica), tf.reshape(exchange_proposed, [-1]))\n\n      exchanged_states = self._insert_old_states_where_no_exchange_was_proposed(\n          no_exchange_proposed, old_states, exchanged_states)\n\n      next_replica_states = []\n      for i in range(self.num_replica):\n        next_replica_states_i = []\n        for k in range(num_state_parts):\n          next_replica_states_i.append(exchanged_states[k].read(i))\n        next_replica_states.append(next_replica_states_i)\n\n      if not states_are_lists:\n        next_replica_states = [s[0] for s in next_replica_states]\n        sampled_replica_states = [s[0] for s in sampled_replica_states]\n\n      # Now that states are/aren't exchanged, bootstrap next kernel_results.\n      # The viewpoint is that after each exchange, we are starting anew.\n      next_replica_results = [\n          rk.bootstrap_results(state)\n          for rk, state in zip(self.replica_kernels, next_replica_states)\n      ]\n\n      next_state = next_replica_states[0]  # Replica 0 is the returned state(s).\n\n      kernel_results = ReplicaExchangeMCKernelResults(\n          replica_states=next_replica_states,\n          replica_results=next_replica_results,\n          sampled_replica_states=sampled_replica_states,\n          sampled_replica_results=sampled_replica_results,\n      )\n\n      return next_state, kernel_results"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\ngetting the exchanged states.", "response": "def _get_exchanged_states(self, old_states, exchange_proposed,\n                            exchange_proposed_n, sampled_replica_states,\n                            sampled_replica_results):\n    \"\"\"Get list of TensorArrays holding exchanged states, and zeros.\"\"\"\n    with tf.compat.v1.name_scope('get_exchanged_states'):\n\n      target_log_probs = []\n      for replica in range(self.num_replica):\n        replica_log_prob = _get_field(sampled_replica_results[replica],\n                                      'target_log_prob')\n        inverse_temp = self.inverse_temperatures[replica]\n        target_log_probs.append(replica_log_prob / inverse_temp)\n      target_log_probs = tf.stack(target_log_probs, axis=0)\n\n      dtype = target_log_probs.dtype\n      num_state_parts = len(sampled_replica_states[0])\n      # exchanged_states[k][i] is Tensor of (new) state part k, for replica i.\n      # The `k` will be known statically, and `i` is a Tensor.\n      # We will insert values into indices `i` for every replica with a proposed\n      # exchange.\n      exchanged_states = [\n          tf.TensorArray(\n              dtype,\n              size=self.num_replica,\n              dynamic_size=False,\n              tensor_array_name='exchanged_states',\n              # State part k has same shape, regardless of replica.  So use 0.\n              element_shape=sampled_replica_states[0][k].shape)\n          for k in range(num_state_parts)\n      ]\n\n      # Draw random variables here, to avoid sampling in the loop (and losing\n      # reproducibility).  This may mean we sample too many, but we will always\n      # have enough.\n      sample_shape = tf.concat(\n          ([self.num_replica // 2], tf.shape(input=target_log_probs)[1:]),\n          axis=0)\n      log_uniforms = tf.math.log(\n          tf.random.uniform(\n              shape=sample_shape, dtype=dtype, seed=self._seed_stream()))\n\n      def _swap(is_exchange_accepted, x, y):\n        \"\"\"Swap batches of x, y where accepted.\"\"\"\n        with tf.compat.v1.name_scope('swap_where_exchange_accepted'):\n          new_x = mcmc_util.choose(is_exchange_accepted, y, x)\n          new_y = mcmc_util.choose(is_exchange_accepted, x, y)\n        return new_x, new_y\n\n      def cond(i, unused_exchanged_states):\n        return i < exchange_proposed_n\n\n      def body(i, exchanged_states):\n        \"\"\"Body of while loop for exchanging states.\"\"\"\n        # Propose exchange between replicas indexed by m and n.\n        m, n = tf.unstack(exchange_proposed[i])\n\n        # Construct log_accept_ratio:  -temp_diff * target_log_prob_diff.\n        # Note target_log_prob_diff = -EnergyDiff (common definition is in terms\n        # of energy).\n        temp_diff = self.inverse_temperatures[m] - self.inverse_temperatures[n]\n        # Difference of target log probs may be +- Inf or NaN.  We want the\n        # product of this with the temperature difference to have \"alt value\" of\n        # -Inf.\n        log_accept_ratio = mcmc_util.safe_sum(\n            [-temp_diff * target_log_probs[m], temp_diff * target_log_probs[n]])\n\n        is_exchange_accepted = log_uniforms[i] < log_accept_ratio\n\n        for k in range(num_state_parts):\n          new_m, new_n = _swap(is_exchange_accepted, old_states[k].read(m),\n                               old_states[k].read(n))\n          exchanged_states[k] = exchanged_states[k].write(m, new_m)\n          exchanged_states[k] = exchanged_states[k].write(n, new_n)\n\n        return i + 1, exchanged_states\n\n      # At this point, exchanged_states[k] is a length num_replicas TensorArray.\n      return tf.while_loop(\n          cond=cond, body=body, loop_vars=[tf.constant(0),\n                                           exchanged_states])[1]"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef bootstrap_results(self, init_state):\n    with tf.compat.v1.name_scope(\n        name=mcmc_util.make_name(self.name, 'remc', 'bootstrap_results'),\n        values=[init_state]):\n      replica_results = [\n          self.replica_kernels[i].bootstrap_results(init_state)\n          for i in range(self.num_replica)\n      ]\n\n      init_state_parts = (\n          list(init_state)\n          if mcmc_util.is_list_like(init_state) else [init_state])\n\n      # Convert all states parts to tensor...\n      replica_states = [[\n          tf.convert_to_tensor(value=s) for s in init_state_parts\n      ] for i in range(self.num_replica)]\n\n      if not mcmc_util.is_list_like(init_state):\n        replica_states = [s[0] for s in replica_states]\n\n      return ReplicaExchangeMCKernelResults(\n          replica_states=replica_states,\n          replica_results=replica_results,\n          sampled_replica_states=replica_states,\n          sampled_replica_results=replica_results,\n      )", "response": "Returns an object with the same type as returned by one_step."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef _variance_scale_term(self):\n    # Expand back the last dim so the shape of _variance_scale_term matches the\n    # shape of self.concentration.\n    c0 = self.total_concentration[..., tf.newaxis]\n    return tf.sqrt((1. + c0 / self.total_count[..., tf.newaxis]) / (1. + c0))", "response": "Helper to _covariance and _variance which computes a shared scale."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nchecks the validity of the concentration parameter.", "response": "def _maybe_assert_valid_concentration(self, concentration, validate_args):\n    \"\"\"Checks the validity of the concentration parameter.\"\"\"\n    if not validate_args:\n      return concentration\n    concentration = distribution_util.embed_check_categorical_event_shape(\n        concentration)\n    return distribution_util.with_dependencies([\n        assert_util.assert_positive(\n            concentration, message=\"Concentration parameter must be positive.\"),\n    ], concentration)"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nchecks counts for proper shape values then return tensor version.", "response": "def _maybe_assert_valid_sample(self, counts):\n    \"\"\"Check counts for proper shape, values, then return tensor version.\"\"\"\n    if not self.validate_args:\n      return counts\n    counts = distribution_util.embed_check_nonnegative_integer_form(counts)\n    return distribution_util.with_dependencies([\n        assert_util.assert_equal(\n            self.total_count,\n            tf.reduce_sum(input_tensor=counts, axis=-1),\n            message=\"counts last-dimension must sum to `self.total_count`\"),\n    ], counts)"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nmaking a function which applies a list of Bijectors s log_det_jacobians.", "response": "def forward_log_det_jacobian_fn(bijector):\n  \"\"\"Makes a function which applies a list of Bijectors' `log_det_jacobian`s.\"\"\"\n  if not mcmc_util.is_list_like(bijector):\n    bijector = [bijector]\n\n  def fn(transformed_state_parts, event_ndims):\n    return sum([\n        b.forward_log_det_jacobian(sp, event_ndims=e)\n        for b, e, sp in zip(bijector, event_ndims, transformed_state_parts)\n    ])\n\n  return fn"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef forward_transform_fn(bijector):\n  if not mcmc_util.is_list_like(bijector):\n    bijector = [bijector]\n\n  def fn(transformed_state_parts):\n    return [b.forward(sp) for b, sp in zip(bijector, transformed_state_parts)]\n\n  return fn", "response": "Makes a function which applies a list of Bijectors s forward s."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nmakes a function which applies a list of Bijectors s inverse s.", "response": "def inverse_transform_fn(bijector):\n  \"\"\"Makes a function which applies a list of Bijectors' `inverse`s.\"\"\"\n  if not mcmc_util.is_list_like(bijector):\n    bijector = [bijector]\n  def fn(state_parts):\n    return [b.inverse(sp)\n            for b, sp in zip(bijector, state_parts)]\n  return fn"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef one_step(self, current_state, previous_kernel_results):\n    with tf.compat.v1.name_scope(\n        name=mcmc_util.make_name(self.name, 'transformed_kernel', 'one_step'),\n        values=[previous_kernel_results]):\n      transformed_next_state, kernel_results = self._inner_kernel.one_step(\n          previous_kernel_results.transformed_state,\n          previous_kernel_results.inner_results)\n      transformed_next_state_parts = (\n          transformed_next_state\n          if mcmc_util.is_list_like(transformed_next_state) else\n          [transformed_next_state])\n      next_state_parts = self._forward_transform(transformed_next_state_parts)\n      next_state = (\n          next_state_parts if mcmc_util.is_list_like(transformed_next_state)\n          else next_state_parts[0])\n      kernel_results = TransformedTransitionKernelResults(\n          transformed_state=transformed_next_state,\n          inner_results=kernel_results)\n      return next_state, kernel_results", "response": "Runs one iteration of the Transformed Kernel."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nreturning a tuple containing the bootstrap results of the current state of the Markov chain.", "response": "def bootstrap_results(self, init_state=None, transformed_init_state=None):\n    \"\"\"Returns an object with the same type as returned by `one_step`.\n\n    Unlike other `TransitionKernel`s,\n    `TransformedTransitionKernel.bootstrap_results` has the option of\n    initializing the `TransformedTransitionKernelResults` from either an initial\n    state, eg, requiring computing `bijector.inverse(init_state)`, or\n    directly from `transformed_init_state`, i.e., a `Tensor` or list\n    of `Tensor`s which is interpretted as the `bijector.inverse`\n    transformed state.\n\n    Args:\n      init_state: `Tensor` or Python `list` of `Tensor`s representing the a\n        state(s) of the Markov chain(s). Must specify `init_state` or\n        `transformed_init_state` but not both.\n      transformed_init_state: `Tensor` or Python `list` of `Tensor`s\n        representing the a state(s) of the Markov chain(s). Must specify\n        `init_state` or `transformed_init_state` but not both.\n\n    Returns:\n      kernel_results: A (possibly nested) `tuple`, `namedtuple` or `list` of\n        `Tensor`s representing internal calculations made within this function.\n\n    Raises:\n      ValueError: if `inner_kernel` results doesn't contain the member\n        \"target_log_prob\".\n\n    #### Examples\n\n    To use `transformed_init_state` in context of\n    `tfp.mcmc.sample_chain`, you need to explicitly pass the\n    `previous_kernel_results`, e.g.,\n\n    ```python\n    transformed_kernel = tfp.mcmc.TransformedTransitionKernel(...)\n    init_state = ...        # Doesnt matter.\n    transformed_init_state = ... # Does matter.\n    results, _ = tfp.mcmc.sample_chain(\n        num_results=...,\n        current_state=init_state,\n        previous_kernel_results=transformed_kernel.bootstrap_results(\n            transformed_init_state=transformed_init_state),\n        kernel=transformed_kernel)\n    ```\n    \"\"\"\n    if (init_state is None) == (transformed_init_state is None):\n      raise ValueError('Must specify exactly one of `init_state` '\n                       'or `transformed_init_state`.')\n    with tf.compat.v1.name_scope(\n        name=mcmc_util.make_name(self.name, 'transformed_kernel',\n                                 'bootstrap_results'),\n        values=[init_state, transformed_init_state]):\n      if transformed_init_state is None:\n        init_state_parts = (init_state if mcmc_util.is_list_like(init_state)\n                            else [init_state])\n        transformed_init_state_parts = self._inverse_transform(init_state_parts)\n        transformed_init_state = (\n            transformed_init_state_parts if mcmc_util.is_list_like(init_state)\n            else transformed_init_state_parts[0])\n      else:\n        if mcmc_util.is_list_like(transformed_init_state):\n          transformed_init_state = [\n              tf.convert_to_tensor(value=s, name='transformed_init_state')\n              for s in transformed_init_state\n          ]\n        else:\n          transformed_init_state = tf.convert_to_tensor(\n              value=transformed_init_state, name='transformed_init_state')\n      kernel_results = TransformedTransitionKernelResults(\n          transformed_state=transformed_init_state,\n          inner_results=self._inner_kernel.bootstrap_results(\n              transformed_init_state))\n      return kernel_results"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nlikes tf. where but works on namedtuples.", "response": "def val_where(cond, tval, fval):\n  \"\"\"Like tf.where but works on namedtuples.\"\"\"\n  if isinstance(tval, tf.Tensor):\n    return tf.where(cond, tval, fval)\n  elif isinstance(tval, tuple):\n    cls = type(tval)\n    return cls(*(val_where(cond, t, f) for t, f in zip(tval, fval)))\n  else:\n    raise Exception(TypeError)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef secant2(value_and_gradients_function,\n            val_0,\n            search_interval,\n            f_lim,\n            sufficient_decrease_param=0.1,\n            curvature_param=0.9,\n            name=None):\n  \"\"\"Performs the secant square procedure of Hager Zhang.\n\n  Given an interval that brackets a root, this procedure performs an update of\n  both end points using two intermediate points generated using the secant\n  interpolation. For details see the steps S1-S4 in [Hager and Zhang (2006)][2].\n\n  The interval [a, b] must satisfy the opposite slope conditions described in\n  the documentation for `update`.\n\n  Args:\n    value_and_gradients_function: A Python callable that accepts a real scalar\n      tensor and returns an object that can be converted to a namedtuple.\n      The namedtuple should have fields 'f' and 'df' that correspond to scalar\n      tensors of real dtype containing the value of the function and its\n      derivative at that point. The other namedtuple fields, if present,\n      should be tensors or sequences (possibly nested) of tensors.\n      In usual optimization application, this function would be generated by\n      projecting the multivariate objective function along some specific\n      direction. The direction is determined by some other procedure but should\n      be a descent direction (i.e. the derivative of the projected univariate\n      function must be negative at 0.).\n      Alternatively, the function may represent the batching of `n` such line\n      functions (e.g. projecting a single multivariate objective function along\n      `n` distinct directions at once) accepting n points as input, i.e. a\n      tensor of shape [n], and the fields 'f' and 'df' in the returned\n      namedtuple should each be a tensor of shape [n], with the corresponding\n      function values and derivatives at the input points.\n    val_0: A namedtuple, as returned by value_and_gradients_function evaluated\n      at `0.`.\n    search_interval: A namedtuple describing the current search interval,\n      must include the fields:\n      - converged: Boolean `Tensor` of shape [n], indicating batch members\n          where search has already converged. Interval for these batch members\n          won't be modified.\n      - failed: Boolean `Tensor` of shape [n], indicating batch members\n          where search has already failed. Interval for these batch members\n          wont be modified.\n      - iterations: Scalar int32 `Tensor`. Number of line search iterations\n          so far.\n      - func_evals: Scalar int32 `Tensor`. Number of function evaluations\n          so far.\n      - left: A namedtuple, as returned by value_and_gradients_function,\n          of the left end point of the current search interval.\n      - right: A namedtuple, as returned by value_and_gradients_function,\n          of the right end point of the current search interval.\n    f_lim: Scalar `Tensor` of real dtype. The function value threshold for\n      the approximate Wolfe conditions to be checked.\n    sufficient_decrease_param: Positive scalar `Tensor` of real dtype.\n      Bounded above by the curvature param. Corresponds to 'delta' in the\n      terminology of [Hager and Zhang (2006)][2].\n    curvature_param: Positive scalar `Tensor` of real dtype. Bounded above\n      by `1.`. Corresponds to 'sigma' in the terminology of\n      [Hager and Zhang (2006)][2].\n    name: (Optional) Python str. The name prefixed to the ops created by this\n      function. If not supplied, the default name 'secant2' is used.\n\n  Returns:\n    A namedtuple containing the following fields.\n      active: A boolean `Tensor` of shape [n]. Used internally by the procedure\n        to indicate batch members on which there is work left to do.\n      converged: A boolean `Tensor` of shape [n]. Indicates whether a point\n        satisfying the Wolfe conditions has been found. If this is True, the\n        interval will be degenerate (i.e. `left` and `right` below will be\n        identical).\n      failed: A boolean `Tensor` of shape [n]. Indicates if invalid function or\n        gradient values were encountered (i.e. infinity or NaNs).\n      num_evals: A scalar int32 `Tensor`. The total number of function\n        evaluations made.\n      left: Return value of value_and_gradients_function at the updated left\n        end point of the interval.\n      right: Return value of value_and_gradients_function at the updated right\n        end point of the interval.\n  \"\"\"\n  with tf.compat.v1.name_scope(name, 'secant2', [\n      val_0, search_interval, f_lim, sufficient_decrease_param,\n      curvature_param]):\n    # This will always be s.t. left <= c <= right\n    val_c = value_and_gradients_function(\n        _secant(search_interval.left, search_interval.right))\n    failed = search_interval.failed | ~is_finite(val_c)\n    converged = search_interval.converged | (~failed & _satisfies_wolfe(\n        val_0, val_c, f_lim, sufficient_decrease_param, curvature_param))\n    new_converged = converged & ~search_interval.converged\n    val_left = val_where(new_converged, val_c, search_interval.left)\n    val_right = val_where(new_converged, val_c, search_interval.right)\n\n    initial_args = _Secant2Result(\n        active=~failed & ~converged,\n        converged=converged,\n        failed=failed,\n        num_evals=search_interval.func_evals + 1,\n        left=val_left,\n        right=val_right)\n\n    def _apply_secant2_inner():\n      return _secant2_inner(\n          value_and_gradients_function,\n          initial_args,\n          val_0,\n          val_c,\n          f_lim,\n          sufficient_decrease_param,\n          curvature_param)\n\n    return prefer_static.cond(\n        tf.reduce_any(input_tensor=initial_args.active),\n        _apply_secant2_inner,\n        lambda: initial_args)", "response": "This function performs the secant square procedure of Hager Zhang."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef _secant2_inner(value_and_gradients_function,\n                   initial_args,\n                   val_0,\n                   val_c,\n                   f_lim,\n                   sufficient_decrease_param,\n                   curvature_param):\n  \"\"\"Helper function for secant square.\"\"\"\n  # Apply the `update` function on active branch members to squeeze their\n  # bracketing interval.\n  update_result = update(value_and_gradients_function,\n                         initial_args.left,\n                         initial_args.right,\n                         val_c,\n                         f_lim,\n                         active=initial_args.active)\n\n  # Update active and failed flags, update left/right on non-failed entries.\n  active = initial_args.active & ~update_result.failed\n  failed = initial_args.failed | update_result.failed\n  val_left = val_where(active, update_result.left, initial_args.left)\n  val_right = val_where(active, update_result.right, initial_args.right)\n\n  # Check if new `c` points should be generated.\n  updated_left = active & tf.equal(val_left.x, val_c.x)\n  updated_right = active & tf.equal(val_right.x, val_c.x)\n  is_new = updated_left | updated_right\n\n  next_c = tf.where(updated_left,\n                    _secant(initial_args.left, val_left),\n                    val_c.x)\n  next_c = tf.where(updated_right,\n                    _secant(initial_args.right, val_right),\n                    next_c)\n  in_range = (val_left.x <= next_c) & (next_c <= val_right.x)\n\n  # Figure out if an extra function evaluation is needed for new `c` points.\n  needs_extra_eval = tf.reduce_any(input_tensor=in_range & is_new)\n  num_evals = initial_args.num_evals + update_result.num_evals\n  num_evals = num_evals + tf.cast(needs_extra_eval, num_evals.dtype)\n\n  next_args = _Secant2Result(\n      active=active & in_range,  # No longer active if `c` is out of range.\n      converged=initial_args.converged,\n      failed=failed,\n      num_evals=num_evals,\n      left=val_left,\n      right=val_right)\n\n  def _apply_inner_update():\n    next_val_c = prefer_static.cond(\n        needs_extra_eval,\n        (lambda: value_and_gradients_function(next_c)),\n        (lambda: val_c))\n    return _secant2_inner_update(\n        value_and_gradients_function, next_args, val_0, next_val_c, f_lim,\n        sufficient_decrease_param, curvature_param)\n\n  return prefer_static.cond(\n      tf.reduce_any(input_tensor=next_args.active),\n      _apply_inner_update,\n      lambda: next_args)", "response": "Internal function for secant 2."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nsqueezes a bracketing interval containing the minimum. Given an interval which brackets a minimum and a point in that interval, finds a smaller nested interval which also brackets the minimum. If the supplied point does not lie in the bracketing interval, the current interval is returned. The following description is given in terms of individual points evaluated on a line function to be minimized. Note, however, the implementation also accepts batches of points allowing to minimize multiple line functions at once. See details on the docstring of `value_and_gradients_function` below. The requirement of the interval bracketing a minimum is expressed through the opposite slope conditions. Assume the left end point is 'a', the right end point is 'b', the function to be minimized is 'f' and the derivative is 'df'. The update procedure relies on the following conditions being satisfied: ''' f(a) <= f(0) + epsilon (1) df(a) < 0 (2) df(b) > 0 (3) ''' In the first condition, epsilon is a small positive constant. The condition demands that the function at the left end point be not much bigger than the starting point (i.e. 0). This is an easy to satisfy condition because by assumption, we are in a direction where the function value is decreasing. The second and third conditions together demand that there is at least one zero of the derivative in between a and b. In addition to the interval, the update algorithm requires a third point to be supplied. Usually, this point would lie within the interval [a, b]. If the point is outside this interval, the current interval is returned. If the point lies within the interval, the behaviour of the function and derivative value at this point is used to squeeze the original interval in a manner that preserves the opposite slope conditions. For further details of this component, see the procedure U0-U3 on page 123 of the [Hager and Zhang (2006)][2] article. Note that this function does not explicitly verify whether the opposite slope conditions are satisfied for the supplied interval. It is assumed that this is so. Args: value_and_gradients_function: A Python callable that accepts a real scalar tensor and returns an object that can be converted to a namedtuple. The namedtuple should have fields 'f' and 'df' that correspond to scalar tensors of real dtype containing the value of the function and its derivative at that point. The other namedtuple fields, if present, should be tensors or sequences (possibly nested) of tensors. In usual optimization application, this function would be generated by projecting the multivariate objective function along some specific direction. The direction is determined by some other procedure but should be a descent direction (i.e. the derivative of the projected univariate function must be negative at 0.). Alternatively, the function may represent the batching of `n` such line functions (e.g. projecting a single multivariate objective function along `n` distinct directions at once) accepting n points as input, i.e. a tensor of shape [n], and the fields 'f' and 'df' in the returned namedtuple should each be a tensor of shape [n], with the corresponding function values and derivatives at the input points. val_left: Return value of value_and_gradients_function at the left end point of the bracketing interval (labelles 'a' above). val_right: Return value of value_and_gradients_function at the right end point of the bracketing interval (labelles 'b' above). val_trial: Return value of value_and_gradients_function at the trial point to be used to shrink the interval (labelled 'c' above). f_lim: real `Tensor` of shape [n]. The function value threshold for the approximate Wolfe conditions to be checked for each batch member. active: optional boolean `Tensor` of shape [n]. Relevant in batching mode only, indicates batch members on which the update procedure should be applied. On non-active members the current left/right interval is returned unmodified. Returns: A namedtuple containing the following fields: iteration: An int32 scalar `Tensor`. The number of iterations performed by the bisect algorithm. stopped: A boolean `Tensor` of shape [n]. True for those batch members where the bisection algorithm terminated. failed: A boolean `Tensor` of shape [n]. True for those batch members where an error was encountered. num_evals: An int32 scalar `Tensor`. The number of times the objective function was evaluated. left: Return value of value_and_gradients_function at the updated left end point of the interval found. right: Return value of value_and_gradients_function at the updated right end point of the interval found.", "response": "def update(value_and_gradients_function, val_left, val_right, val_trial, f_lim,\n           active=None):\n  \"\"\"Squeezes a bracketing interval containing the minimum.\n\n  Given an interval which brackets a minimum and a point in that interval,\n  finds a smaller nested interval which also brackets the minimum. If the\n  supplied point does not lie in the bracketing interval, the current interval\n  is returned.\n\n  The following description is given in terms of individual points evaluated on\n  a line function to be minimized. Note, however, the implementation also\n  accepts batches of points allowing to minimize multiple line functions at\n  once. See details on the docstring of `value_and_gradients_function` below.\n\n  The requirement of the interval bracketing a minimum is expressed through the\n  opposite slope conditions. Assume the left end point is 'a', the right\n  end point is 'b', the function to be minimized is 'f' and the derivative is\n  'df'. The update procedure relies on the following conditions being satisfied:\n\n  '''\n    f(a) <= f(0) + epsilon   (1)\n    df(a) < 0                (2)\n    df(b) > 0                (3)\n  '''\n\n  In the first condition, epsilon is a small positive constant. The condition\n  demands that the function at the left end point be not much bigger than the\n  starting point (i.e. 0). This is an easy to satisfy condition because by\n  assumption, we are in a direction where the function value is decreasing.\n  The second and third conditions together demand that there is at least one\n  zero of the derivative in between a and b.\n\n  In addition to the interval, the update algorithm requires a third point to\n  be supplied. Usually, this point would lie within the interval [a, b]. If the\n  point is outside this interval, the current interval is returned. If the\n  point lies within the interval, the behaviour of the function and derivative\n  value at this point is used to squeeze the original interval in a manner that\n  preserves the opposite slope conditions.\n\n  For further details of this component, see the procedure U0-U3 on page 123 of\n  the [Hager and Zhang (2006)][2] article.\n\n  Note that this function does not explicitly verify whether the opposite slope\n  conditions are satisfied for the supplied interval. It is assumed that this\n  is so.\n\n  Args:\n    value_and_gradients_function: A Python callable that accepts a real scalar\n      tensor and returns an object that can be converted to a namedtuple.\n      The namedtuple should have fields 'f' and 'df' that correspond to scalar\n      tensors of real dtype containing the value of the function and its\n      derivative at that point. The other namedtuple fields, if present,\n      should be tensors or sequences (possibly nested) of tensors.\n      In usual optimization application, this function would be generated by\n      projecting the multivariate objective function along some specific\n      direction. The direction is determined by some other procedure but should\n      be a descent direction (i.e. the derivative of the projected univariate\n      function must be negative at 0.).\n      Alternatively, the function may represent the batching of `n` such line\n      functions (e.g. projecting a single multivariate objective function along\n      `n` distinct directions at once) accepting n points as input, i.e. a\n      tensor of shape [n], and the fields 'f' and 'df' in the returned\n      namedtuple should each be a tensor of shape [n], with the corresponding\n      function values and derivatives at the input points.\n    val_left: Return value of value_and_gradients_function at the left\n      end point of the bracketing interval (labelles 'a' above).\n    val_right: Return value of value_and_gradients_function at the right\n      end point of the bracketing interval (labelles 'b' above).\n    val_trial: Return value of value_and_gradients_function at the trial point\n      to be used to shrink the interval (labelled 'c' above).\n    f_lim: real `Tensor` of shape [n]. The function value threshold for\n      the approximate Wolfe conditions to be checked for each batch member.\n    active: optional boolean `Tensor` of shape [n]. Relevant in batching mode\n      only, indicates batch members on which the update procedure should be\n      applied. On non-active members the current left/right interval is returned\n      unmodified.\n\n  Returns:\n    A namedtuple containing the following fields:\n      iteration: An int32 scalar `Tensor`. The number of iterations performed\n        by the bisect algorithm.\n      stopped: A boolean `Tensor` of shape [n]. True for those batch members\n        where the bisection algorithm terminated.\n      failed: A boolean `Tensor` of shape [n]. True for those batch members\n        where an error was encountered.\n      num_evals: An int32 scalar `Tensor`. The number of times the objective\n        function was evaluated.\n      left: Return value of value_and_gradients_function at the updated left\n        end point of the interval found.\n      right: Return value of value_and_gradients_function at the updated right\n        end point of the interval found.\n  \"\"\"\n  # We should only update if the trial point is within the interval.\n  within_range = (val_left.x < val_trial.x) & (val_trial.x < val_right.x)\n  if active is not None:\n    within_range = within_range & active\n\n  # The new point is a valid left end point if it has negative slope\n  # and the value at the point is not too large.\n  valid_left = (val_trial.df < 0) & (val_trial.f <= f_lim)\n\n  # If the trial point has a negative slope but the value at that point\n  # is too high, bisect can narrow down an interval between the current left\n  # and the trial point.\n  needs_bisect = within_range & (val_trial.df < 0) & (val_trial.f > f_lim)\n\n  # Note that if `~valid_left` it is because either:\n  # - the slope at the trial point is positive, so it is a valid right\n  #   point, or\n  # - the needs_bisect condition is true.\n  # In both cases we want to keep the current left and replace right\n  # with the trial point.\n  left = val_where(within_range & valid_left, val_trial, val_left)\n  right = val_where(within_range & ~valid_left, val_trial, val_right)\n\n  bisect_args = _IntermediateResult(\n      iteration=tf.convert_to_tensor(value=0),\n      stopped=~needs_bisect,\n      failed=tf.zeros_like(within_range),  # i.e. all false.\n      num_evals=tf.convert_to_tensor(value=0),\n      left=left,\n      right=right)\n  return _bisect(value_and_gradients_function, bisect_args, f_lim)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nbracketing the minimum given an initial starting point. Applies the Hager Zhang bracketing algorithm to find an interval containing a region with points satisfying Wolfe conditions. Uses the supplied initial step size 'c', the right end point of the provided search interval, to find such an interval. The only condition on 'c' is that it should be positive. For more details see steps B0-B3 in [Hager and Zhang (2006)][2]. Args: value_and_gradients_function: A Python callable that accepts a real scalar tensor and returns a namedtuple containing the value filed `f` of the function and its derivative value field `df` at that point. Alternatively, the function may representthe batching of `n` such line functions (e.g. projecting a single multivariate objective function along `n` distinct directions at once) accepting n points as input, i.e. a tensor of shape [n], and return a tuple of two tensors of shape [n], the function values and the corresponding derivatives at the input points. search_interval: A namedtuple describing the current search interval, must include the fields: - converged: Boolean `Tensor` of shape [n], indicating batch members where search has already converged. Interval for these batch members wont be modified. - failed: Boolean `Tensor` of shape [n], indicating batch members where search has already failed. Interval for these batch members wont be modified. - iterations: Scalar int32 `Tensor`. Number of line search iterations so far. - func_evals: Scalar int32 `Tensor`. Number of function evaluations so far. - left: A namedtuple, as returned by value_and_gradients_function evaluated at 0, the left end point of the current interval. - right: A namedtuple, as returned by value_and_gradients_function, of the right end point of the current interval (labelled 'c' above). f_lim: real `Tensor` of shape [n]. The function value threshold for the approximate Wolfe conditions to be checked for each batch member. max_iterations: Int32 scalar `Tensor`. The maximum number of iterations permitted. The limit applies equally to all batch members. expansion_param: Scalar positive `Tensor` of real dtype. Must be greater than `1.`. Used to expand the initial interval in case it does not bracket a minimum. Returns: A namedtuple with the following fields. iteration: An int32 scalar `Tensor`. The number of iterations performed. Bounded above by `max_iterations` parameter. stopped: A boolean `Tensor` of shape [n]. True for those batch members where the algorithm terminated before reaching `max_iterations`. failed: A boolean `Tensor` of shape [n]. True for those batch members where an error was encountered during bracketing. num_evals: An int32 scalar `Tensor`. The number of times the objective function was evaluated. left: Return value of value_and_gradients_function at the updated left end point of the interval found. right: Return value of value_and_gradients_function at the updated right end point of the interval found.", "response": "def bracket(value_and_gradients_function,\n            search_interval,\n            f_lim,\n            max_iterations,\n            expansion_param=5.0):\n  \"\"\"Brackets the minimum given an initial starting point.\n\n  Applies the Hager Zhang bracketing algorithm to find an interval containing\n  a region with points satisfying Wolfe conditions. Uses the supplied initial\n  step size 'c', the right end point of the provided search interval, to find\n  such an interval. The only condition on 'c' is that it should be positive.\n  For more details see steps B0-B3 in [Hager and Zhang (2006)][2].\n\n  Args:\n    value_and_gradients_function: A Python callable that accepts a real scalar\n      tensor and returns a namedtuple containing the value filed `f` of the\n      function and its derivative value field `df` at that point.\n      Alternatively, the function may representthe batching of `n` such line\n      functions (e.g. projecting a single multivariate objective function along\n      `n` distinct directions at once) accepting n points as input, i.e. a\n      tensor of shape [n], and return a tuple of two tensors of shape [n], the\n      function values and the corresponding derivatives at the input points.\n    search_interval: A namedtuple describing the current search interval,\n      must include the fields:\n      - converged: Boolean `Tensor` of shape [n], indicating batch members\n          where search has already converged. Interval for these batch members\n          wont be modified.\n      - failed: Boolean `Tensor` of shape [n], indicating batch members\n          where search has already failed. Interval for these batch members\n          wont be modified.\n      - iterations: Scalar int32 `Tensor`. Number of line search iterations\n          so far.\n      - func_evals: Scalar int32 `Tensor`. Number of function evaluations\n          so far.\n      - left: A namedtuple, as returned by value_and_gradients_function\n          evaluated at 0, the left end point of the current interval.\n      - right: A namedtuple, as returned by value_and_gradients_function,\n          of the right end point of the current interval (labelled 'c' above).\n    f_lim: real `Tensor` of shape [n]. The function value threshold for\n      the approximate Wolfe conditions to be checked for each batch member.\n    max_iterations: Int32 scalar `Tensor`. The maximum number of iterations\n      permitted. The limit applies equally to all batch members.\n    expansion_param: Scalar positive `Tensor` of real dtype. Must be greater\n      than `1.`. Used to expand the initial interval in case it does not bracket\n      a minimum.\n\n  Returns:\n    A namedtuple with the following fields.\n      iteration: An int32 scalar `Tensor`. The number of iterations performed.\n        Bounded above by `max_iterations` parameter.\n      stopped: A boolean `Tensor` of shape [n]. True for those batch members\n        where the algorithm terminated before reaching `max_iterations`.\n      failed: A boolean `Tensor` of shape [n]. True for those batch members\n        where an error was encountered during bracketing.\n      num_evals: An int32 scalar `Tensor`. The number of times the objective\n        function was evaluated.\n      left: Return value of value_and_gradients_function at the updated left\n        end point of the interval found.\n      right: Return value of value_and_gradients_function at the updated right\n        end point of the interval found.\n  \"\"\"\n  already_stopped = search_interval.failed | search_interval.converged\n\n  # If the slope at right end point is positive, step B1 in [2], then the given\n  # initial points already bracket a minimum.\n  bracketed = search_interval.right.df >= 0\n\n  # Bisection is needed, step B2, if right end point almost works as a new left\n  # end point but the objective value is too high.\n  needs_bisect = (\n      search_interval.right.df < 0) & (search_interval.right.f > f_lim)\n\n  # In these three cases bracketing is already `stopped` and there is no need\n  # to perform further evaluations. Otherwise the bracketing loop is needed to\n  # expand the interval, step B3, until the conditions are met.\n  initial_args = _IntermediateResult(\n      iteration=search_interval.iterations,\n      stopped=already_stopped | bracketed | needs_bisect,\n      failed=search_interval.failed,\n      num_evals=search_interval.func_evals,\n      left=search_interval.left,\n      right=search_interval.right)\n\n  def _loop_cond(curr):\n    return (curr.iteration <\n            max_iterations) & ~tf.reduce_all(input_tensor=curr.stopped)\n\n  def _loop_body(curr):\n    \"\"\"Main body of bracketing loop.\"\"\"\n    # The loop maintains the invariant that curr.stopped is true if we have\n    # either: failed, successfully bracketed, or not yet bracketed but needs\n    # bisect. On the only remaining case, step B3 in [2]. case we need to\n    # expand and update the left/right values appropriately.\n    new_right = value_and_gradients_function(expansion_param * curr.right.x)\n    left = val_where(curr.stopped, curr.left, curr.right)\n    right = val_where(curr.stopped, curr.right, new_right)\n\n    # Updated the failed, bracketed, and needs_bisect conditions.\n    failed = curr.failed | ~is_finite(right)\n    bracketed = right.df >= 0\n    needs_bisect = (right.df < 0) & (right.f > f_lim)\n    return [_IntermediateResult(\n        iteration=curr.iteration + 1,\n        stopped=curr.stopped | failed | bracketed | needs_bisect,\n        failed=failed,\n        num_evals=curr.num_evals + 1,\n        left=left,\n        right=right)]\n\n  bracket_result = tf.while_loop(\n      cond=_loop_cond, body=_loop_body, loop_vars=[initial_args])[0]\n\n  # For entries where bisect is still needed, mark them as not yet stopped,\n  # reset the left end point, and run `_bisect` on them.\n  needs_bisect = (\n      (bracket_result.right.df < 0) & (bracket_result.right.f > f_lim))\n  stopped = already_stopped | bracket_result.failed | ~needs_bisect\n  left = val_where(stopped, bracket_result.left, search_interval.left)\n  bisect_args = bracket_result._replace(stopped=stopped, left=left)\n  return _bisect(value_and_gradients_function, bisect_args, f_lim)"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nbisect an interval and updates to satisfy opposite slope conditions.", "response": "def bisect(value_and_gradients_function,\n           initial_left,\n           initial_right,\n           f_lim):\n  \"\"\"Bisects an interval and updates to satisfy opposite slope conditions.\n\n  Corresponds to the step U3 in [Hager and Zhang (2006)][2].\n\n  Args:\n    value_and_gradients_function: A Python callable that accepts a real scalar\n      tensor and returns a namedtuple containing the value filed `f` of the\n      function and its derivative value field `df` at that point.\n      Alternatively, the function may representthe batching of `n` such line\n      functions (e.g. projecting a single multivariate objective function along\n      `n` distinct directions at once) accepting n points as input, i.e. a\n      tensor of shape [n], and return a tuple of two tensors of shape [n], the\n      function values and the corresponding derivatives at the input points.\n    initial_left: Return value of value_and_gradients_function at the left end\n      point of the current bracketing interval.\n    initial_right: Return value of value_and_gradients_function at the right end\n      point of the current bracketing interval.\n    f_lim: real `Tensor` of shape [n]. The function value threshold for\n      the approximate Wolfe conditions to be checked for each batch member.\n\n  Returns:\n    A namedtuple containing the following fields:\n      iteration: An int32 scalar `Tensor`. The number of iterations performed.\n        Bounded above by `max_iterations` parameter.\n      stopped: A boolean scalar `Tensor`. True if the bisect algorithm\n        terminated.\n      failed: A scalar boolean tensor. Indicates whether the objective function\n        failed to produce a finite value.\n      num_evals: A scalar int32 tensor. The number of value and gradients\n        function evaluations.\n      left: Return value of value_and_gradients_function at the left end\n        point of the bracketing interval found.\n      right: Return value of value_and_gradients_function at the right end\n        point of the bracketing interval found.\n  \"\"\"\n  failed = ~is_finite(initial_left, initial_right)\n  needs_bisect = (initial_right.df < 0) & (initial_right.f > f_lim)\n  bisect_args = _IntermediateResult(\n      iteration=tf.convert_to_tensor(value=0),\n      stopped=failed | ~needs_bisect,\n      failed=failed,\n      num_evals=tf.convert_to_tensor(value=0),\n      left=initial_left,\n      right=initial_right)\n  return _bisect(value_and_gradients_function, bisect_args, f_lim)"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nchecking if the supplied values are finite.", "response": "def is_finite(val_1, val_2=None):\n  \"\"\"Checks if the supplied values are finite.\n\n  Args:\n    val_1: A namedtuple instance with the function value and derivative,\n      as returned e.g. by value_and_gradients_function evaluations.\n    val_2: (Optional) A namedtuple instance with the function value and\n      derivative, as returned e.g. by value_and_gradients_function evaluations.\n\n  Returns:\n    is_finite: Scalar boolean `Tensor` indicating whether the function value\n      and the derivative in `val_1` (and optionally in `val_2`) are all finite.\n  \"\"\"\n  val_1_finite = tf.math.is_finite(val_1.f) & tf.math.is_finite(val_1.df)\n  if val_2 is not None:\n    return val_1_finite & tf.math.is_finite(val_2.f) & tf.math.is_finite(\n        val_2.df)\n  return val_1_finite"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nchecking whether the Wolfe or approx Wolfe conditions are satisfied.", "response": "def _satisfies_wolfe(val_0,\n                     val_c,\n                     f_lim,\n                     sufficient_decrease_param,\n                     curvature_param):\n  \"\"\"Checks whether the Wolfe or approx Wolfe conditions are satisfied.\n\n  The Wolfe conditions are a set of stopping criteria for an inexact line search\n  algorithm. Let f(a) be the function value along the search direction and\n  df(a) the derivative along the search direction evaluated a distance 'a'.\n  Here 'a' is the distance along the search direction. The Wolfe conditions are:\n\n    ```None\n      f(a) <= f(0) + delta * a * df(0)   (Armijo/Sufficient decrease condition)\n      df(a) >= sigma * df(0)             (Weak curvature condition)\n    ```\n  `delta` and `sigma` are two user supplied parameters satisfying:\n   `0 < delta < sigma <= 1.`. In the following, delta is called\n   `sufficient_decrease_param` and sigma is called `curvature_param`.\n\n  On a finite precision machine, the Wolfe conditions are difficult to satisfy\n  when one is close to the minimum. Hence, Hager-Zhang propose replacing\n  the sufficient decrease condition with the following condition on the\n  derivative in the vicinity of a minimum.\n\n    ```None\n      df(a) <= (2 * delta - 1) * df(0)  (Approx Wolfe sufficient decrease)\n    ```\n  This condition is only used if one is near the minimum. This is tested using\n\n    ```None\n      f(a) <= f(0) + epsilon * |f(0)|\n    ```\n  The following function checks both the Wolfe and approx Wolfe conditions.\n  Here, `epsilon` is a small positive constant. In the following, the argument\n  `f_lim` corresponds to the product: epsilon * |f(0)|.\n\n  Args:\n    val_0: A namedtuple, as returned by value_and_gradients_function\n      evaluated at 0.\n    val_c: A namedtuple, as returned by value_and_gradients_function\n      evaluated at the point to be tested.\n    f_lim: Scalar `Tensor` of real dtype. The function value threshold for\n      the approximate Wolfe conditions to be checked.\n    sufficient_decrease_param: Positive scalar `Tensor` of real dtype.\n      Bounded above by the curvature param. Corresponds to 'delta' in the\n      terminology of [Hager and Zhang (2006)][2].\n    curvature_param: Positive scalar `Tensor` of real dtype. Bounded above\n      by `1.`. Corresponds to 'sigma' in the terminology of\n      [Hager Zhang (2005)][1].\n\n  Returns:\n    is_satisfied: A scalar boolean `Tensor` which is True if either the\n      Wolfe or approximate Wolfe conditions are satisfied.\n  \"\"\"\n  exact_wolfe_suff_dec = (sufficient_decrease_param * val_0.df >=\n                          (val_c.f - val_0.f) / val_c.x)\n  wolfe_curvature = val_c.df >= curvature_param * val_0.df\n  exact_wolfe = exact_wolfe_suff_dec & wolfe_curvature\n  approx_wolfe_applies = val_c.f <= f_lim\n  approx_wolfe_suff_dec = ((2 * sufficient_decrease_param - 1) * val_0.df\n                           >= val_c.df)\n  approx_wolfe = approx_wolfe_applies & approx_wolfe_suff_dec & wolfe_curvature\n  is_satisfied = exact_wolfe | approx_wolfe\n  return is_satisfied"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef _secant(val_a, val_b):\n  return (val_a.x * val_b.df - val_b.x * val_a.df) / (val_b.df - val_a.df)", "response": "Calculates the secant interpolation for the minimum."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef make_simple_step_size_update_policy(num_adaptation_steps,\n                                        target_rate=0.75,\n                                        decrement_multiplier=0.01,\n                                        increment_multiplier=0.01,\n                                        step_counter=None):\n  \"\"\"Create a function implementing a step-size update policy.\n\n  The simple policy increases or decreases the `step_size_var` based on the\n  average of `exp(minimum(0., log_accept_ratio))`. It is based on\n  [Section 4.2 of Andrieu and Thoms (2008)](\n  https://people.eecs.berkeley.edu/~jordan/sail/readings/andrieu-thoms.pdf).\n\n  The `num_adaptation_steps` argument is set independently of any burnin\n  for the overall chain. In general, adaptation prevents the chain from\n  reaching a stationary distribution, so obtaining consistent samples requires\n  `num_adaptation_steps` be set to a value [somewhat smaller](\n  http://andrewgelman.com/2017/12/15/burn-vs-warm-iterative-simulation-algorithms/#comment-627745)\n  than the number of burnin steps. However, it may sometimes be helpful to set\n  `num_adaptation_steps` to a larger value during development in order to\n  inspect the behavior of the chain during adaptation.\n\n  Args:\n    num_adaptation_steps: Scalar `int` `Tensor` number of initial steps to\n      during which to adjust the step size. This may be greater, less than, or\n      equal to the number of burnin steps. If `None`, the step size is adapted\n      on every step (note this breaks stationarity of the chain!).\n    target_rate: Scalar `Tensor` representing desired `accept_ratio`.\n      Default value: `0.75` (i.e., [center of asymptotically optimal\n      rate](https://arxiv.org/abs/1411.6669)).\n    decrement_multiplier: `Tensor` representing amount to downscale current\n      `step_size`.\n      Default value: `0.01`.\n    increment_multiplier: `Tensor` representing amount to upscale current\n      `step_size`.\n      Default value: `0.01`.\n    step_counter: Scalar `int` `Variable` specifying the current step. The step\n      size is adapted iff `step_counter < num_adaptation_steps`.\n      Default value: if `None`, an internal variable\n        `step_size_adaptation_step_counter` is created and initialized to `-1`.\n\n  Returns:\n    step_size_simple_update_fn: Callable that takes args\n      `step_size_var, kernel_results` and returns updated step size(s).\n  \"\"\"\n  if step_counter is None and num_adaptation_steps is not None:\n    step_counter = tf.compat.v1.get_variable(\n        name='step_size_adaptation_step_counter',\n        initializer=np.array(-1, dtype=np.int32),\n        # Specify the dtype for variable sharing to work correctly\n        # (b/120599991).\n        dtype=tf.int32,\n        trainable=False,\n        use_resource=True)\n\n  def step_size_simple_update_fn(step_size_var, kernel_results):\n    \"\"\"Updates (list of) `step_size` using a standard adaptive MCMC procedure.\n\n    Args:\n      step_size_var: (List of) `tf.Variable`s representing the per `state_part`\n        HMC `step_size`.\n      kernel_results: `collections.namedtuple` containing `Tensor`s\n        representing values from most recent call to `one_step`.\n\n    Returns:\n      step_size_assign: (List of) `Tensor`(s) representing updated\n        `step_size_var`(s).\n    \"\"\"\n\n    if kernel_results is None:\n      if mcmc_util.is_list_like(step_size_var):\n        return [tf.identity(ss) for ss in step_size_var]\n      return tf.identity(step_size_var)\n    log_n = tf.math.log(\n        tf.cast(\n            tf.size(input=kernel_results.log_accept_ratio),\n            kernel_results.log_accept_ratio.dtype))\n    log_mean_accept_ratio = tf.reduce_logsumexp(\n        input_tensor=tf.minimum(kernel_results.log_accept_ratio, 0.)) - log_n\n    adjustment = tf.where(\n        log_mean_accept_ratio < tf.cast(\n            tf.math.log(target_rate), log_mean_accept_ratio.dtype),\n        -decrement_multiplier / (1. + decrement_multiplier),\n        increment_multiplier)\n\n    def build_assign_op():\n      if mcmc_util.is_list_like(step_size_var):\n        return [\n            ss.assign_add(ss * tf.cast(adjustment, ss.dtype))\n            for ss in step_size_var\n        ]\n      return step_size_var.assign_add(\n          step_size_var * tf.cast(adjustment, step_size_var.dtype))\n\n    if num_adaptation_steps is None:\n      return build_assign_op()\n    else:\n      with tf.control_dependencies([step_counter.assign_add(1)]):\n        return tf.cond(\n            pred=step_counter < num_adaptation_steps,\n            true_fn=build_assign_op,\n            false_fn=lambda: step_size_var)\n\n  return step_size_simple_update_fn", "response": "Creates a function implementing a simple step - size update policy."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\napplies num_leapfrog_steps of the leapfrog integrator.", "response": "def _leapfrog_integrator_one_step(\n    target_log_prob_fn,\n    independent_chain_ndims,\n    step_sizes,\n    current_momentum_parts,\n    current_state_parts,\n    current_target_log_prob,\n    current_target_log_prob_grad_parts,\n    state_gradients_are_stopped=False,\n    name=None):\n  \"\"\"Applies `num_leapfrog_steps` of the leapfrog integrator.\n\n  Assumes a simple quadratic kinetic energy function: `0.5 ||momentum||**2`.\n\n  #### Examples:\n\n  ##### Simple quadratic potential.\n\n  ```python\n  import matplotlib.pyplot as plt\n  %matplotlib inline\n  import numpy as np\n  import tensorflow as tf\n  from tensorflow_probability.python.mcmc.hmc import _leapfrog_integrator_one_step  # pylint: disable=line-too-long\n  tfd = tfp.distributions\n\n  dims = 10\n  num_iter = int(1e3)\n  dtype = np.float32\n\n  position = tf.placeholder(np.float32)\n  momentum = tf.placeholder(np.float32)\n\n  target_log_prob_fn = tfd.MultivariateNormalDiag(\n      loc=tf.zeros(dims, dtype)).log_prob\n\n  def _leapfrog_one_step(*args):\n    # Closure representing computation done during each leapfrog step.\n    return _leapfrog_integrator_one_step(\n        target_log_prob_fn=target_log_prob_fn,\n        independent_chain_ndims=0,\n        step_sizes=[0.1],\n        current_momentum_parts=args[0],\n        current_state_parts=args[1],\n        current_target_log_prob=args[2],\n        current_target_log_prob_grad_parts=args[3])\n\n  # Do leapfrog integration.\n  [\n      [next_momentum],\n      [next_position],\n      next_target_log_prob,\n      next_target_log_prob_grad_parts,\n  ] = tf.while_loop(\n      cond=lambda *args: True,\n      body=_leapfrog_one_step,\n      loop_vars=[\n        [momentum],\n        [position],\n        target_log_prob_fn(position),\n        tf.gradients(target_log_prob_fn(position), position),\n      ],\n      maximum_iterations=3)\n\n  momentum_ = np.random.randn(dims).astype(dtype)\n  position_ = np.random.randn(dims).astype(dtype)\n  positions = np.zeros([num_iter, dims], dtype)\n\n  with tf.Session() as sess:\n    for i in xrange(num_iter):\n      position_, momentum_ = sess.run(\n          [next_momentum, next_position],\n          feed_dict={position: position_, momentum: momentum_})\n      positions[i] = position_\n\n  plt.plot(positions[:, 0]);  # Sinusoidal.\n  ```\n\n  Args:\n    target_log_prob_fn: Python callable which takes an argument like\n      `*current_state_parts` and returns its (possibly unnormalized) log-density\n      under the target distribution.\n    independent_chain_ndims: Scalar `int` `Tensor` representing the number of\n      leftmost `Tensor` dimensions which index independent chains.\n    step_sizes: Python `list` of `Tensor`s representing the step size for the\n      leapfrog integrator. Must broadcast with the shape of\n      `current_state_parts`.  Larger step sizes lead to faster progress, but\n      too-large step sizes make rejection exponentially more likely. When\n      possible, it's often helpful to match per-variable step sizes to the\n      standard deviations of the target distribution in each variable.\n    current_momentum_parts: Tensor containing the value(s) of the momentum\n      variable(s) to update.\n    current_state_parts: Python `list` of `Tensor`s representing the current\n      state(s) of the Markov chain(s). The first `independent_chain_ndims` of\n      the `Tensor`(s) index different chains.\n    current_target_log_prob: `Tensor` representing the value of\n      `target_log_prob_fn(*current_state_parts)`. The only reason to specify\n      this argument is to reduce TF graph size.\n    current_target_log_prob_grad_parts: Python list of `Tensor`s representing\n      gradient of `target_log_prob_fn(*current_state_parts`) wrt\n      `current_state_parts`. Must have same shape as `current_state_parts`. The\n      only reason to specify this argument is to reduce TF graph size.\n    state_gradients_are_stopped: Python `bool` indicating that the proposed new\n      state be run through `tf.stop_gradient`. This is particularly useful when\n      combining optimization over samples from the HMC chain.\n      Default value: `False` (i.e., do not apply `stop_gradient`).\n    name: Python `str` name prefixed to Ops created by this function.\n      Default value: `None` (i.e., 'hmc_leapfrog_integrator').\n\n  Returns:\n    proposed_momentum_parts: Updated value of the momentum.\n    proposed_state_parts: Tensor or Python list of `Tensor`s representing the\n      state(s) of the Markov chain(s) at each result step. Has same shape as\n      input `current_state_parts`.\n    proposed_target_log_prob: `Tensor` representing the value of\n      `target_log_prob_fn` at `next_state`.\n    proposed_target_log_prob_grad_parts: Gradient of `proposed_target_log_prob`\n      wrt `next_state`.\n\n  Raises:\n    ValueError: if `len(momentum_parts) != len(state_parts)`.\n    ValueError: if `len(state_parts) != len(step_sizes)`.\n    ValueError: if `len(state_parts) != len(grads_target_log_prob)`.\n    TypeError: if `not target_log_prob.dtype.is_floating`.\n  \"\"\"\n  # Note on per-variable step sizes:\n  #\n  # Using per-variable step sizes is equivalent to using the same step\n  # size for all variables and adding a diagonal mass matrix in the\n  # kinetic energy term of the Hamiltonian being integrated. This is\n  # hinted at by Neal (2011) but not derived in detail there.\n  #\n  # Let x and v be position and momentum variables respectively.\n  # Let g(x) be the gradient of `target_log_prob_fn(x)`.\n  # Let S be a diagonal matrix of per-variable step sizes.\n  # Let the Hamiltonian H(x, v) = -target_log_prob_fn(x) + 0.5 * ||v||**2.\n  #\n  # Using per-variable step sizes gives the updates\n  # v'  = v  + 0.5 * matmul(S, g(x))\n  # x'' = x  + matmul(S, v')\n  # v'' = v' + 0.5 * matmul(S, g(x''))\n  #\n  # Let u = matmul(inv(S), v).\n  # Multiplying v by inv(S) in the updates above gives the transformed dynamics\n  # u'  = matmul(inv(S), v')  = matmul(inv(S), v) + 0.5 * g(x)\n  #                           = u + 0.5 * g(x)\n  # x'' = x + matmul(S, v') = x + matmul(S**2, u')\n  # u'' = matmul(inv(S), v'') = matmul(inv(S), v') + 0.5 * g(x'')\n  #                           = u' + 0.5 * g(x'')\n  #\n  # These are exactly the leapfrog updates for the Hamiltonian\n  # H'(x, u) = -target_log_prob_fn(x) + 0.5 * u^T S**2 u\n  #          = -target_log_prob_fn(x) + 0.5 * ||v||**2 = H(x, v).\n  #\n  # To summarize:\n  #\n  # * Using per-variable step sizes implicitly simulates the dynamics\n  #   of the Hamiltonian H' (which are energy-conserving in H'). We\n  #   keep track of v instead of u, but the underlying dynamics are\n  #   the same if we transform back.\n  # * The value of the Hamiltonian H'(x, u) is the same as the value\n  #   of the original Hamiltonian H(x, v) after we transform back from\n  #   u to v.\n  # * Sampling v ~ N(0, I) is equivalent to sampling u ~ N(0, S**-2).\n  #\n  # So using per-variable step sizes in HMC will give results that are\n  # exactly identical to explicitly using a diagonal mass matrix.\n\n  with tf.compat.v1.name_scope(name, 'hmc_leapfrog_integrator_one_step', [\n      independent_chain_ndims, step_sizes, current_momentum_parts,\n      current_state_parts, current_target_log_prob,\n      current_target_log_prob_grad_parts\n  ]):\n\n    # Step 1: Update momentum.\n    proposed_momentum_parts = [\n        v + 0.5 * tf.cast(eps, v.dtype) * g\n        for v, eps, g\n        in zip(current_momentum_parts,\n               step_sizes,\n               current_target_log_prob_grad_parts)]\n\n    # Step 2: Update state.\n    proposed_state_parts = [\n        x + tf.cast(eps, v.dtype) * v\n        for x, eps, v\n        in zip(current_state_parts,\n               step_sizes,\n               proposed_momentum_parts)]\n\n    if state_gradients_are_stopped:\n      proposed_state_parts = [tf.stop_gradient(x) for x in proposed_state_parts]\n\n    # Step 3a: Re-evaluate target-log-prob (and grad) at proposed state.\n    [\n        proposed_target_log_prob,\n        proposed_target_log_prob_grad_parts,\n    ] = mcmc_util.maybe_call_fn_and_grads(\n        target_log_prob_fn,\n        proposed_state_parts)\n\n    if not proposed_target_log_prob.dtype.is_floating:\n      raise TypeError('`target_log_prob_fn` must produce a `Tensor` '\n                      'with `float` `dtype`.')\n\n    if any(g is None for g in proposed_target_log_prob_grad_parts):\n      raise ValueError(\n          'Encountered `None` gradient. Does your target `target_log_prob_fn` '\n          'access all `tf.Variable`s via `tf.get_variable`?\\n'\n          '  current_state_parts: {}\\n'\n          '  proposed_state_parts: {}\\n'\n          '  proposed_target_log_prob_grad_parts: {}'.format(\n              current_state_parts,\n              proposed_state_parts,\n              proposed_target_log_prob_grad_parts))\n\n    # Step 3b: Update momentum (again).\n    proposed_momentum_parts = [\n        v + 0.5 * tf.cast(eps, v.dtype) * g\n        for v, eps, g\n        in zip(proposed_momentum_parts,\n               step_sizes,\n               proposed_target_log_prob_grad_parts)]\n\n    return [\n        proposed_momentum_parts,\n        proposed_state_parts,\n        proposed_target_log_prob,\n        proposed_target_log_prob_grad_parts,\n    ]"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef _compute_log_acceptance_correction(current_momentums,\n                                       proposed_momentums,\n                                       independent_chain_ndims,\n                                       name=None):\n  \"\"\"Helper to `kernel` which computes the log acceptance-correction.\n\n  A sufficient but not necessary condition for the existence of a stationary\n  distribution, `p(x)`, is \"detailed balance\", i.e.:\n\n  ```none\n  p(x'|x) p(x) = p(x|x') p(x')\n  ```\n\n  In the Metropolis-Hastings algorithm, a state is proposed according to\n  `g(x'|x)` and accepted according to `a(x'|x)`, hence\n  `p(x'|x) = g(x'|x) a(x'|x)`.\n\n  Inserting this into the detailed balance equation implies:\n\n  ```none\n      g(x'|x) a(x'|x) p(x) = g(x|x') a(x|x') p(x')\n  ==> a(x'|x) / a(x|x') = p(x') / p(x) [g(x|x') / g(x'|x)]    (*)\n  ```\n\n  One definition of `a(x'|x)` which satisfies (*) is:\n\n  ```none\n  a(x'|x) = min(1, p(x') / p(x) [g(x|x') / g(x'|x)])\n  ```\n\n  (To see that this satisfies (*), notice that under this definition only at\n  most one `a(x'|x)` and `a(x|x') can be other than one.)\n\n  We call the bracketed term the \"acceptance correction\".\n\n  In the case of UncalibratedHMC, the log acceptance-correction is not the log\n  proposal-ratio. UncalibratedHMC augments the state-space with momentum, z.\n  Assuming a standard Gaussian distribution for momentums, the chain eventually\n  converges to:\n\n  ```none\n  p([x, z]) propto= target_prob(x) exp(-0.5 z**2)\n  ```\n\n  Relating this back to Metropolis-Hastings parlance, for HMC we have:\n\n  ```none\n  p([x, z]) propto= target_prob(x) exp(-0.5 z**2)\n  g([x, z] | [x', z']) = g([x', z'] | [x, z])\n  ```\n\n  In other words, the MH bracketed term is `1`. However, because we desire to\n  use a general MH framework, we can place the momentum probability ratio inside\n  the metropolis-correction factor thus getting an acceptance probability:\n\n  ```none\n                       target_prob(x')\n  accept_prob(x'|x) = -----------------  [exp(-0.5 z**2) / exp(-0.5 z'**2)]\n                       target_prob(x)\n  ```\n\n  (Note: we actually need to handle the kinetic energy change at each leapfrog\n  step, but this is the idea.)\n\n  Args:\n    current_momentums: `Tensor` representing the value(s) of the current\n      momentum(s) of the state (parts).\n    proposed_momentums: `Tensor` representing the value(s) of the proposed\n      momentum(s) of the state (parts).\n    independent_chain_ndims: Scalar `int` `Tensor` representing the number of\n      leftmost `Tensor` dimensions which index independent chains.\n    name: Python `str` name prefixed to Ops created by this function.\n      Default value: `None` (i.e., 'compute_log_acceptance_correction').\n\n  Returns:\n    log_acceptance_correction: `Tensor` representing the `log`\n      acceptance-correction.  (See docstring for mathematical definition.)\n  \"\"\"\n  with tf.compat.v1.name_scope(\n      name, 'compute_log_acceptance_correction',\n      [independent_chain_ndims, current_momentums, proposed_momentums]):\n    log_current_kinetic, log_proposed_kinetic = [], []\n    for current_momentum, proposed_momentum in zip(\n        current_momentums, proposed_momentums):\n      axis = tf.range(independent_chain_ndims, tf.rank(current_momentum))\n      log_current_kinetic.append(_log_sum_sq(current_momentum, axis))\n      log_proposed_kinetic.append(_log_sum_sq(proposed_momentum, axis))\n    current_kinetic = 0.5 * tf.exp(\n        tf.reduce_logsumexp(\n            input_tensor=tf.stack(log_current_kinetic, axis=-1), axis=-1))\n    proposed_kinetic = 0.5 * tf.exp(\n        tf.reduce_logsumexp(\n            input_tensor=tf.stack(log_proposed_kinetic, axis=-1), axis=-1))\n    return mcmc_util.safe_sum([current_kinetic, -proposed_kinetic])", "response": "Helper function to compute the log acceptance correction."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef _log_sum_sq(x, axis=None):\n  return tf.reduce_logsumexp(\n      input_tensor=2. * tf.math.log(tf.abs(x)), axis=axis)", "response": "Computes log sum of x."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef one_step(self, current_state, previous_kernel_results):\n    previous_step_size_assign = (\n        [] if self.step_size_update_fn is None\n        else (previous_kernel_results.extra.step_size_assign\n              if mcmc_util.is_list_like(\n                  previous_kernel_results.extra.step_size_assign)\n              else [previous_kernel_results.extra.step_size_assign]))\n\n    with tf.control_dependencies(previous_step_size_assign):\n      next_state, kernel_results = self._impl.one_step(\n          current_state, previous_kernel_results)\n      if self.step_size_update_fn is not None:\n        step_size_assign = self.step_size_update_fn(  # pylint: disable=not-callable\n            self.step_size, kernel_results)\n        kernel_results = kernel_results._replace(\n            extra=HamiltonianMonteCarloExtraKernelResults(\n                step_size_assign=step_size_assign))\n      return next_state, kernel_results", "response": "Runs one iteration of Hamiltonian Monte Carlo."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef bootstrap_results(self, init_state):\n    kernel_results = self._impl.bootstrap_results(init_state)\n    if self.step_size_update_fn is not None:\n      step_size_assign = self.step_size_update_fn(self.step_size, None)  # pylint: disable=not-callable\n      kernel_results = kernel_results._replace(\n          extra=HamiltonianMonteCarloExtraKernelResults(\n              step_size_assign=step_size_assign))\n    return kernel_results", "response": "Creates initial kernel_results using a supplied state."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef bayesian_resnet(input_shape,\n                    num_classes=10,\n                    kernel_posterior_scale_mean=-9.0,\n                    kernel_posterior_scale_stddev=0.1,\n                    kernel_posterior_scale_constraint=0.2):\n  \"\"\"Constructs a ResNet18 model.\n\n  Args:\n    input_shape: A `tuple` indicating the Tensor shape.\n    num_classes: `int` representing the number of class labels.\n    kernel_posterior_scale_mean: Python `int` number for the kernel\n      posterior's scale (log variance) mean. The smaller the mean the closer\n      is the initialization to a deterministic network.\n    kernel_posterior_scale_stddev: Python `float` number for the initial kernel\n      posterior's scale stddev.\n      ```\n      q(W|x) ~ N(mu, var),\n      log_var ~ N(kernel_posterior_scale_mean, kernel_posterior_scale_stddev)\n      ````\n    kernel_posterior_scale_constraint: Python `float` number for the log value\n      to constrain the log variance throughout training.\n      i.e. log_var <= log(kernel_posterior_scale_constraint).\n\n  Returns:\n    tf.keras.Model.\n  \"\"\"\n\n  filters = [64, 128, 256, 512]\n  kernels = [3, 3, 3, 3]\n  strides = [1, 2, 2, 2]\n\n  def _untransformed_scale_constraint(t):\n    return tf.clip_by_value(t, -1000,\n                            tf.math.log(kernel_posterior_scale_constraint))\n\n  kernel_posterior_fn = tfp.layers.default_mean_field_normal_fn(\n      untransformed_scale_initializer=tf.compat.v1.initializers.random_normal(\n          mean=kernel_posterior_scale_mean,\n          stddev=kernel_posterior_scale_stddev),\n      untransformed_scale_constraint=_untransformed_scale_constraint)\n\n  image = tf.keras.layers.Input(shape=input_shape, dtype='float32')\n  x = tfp.layers.Convolution2DFlipout(\n      64,\n      3,\n      strides=1,\n      padding='same',\n      kernel_posterior_fn=kernel_posterior_fn)(image)\n\n  for i in range(len(kernels)):\n    x = _resnet_block(\n        x,\n        filters[i],\n        kernels[i],\n        strides[i],\n        kernel_posterior_fn)\n\n  x = tf.keras.layers.BatchNormalization()(x)\n  x = tf.keras.layers.Activation('relu')(x)\n  x = tf.keras.layers.AveragePooling2D(4, 1)(x)\n  x = tf.keras.layers.Flatten()(x)\n\n  x = tfp.layers.DenseFlipout(\n      num_classes,\n      kernel_posterior_fn=kernel_posterior_fn)(x)\n\n  model = tf.keras.Model(inputs=image, outputs=x, name='resnet18')\n  return model", "response": "Constructs a ResNet18 model."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef _resnet_block(x, filters, kernel, stride, kernel_posterior_fn):\n  x = tf.keras.layers.BatchNormalization()(x)\n  x = tf.keras.layers.Activation('relu')(x)\n\n  if stride != 1 or filters != x.shape[1]:\n    shortcut = _projection_shortcut(x, filters, stride, kernel_posterior_fn)\n  else:\n    shortcut = x\n\n  x = tfp.layers.Convolution2DFlipout(\n      filters,\n      kernel,\n      strides=stride,\n      padding='same',\n      kernel_posterior_fn=kernel_posterior_fn)(x)\n  x = tf.keras.layers.BatchNormalization()(x)\n  x = tf.keras.layers.Activation('relu')(x)\n\n  x = tfp.layers.Convolution2DFlipout(\n      filters,\n      kernel,\n      strides=1,\n      padding='same',\n      kernel_posterior_fn=kernel_posterior_fn)(x)\n  x = tf.keras.layers.add([x, shortcut])\n  return x", "response": "Network block for ResNet."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef make_encoder(activation, num_topics, layer_sizes):\n  encoder_net = tf.keras.Sequential()\n  for num_hidden_units in layer_sizes:\n    encoder_net.add(\n        tf.keras.layers.Dense(\n            num_hidden_units,\n            activation=activation,\n            kernel_initializer=tf.compat.v1.glorot_normal_initializer()))\n  encoder_net.add(\n      tf.keras.layers.Dense(\n          num_topics,\n          activation=tf.nn.softplus,\n          kernel_initializer=tf.compat.v1.glorot_normal_initializer()))\n\n  def encoder(bag_of_words):\n    net = _clip_dirichlet_parameters(encoder_net(bag_of_words))\n    return tfd.Dirichlet(concentration=net,\n                         name=\"topics_posterior\")\n\n  return encoder", "response": "Create the encoder function."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\ncreates the decoder function.", "response": "def make_decoder(num_topics, num_words):\n  \"\"\"Create the decoder function.\n\n  Args:\n    num_topics: The number of topics.\n    num_words: The number of words.\n\n  Returns:\n    decoder: A `callable` mapping a `Tensor` of encodings to a\n      `tfd.Distribution` instance over words.\n  \"\"\"\n  topics_words_logits = tf.compat.v1.get_variable(\n      \"topics_words_logits\",\n      shape=[num_topics, num_words],\n      initializer=tf.compat.v1.glorot_normal_initializer())\n  topics_words = tf.nn.softmax(topics_words_logits, axis=-1)\n\n  def decoder(topics):\n    word_probs = tf.matmul(topics, topics_words)\n    # The observations are bag of words and therefore not one-hot. However,\n    # log_prob of OneHotCategorical computes the probability correctly in\n    # this case.\n    return tfd.OneHotCategorical(probs=word_probs,\n                                 name=\"bag_of_words\")\n\n  return decoder, topics_words"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\ncreates the prior distribution.", "response": "def make_prior(num_topics, initial_value):\n  \"\"\"Create the prior distribution.\n\n  Args:\n    num_topics: Number of topics.\n    initial_value: The starting value for the prior parameters.\n\n  Returns:\n    prior: A `callable` that returns a `tf.distribution.Distribution`\n        instance, the prior distribution.\n    prior_variables: A `list` of `Variable` objects, the trainable parameters\n        of the prior.\n  \"\"\"\n  def _softplus_inverse(x):\n    return np.log(np.expm1(x))\n\n  logit_concentration = tf.compat.v1.get_variable(\n      \"logit_concentration\",\n      shape=[1, num_topics],\n      initializer=tf.compat.v1.initializers.constant(\n          _softplus_inverse(initial_value)))\n  concentration = _clip_dirichlet_parameters(\n      tf.nn.softplus(logit_concentration))\n\n  def prior():\n    return tfd.Dirichlet(concentration=concentration,\n                         name=\"topics_prior\")\n\n  prior_variables = [logit_concentration]\n\n  return prior, prior_variables"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef model_fn(features, labels, mode, params, config):\n  del labels, config\n\n  encoder = make_encoder(params[\"activation\"],\n                         params[\"num_topics\"],\n                         params[\"layer_sizes\"])\n  decoder, topics_words = make_decoder(params[\"num_topics\"],\n                                       features.shape[1])\n  prior, prior_variables = make_prior(params[\"num_topics\"],\n                                      params[\"prior_initial_value\"])\n\n  topics_prior = prior()\n  alpha = topics_prior.concentration\n\n  topics_posterior = encoder(features)\n  topics = topics_posterior.sample()\n  random_reconstruction = decoder(topics)\n\n  reconstruction = random_reconstruction.log_prob(features)\n  tf.compat.v1.summary.scalar(\"reconstruction\",\n                              tf.reduce_mean(input_tensor=reconstruction))\n\n  # Compute the KL-divergence between two Dirichlets analytically.\n  # The sampled KL does not work well for \"sparse\" distributions\n  # (see Appendix D of [2]).\n  kl = tfd.kl_divergence(topics_posterior, topics_prior)\n  tf.compat.v1.summary.scalar(\"kl\", tf.reduce_mean(input_tensor=kl))\n\n  # Ensure that the KL is non-negative (up to a very small slack).\n  # Negative KL can happen due to numerical instability.\n  with tf.control_dependencies(\n      [tf.compat.v1.assert_greater(kl, -1e-3, message=\"kl\")]):\n    kl = tf.identity(kl)\n\n  elbo = reconstruction - kl\n  avg_elbo = tf.reduce_mean(input_tensor=elbo)\n  tf.compat.v1.summary.scalar(\"elbo\", avg_elbo)\n  loss = -avg_elbo\n\n  # Perform variational inference by minimizing the -ELBO.\n  global_step = tf.compat.v1.train.get_or_create_global_step()\n  optimizer = tf.compat.v1.train.AdamOptimizer(params[\"learning_rate\"])\n\n  # This implements the \"burn-in\" for prior parameters (see Appendix D of [2]).\n  # For the first prior_burn_in_steps steps they are fixed, and then trained\n  # jointly with the other parameters.\n  grads_and_vars = optimizer.compute_gradients(loss)\n  grads_and_vars_except_prior = [\n      x for x in grads_and_vars if x[1] not in prior_variables]\n\n  def train_op_except_prior():\n    return optimizer.apply_gradients(\n        grads_and_vars_except_prior,\n        global_step=global_step)\n\n  def train_op_all():\n    return optimizer.apply_gradients(\n        grads_and_vars,\n        global_step=global_step)\n\n  train_op = tf.cond(\n      pred=global_step < params[\"prior_burn_in_steps\"],\n      true_fn=train_op_except_prior,\n      false_fn=train_op_all)\n\n  # The perplexity is an exponent of the average negative ELBO per word.\n  words_per_document = tf.reduce_sum(input_tensor=features, axis=1)\n  log_perplexity = -elbo / words_per_document\n  tf.compat.v1.summary.scalar(\n      \"perplexity\", tf.exp(tf.reduce_mean(input_tensor=log_perplexity)))\n  (log_perplexity_tensor,\n   log_perplexity_update) = tf.compat.v1.metrics.mean(log_perplexity)\n  perplexity_tensor = tf.exp(log_perplexity_tensor)\n\n  # Obtain the topics summary. Implemented as a py_func for simplicity.\n  topics = tf.compat.v1.py_func(\n      functools.partial(get_topics_strings, vocabulary=params[\"vocabulary\"]),\n      [topics_words, alpha],\n      tf.string,\n      stateful=False)\n  tf.compat.v1.summary.text(\"topics\", topics)\n\n  return tf.estimator.EstimatorSpec(\n      mode=mode,\n      loss=loss,\n      train_op=train_op,\n      eval_metric_ops={\n          \"elbo\": tf.compat.v1.metrics.mean(elbo),\n          \"reconstruction\": tf.compat.v1.metrics.mean(reconstruction),\n          \"kl\": tf.compat.v1.metrics.mean(kl),\n          \"perplexity\": (perplexity_tensor, log_perplexity_update),\n          \"topics\": (topics, tf.no_op()),\n      },\n  )", "response": "Builds the model function for use in an estimator."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef sample_chain(\n    num_results,\n    current_state,\n    previous_kernel_results=None,\n    kernel=None,\n    num_burnin_steps=0,\n    num_steps_between_results=0,\n    trace_fn=lambda current_state, kernel_results: kernel_results,\n    return_final_kernel_results=False,\n    parallel_iterations=10,\n    name=None,\n):\n  \"\"\"Implements Markov chain Monte Carlo via repeated `TransitionKernel` steps.\n\n  This function samples from an Markov chain at `current_state` and whose\n  stationary distribution is governed by the supplied `TransitionKernel`\n  instance (`kernel`).\n\n  This function can sample from multiple chains, in parallel. (Whether or not\n  there are multiple chains is dictated by the `kernel`.)\n\n  The `current_state` can be represented as a single `Tensor` or a `list` of\n  `Tensors` which collectively represent the current state.\n\n  Since MCMC states are correlated, it is sometimes desirable to produce\n  additional intermediate states, and then discard them, ending up with a set of\n  states with decreased autocorrelation.  See [Owen (2017)][1]. Such \"thinning\"\n  is made possible by setting `num_steps_between_results > 0`. The chain then\n  takes `num_steps_between_results` extra steps between the steps that make it\n  into the results. The extra steps are never materialized (in calls to\n  `sess.run`), and thus do not increase memory requirements.\n\n  Warning: when setting a `seed` in the `kernel`, ensure that `sample_chain`'s\n  `parallel_iterations=1`, otherwise results will not be reproducible.\n\n  In addition to returning the chain state, this function supports tracing of\n  auxiliary variables used by the kernel. The traced values are selected by\n  specifying `trace_fn`. By default, all kernel results are traced but in the\n  future the default will be changed to no results being traced, so plan\n  accordingly. See below for some examples of this feature.\n\n  Args:\n    num_results: Integer number of Markov chain draws.\n    current_state: `Tensor` or Python `list` of `Tensor`s representing the\n      current state(s) of the Markov chain(s).\n    previous_kernel_results: A `Tensor` or a nested collection of `Tensor`s\n      representing internal calculations made within the previous call to this\n      function (or as returned by `bootstrap_results`).\n    kernel: An instance of `tfp.mcmc.TransitionKernel` which implements one step\n      of the Markov chain.\n    num_burnin_steps: Integer number of chain steps to take before starting to\n      collect results.\n      Default value: 0 (i.e., no burn-in).\n    num_steps_between_results: Integer number of chain steps between collecting\n      a result. Only one out of every `num_steps_between_samples + 1` steps is\n      included in the returned results.  The number of returned chain states is\n      still equal to `num_results`.  Default value: 0 (i.e., no thinning).\n    trace_fn: A callable that takes in the current chain state and the previous\n      kernel results and return a `Tensor` or a nested collection of `Tensor`s\n      that is then traced along with the chain state.\n    return_final_kernel_results: If `True`, then the final kernel results are\n      returned alongside the chain state and the trace specified by the\n      `trace_fn`.\n    parallel_iterations: The number of iterations allowed to run in parallel. It\n      must be a positive integer. See `tf.while_loop` for more details.\n    name: Python `str` name prefixed to Ops created by this function.\n      Default value: `None` (i.e., \"mcmc_sample_chain\").\n\n  Returns:\n    checkpointable_states_and_trace: if `return_final_kernel_results` is\n      `True`. The return value is an instance of\n      `CheckpointableStatesAndTrace`.\n    all_states: if `return_final_kernel_results` is `False` and `trace_fn` is\n      `None`. The return value is a `Tensor` or Python list of `Tensor`s\n      representing the state(s) of the Markov chain(s) at each result step. Has\n      same shape as input `current_state` but with a prepended\n      `num_results`-size dimension.\n    states_and_trace: if `return_final_kernel_results` is `False` and\n      `trace_fn` is not `None`. The return value is an instance of\n      `StatesAndTrace`.\n\n  #### Examples\n\n  ##### Sample from a diagonal-variance Gaussian.\n\n  I.e.,\n\n  ```none\n  for i=1..n:\n    x[i] ~ MultivariateNormal(loc=0, scale=diag(true_stddev))  # likelihood\n  ```\n\n  ```python\n  import tensorflow as tf\n  import tensorflow_probability as tfp\n  tfd = tfp.distributions\n\n  dims = 10\n  true_stddev = np.sqrt(np.linspace(1., 3., dims))\n  likelihood = tfd.MultivariateNormalDiag(loc=0., scale_diag=true_stddev)\n\n  states = tfp.mcmc.sample_chain(\n      num_results=1000,\n      num_burnin_steps=500,\n      current_state=tf.zeros(dims),\n      kernel=tfp.mcmc.HamiltonianMonteCarlo(\n        target_log_prob_fn=likelihood.log_prob,\n        step_size=0.5,\n        num_leapfrog_steps=2),\n      trace_fn=None)\n\n  sample_mean = tf.reduce_mean(states, axis=0)\n  # ==> approx all zeros\n\n  sample_stddev = tf.sqrt(tf.reduce_mean(\n      tf.squared_difference(states, sample_mean),\n      axis=0))\n  # ==> approx equal true_stddev\n  ```\n\n  ##### Sampling from factor-analysis posteriors with known factors.\n\n  I.e.,\n\n  ```none\n  # prior\n  w ~ MultivariateNormal(loc=0, scale=eye(d))\n  for i=1..n:\n    # likelihood\n    x[i] ~ Normal(loc=w^T F[i], scale=1)\n  ```\n\n  where `F` denotes factors.\n\n  ```python\n  import tensorflow as tf\n  import tensorflow_probability as tfp\n  tfd = tfp.distributions\n\n  # Specify model.\n  def make_prior(dims):\n    return tfd.MultivariateNormalDiag(\n        loc=tf.zeros(dims))\n\n  def make_likelihood(weights, factors):\n    return tfd.MultivariateNormalDiag(\n        loc=tf.matmul(weights, factors, adjoint_b=True))\n\n  def joint_log_prob(num_weights, factors, x, w):\n    return (make_prior(num_weights).log_prob(w) +\n            make_likelihood(w, factors).log_prob(x))\n\n  def unnormalized_log_posterior(w):\n    # Posterior is proportional to: `p(W, X=x | factors)`.\n    return joint_log_prob(num_weights, factors, x, w)\n\n  # Setup data.\n  num_weights = 10 # == d\n  num_factors = 40 # == n\n  num_chains = 100\n\n  weights = make_prior(num_weights).sample(1)\n  factors = tf.random_normal([num_factors, num_weights])\n  x = make_likelihood(weights, factors).sample()\n\n  # Sample from Hamiltonian Monte Carlo Markov Chain.\n\n  # Get `num_results` samples from `num_chains` independent chains.\n  chains_states, kernels_results = tfp.mcmc.sample_chain(\n      num_results=1000,\n      num_burnin_steps=500,\n      current_state=tf.zeros([num_chains, num_weights], name='init_weights'),\n      kernel=tfp.mcmc.HamiltonianMonteCarlo(\n        target_log_prob_fn=unnormalized_log_posterior,\n        step_size=0.1,\n        num_leapfrog_steps=2))\n\n  # Compute sample stats.\n  sample_mean = tf.reduce_mean(chains_states, axis=[0, 1])\n  # ==> approx equal to weights\n\n  sample_var = tf.reduce_mean(\n      tf.squared_difference(chains_states, sample_mean),\n      axis=[0, 1])\n  # ==> less than 1\n  ```\n\n  ##### Custom tracing functions.\n\n  ```python\n  import tensorflow as tf\n  import tensorflow_probability as tfp\n  tfd = tfp.distributions\n\n  likelihood = tfd.Normal(loc=0., scale=1.)\n\n  def sample_chain(trace_fn):\n    return tfp.mcmc.sample_chain(\n      num_results=1000,\n      num_burnin_steps=500,\n      current_state=0.,\n      kernel=tfp.mcmc.HamiltonianMonteCarlo(\n        target_log_prob_fn=likelihood.log_prob,\n        step_size=0.5,\n        num_leapfrog_steps=2),\n      trace_fn=trace_fn)\n\n  def trace_log_accept_ratio(states, previous_kernel_results):\n    return previous_kernel_results.log_accept_ratio\n\n  def trace_everything(states, previous_kernel_results):\n    return previous_kernel_results\n\n  _, log_accept_ratio = sample_chain(trace_fn=trace_log_accept_ratio)\n  _, kernel_results = sample_chain(trace_fn=trace_everything)\n\n  acceptance_prob = tf.exp(tf.minimum(log_accept_ratio_, 0.))\n  # Equivalent to, but more efficient than:\n  acceptance_prob = tf.exp(tf.minimum(kernel_results.log_accept_ratio_, 0.))\n  ```\n\n  #### References\n\n  [1]: Art B. Owen. Statistically efficient thinning of a Markov chain sampler.\n       _Technical Report_, 2017.\n       http://statweb.stanford.edu/~owen/reports/bestthinning.pdf\n  \"\"\"\n  if not kernel.is_calibrated:\n    warnings.warn(\"supplied `TransitionKernel` is not calibrated. Markov \"\n                  \"chain may not converge to intended target distribution.\")\n  with tf.compat.v1.name_scope(\n      name, \"mcmc_sample_chain\",\n      [num_results, num_burnin_steps, num_steps_between_results]):\n    num_results = tf.convert_to_tensor(\n        value=num_results, dtype=tf.int32, name=\"num_results\")\n    num_burnin_steps = tf.convert_to_tensor(\n        value=num_burnin_steps, dtype=tf.int32, name=\"num_burnin_steps\")\n    num_steps_between_results = tf.convert_to_tensor(\n        value=num_steps_between_results,\n        dtype=tf.int32,\n        name=\"num_steps_between_results\")\n    current_state = tf.nest.map_structure(\n        lambda x: tf.convert_to_tensor(value=x, name=\"current_state\"),\n        current_state)\n    if previous_kernel_results is None:\n      previous_kernel_results = kernel.bootstrap_results(current_state)\n\n    if trace_fn is None:\n      # It simplifies the logic to use a dummy function here.\n      trace_fn = lambda *args: ()\n      no_trace = True\n    else:\n      no_trace = False\n    if trace_fn is sample_chain.__defaults__[4]:\n      warnings.warn(\"Tracing all kernel results by default is deprecated. Set \"\n                    \"the `trace_fn` argument to None (the future default \"\n                    \"value) or an explicit callback that traces the values \"\n                    \"you are interested in.\")\n\n    def _trace_scan_fn(state_and_results, num_steps):\n      next_state, current_kernel_results = mcmc_util.smart_for_loop(\n          loop_num_iter=num_steps,\n          body_fn=kernel.one_step,\n          initial_loop_vars=list(state_and_results),\n          parallel_iterations=parallel_iterations)\n      return next_state, current_kernel_results\n\n    (_, final_kernel_results), (all_states, trace) = mcmc_util.trace_scan(\n        loop_fn=_trace_scan_fn,\n        initial_state=(current_state, previous_kernel_results),\n        elems=tf.one_hot(\n            indices=0,\n            depth=num_results,\n            on_value=1 + num_burnin_steps,\n            off_value=1 + num_steps_between_results,\n            dtype=tf.int32),\n        # pylint: disable=g-long-lambda\n        trace_fn=lambda state_and_results: (state_and_results[0],\n                                            trace_fn(*state_and_results)),\n        # pylint: enable=g-long-lambda\n        parallel_iterations=parallel_iterations)\n\n    if return_final_kernel_results:\n      return CheckpointableStatesAndTrace(\n          all_states=all_states,\n          trace=trace,\n          final_kernel_results=final_kernel_results)\n    else:\n      if no_trace:\n        return all_states\n      else:\n        return StatesAndTrace(all_states=all_states, trace=trace)", "response": "Implements Markov chain Monte Carlo via repeated TransitionKernel steps."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef trainable_gamma(shape, min_concentration=1e-3, min_scale=1e-5, name=None):\n  with tf.compat.v1.variable_scope(None, default_name=\"trainable_gamma\"):\n    unconstrained_concentration = tf.compat.v1.get_variable(\n        \"unconstrained_concentration\",\n        shape,\n        initializer=tf.compat.v1.initializers.random_normal(\n            mean=0.5, stddev=0.1))\n    unconstrained_scale = tf.compat.v1.get_variable(\n        \"unconstrained_scale\",\n        shape,\n        initializer=tf.compat.v1.initializers.random_normal(stddev=0.1))\n    concentration = tf.maximum(tf.nn.softplus(unconstrained_concentration),\n                               min_concentration)\n    rate = tf.maximum(1. / tf.nn.softplus(unconstrained_scale), 1. / min_scale)\n    rv = ed.Gamma(concentration=concentration, rate=rate, name=name)\n    return rv", "response": "Learnable Gamma via concentration and scale parameterization."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nloading NIPS 2011 conference papers from CSV file.", "response": "def load_nips2011_papers(path):\n  \"\"\"Loads NIPS 2011 conference papers.\n\n  The NIPS 1987-2015 data set is in the form of a 11,463 x 5,812 matrix of\n  per-paper word counts, containing 11,463 words and 5,811 NIPS conference\n  papers (Perrone et al., 2016). We subset to papers in 2011 and words appearing\n  in at least two documents and having a total word count of at least 10.\n\n  Built from the Observations Python package.\n\n  Args:\n    path: str.\n      Path to directory which either stores file or otherwise file will\n      be downloaded and extracted there. Filename is `NIPS_1987-2015.csv`.\n\n  Returns:\n    bag_of_words: np.ndarray of shape [num_documents, num_words]. Each element\n      denotes the number of occurrences of a specific word in a specific\n      document.\n    words: List of strings, denoting the words for `bag_of_words`'s columns.\n  \"\"\"\n  path = os.path.expanduser(path)\n  filename = \"NIPS_1987-2015.csv\"\n  filepath = os.path.join(path, filename)\n  if not os.path.exists(filepath):\n    url = (\"https://archive.ics.uci.edu/ml/machine-learning-databases/\"\n           \"00371/NIPS_1987-2015.csv\")\n    if not tf.io.gfile.exists(path):\n      tf.io.gfile.makedirs(path)\n    print(\"Downloading %s to %s\" % (url, filepath))\n    urllib.request.urlretrieve(url, filepath)\n\n  with open(filepath) as f:\n    iterator = csv.reader(f)\n    documents = next(iterator)[1:]\n    words = []\n    x_train = []\n    for row in iterator:\n      words.append(row[0])\n      x_train.append(row[1:])\n\n  x_train = np.array(x_train, dtype=np.int)\n\n  # Subset to documents in 2011 and words appearing in at least two documents\n  # and have a total word count of at least 10.\n  doc_idx = [i for i, document in enumerate(documents)\n             if document.startswith(\"2011\")]\n  documents = [documents[doc] for doc in doc_idx]\n  x_train = x_train[:, doc_idx]\n  word_idx = np.logical_and(np.sum(x_train != 0, 1) >= 2,\n                            np.sum(x_train, 1) >= 10)\n  words = [word for word, idx in zip(words, word_idx) if idx]\n  bag_of_words = x_train[word_idx, :].T\n  return bag_of_words, words"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nsharing init logic for amplitude and length_scale params.", "response": "def _init_params(self, amplitude, length_scale, validate_args):\n    \"\"\"Shared init logic for `amplitude` and `length_scale` params.\n\n    Args:\n      amplitude: `Tensor` (or convertible) or `None` to convert, validate.\n      length_scale: `Tensor` (or convertible) or `None` to convert, validate.\n      validate_args: If `True`, parameters are checked for validity despite\n        possibly degrading runtime performance\n\n    Returns:\n      dtype: The common `DType` of the parameters.\n    \"\"\"\n    dtype = util.maybe_get_common_dtype(\n        [amplitude, length_scale])\n    if amplitude is not None:\n      amplitude = tf.convert_to_tensor(\n          value=amplitude, name='amplitude', dtype=dtype)\n    self._amplitude = _validate_arg_if_not_none(\n        amplitude, tf.compat.v1.assert_positive, validate_args)\n    if length_scale is not None:\n      length_scale = tf.convert_to_tensor(\n          value=length_scale, name='length_scale', dtype=dtype)\n    self._length_scale = _validate_arg_if_not_none(\n        length_scale, tf.compat.v1.assert_positive, validate_args)\n    return dtype"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\ngets the KL function registered for classes a and b.", "response": "def _registered_kl(type_a, type_b):\n  \"\"\"Get the KL function registered for classes a and b.\"\"\"\n  hierarchy_a = tf_inspect.getmro(type_a)\n  hierarchy_b = tf_inspect.getmro(type_b)\n  dist_to_children = None\n  kl_fn = None\n  for mro_to_a, parent_a in enumerate(hierarchy_a):\n    for mro_to_b, parent_b in enumerate(hierarchy_b):\n      candidate_dist = mro_to_a + mro_to_b\n      candidate_kl_fn = _DIVERGENCES.get((parent_a, parent_b), None)\n      if not kl_fn or (candidate_kl_fn and candidate_dist < dist_to_children):\n        dist_to_children = candidate_dist\n        kl_fn = candidate_kl_fn\n  return kl_fn"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nreturning the batchwise KL - divergence between two distributions.", "response": "def kl_divergence(distribution_a, distribution_b,\n                  allow_nan_stats=True, name=None):\n  \"\"\"Get the KL-divergence KL(distribution_a || distribution_b).\n\n  If there is no KL method registered specifically for `type(distribution_a)`\n  and `type(distribution_b)`, then the class hierarchies of these types are\n  searched.\n\n  If one KL method is registered between any pairs of classes in these two\n  parent hierarchies, it is used.\n\n  If more than one such registered method exists, the method whose registered\n  classes have the shortest sum MRO paths to the input types is used.\n\n  If more than one such shortest path exists, the first method\n  identified in the search is used (favoring a shorter MRO distance to\n  `type(distribution_a)`).\n\n  Args:\n    distribution_a: The first distribution.\n    distribution_b: The second distribution.\n    allow_nan_stats: Python `bool`, default `True`. When `True`,\n      statistics (e.g., mean, mode, variance) use the value \"`NaN`\" to\n      indicate the result is undefined. When `False`, an exception is raised\n      if one or more of the statistic's batch members are undefined.\n    name: Python `str` name prefixed to Ops created by this class.\n\n  Returns:\n    A Tensor with the batchwise KL-divergence between `distribution_a`\n    and `distribution_b`.\n\n  Raises:\n    NotImplementedError: If no KL method is defined for distribution types\n      of `distribution_a` and `distribution_b`.\n  \"\"\"\n  kl_fn = _registered_kl(type(distribution_a), type(distribution_b))\n  if kl_fn is None:\n    raise NotImplementedError(\n        \"No KL(distribution_a || distribution_b) registered for distribution_a \"\n        \"type {} and distribution_b type {}\".format(\n            type(distribution_a).__name__, type(distribution_b).__name__))\n\n  with tf.name_scope(\"KullbackLeibler\"):\n    kl_t = kl_fn(distribution_a, distribution_b, name=name)\n    if allow_nan_stats:\n      return kl_t\n\n    # Check KL for NaNs\n    kl_t = tf.identity(kl_t, name=\"kl\")\n\n    with tf.control_dependencies([\n        tf.Assert(\n            tf.logical_not(tf.reduce_any(input_tensor=tf.math.is_nan(kl_t))),\n            [\n                (\"KL calculation between {} and {} returned NaN values \"\n                 \"(and was called with allow_nan_stats=False). Values:\".format(\n                     distribution_a.name, distribution_b.name)),\n                kl_t\n            ])\n    ]):\n      return tf.identity(kl_t, name=\"checked_kl\")"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\ncomputes the cross entropy of two distributions by P and Q.", "response": "def cross_entropy(ref, other,\n                  allow_nan_stats=True, name=None):\n  \"\"\"Computes the (Shannon) cross entropy.\n\n  Denote two distributions by `P` (`ref`) and `Q` (`other`). Assuming `P, Q`\n  are absolutely continuous with respect to one another and permit densities\n  `p(x) dr(x)` and `q(x) dr(x)`, (Shanon) cross entropy is defined as:\n\n  ```none\n  H[P, Q] = E_p[-log q(X)] = -int_F p(x) log q(x) dr(x)\n  ```\n\n  where `F` denotes the support of the random variable `X ~ P`.\n\n  Args:\n    ref: `tfd.Distribution` instance.\n    other: `tfd.Distribution` instance.\n    allow_nan_stats: Python `bool`, default `True`. When `True`,\n      statistics (e.g., mean, mode, variance) use the value \"`NaN`\" to\n      indicate the result is undefined. When `False`, an exception is raised\n      if one or more of the statistic's batch members are undefined.\n    name: Python `str` prepended to names of ops created by this function.\n\n  Returns:\n    cross_entropy: `ref.dtype` `Tensor` with shape `[B1, ..., Bn]`\n      representing `n` different calculations of (Shanon) cross entropy.\n  \"\"\"\n  with tf.name_scope(name or \"cross_entropy\"):\n    return ref.entropy() + kl_divergence(\n        ref, other, allow_nan_stats=allow_nan_stats)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef read_image(filepath):\n  im_bytes = tf.io.read_file(filepath)\n  im = tf.image.decode_image(im_bytes, channels=CHANNELS)\n  im = tf.image.convert_image_dtype(im, tf.float32)\n  return im", "response": "Reads an image tensor from a file."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef download_sprites():\n  filepath = os.path.join(FLAGS.data_dir, DATA_SPRITES_DIR)\n  if not tf.io.gfile.exists(filepath):\n    if not tf.io.gfile.exists(FLAGS.data_dir):\n      tf.io.gfile.makedirs(FLAGS.data_dir)\n    zip_name = \"{}.zip\".format(filepath)\n    urllib.request.urlretrieve(DATA_SPRITES_URL, zip_name)\n    with zipfile.ZipFile(zip_name, \"r\") as zip_file:\n      zip_file.extractall(FLAGS.data_dir)\n    tf.io.gfile.remove(zip_name)\n  return filepath", "response": "Downloads the sprites data and returns the saved filepath."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\ncreates a character sprite from a set of attribute sprites.", "response": "def create_character(skin, hair, top, pants):\n  \"\"\"Creates a character sprite from a set of attribute sprites.\"\"\"\n  dtype = skin.dtype\n  hair_mask = tf.cast(hair[..., -1:] <= 0, dtype)\n  top_mask = tf.cast(top[..., -1:] <= 0, dtype)\n  pants_mask = tf.cast(pants[..., -1:] <= 0, dtype)\n  char = (skin * hair_mask) + hair\n  char = (char * top_mask) + top\n  char = (char * pants_mask) + pants\n  return char"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\ncreate a sequence tensor for the given character sprite.", "response": "def create_seq(character, action_metadata, direction, length=8, start=0):\n  \"\"\"Creates a sequence.\n\n  Args:\n    character: A character sprite tensor.\n    action_metadata: An action metadata tuple.\n    direction: An integer representing the direction, i.e., the row\n      offset within each action group corresponding to a particular\n      direction.\n    length: Desired length of the sequence. If this is longer than\n      the number of available frames, it will roll over to the\n      beginning.\n    start: Index of possible frames at which to start the sequence.\n\n  Returns:\n    A sequence tensor.\n  \"\"\"\n  sprite_start = (action_metadata[0]+direction) * FRAME_SIZE\n  sprite_end = (action_metadata[0]+direction+1) * FRAME_SIZE\n  sprite_line = character[sprite_start:sprite_end, ...]\n\n  # Extract 64x64 patches that are side-by-side in the sprite, and limit\n  # to the actual number of frames for the given action.\n  frames = tf.stack(tf.split(sprite_line, 13, axis=1))  # 13 is a hack\n  frames = frames[0:action_metadata[1]]\n\n  # Extract a slice of the desired length.\n  # NOTE: Length could be longer than the number of frames, so tile as needed.\n  frames = tf.roll(frames, shift=-start, axis=0)\n  frames = tf.tile(frames, [2, 1, 1, 1])  # 2 is a hack\n  frames = frames[:length]\n  frames = tf.cast(frames, dtype=tf.float32)\n  frames.set_shape([length, FRAME_SIZE, FRAME_SIZE, CHANNELS])\n  return frames"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\ncreate a random sequence.", "response": "def create_random_seq(character, action_metadata, direction, length=8):\n  \"\"\"Creates a random sequence.\"\"\"\n  start = tf.random.uniform([], maxval=action_metadata[1], dtype=tf.int32)\n  return create_seq(character, action_metadata, direction, length, start)"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\ncreating a tf. data. Dataset for the sprites dataset.", "response": "def create_sprites_dataset(characters, actions, directions, channels=3,\n                           length=8, shuffle=False, fake_data=False):\n  \"\"\"Creates a tf.data pipeline for the sprites dataset.\n\n  Args:\n    characters: A list of (skin, hair, top, pants) tuples containing\n      relative paths to the sprite png image for each attribute.\n    actions: A list of Actions.\n    directions: A list of Directions.\n    channels: Number of image channels to yield.\n    length: Desired length of the sequences.\n    shuffle: Whether or not to shuffle the characters and sequences\n      start frame.\n    fake_data: Boolean for whether or not to yield synthetic data.\n\n  Returns:\n    A tf.data.Dataset yielding (seq, skin label index, hair label index,\n    top label index, pants label index, action label index, skin label\n    name, hair label_name, top label name, pants label name, action\n    label name) tuples.\n  \"\"\"\n  if fake_data:\n    dummy_image = tf.random.normal([HEIGHT, WIDTH, CHANNELS])\n  else:\n    basedir = download_sprites()\n\n  action_names = [action.name for action in actions]\n  action_metadata = [(action.start_row, action.frames) for action in actions]\n\n  direction_rows = [direction.row_offset for direction in directions]\n\n  chars = tf.data.Dataset.from_tensor_slices(characters)\n  act_names = tf.data.Dataset.from_tensor_slices(action_names).repeat()\n  acts_metadata = tf.data.Dataset.from_tensor_slices(action_metadata).repeat()\n  dir_rows = tf.data.Dataset.from_tensor_slices(direction_rows).repeat()\n\n  if shuffle:\n    chars = chars.shuffle(len(characters))\n\n  dataset = tf.data.Dataset.zip((chars, act_names, acts_metadata, dir_rows))\n\n  skin_table = tf.contrib.lookup.index_table_from_tensor(sorted(SKIN_COLORS))\n  hair_table = tf.contrib.lookup.index_table_from_tensor(sorted(HAIRSTYLES))\n  top_table = tf.contrib.lookup.index_table_from_tensor(sorted(TOPS))\n  pants_table = tf.contrib.lookup.index_table_from_tensor(sorted(PANTS))\n  action_table = tf.contrib.lookup.index_table_from_tensor(sorted(action_names))\n\n  def process_example(attrs, act_name, act_metadata, dir_row_offset):\n    \"\"\"Processes a dataset row.\"\"\"\n    skin_name = attrs[0]\n    hair_name = attrs[1]\n    top_name = attrs[2]\n    pants_name = attrs[3]\n\n    if fake_data:\n      char = dummy_image\n    else:\n      skin = read_image(basedir + os.sep + skin_name)\n      hair = read_image(basedir + os.sep + hair_name)\n      top = read_image(basedir + os.sep + top_name)\n      pants = read_image(basedir + os.sep + pants_name)\n      char = create_character(skin, hair, top, pants)\n\n    if shuffle:\n      seq = create_random_seq(char, act_metadata, dir_row_offset, length)\n    else:\n      seq = create_seq(char, act_metadata, dir_row_offset, length)\n    seq = seq[..., :channels]  # limit output channels\n\n    skin_idx = skin_table.lookup(skin_name)\n    hair_idx = hair_table.lookup(hair_name)\n    top_idx = top_table.lookup(top_name)\n    pants_idx = pants_table.lookup(pants_name)\n    act_idx = action_table.lookup(act_name)\n\n    return (seq, skin_idx, hair_idx, top_idx, pants_idx, act_idx,\n            skin_name, hair_name, top_name, pants_name, act_name)\n\n  dataset = dataset.map(process_example)\n  return dataset"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _maybe_validate_distributions(distributions, dtype_override, validate_args):\n  assertions = []\n\n  if not _is_iterable(distributions) or not distributions:\n    raise ValueError('`distributions` must be a list of one or more '\n                     'distributions.')\n\n  if dtype_override is None:\n    dts = [\n        dtype_util.base_dtype(d.dtype)\n        for d in distributions\n        if d.dtype is not None\n    ]\n    if dts[1:] != dts[:-1]:\n      raise TypeError('Distributions must have same dtype; found: {}.'.format(\n          set(dtype_util.name(dt) for dt in dts)))\n\n  # Validate event_ndims.\n  for d in distributions:\n    if tensorshape_util.rank(d.event_shape) is not None:\n      if tensorshape_util.rank(d.event_shape) != 1:\n        raise ValueError('`Distribution` must be vector variate, '\n                         'found event nimds: {}.'.format(\n                             tensorshape_util.rank(d.event_shape)))\n    elif validate_args:\n      assertions.append(\n          assert_util.assert_equal(\n              1, tf.size(input=d.event_shape_tensor()),\n              message='`Distribution` must be vector variate.'))\n\n  batch_shapes = [d.batch_shape for d in distributions]\n  if all(tensorshape_util.is_fully_defined(b) for b in batch_shapes):\n    if batch_shapes[1:] != batch_shapes[:-1]:\n      raise ValueError('Distributions must have the same `batch_shape`; '\n                       'found: {}.'.format(batch_shapes))\n  elif validate_args:\n    batch_shapes = [\n        tensorshape_util.as_list(d.batch_shape)  # pylint: disable=g-complex-comprehension\n        if tensorshape_util.is_fully_defined(d.batch_shape) else\n        d.batch_shape_tensor() for d in distributions\n    ]\n    assertions.extend(\n        assert_util.assert_equal(  # pylint: disable=g-complex-comprehension\n            b1, b2,\n            message='Distribution `batch_shape`s must be identical.')\n        for b1, b2 in zip(batch_shapes[1:], batch_shapes[:-1]))\n\n  return assertions", "response": "Checks that the given distributions satisfy all assumptions."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef _kl_blockwise_blockwise(b0, b1, name=None):\n  if len(b0.distributions) != len(b1.distributions):\n    raise ValueError(\n        'Can only compute KL divergence between Blockwise distributions with '\n        'the same number of component distributions.')\n\n  # We also need to check that the event shapes match for each one.\n  b0_event_sizes = [_event_size(d) for d in b0.distributions]\n  b1_event_sizes = [_event_size(d) for d in b1.distributions]\n\n  assertions = []\n  message = ('Can only compute KL divergence between Blockwise distributions '\n             'with the same pairwise event shapes.')\n\n  if (all(isinstance(event_size, int) for event_size in b0_event_sizes) and\n      all(isinstance(event_size, int) for event_size in b1_event_sizes)):\n    if b0_event_sizes != b1_event_sizes:\n      raise ValueError(message)\n  else:\n    if b0.validate_args or b1.validate_args:\n      assertions.extend(\n          assert_util.assert_equal(  # pylint: disable=g-complex-comprehension\n              e1, e2, message=message)\n          for e1, e2 in zip(b0_event_sizes, b1_event_sizes))\n\n  with tf.name_scope(name or 'kl_blockwise_blockwise'):\n    with tf.control_dependencies(assertions):\n      return sum([\n          kullback_leibler.kl_divergence(d1, d2) for d1, d2 in zip(\n              b0.distributions, b1.distributions)])", "response": "Calculate the batched KL divergence between two Blockwise distributions."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _kl_half_normal_half_normal(a, b, name=None):\n  with tf.name_scope(name or \"kl_half_normal_half_normal\"):\n    # Consistent with\n    # http://www.mast.queensu.ca/~communications/Papers/gil-msc11.pdf, page 119\n    return (tf.math.log(b.scale) - tf.math.log(a.scale) +\n            (a.scale**2 - b.scale**2) / (2 * b.scale**2))", "response": "Calculate the batched KL divergence KL with a and b HalfNormal distribution."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nflatten a list of kernels which may contain _SumKernel instances.", "response": "def _flatten_summand_list(kernels):\n  \"\"\"Flatten a list of kernels which may contain _SumKernel instances.\n\n  Args:\n    kernels: Python list of `PositiveSemidefiniteKernel` instances\n\n  Returns:\n    Python list containing the elements of kernels, with any _SumKernel\n    instances replaced by their `kernels` property contents.\n  \"\"\"\n  flattened = []\n  for k in kernels:\n    if isinstance(k, _SumKernel):\n      flattened += k.kernels\n    else:\n      flattened.append(k)\n  return flattened"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _flatten_multiplicand_list(kernels):\n  flattened = []\n  for k in kernels:\n    if isinstance(k, _ProductKernel):\n      flattened += k.kernels\n    else:\n      flattened.append(k)\n  return flattened", "response": "Flatten a list of kernels which may contain _ProductKernel instances."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef build_input_pipeline(x_train, x_test, y_train, y_test,\n                         batch_size, valid_size):\n  \"\"\"Build an Iterator switching between train and heldout data.\"\"\"\n\n  x_train = x_train.astype(\"float32\")\n  x_test = x_test.astype(\"float32\")\n\n  x_train /= 255\n  x_test /= 255\n\n  y_train = y_train.flatten()\n  y_test = y_test.flatten()\n\n  if FLAGS.subtract_pixel_mean:\n    x_train_mean = np.mean(x_train, axis=0)\n    x_train -= x_train_mean\n    x_test -= x_train_mean\n\n  print(\"x_train shape:\" + str(x_train.shape))\n  print(str(x_train.shape[0]) + \" train samples\")\n  print(str(x_test.shape[0]) + \" test samples\")\n\n  # Build an iterator over training batches.\n  training_dataset = tf.data.Dataset.from_tensor_slices(\n      (x_train, np.int32(y_train)))\n  training_batches = training_dataset.shuffle(\n      50000, reshuffle_each_iteration=True).repeat().batch(batch_size)\n  training_iterator = tf.compat.v1.data.make_one_shot_iterator(training_batches)\n\n  # Build a iterator over the heldout set with batch_size=heldout_size,\n  # i.e., return the entire heldout set as a constant.\n  heldout_dataset = tf.data.Dataset.from_tensor_slices(\n      (x_test, np.int32(y_test)))\n  heldout_batches = heldout_dataset.repeat().batch(valid_size)\n  heldout_iterator = tf.compat.v1.data.make_one_shot_iterator(heldout_batches)\n\n  # Combine these into a feedable iterator that can switch between training\n  # and validation inputs.\n  handle = tf.compat.v1.placeholder(tf.string, shape=[])\n  feedable_iterator = tf.compat.v1.data.Iterator.from_string_handle(\n      handle, training_batches.output_types, training_batches.output_shapes)\n  images, labels = feedable_iterator.get_next()\n\n  return images, labels, handle, training_iterator, heldout_iterator", "response": "Builds a pipeline that can be used to switch between train and heldout data."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef build_fake_data():\n  num_examples = 10\n  x_train = np.random.rand(num_examples, *IMAGE_SHAPE).astype(np.float32)\n  y_train = np.random.permutation(np.arange(num_examples)).astype(np.int32)\n  x_test = np.random.rand(num_examples, *IMAGE_SHAPE).astype(np.float32)\n  y_test = np.random.permutation(np.arange(num_examples)).astype(np.int32)\n  return (x_train, y_train), (x_test, y_test)", "response": "Build fake CIFAR10 - style data for unit testing."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\ncount the number of occurrences of each value in an integer array.", "response": "def count_integers(arr,\n                   weights=None,\n                   minlength=None,\n                   maxlength=None,\n                   axis=None,\n                   dtype=tf.int32,\n                   name=None):\n  \"\"\"Counts the number of occurrences of each value in an integer array `arr`.\n\n  Works like `tf.math.bincount`, but provides an `axis` kwarg that specifies\n  dimensions to reduce over.  With\n    `~axis = [i for i in range(arr.ndim) if i not in axis]`,\n  this function returns a `Tensor` of shape `[K] + arr.shape[~axis]`.\n\n  If `minlength` and `maxlength` are not given, `K = tf.reduce_max(arr) + 1`\n  if `arr` is non-empty, and 0 otherwise.\n  If `weights` are non-None, then index `i` of the output stores the sum of the\n  value in `weights` at each index where the corresponding value in `arr` is\n  `i`.\n\n  Args:\n    arr: An `int32` `Tensor` of non-negative values.\n    weights: If non-None, must be the same shape as arr. For each value in\n      `arr`, the bin will be incremented by the corresponding weight instead of\n      1.\n    minlength: If given, ensures the output has length at least `minlength`,\n      padding with zeros at the end if necessary.\n    maxlength: If given, skips values in `arr` that are equal or greater than\n      `maxlength`, ensuring that the output has length at most `maxlength`.\n    axis: A `0-D` or `1-D` `int32` `Tensor` (with static values) designating\n      dimensions in `arr` to reduce over.\n      `Default value:` `None`, meaning reduce over all dimensions.\n    dtype: If `weights` is None, determines the type of the output bins.\n    name: A name scope for the associated operations (optional).\n\n  Returns:\n    A vector with the same dtype as `weights` or the given `dtype`. The bin\n    values.\n  \"\"\"\n  with tf.compat.v1.name_scope(\n      name, 'count_integers', values=[arr, weights, minlength, maxlength,\n                                      axis]):\n    if axis is None:\n      return tf.math.bincount(\n          arr,\n          weights=weights,\n          minlength=minlength,\n          maxlength=maxlength,\n          dtype=dtype)\n\n    arr = tf.convert_to_tensor(value=arr, dtype=tf.int32, name='arr')\n    arr_ndims = _get_static_ndims(arr, expect_static=True)\n\n    axis = _make_static_axis_non_negative_list(axis, arr_ndims)\n\n    # ~axis from docstring.  Dims in arr that are not in axis.\n    not_axis = sorted(set(range(arr_ndims)).difference(axis))\n\n    # If we're reducing over everything, just use standard bincount.\n    if not not_axis:\n      return tf.math.bincount(\n          arr,\n          weights=weights,\n          minlength=minlength,\n          maxlength=maxlength,\n          dtype=dtype)\n\n    # Move dims in ~axis to the left, so we can tf.map_fn bincount over them,\n    # Producing counts for every index I in ~axis.\n    # Thus, flat_arr is not totally flat, it just has the dims in ~axis\n    # flattened.\n    flat_arr = _move_dims_to_flat_end(arr, not_axis, arr_ndims, right_end=False)\n\n    # tf.map_fn over dim 0.\n    if weights is None:\n\n      def one_bincount(arr_slice):\n        return tf.math.bincount(\n            arr_slice,\n            weights=None,\n            minlength=minlength,\n            maxlength=maxlength,\n            dtype=dtype)\n\n      flat_counts = tf.map_fn(one_bincount, elems=flat_arr, dtype=dtype)\n    else:\n      weights = tf.convert_to_tensor(value=weights, name='weights')\n      _get_static_ndims(weights, expect_static=True, expect_ndims=arr_ndims)\n      flat_weights = _move_dims_to_flat_end(\n          weights, not_axis, arr_ndims, right_end=False)\n\n      def one_bincount(arr_and_weights_slices):\n        arr_slice, weights_slice = arr_and_weights_slices\n        return tf.math.bincount(\n            arr_slice,\n            weights=weights_slice,\n            minlength=minlength,\n            maxlength=maxlength,\n            dtype=dtype)\n\n      flat_counts = tf.map_fn(\n          one_bincount, elems=[flat_arr, flat_weights], dtype=weights.dtype)\n\n    # flat_counts.shape = [prod(~axis), K], because map_fn stacked on axis 0.\n    # bincount needs to have the K bins in axis 0, so transpose...\n    flat_counts_t = tf.transpose(a=flat_counts, perm=[1, 0])\n\n    # Throw in this assert, to ensure shape assumptions are correct.\n    _get_static_ndims(flat_counts_t, expect_ndims=2, expect_static=True)\n\n    # not_axis_shape = arr.shape[~axis]\n    not_axis_shape = tf.gather(tf.shape(input=arr), indices=not_axis)\n\n    # The first index of flat_counts_t indexes bins 0,..,K-1, the rest are ~axis\n    out_shape = tf.concat([[-1], not_axis_shape], axis=0)\n\n    return tf.reshape(flat_counts_t, out_shape)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nreturn a list of bin values into discrete intervals.", "response": "def find_bins(x,\n              edges,\n              extend_lower_interval=False,\n              extend_upper_interval=False,\n              dtype=None,\n              name=None):\n  \"\"\"Bin values into discrete intervals.\n\n  Given `edges = [c0, ..., cK]`, defining intervals\n  `I0 = [c0, c1)`, `I1 = [c1, c2)`, ..., `I_{K-1} = [c_{K-1}, cK]`,\n  This function returns `bins`, such that:\n  `edges[bins[i]] <= x[i] < edges[bins[i] + 1]`.\n\n  Args:\n    x:  Numeric `N-D` `Tensor` with `N > 0`.\n    edges:  `Tensor` of same `dtype` as `x`.  The first dimension indexes edges\n      of intervals.  Must either be `1-D` or have\n      `x.shape[1:] == edges.shape[1:]`.  If `rank(edges) > 1`, `edges[k]`\n      designates a shape `edges.shape[1:]` `Tensor` of bin edges for the\n      corresponding dimensions of `x`.\n    extend_lower_interval:  Python `bool`.  If `True`, extend the lowest\n      interval `I0` to `(-inf, c1]`.\n    extend_upper_interval:  Python `bool`.  If `True`, extend the upper\n      interval `I_{K-1}` to `[c_{K-1}, +inf)`.\n    dtype: The output type (`int32` or `int64`). `Default value:` `x.dtype`.\n      This effects the output values when `x` is below/above the intervals,\n      which will be `-1/K+1` for `int` types and `NaN` for `float`s.\n      At indices where `x` is `NaN`, the output values will be `0` for `int`\n      types and `NaN` for floats.\n    name:  A Python string name to prepend to created ops. Default: 'find_bins'\n\n  Returns:\n    bins: `Tensor` with same `shape` as `x` and `dtype`.\n      Has whole number values.  `bins[i] = k` means the `x[i]` falls into the\n      `kth` bin, ie, `edges[bins[i]] <= x[i] < edges[bins[i] + 1]`.\n\n  Raises:\n    ValueError:  If `edges.shape[0]` is determined to be less than 2.\n\n  #### Examples\n\n  Cut a `1-D` array\n\n  ```python\n  x = [0., 5., 6., 10., 20.]\n  edges = [0., 5., 10.]\n  tfp.stats.find_bins(x, edges)\n  ==> [0., 0., 1., 1., np.nan]\n  ```\n\n  Cut `x` into its deciles\n\n  ```python\n  x = tf.random_uniform(shape=(100, 200))\n  decile_edges = tfp.stats.quantiles(x, num_quantiles=10)\n  bins = tfp.stats.find_bins(x, edges=decile_edges)\n  bins.shape\n  ==> (100, 200)\n  tf.reduce_mean(bins == 0.)\n  ==> approximately 0.1\n  tf.reduce_mean(bins == 1.)\n  ==> approximately 0.1\n  ```\n\n  \"\"\"\n  # TFP users may be surprised to see the \"action\" in the leftmost dim of\n  # edges, rather than the rightmost (event) dim.  Why?\n  # 1. Most likely you created edges by getting quantiles over samples, and\n  #    quantile/percentile return these edges in the leftmost (sample) dim.\n  # 2. Say you have event_shape = [5], then we expect the bin will be different\n  #    for all 5 events, so the index of the bin should not be in the event dim.\n  with tf.compat.v1.name_scope(\n      name, default_name='find_bins', values=[x, edges]):\n    in_type = dtype_util.common_dtype([x, edges],\n                                      preferred_dtype=tf.float32)\n    edges = tf.convert_to_tensor(value=edges, name='edges', dtype=in_type)\n    x = tf.convert_to_tensor(value=x, name='x', dtype=in_type)\n\n    if (tf.compat.dimension_value(edges.shape[0]) is not None and\n        tf.compat.dimension_value(edges.shape[0]) < 2):\n      raise ValueError(\n          'First dimension of `edges` must have length > 1 to index 1 or '\n          'more bin. Found: {}'.format(edges.shape))\n\n    flattening_x = edges.shape.ndims == 1 and x.shape.ndims > 1\n\n    if flattening_x:\n      x_orig_shape = tf.shape(input=x)\n      x = tf.reshape(x, [-1])\n\n    if dtype is None:\n      dtype = in_type\n    dtype = tf.as_dtype(dtype)\n\n    # Move first dims into the rightmost.\n    x_permed = distribution_util.rotate_transpose(x, shift=-1)\n    edges_permed = distribution_util.rotate_transpose(edges, shift=-1)\n\n    # If...\n    #   x_permed = [0, 1, 6., 10]\n    #   edges = [0, 5, 10.]\n    #   ==> almost_output = [0, 1, 2, 2]\n    searchsorted_type = dtype if dtype in [tf.int32, tf.int64] else None\n    almost_output_permed = tf.searchsorted(\n        sorted_sequence=edges_permed,\n        values=x_permed,\n        side='right',\n        out_type=searchsorted_type)\n    # Move the rightmost dims back to the leftmost.\n    almost_output = tf.cast(\n        distribution_util.rotate_transpose(almost_output_permed, shift=1),\n        dtype)\n\n    # In above example, we want [0, 0, 1, 1], so correct this here.\n    bins = tf.clip_by_value(almost_output - 1, tf.cast(0, dtype),\n                            tf.cast(tf.shape(input=edges)[0] - 2, dtype))\n\n    if not extend_lower_interval:\n      low_fill = np.nan if dtype.is_floating else -1\n      bins = tf.where(x < tf.expand_dims(edges[0], 0),\n                      tf.fill(tf.shape(input=x), tf.cast(low_fill, dtype)),\n                      bins)\n\n    if not extend_upper_interval:\n      up_fill = np.nan if dtype.is_floating else tf.shape(input=edges)[0] - 1\n      bins = tf.where(x > tf.expand_dims(edges[-1], 0),\n                      tf.fill(tf.shape(input=x), tf.cast(up_fill, dtype)), bins)\n\n    if flattening_x:\n      bins = tf.reshape(bins, x_orig_shape)\n\n    return bins"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nreturns a histogram of the number of events in a given axis.", "response": "def histogram(x,\n              edges,\n              axis=None,\n              extend_lower_interval=False,\n              extend_upper_interval=False,\n              dtype=None,\n              name=None):\n  \"\"\"Count how often `x` falls in intervals defined by `edges`.\n\n  Given `edges = [c0, ..., cK]`, defining intervals\n  `I0 = [c0, c1)`, `I1 = [c1, c2)`, ..., `I_{K-1} = [c_{K-1}, cK]`,\n  This function counts how often `x` falls into each interval.\n\n  Values of `x` outside of the intervals cause errors.  Consider using\n  `extend_lower_interval`, `extend_upper_interval` to deal with this.\n\n  Args:\n    x:  Numeric `N-D` `Tensor` with `N > 0`.  If `axis` is not\n      `None`, must have statically known number of dimensions. The\n      `axis` kwarg determines which dimensions index iid samples.\n      Other dimensions of `x` index \"events\" for which we will compute different\n      histograms.\n    edges:  `Tensor` of same `dtype` as `x`.  The first dimension indexes edges\n      of intervals.  Must either be `1-D` or have `edges.shape[1:]` the same\n      as the dimensions of `x` excluding `axis`.\n      If `rank(edges) > 1`, `edges[k]` designates a shape `edges.shape[1:]`\n      `Tensor` of interval edges for the corresponding dimensions of `x`.\n    axis:  Optional `0-D` or `1-D` integer `Tensor` with constant\n      values. The axis in `x` that index iid samples.\n      `Default value:` `None` (treat every dimension as sample dimension).\n    extend_lower_interval:  Python `bool`.  If `True`, extend the lowest\n      interval `I0` to `(-inf, c1]`.\n    extend_upper_interval:  Python `bool`.  If `True`, extend the upper\n      interval `I_{K-1}` to `[c_{K-1}, +inf)`.\n    dtype: The output type (`int32` or `int64`). `Default value:` `x.dtype`.\n    name:  A Python string name to prepend to created ops.\n      `Default value:` 'histogram'\n\n  Returns:\n    counts: `Tensor` of type `dtype` and, with\n      `~axis = [i for i in range(arr.ndim) if i not in axis]`,\n      `counts.shape = [edges.shape[0]] + x.shape[~axis]`.\n      With `I` a multi-index into `~axis`, `counts[k][I]` is the number of times\n      event(s) fell into the `kth` interval of `edges`.\n\n  #### Examples\n\n  ```python\n  # x.shape = [1000, 2]\n  # x[:, 0] ~ Uniform(0, 1), x[:, 1] ~ Uniform(1, 2).\n  x = tf.stack([tf.random_uniform([1000]), 1 + tf.random_uniform([1000])],\n               axis=-1)\n\n  # edges ==> bins [0, 0.5), [0.5, 1.0), [1.0, 1.5), [1.5, 2.0].\n  edges = [0., 0.5, 1.0, 1.5, 2.0]\n\n  tfp.stats.histogram(x, edges)\n  ==> approximately [500, 500, 500, 500]\n\n  tfp.stats.histogram(x, edges, axis=0)\n  ==> approximately [[500, 500, 0, 0], [0, 0, 500, 500]]\n  ```\n\n  \"\"\"\n  with tf.compat.v1.name_scope(name, 'histogram', values=[x, edges, axis]):\n\n    # Tensor conversions.\n    in_dtype = dtype_util.common_dtype([x, edges], preferred_dtype=tf.float32)\n\n    x = tf.convert_to_tensor(value=x, name='x', dtype=in_dtype)\n    edges = tf.convert_to_tensor(value=edges, name='edges', dtype=in_dtype)\n\n    # Move dims in axis to the left end as one flattened dim.\n    # After this, x.shape = [n_samples] + E.\n    if axis is None:\n      x = tf.reshape(x, shape=[-1])\n    else:\n      x_ndims = _get_static_ndims(\n          x, expect_static=True, expect_ndims_at_least=1)\n      axis = _make_static_axis_non_negative_list(axis, x_ndims)\n      if not axis:\n        raise ValueError('`axis` cannot be empty.  Found: {}'.format(axis))\n      x = _move_dims_to_flat_end(x, axis, x_ndims, right_end=False)\n\n    # bins.shape = x.shape = [n_samples] + E,\n    # and bins[i] is a shape E Tensor of the bins that sample `i` fell into.\n    # E is the \"event shape\", which is [] if axis is None.\n    bins = find_bins(\n        x,\n        edges=edges,\n        # If not extending intervals, then values outside the edges will return\n        # -1, which gives an error when fed to bincount.\n        extend_lower_interval=extend_lower_interval,\n        extend_upper_interval=extend_upper_interval,\n        dtype=tf.int32)\n\n    # TODO(b/124015136) Use standard tf.math.bincount once it supports `axis`.\n    counts = count_integers(\n        bins,\n        # Ensure we get correct output, even if x did not fall into every bin\n        minlength=tf.shape(input=edges)[0] - 1,\n        maxlength=tf.shape(input=edges)[0] - 1,\n        axis=0,\n        dtype=dtype or in_dtype)\n    n_edges = tf.compat.dimension_value(edges.shape[0])\n    if n_edges is not None:\n      counts.set_shape(\n          tf.TensorShape([n_edges - 1]).concatenate(counts.shape[1:]))\n    return counts"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef percentile(x,\n               q,\n               axis=None,\n               interpolation=None,\n               keep_dims=False,\n               validate_args=False,\n               preserve_gradients=True,\n               name=None):\n  \"\"\"Compute the `q`-th percentile(s) of `x`.\n\n  Given a vector `x`, the `q`-th percentile of `x` is the value `q / 100` of the\n  way from the minimum to the maximum in a sorted copy of `x`.\n\n  The values and distances of the two nearest neighbors as well as the\n  `interpolation` parameter will determine the percentile if the normalized\n  ranking does not match the location of `q` exactly.\n\n  This function is the same as the median if `q = 50`, the same as the minimum\n  if `q = 0` and the same as the maximum if `q = 100`.\n\n  Multiple percentiles can be computed at once by using `1-D` vector `q`.\n  Dimension zero of the returned `Tensor` will index the different percentiles.\n\n  Compare to `numpy.percentile`.\n\n  Args:\n    x:  Numeric `N-D` `Tensor` with `N > 0`.  If `axis` is not `None`,\n      `x` must have statically known number of dimensions.\n    q:  Scalar or vector `Tensor` with values in `[0, 100]`. The percentile(s).\n    axis:  Optional `0-D` or `1-D` integer `Tensor` with constant values. The\n      axis that index independent samples over which to return the desired\n      percentile.  If `None` (the default), treat every dimension as a sample\n      dimension, returning a scalar.\n    interpolation : {'nearest', 'linear', 'lower', 'higher', 'midpoint'}.\n      Default value: 'nearest'.  This specifies the interpolation method to\n      use when the desired quantile lies between two data points `i < j`:\n        * linear: i + (j - i) * fraction, where fraction is the fractional part\n          of the index surrounded by i and j.\n        * lower: `i`.\n        * higher: `j`.\n        * nearest: `i` or `j`, whichever is nearest.\n        * midpoint: (i + j) / 2.\n      `linear` and `midpoint` interpolation do not work with integer dtypes.\n    keep_dims:  Python `bool`. If `True`, the last dimension is kept with size 1\n      If `False`, the last dimension is removed from the output shape.\n    validate_args:  Whether to add runtime checks of argument validity. If\n      False, and arguments are incorrect, correct behavior is not guaranteed.\n    preserve_gradients:  Python `bool`.  If `True`, ensure that gradient w.r.t\n      the percentile `q` is preserved in the case of linear interpolation.\n      If `False`, the gradient will be (incorrectly) zero when `q` corresponds\n      to a point in `x`.\n    name:  A Python string name to give this `Op`.  Default is 'percentile'\n\n  Returns:\n    A `(rank(q) + N - len(axis))` dimensional `Tensor` of same dtype as `x`, or,\n      if `axis` is `None`, a `rank(q)` `Tensor`.  The first `rank(q)` dimensions\n      index quantiles for different values of `q`.\n\n  Raises:\n    ValueError:  If argument 'interpolation' is not an allowed type.\n    ValueError:  If interpolation type not compatible with `dtype`.\n\n  #### Examples\n\n  ```python\n  # Get 30th percentile with default ('nearest') interpolation.\n  x = [1., 2., 3., 4.]\n  tfp.stats.percentile(x, q=30.)\n  ==> 2.0\n\n  # Get 30th percentile with 'linear' interpolation.\n  x = [1., 2., 3., 4.]\n  tfp.stats.percentile(x, q=30., interpolation='linear')\n  ==> 1.9\n\n  # Get 30th and 70th percentiles with 'lower' interpolation\n  x = [1., 2., 3., 4.]\n  tfp.stats.percentile(x, q=[30., 70.], interpolation='lower')\n  ==> [1., 3.]\n\n  # Get 100th percentile (maximum).  By default, this is computed over every dim\n  x = [[1., 2.]\n       [3., 4.]]\n  tfp.stats.percentile(x, q=100.)\n  ==> 4.\n\n  # Treat the leading dim as indexing samples, and find the 100th quantile (max)\n  # over all such samples.\n  x = [[1., 2.]\n       [3., 4.]]\n  tfp.stats.percentile(x, q=100., axis=[0])\n  ==> [3., 4.]\n  ```\n\n  \"\"\"\n  name = name or 'percentile'\n  allowed_interpolations = {'linear', 'lower', 'higher', 'nearest', 'midpoint'}\n\n  if interpolation is None:\n    interpolation = 'nearest'\n  else:\n    if interpolation not in allowed_interpolations:\n      raise ValueError('Argument `interpolation` must be in %s.  Found %s' %\n                       (allowed_interpolations, interpolation))\n\n  with tf.compat.v1.name_scope(name, values=[x, q]):\n    x = tf.convert_to_tensor(value=x, name='x')\n\n    if interpolation in {'linear', 'midpoint'} and x.dtype.is_integer:\n      raise TypeError('{} interpolation not allowed with dtype {}'.format(\n          interpolation, x.dtype))\n\n    # Double is needed here and below, else we get the wrong index if the array\n    # is huge along axis.\n    q = tf.cast(q, tf.float64)\n    _get_static_ndims(q, expect_ndims_no_more_than=1)\n\n    if validate_args:\n      q = distribution_util.with_dependencies([\n          tf.compat.v1.assert_rank_in(q, [0, 1]),\n          tf.compat.v1.assert_greater_equal(q, tf.cast(0., tf.float64)),\n          tf.compat.v1.assert_less_equal(q, tf.cast(100., tf.float64))\n      ], q)\n\n    # Move `axis` dims of `x` to the rightmost, call it `y`.\n    if axis is None:\n      y = tf.reshape(x, [-1])\n    else:\n      x_ndims = _get_static_ndims(\n          x, expect_static=True, expect_ndims_at_least=1)\n      axis = _make_static_axis_non_negative_list(axis, x_ndims)\n      y = _move_dims_to_flat_end(x, axis, x_ndims, right_end=True)\n\n    frac_at_q_or_above = 1. - q / 100.\n\n    # Sort everything, not just the top 'k' entries, which allows multiple calls\n    # to sort only once (under the hood) and use CSE.\n    sorted_y = _sort_tensor(y)\n\n    d = tf.cast(tf.shape(input=y)[-1], tf.float64)\n\n    def _get_indices(interp_type):\n      \"\"\"Get values of y at the indices implied by interp_type.\"\"\"\n      # Note `lower` <--> ceiling.  Confusing, huh?  Due to the fact that\n      # _sort_tensor sorts highest to lowest, tf.ceil corresponds to the higher\n      # index, but the lower value of y!\n      if interp_type == 'lower':\n        indices = tf.math.ceil((d - 1) * frac_at_q_or_above)\n      elif interp_type == 'higher':\n        indices = tf.floor((d - 1) * frac_at_q_or_above)\n      elif interp_type == 'nearest':\n        indices = tf.round((d - 1) * frac_at_q_or_above)\n      # d - 1 will be distinct from d in int32, but not necessarily double.\n      # So clip to avoid out of bounds errors.\n      return tf.clip_by_value(\n          tf.cast(indices, tf.int32), 0,\n          tf.shape(input=y)[-1] - 1)\n\n    if interpolation in ['nearest', 'lower', 'higher']:\n      gathered_y = tf.gather(sorted_y, _get_indices(interpolation), axis=-1)\n    elif interpolation == 'midpoint':\n      gathered_y = 0.5 * (\n          tf.gather(sorted_y, _get_indices('lower'), axis=-1) +\n          tf.gather(sorted_y, _get_indices('higher'), axis=-1))\n    elif interpolation == 'linear':\n      # Copy-paste of docstring on interpolation:\n      # linear: i + (j - i) * fraction, where fraction is the fractional part\n      # of the index surrounded by i and j.\n      larger_y_idx = _get_indices('lower')\n      exact_idx = (d - 1) * frac_at_q_or_above\n      if preserve_gradients:\n        # If q corresponds to a point in x, we will initially have\n        # larger_y_idx == smaller_y_idx.\n        # This results in the gradient w.r.t. fraction being zero (recall `q`\n        # enters only through `fraction`...and see that things cancel).\n        # The fix is to ensure that smaller_y_idx and larger_y_idx are always\n        # separated by exactly 1.\n        smaller_y_idx = tf.maximum(larger_y_idx - 1, 0)\n        larger_y_idx = tf.minimum(smaller_y_idx + 1, tf.shape(input=y)[-1] - 1)\n        fraction = tf.cast(larger_y_idx, tf.float64) - exact_idx\n      else:\n        smaller_y_idx = _get_indices('higher')\n        fraction = tf.math.ceil((d - 1) * frac_at_q_or_above) - exact_idx\n\n      fraction = tf.cast(fraction, y.dtype)\n      gathered_y = (\n          tf.gather(sorted_y, larger_y_idx, axis=-1) * (1 - fraction) +\n          tf.gather(sorted_y, smaller_y_idx, axis=-1) * fraction)\n\n    # Propagate NaNs\n    if x.dtype in (tf.bfloat16, tf.float16, tf.float32, tf.float64):\n      # Apparently tf.is_nan doesn't like other dtypes\n      nan_batch_members = tf.reduce_any(\n          input_tensor=tf.math.is_nan(x), axis=axis)\n      right_rank_matched_shape = tf.pad(\n          tensor=tf.shape(input=nan_batch_members),\n          paddings=[[0, tf.rank(input=q)]],\n          constant_values=1)\n      nan_batch_members = tf.reshape(\n          nan_batch_members, shape=right_rank_matched_shape)\n      shape_gathered_y = tf.shape(input=gathered_y)\n      nan = np.array(np.nan, gathered_y.dtype.as_numpy_dtype)\n      gathered_y = tf.where(\n          tf.broadcast_to(nan_batch_members, shape_gathered_y),\n          tf.fill(shape_gathered_y, nan),\n          gathered_y)\n\n    # Expand dimensions if requested\n    if keep_dims:\n      if axis is None:\n        ones_vec = tf.ones(\n            shape=[_get_best_effort_ndims(x) + _get_best_effort_ndims(q)],\n            dtype=tf.int32)\n        gathered_y *= tf.ones(ones_vec, dtype=x.dtype)\n      else:\n        gathered_y = _insert_back_keep_dims(gathered_y, axis)\n\n    # If q is a scalar, then result has the right shape.\n    # If q is a vector, then result has trailing dim of shape q.shape, which\n    # needs to be rotated to dim 0.\n    return distribution_util.rotate_transpose(gathered_y, tf.rank(q))", "response": "Compute the percentile of x along a given axis."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\ncomputes the quantiles of a distribution along a specified axis.", "response": "def quantiles(x,\n              num_quantiles,\n              axis=None,\n              interpolation=None,\n              keep_dims=False,\n              validate_args=False,\n              name=None):\n  \"\"\"Compute quantiles of `x` along `axis`.\n\n  The quantiles of a distribution are cut points dividing the range into\n  intervals with equal probabilities.\n\n  Given a vector `x` of samples, this function estimates the cut points by\n  returning `num_quantiles + 1` cut points, `(c0, ..., cn)`, such that, roughly\n  speaking, equal number of sample points lie in the `num_quantiles` intervals\n  `[c0, c1), [c1, c2), ..., [c_{n-1}, cn]`.  That is,\n\n  * About `1 / n` fraction of the data lies in `[c_{k-1}, c_k)`, `k = 1, ..., n`\n  * About `k / n` fraction of the data lies below `c_k`.\n  * `c0` is the sample minimum and `cn` is the maximum.\n\n  The exact number of data points in each interval depends on the size of\n  `x` (e.g. whether the size is divisible by `n`) and the `interpolation` kwarg.\n\n  Args:\n    x:  Numeric `N-D` `Tensor` with `N > 0`.  If `axis` is not `None`,\n      `x` must have statically known number of dimensions.\n    num_quantiles:  Scalar `integer` `Tensor`.  The number of intervals the\n      returned `num_quantiles + 1` cut points divide the range into.\n    axis:  Optional `0-D` or `1-D` integer `Tensor` with constant values. The\n      axis that index independent samples over which to return the desired\n      percentile.  If `None` (the default), treat every dimension as a sample\n      dimension, returning a scalar.\n    interpolation : {'nearest', 'linear', 'lower', 'higher', 'midpoint'}.\n      Default value: 'nearest'.  This specifies the interpolation method to\n      use when the fractions `k / n` lie between two data points `i < j`:\n        * linear: i + (j - i) * fraction, where fraction is the fractional part\n          of the index surrounded by i and j.\n        * lower: `i`.\n        * higher: `j`.\n        * nearest: `i` or `j`, whichever is nearest.\n        * midpoint: (i + j) / 2. `linear` and `midpoint` interpolation do not\n          work with integer dtypes.\n    keep_dims:  Python `bool`. If `True`, the last dimension is kept with size 1\n      If `False`, the last dimension is removed from the output shape.\n    validate_args:  Whether to add runtime checks of argument validity. If\n      False, and arguments are incorrect, correct behavior is not guaranteed.\n    name:  A Python string name to give this `Op`.  Default is 'percentile'\n\n  Returns:\n    cut_points:  A `rank(x) + 1 - len(axis)` dimensional `Tensor` with same\n    `dtype` as `x` and shape `[num_quantiles + 1, ...]` where the trailing shape\n    is that of `x` without the dimensions in `axis` (unless `keep_dims is True`)\n\n  Raises:\n    ValueError:  If argument 'interpolation' is not an allowed type.\n    ValueError:  If interpolation type not compatible with `dtype`.\n\n  #### Examples\n\n  ```python\n  # Get quartiles of x with various interpolation choices.\n  x = [0.,  1.,   2.,   3.,   4.,   5.,   6.,   7.,   8.,   9.,  10.]\n\n  tfp.stats.quantiles(x, num_quantiles=4, interpolation='nearest')\n  ==> [  0.,   2.,   5.,   8.,  10.]\n\n  tfp.stats.quantiles(x, num_quantiles=4, interpolation='linear')\n  ==> [  0. ,   2.5,   5. ,   7.5,  10. ]\n\n  tfp.stats.quantiles(x, num_quantiles=4, interpolation='lower')\n  ==> [  0.,   2.,   5.,   7.,  10.]\n\n  # Get deciles of columns of an R x C data set.\n  data = load_my_columnar_data(...)\n  tfp.stats.quantiles(data, num_quantiles=10)\n  ==> Shape [11, C] Tensor\n  ```\n\n  \"\"\"\n  with tf.compat.v1.name_scope(\n      name, 'quantiles', values=[x, num_quantiles, axis]):\n    x = tf.convert_to_tensor(value=x, name='x')\n    return percentile(\n        x,\n        q=tf.linspace(\n            # percentile casts q to float64 before using it...so may as well use\n            # float64 here. Note that  using x.dtype won't work with linspace\n            # if x is integral type (which is anothe motivation for hard-coding\n            # float64).\n            tf.convert_to_tensor(value=0, dtype=tf.float64),\n            tf.convert_to_tensor(value=100, dtype=tf.float64),\n            num=num_quantiles + 1),\n        axis=axis,\n        interpolation=interpolation,\n        keep_dims=keep_dims,\n        validate_args=validate_args,\n        preserve_gradients=False)"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\ngets the number of dimensions and assert that some expectations are met.", "response": "def _get_static_ndims(x,\n                      expect_static=False,\n                      expect_ndims=None,\n                      expect_ndims_no_more_than=None,\n                      expect_ndims_at_least=None):\n  \"\"\"Get static number of dimensions and assert that some expectations are met.\n\n  This function returns the number of dimensions 'ndims' of x, as a Python int.\n\n  The optional expect arguments are used to check the ndims of x, but this is\n  only done if the static ndims of x is not None.\n\n  Args:\n    x:  A Tensor.\n    expect_static:  Expect `x` to have statically defined `ndims`.\n    expect_ndims:  Optional Python integer.  If provided, assert that x has\n      number of dimensions equal to this.\n    expect_ndims_no_more_than:  Optional Python integer.  If provided, assert\n      that x has no more than this many dimensions.\n    expect_ndims_at_least:  Optional Python integer.  If provided, assert that x\n      has at least this many dimensions.\n\n  Returns:\n    ndims:  A Python integer.\n\n  Raises:\n    ValueError:  If any of the expectations above are violated.\n  \"\"\"\n  ndims = x.shape.ndims\n  if ndims is None:\n    shape_const = tf.get_static_value(tf.shape(input=x))\n    if shape_const is not None:\n      ndims = shape_const.ndim\n\n  if ndims is None:\n    if expect_static:\n      raise ValueError(\n          'Expected argument `x` to have statically defined `ndims`.  Found: ' %\n          x)\n    return\n\n  if expect_ndims is not None:\n    ndims_message = ('Expected argument `x` to have ndims %s.  Found tensor %s'\n                     % (expect_ndims, x))\n    if ndims != expect_ndims:\n      raise ValueError(ndims_message)\n\n  if expect_ndims_at_least is not None:\n    ndims_at_least_message = (\n        'Expected argument `x` to have ndims >= %d.  Found tensor %s' %\n        (expect_ndims_at_least, x))\n    if ndims < expect_ndims_at_least:\n      raise ValueError(ndims_at_least_message)\n\n  if expect_ndims_no_more_than is not None:\n    ndims_no_more_than_message = (\n        'Expected argument `x` to have ndims <= %d.  Found tensor %s' %\n        (expect_ndims_no_more_than, x))\n    if ndims > expect_ndims_no_more_than:\n      raise ValueError(ndims_no_more_than_message)\n\n  return ndims"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nget ndims if possible.", "response": "def _get_best_effort_ndims(x,\n                           expect_ndims=None,\n                           expect_ndims_at_least=None,\n                           expect_ndims_no_more_than=None):\n  \"\"\"Get static ndims if possible.  Fallback on `tf.rank(x)`.\"\"\"\n  ndims_static = _get_static_ndims(\n      x,\n      expect_ndims=expect_ndims,\n      expect_ndims_at_least=expect_ndims_at_least,\n      expect_ndims_no_more_than=expect_ndims_no_more_than)\n  if ndims_static is not None:\n    return ndims_static\n  return tf.rank(x)"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\ninsert the dims in axis back as singletons after being removed.", "response": "def _insert_back_keep_dims(x, axis):\n  \"\"\"Insert the dims in `axis` back as singletons after being removed.\n\n  Args:\n    x:  `Tensor`.\n    axis:  Python list of integers.\n\n  Returns:\n    `Tensor` with same values as `x`, but additional singleton dimensions.\n  \"\"\"\n  for i in sorted(axis):\n    x = tf.expand_dims(x, axis=i)\n  return x"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nconverts possibly negative indexed axis to non - negative list of ints.", "response": "def _make_static_axis_non_negative_list(axis, ndims):\n  \"\"\"Convert possibly negatively indexed axis to non-negative list of ints.\n\n  Args:\n    axis:  Integer Tensor.\n    ndims:  Number of dimensions into which axis indexes.\n\n  Returns:\n    A list of non-negative Python integers.\n\n  Raises:\n    ValueError: If `axis` is not statically defined.\n  \"\"\"\n  axis = distribution_util.make_non_negative_axis(axis, ndims)\n\n  axis_const = tf.get_static_value(axis)\n  if axis_const is None:\n    raise ValueError(\n        'Expected argument `axis` to be statically available.  Found: %s' %\n        axis)\n\n  # Make at least 1-D.\n  axis = axis_const + np.zeros([1], dtype=axis_const.dtype)\n\n  return list(int(dim) for dim in axis)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nmove dims corresponding to axis in x to the end then flatten.", "response": "def _move_dims_to_flat_end(x, axis, x_ndims, right_end=True):\n  \"\"\"Move dims corresponding to `axis` in `x` to the end, then flatten.\n\n  Args:\n    x: `Tensor` with shape `[B0,B1,...,Bb]`.\n    axis:  Python list of indices into dimensions of `x`.\n    x_ndims:  Python integer holding number of dimensions in `x`.\n    right_end:  Python bool.  Whether to move dims to the right end (else left).\n\n  Returns:\n    `Tensor` with value from `x` and dims in `axis` moved to end into one single\n      dimension.\n  \"\"\"\n\n  if not axis:\n    return x\n\n  # Suppose x.shape = [a, b, c, d]\n  # Suppose axis = [1, 3]\n\n  # other_dims = [0, 2] in example above.\n  other_dims = sorted(set(range(x_ndims)).difference(axis))\n  # x_permed.shape = [a, c, b, d]\n  perm = other_dims + list(axis) if right_end else list(axis) + other_dims\n  x_permed = tf.transpose(a=x, perm=perm)\n\n  if x.shape.is_fully_defined():\n    x_shape = x.shape.as_list()\n    # other_shape = [a, c], end_shape = [b * d]\n    other_shape = [x_shape[i] for i in other_dims]\n    end_shape = [np.prod([x_shape[i] for i in axis])]\n    full_shape = (\n        other_shape + end_shape if right_end else end_shape + other_shape)\n  else:\n    other_shape = tf.gather(tf.shape(input=x), other_dims)\n    full_shape = tf.concat(\n        [other_shape, [-1]] if right_end else [[-1], other_shape], axis=0)\n  return tf.reshape(x_permed, shape=full_shape)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nusing top_k to sort a Tensor along the last dimension.", "response": "def _sort_tensor(tensor):\n  \"\"\"Use `top_k` to sort a `Tensor` along the last dimension.\"\"\"\n  sorted_, _ = tf.nn.top_k(tensor, k=tf.shape(input=tensor)[-1])\n  sorted_.set_shape(tensor.shape)\n  return sorted_"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nbuild a list of LinearGaussianStateSpaceModel objects for each component in the current state space.", "response": "def make_component_state_space_models(self,\n                                        num_timesteps,\n                                        param_vals,\n                                        initial_step=0):\n    \"\"\"Build an ordered list of Distribution instances for component models.\n\n    Args:\n      num_timesteps: Python `int` number of timesteps to model.\n      param_vals: a list of `Tensor` parameter values in order corresponding to\n        `self.parameters`, or a dict mapping from parameter names to values.\n      initial_step: optional `int` specifying the initial timestep to model.\n        This is relevant when the model contains time-varying components,\n        e.g., holidays or seasonality.\n\n    Returns:\n      component_ssms: a Python list of `LinearGaussianStateSpaceModel`\n        Distribution objects, in order corresponding to `self.components`.\n    \"\"\"\n\n    with tf.compat.v1.name_scope('make_component_state_space_models'):\n\n      # List the model parameters in canonical order\n      param_map = self._canonicalize_param_vals_as_map(param_vals)\n      param_vals_list = [param_map[p.name] for p in self.parameters]\n\n      # Build SSMs for each component model. We process the components in\n      # canonical order, extracting the parameters for each component from the\n      # (ordered) list of parameters.\n      remaining_param_vals = param_vals_list[1:]\n      component_ssms = []\n      for component in self.components:\n        num_parameters = len(component.parameters)\n        component_param_vals = remaining_param_vals[:num_parameters]\n        remaining_param_vals = remaining_param_vals[num_parameters:]\n\n        component_ssms.append(\n            component.make_state_space_model(\n                num_timesteps,\n                param_vals=component_param_vals,\n                initial_step=initial_step))\n\n    return component_ssms"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef kl_reverse(logu, self_normalized=False, name=None):\n\n  with tf.compat.v1.name_scope(name, \"kl_reverse\", [logu]):\n    return amari_alpha(logu, alpha=0., self_normalized=self_normalized)", "response": "A function that computes the inverse Kullback - Leibler Csiszar - function at logu."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef jensen_shannon(logu, self_normalized=False, name=None):\n\n  with tf.compat.v1.name_scope(name, \"jensen_shannon\", [logu]):\n    logu = tf.convert_to_tensor(value=logu, name=\"logu\")\n    npdt = logu.dtype.as_numpy_dtype\n    y = tf.nn.softplus(logu)\n    if self_normalized:\n      y -= np.log(2).astype(npdt)\n    return tf.exp(logu) * logu - (1. + tf.exp(logu)) * y", "response": "A function that computes the Jensen - Shannon Csiszar - function in log - space."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef triangular(logu, name=None):\n\n  with tf.compat.v1.name_scope(name, \"triangular\", [logu]):\n    logu = tf.convert_to_tensor(value=logu, name=\"logu\")\n    return pearson(logu) / (1. + tf.exp(logu))", "response": "A function that evaluates a Triangular Csiszar - function at logu."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef t_power(logu, t, self_normalized=False, name=None):\n  with tf.compat.v1.name_scope(name, \"t_power\", [logu, t]):\n    logu = tf.convert_to_tensor(value=logu, name=\"logu\")\n    t = tf.convert_to_tensor(value=t, dtype=logu.dtype.base_dtype, name=\"t\")\n    fu = tf.math.expm1(t * logu)\n    if self_normalized:\n      fu -= t * tf.math.expm1(logu)\n    fu *= tf.where(tf.logical_and(0. < t, t < 1.),\n                   -tf.ones_like(t),\n                   tf.ones_like(t))\n    return fu", "response": "A function that evaluates the T - Power Csiszar - function at logu at t."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef modified_gan(logu, self_normalized=False, name=None):\n\n  with tf.compat.v1.name_scope(name, \"chi_square\", [logu]):\n    logu = tf.convert_to_tensor(value=logu, name=\"logu\")\n    y = tf.nn.softplus(logu) - logu\n    if self_normalized:\n      y += 0.5 * tf.math.expm1(logu)\n    return y", "response": "The Modified - GAN Csiszar - function in log - space."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef dual_csiszar_function(logu, csiszar_function, name=None):\n\n  with tf.compat.v1.name_scope(name, \"dual_csiszar_function\", [logu]):\n    return tf.exp(logu) * csiszar_function(-logu)", "response": "Calculates the dual of a Csiszar - function over log - space."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef symmetrized_csiszar_function(logu, csiszar_function, name=None):\n\n  with tf.compat.v1.name_scope(name, \"symmetrized_csiszar_function\", [logu]):\n    logu = tf.convert_to_tensor(value=logu, name=\"logu\")\n    return 0.5 * (csiszar_function(logu)\n                  + dual_csiszar_function(logu, csiszar_function))", "response": "This function symmetrizes a Csiszar - function in log - space."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef monte_carlo_csiszar_f_divergence(\n    f,\n    p_log_prob,\n    q,\n    num_draws,\n    use_reparametrization=None,\n    seed=None,\n    name=None):\n  \"\"\"Monte-Carlo approximation of the Csiszar f-Divergence.\n\n  A Csiszar-function is a member of,\n\n  ```none\n  F = { f:R_+ to R : f convex }.\n  ```\n\n  The Csiszar f-Divergence for Csiszar-function f is given by:\n\n  ```none\n  D_f[p(X), q(X)] := E_{q(X)}[ f( p(X) / q(X) ) ]\n                  ~= m**-1 sum_j^m f( p(x_j) / q(x_j) ),\n                             where x_j ~iid q(X)\n  ```\n\n  Tricks: Reparameterization and Score-Gradient\n\n  When q is \"reparameterized\", i.e., a diffeomorphic transformation of a\n  parameterless distribution (e.g.,\n  `Normal(Y; m, s) <=> Y = sX + m, X ~ Normal(0,1)`), we can swap gradient and\n  expectation, i.e.,\n  `grad[Avg{ s_i : i=1...n }] = Avg{ grad[s_i] : i=1...n }` where `S_n=Avg{s_i}`\n  and `s_i = f(x_i), x_i ~iid q(X)`.\n\n  However, if q is not reparameterized, TensorFlow's gradient will be incorrect\n  since the chain-rule stops at samples of unreparameterized distributions. In\n  this circumstance using the Score-Gradient trick results in an unbiased\n  gradient, i.e.,\n\n  ```none\n  grad[ E_q[f(X)] ]\n  = grad[ int dx q(x) f(x) ]\n  = int dx grad[ q(x) f(x) ]\n  = int dx [ q'(x) f(x) + q(x) f'(x) ]\n  = int dx q(x) [q'(x) / q(x) f(x) + f'(x) ]\n  = int dx q(x) grad[ f(x) q(x) / stop_grad[q(x)] ]\n  = E_q[ grad[ f(x) q(x) / stop_grad[q(x)] ] ]\n  ```\n\n  Unless `q.reparameterization_type != tfd.FULLY_REPARAMETERIZED` it is\n  usually preferable to set `use_reparametrization = True`.\n\n  Example Application:\n\n  The Csiszar f-Divergence is a useful framework for variational inference.\n  I.e., observe that,\n\n  ```none\n  f(p(x)) =  f( E_{q(Z | x)}[ p(x, Z) / q(Z | x) ] )\n          <= E_{q(Z | x)}[ f( p(x, Z) / q(Z | x) ) ]\n          := D_f[p(x, Z), q(Z | x)]\n  ```\n\n  The inequality follows from the fact that the \"perspective\" of `f`, i.e.,\n  `(s, t) |-> t f(s / t))`, is convex in `(s, t)` when `s/t in domain(f)` and\n  `t` is a real. Since the above framework includes the popular Evidence Lower\n  BOund (ELBO) as a special case, i.e., `f(u) = -log(u)`, we call this framework\n  \"Evidence Divergence Bound Optimization\" (EDBO).\n\n  Args:\n    f: Python `callable` representing a Csiszar-function in log-space, i.e.,\n      takes `p_log_prob(q_samples) - q.log_prob(q_samples)`.\n    p_log_prob: Python `callable` taking (a batch of) samples from `q` and\n      returning the natural-log of the probability under distribution `p`.\n      (In variational inference `p` is the joint distribution.)\n    q: `tf.Distribution`-like instance; must implement:\n      `reparameterization_type`, `sample(n, seed)`, and `log_prob(x)`.\n      (In variational inference `q` is the approximate posterior distribution.)\n    num_draws: Integer scalar number of draws used to approximate the\n      f-Divergence expectation.\n    use_reparametrization: Python `bool`. When `None` (the default),\n      automatically set to:\n      `q.reparameterization_type == tfd.FULLY_REPARAMETERIZED`.\n      When `True` uses the standard Monte-Carlo average. When `False` uses the\n      score-gradient trick. (See above for details.)  When `False`, consider\n      using `csiszar_vimco`.\n    seed: Python `int` seed for `q.sample`.\n    name: Python `str` name prefixed to Ops created by this function.\n\n  Returns:\n    monte_carlo_csiszar_f_divergence: `float`-like `Tensor` Monte Carlo\n      approximation of the Csiszar f-Divergence.\n\n  Raises:\n    ValueError: if `q` is not a reparameterized distribution and\n      `use_reparametrization = True`. A distribution `q` is said to be\n      \"reparameterized\" when its samples are generated by transforming the\n      samples of another distribution which does not depend on the\n      parameterization of `q`. This property ensures the gradient (with respect\n      to parameters) is valid.\n    TypeError: if `p_log_prob` is not a Python `callable`.\n  \"\"\"\n  reparameterization_types = tf.nest.flatten(q.reparameterization_type)\n  with tf.compat.v1.name_scope(name, \"monte_carlo_csiszar_f_divergence\",\n                               [num_draws]):\n    if use_reparametrization is None:\n      use_reparametrization = all(\n          reparameterization_type == tfd.FULLY_REPARAMETERIZED\n          for reparameterization_type in reparameterization_types)\n    elif (use_reparametrization and\n          any(reparameterization_type != tfd.FULLY_REPARAMETERIZED\n              for reparameterization_type in reparameterization_types)):\n      # TODO(jvdillon): Consider only raising an exception if the gradient is\n      # requested.\n      raise ValueError(\n          \"Distribution `q` must be reparameterized, i.e., a diffeomorphic \"\n          \"transformation of a parameterless distribution. (Otherwise this \"\n          \"function has a biased gradient.)\")\n    if not callable(p_log_prob):\n      raise TypeError(\"`p_log_prob` must be a Python `callable` function.\")\n    return monte_carlo.expectation(\n        f=lambda q_samples: f(p_log_prob(q_samples) - q.log_prob(q_samples)),\n        samples=q.sample(num_draws, seed=seed),\n        log_prob=q.log_prob,  # Only used if use_reparametrization=False.\n        use_reparametrization=use_reparametrization)", "response": "Monte - Carlo approximation of the Csiszar - function f - Divergence."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef csiszar_vimco(f,\n                  p_log_prob,\n                  q,\n                  num_draws,\n                  num_batch_draws=1,\n                  seed=None,\n                  name=None):\n  \"\"\"Use VIMCO to lower the variance of gradient[csiszar_function(Avg(logu))].\n\n  This function generalizes VIMCO [(Mnih and Rezende, 2016)][1] to Csiszar\n  f-Divergences.\n\n  Note: if `q.reparameterization_type = tfd.FULLY_REPARAMETERIZED`,\n  consider using `monte_carlo_csiszar_f_divergence`.\n\n  The VIMCO loss is:\n\n  ```none\n  vimco = f(Avg{logu[i] : i=0,...,m-1})\n  where,\n    logu[i] = log( p(x, h[i]) / q(h[i] | x) )\n    h[i] iid~ q(H | x)\n  ```\n\n  Interestingly, the VIMCO gradient is not the naive gradient of `vimco`.\n  Rather, it is characterized by:\n\n  ```none\n  grad[vimco] - variance_reducing_term\n  where,\n    variance_reducing_term = Sum{ grad[log q(h[i] | x)] *\n                                    (vimco - f(log Avg{h[j;i] : j=0,...,m-1}))\n                                 : i=0, ..., m-1 }\n    h[j;i] = { u[j]                             j!=i\n             { GeometricAverage{ u[k] : k!=i}   j==i\n  ```\n\n  (We omitted `stop_gradient` for brevity. See implementation for more details.)\n\n  The `Avg{h[j;i] : j}` term is a kind of \"swap-out average\" where the `i`-th\n  element has been replaced by the leave-`i`-out Geometric-average.\n\n  This implementation prefers numerical precision over efficiency, i.e.,\n  `O(num_draws * num_batch_draws * prod(batch_shape) * prod(event_shape))`.\n  (The constant may be fairly large, perhaps around 12.)\n\n  Args:\n    f: Python `callable` representing a Csiszar-function in log-space.\n    p_log_prob: Python `callable` representing the natural-log of the\n      probability under distribution `p`. (In variational inference `p` is the\n      joint distribution.)\n    q: `tf.Distribution`-like instance; must implement: `sample(n, seed)`, and\n      `log_prob(x)`. (In variational inference `q` is the approximate posterior\n      distribution.)\n    num_draws: Integer scalar number of draws used to approximate the\n      f-Divergence expectation.\n    num_batch_draws: Integer scalar number of draws used to approximate the\n      f-Divergence expectation.\n    seed: Python `int` seed for `q.sample`.\n    name: Python `str` name prefixed to Ops created by this function.\n\n  Returns:\n    vimco: The Csiszar f-Divergence generalized VIMCO objective.\n\n  Raises:\n    ValueError: if `num_draws < 2`.\n\n  #### References\n\n  [1]: Andriy Mnih and Danilo Rezende. Variational Inference for Monte Carlo\n       objectives. In _International Conference on Machine Learning_, 2016.\n       https://arxiv.org/abs/1602.06725\n  \"\"\"\n  with tf.compat.v1.name_scope(name, \"csiszar_vimco\",\n                               [num_draws, num_batch_draws]):\n    if num_draws < 2:\n      raise ValueError(\"Must specify num_draws > 1.\")\n    stop = tf.stop_gradient  # For readability.\n    x = stop(q.sample(sample_shape=[num_draws, num_batch_draws],\n                      seed=seed))\n    logqx = q.log_prob(x)\n    logu = p_log_prob(x) - logqx\n    f_log_avg_u, f_log_sooavg_u = [f(r) for r in csiszar_vimco_helper(logu)]\n    dotprod = tf.reduce_sum(\n        input_tensor=logqx * stop(f_log_avg_u - f_log_sooavg_u),\n        axis=0)  # Sum over iid samples.\n    # We now rewrite f_log_avg_u so that:\n    #   `grad[f_log_avg_u] := grad[f_log_avg_u + dotprod]`.\n    # To achieve this, we use a trick that\n    #   `f(x) - stop(f(x)) == zeros_like(f(x))`\n    # but its gradient is grad[f(x)].\n    # Note that IEEE754 specifies that `x - x == 0.` and `x + 0. == x`, hence\n    # this trick loses no precision. For more discussion regarding the relevant\n    # portions of the IEEE754 standard, see the StackOverflow question,\n    # \"Is there a floating point value of x, for which x-x == 0 is false?\"\n    # http://stackoverflow.com/q/2686644\n    f_log_avg_u += dotprod - stop(dotprod)  # Add zeros_like(dot_prod).\n    return tf.reduce_mean(input_tensor=f_log_avg_u, axis=0)", "response": "This function is used to lower the variance of gradient of vimco."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef csiszar_vimco_helper(logu, name=None):\n  with tf.compat.v1.name_scope(name, \"csiszar_vimco_helper\", [logu]):\n    logu = tf.convert_to_tensor(value=logu, name=\"logu\")\n\n    n = tf.compat.dimension_value(logu.shape.with_rank_at_least(1)[0])\n    if n is None:\n      n = tf.shape(input=logu)[0]\n      log_n = tf.math.log(tf.cast(n, dtype=logu.dtype))\n      nm1 = tf.cast(n - 1, dtype=logu.dtype)\n    else:\n      log_n = np.log(n).astype(logu.dtype.as_numpy_dtype)\n      nm1 = np.asarray(n - 1, dtype=logu.dtype.as_numpy_dtype)\n\n    # Throughout we reduce across axis=0 since this is presumed to be iid\n    # samples.\n\n    log_max_u = tf.reduce_max(input_tensor=logu, axis=0)\n    log_sum_u_minus_log_max_u = tf.reduce_logsumexp(\n        input_tensor=logu - log_max_u, axis=0)\n\n    # log_loosum_u[i] =\n    # = logsumexp(logu[j] : j != i)\n    # = log( exp(logsumexp(logu)) - exp(logu[i]) )\n    # = log( exp(logsumexp(logu - logu[i])) exp(logu[i])  - exp(logu[i]))\n    # = logu[i] + log(exp(logsumexp(logu - logu[i])) - 1)\n    # = logu[i] + log(exp(logsumexp(logu) - logu[i]) - 1)\n    # = logu[i] + softplus_inverse(logsumexp(logu) - logu[i])\n    d = log_sum_u_minus_log_max_u + (log_max_u - logu)\n    # We use `d != 0` rather than `d > 0.` because `d < 0.` should never\n    # happens; if it does we want to complain loudly (which `softplus_inverse`\n    # will).\n    d_ok = tf.not_equal(d, 0.)\n    safe_d = tf.where(d_ok, d, tf.ones_like(d))\n    d_ok_result = logu + tfd.softplus_inverse(safe_d)\n\n    inf = np.array(np.inf, dtype=logu.dtype.as_numpy_dtype)\n\n    # When not(d_ok) and is_positive_and_largest then we manually compute the\n    # log_loosum_u. (We can efficiently do this for any one point but not all,\n    # hence we still need the above calculation.) This is good because when\n    # this condition is met, we cannot use the above calculation; its -inf.\n    # We now compute the log-leave-out-max-sum, replicate it to every\n    # point and make sure to select it only when we need to.\n    is_positive_and_largest = tf.logical_and(\n        logu > 0.,\n        tf.equal(logu, log_max_u[tf.newaxis, ...]))\n    log_lomsum_u = tf.reduce_logsumexp(\n        input_tensor=tf.where(is_positive_and_largest,\n                              tf.fill(tf.shape(input=logu), -inf), logu),\n        axis=0,\n        keepdims=True)\n    log_lomsum_u = tf.tile(\n        log_lomsum_u,\n        multiples=1 + tf.pad(tensor=[n - 1], paddings=[[0, tf.rank(logu) - 1]]))\n\n    d_not_ok_result = tf.where(is_positive_and_largest, log_lomsum_u,\n                               tf.fill(tf.shape(input=d), -inf))\n\n    log_loosum_u = tf.where(d_ok, d_ok_result, d_not_ok_result)\n\n    # The swap-one-out-sum (\"soosum\") is n different sums, each of which\n    # replaces the i-th item with the i-th-left-out average, i.e.,\n    # soo_sum_u[i] = [exp(logu) - exp(logu[i])] + exp(mean(logu[!=i]))\n    #              =  exp(log_loosum_u[i])      + exp(looavg_logu[i])\n    looavg_logu = (tf.reduce_sum(input_tensor=logu, axis=0) - logu) / nm1\n    log_soosum_u = tf.reduce_logsumexp(\n        input_tensor=tf.stack([log_loosum_u, looavg_logu]), axis=0)\n\n    log_avg_u = log_sum_u_minus_log_max_u + log_max_u - log_n\n    log_sooavg_u = log_soosum_u - log_n\n\n    log_avg_u.set_shape(logu.shape.with_rank_at_least(1)[1:])\n    log_sooavg_u.set_shape(logu.shape)\n\n    return log_avg_u, log_sooavg_u", "response": "Helper to `csiszar_vimco`; computes `log_avg_u`, `log_sooavg_u`.\n\n  `axis = 0` of `logu` is presumed to correspond to iid samples from `q`, i.e.,\n\n  ```none\n  logu[j] = log(u[j])\n  u[j] = p(x, h[j]) / q(h[j] | x)\n  h[j] iid~ q(H | x)\n  ```\n\n  Args:\n    logu: Floating-type `Tensor` representing `log(p(x, h) / q(h | x))`.\n    name: Python `str` name prefixed to Ops created by this function.\n\n  Returns:\n    log_avg_u: `logu.dtype` `Tensor` corresponding to the natural-log of the\n      average of `u`. The sum of the gradient of `log_avg_u` is `1`.\n    log_sooavg_u: `logu.dtype` `Tensor` characterized by the natural-log of the\n      average of `u`` except that the average swaps-out `u[i]` for the\n      leave-`i`-out Geometric-average. The mean of the gradient of\n      `log_sooavg_u` is `1`. Mathematically `log_sooavg_u` is,\n      ```none\n      log_sooavg_u[i] = log(Avg{h[j ; i] : j=0, ..., m-1})\n      h[j ; i] = { u[j]                              j!=i\n                 { GeometricAverage{u[k] : k != i}   j==i\n      ```"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef interp_regular_1d_grid(x,\n                           x_ref_min,\n                           x_ref_max,\n                           y_ref,\n                           axis=-1,\n                           fill_value='constant_extension',\n                           fill_value_below=None,\n                           fill_value_above=None,\n                           grid_regularizing_transform=None,\n                           name=None):\n  \"\"\"Linear `1-D` interpolation on a regular (constant spacing) grid.\n\n  Given reference values, this function computes a piecewise linear interpolant\n  and evaluates it on a new set of `x` values.\n\n  The interpolant is built from `C` reference values indexed by one dimension\n  of `y_ref` (specified by the `axis` kwarg).\n\n  If `y_ref` is a vector, then each value `y_ref[i]` is considered to be equal\n  to `f(x_ref[i])`, for `C` (implicitly defined) reference values between\n  `x_ref_min` and `x_ref_max`:\n\n  ```none\n  x_ref[i] = x_ref_min + i * (x_ref_max - x_ref_min) / (C - 1),\n  i = 0, ..., C - 1.\n  ```\n\n  If `rank(y_ref) > 1`, then dimension `axis` indexes `C` reference values of\n  a shape `y_ref.shape[:axis] + y_ref.shape[axis + 1:]` `Tensor`.\n\n  If `rank(x) > 1`, then the output is obtained by effectively flattening `x`,\n  interpolating along `axis`, then expanding the result to shape\n  `y_ref.shape[:axis] + x.shape + y_ref.shape[axis + 1:]`.\n\n  These shape semantics are equivalent to `scipy.interpolate.interp1d`.\n\n  Args:\n    x: Numeric `Tensor` The x-coordinates of the interpolated output values.\n    x_ref_min:  Scalar `Tensor` of same `dtype` as `x`.  The minimum value of\n      the (implicitly defined) reference `x_ref`.\n    x_ref_max:  Scalar `Tensor` of same `dtype` as `x`.  The maximum value of\n      the (implicitly defined) reference `x_ref`.\n    y_ref:  `N-D` `Tensor` (`N > 0`) of same `dtype` as `x`. The reference\n      output values.\n    axis:  Scalar `Tensor` designating the dimension of `y_ref` that indexes\n      values of the interpolation table.\n      Default value: `-1`, the rightmost axis.\n    fill_value:  Determines what values output should take for `x` values that\n      are below `x_ref_min` or above `x_ref_max`. `Tensor` or one of the strings\n      \"constant_extension\" ==> Extend as constant function. \"extrapolate\" ==>\n      Extrapolate in a linear fashion.\n      Default value: `\"constant_extension\"`\n    fill_value_below:  Optional override of `fill_value` for `x < x_ref_min`.\n    fill_value_above:  Optional override of `fill_value` for `x > x_ref_max`.\n    grid_regularizing_transform:  Optional transformation `g` which regularizes\n      the implied spacing of the x reference points.  In other words, if\n      provided, we assume `g(x_ref_i)` is a regular grid between `g(x_ref_min)`\n      and `g(x_ref_max)`.\n    name:  A name to prepend to created ops.\n      Default value: `\"interp_regular_1d_grid\"`.\n\n  Returns:\n    y_interp:  Interpolation between members of `y_ref`, at points `x`.\n      `Tensor` of same `dtype` as `x`, and shape\n      `y.shape[:axis] + x.shape + y.shape[axis + 1:]`\n\n  Raises:\n    ValueError:  If `fill_value` is not an allowed string.\n    ValueError:  If `axis` is not a scalar.\n\n  #### Examples\n\n  Interpolate a function of one variable:\n\n  ```python\n  y_ref = tf.exp(tf.linspace(start=0., stop=10., num=200))\n\n  interp_regular_1d_grid(\n      x=[6.0, 0.5, 3.3], x_ref_min=0., x_ref_max=1., y_ref=y_ref)\n  ==> approx [exp(6.0), exp(0.5), exp(3.3)]\n  ```\n\n  Interpolate a matrix-valued function of one variable:\n\n  ```python\n  mat_0 = [[1., 0.], [0., 1.]]\n  mat_1 = [[0., -1], [1, 0]]\n  y_ref = [mat_0, mat_1]\n\n  # Get three output matrices at once.\n  tfp.math.interp_regular_1d_grid(\n      x=[0., 0.5, 1.], x_ref_min=0., x_ref_max=1., y_ref=y_ref, axis=0)\n  ==> [mat_0, 0.5 * mat_0 + 0.5 * mat_1, mat_1]\n  ```\n\n  Interpolate a scalar valued function, and get a matrix of results:\n\n  ```python\n  y_ref = tf.exp(tf.linspace(start=0., stop=10., num=200))\n  x = [[1.1, 1.2], [2.1, 2.2]]\n  tfp.math.interp_regular_1d_grid(x, x_ref_min=0., x_ref_max=10., y_ref=y_ref)\n  ==> tf.exp(x)\n  ```\n\n  Interpolate a function of one variable on a log-spaced grid:\n\n  ```python\n  x_ref = tf.exp(tf.linspace(tf.log(1.), tf.log(100000.), num_pts))\n  y_ref = tf.log(x_ref + x_ref**2)\n\n  interp_regular_1d_grid(x=[1.1, 2.2], x_ref_min=1., x_ref_max=100000., y_ref,\n      grid_regularizing_transform=tf.log)\n  ==> [tf.log(1.1 + 1.1**2), tf.log(2.2 + 2.2**2)]\n  ```\n\n  \"\"\"\n\n  return _interp_regular_1d_grid_impl(\n      x,\n      x_ref_min,\n      x_ref_max,\n      y_ref,\n      axis=axis,\n      batch_y_ref=False,\n      fill_value=fill_value,\n      fill_value_below=fill_value_below,\n      fill_value_above=fill_value_above,\n      grid_regularizing_transform=grid_regularizing_transform,\n      name=name or 'interp_regular_1d_grid')", "response": "Linear 1 - D interpolation on regular grids."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef batch_interp_regular_1d_grid(x,\n                                 x_ref_min,\n                                 x_ref_max,\n                                 y_ref,\n                                 axis=-1,\n                                 fill_value='constant_extension',\n                                 fill_value_below=None,\n                                 fill_value_above=None,\n                                 grid_regularizing_transform=None,\n                                 name=None):\n  \"\"\"Linear `1-D` interpolation on a regular (constant spacing) grid.\n\n  Given [batch of] reference values, this function computes a piecewise linear\n  interpolant and evaluates it on a [batch of] of new `x` values.\n\n  The interpolant is built from `C` reference values indexed by one dimension\n  of `y_ref` (specified by the `axis` kwarg).\n\n  If `y_ref` is a vector, then each value `y_ref[i]` is considered to be equal\n  to `f(x_ref[i])`, for `C` (implicitly defined) reference values between\n  `x_ref_min` and `x_ref_max`:\n\n  ```none\n  x_ref[i] = x_ref_min + i * (x_ref_max - x_ref_min) / (C - 1),\n  i = 0, ..., C - 1.\n  ```\n\n  In the general case, dimensions to the left of `axis` in `y_ref` are broadcast\n  with leading dimensions in `x`, `x_ref_min`, `x_ref_max`.\n\n  Args:\n    x: Numeric `Tensor` The x-coordinates of the interpolated output values for\n      each batch.  Shape broadcasts with `[A1, ..., AN, D]`, `N >= 0`.\n    x_ref_min:  `Tensor` of same `dtype` as `x`.  The minimum value of the each\n      batch of the (implicitly defined) reference `x_ref`. Shape broadcasts with\n      `[A1, ..., AN]`, `N >= 0`.\n    x_ref_max:  `Tensor` of same `dtype` as `x`.  The maximum value of the each\n      batch of the (implicitly defined) reference `x_ref`. Shape broadcasts with\n      `[A1, ..., AN]`, `N >= 0`.\n    y_ref:  `Tensor` of same `dtype` as `x`.  The reference output values.\n      `y_ref.shape[:axis]` broadcasts with the batch shape `[A1, ..., AN]`, and\n      `y_ref.shape[axis:]` is `[C, B1, ..., BM]`, so the trailing dimensions\n        index `C` reference values of a rank `M` `Tensor` (`M >= 0`).\n    axis:  Scalar `Tensor` designating the dimension of `y_ref` that indexes\n      values of the interpolation table.\n      Default value: `-1`, the rightmost axis.\n    fill_value:  Determines what values output should take for `x` values that\n      are below `x_ref_min` or above `x_ref_max`. `Tensor` or one of the strings\n      \"constant_extension\" ==> Extend as constant function. \"extrapolate\" ==>\n      Extrapolate in a linear fashion.\n      Default value: `\"constant_extension\"`\n    fill_value_below:  Optional override of `fill_value` for `x < x_ref_min`.\n    fill_value_above:  Optional override of `fill_value` for `x > x_ref_max`.\n    grid_regularizing_transform:  Optional transformation `g` which regularizes\n      the implied spacing of the x reference points.  In other words, if\n      provided, we assume `g(x_ref_i)` is a regular grid between `g(x_ref_min)`\n      and `g(x_ref_max)`.\n    name:  A name to prepend to created ops.\n      Default value: `\"batch_interp_regular_1d_grid\"`.\n\n  Returns:\n    y_interp:  Interpolation between members of `y_ref`, at points `x`.\n      `Tensor` of same `dtype` as `x`, and shape `[A1, ..., AN, D, B1, ..., BM]`\n\n  Raises:\n    ValueError:  If `fill_value` is not an allowed string.\n    ValueError:  If `axis` is not a scalar.\n\n  #### Examples\n\n  Interpolate a function of one variable:\n\n  ```python\n  y_ref = tf.exp(tf.linspace(start=0., stop=10., 20))\n\n  batch_interp_regular_1d_grid(\n      x=[6.0, 0.5, 3.3], x_ref_min=0., x_ref_max=1., y_ref=y_ref)\n  ==> approx [exp(6.0), exp(0.5), exp(3.3)]\n  ```\n\n  Interpolate a batch of functions of one variable.\n\n  ```python\n  # First batch member is an exponential function, second is a log.\n  implied_x_ref = [tf.linspace(-3., 3.2, 200), tf.linspace(0.5, 3., 200)]\n  y_ref = tf.stack(  # Shape [2, 200], 2 batches, 200 reference values per batch\n      [tf.exp(implied_x_ref[0]), tf.log(implied_x_ref[1])], axis=0)\n\n  x = [[-1., 1., 0.],  # Shape [2, 3], 2 batches, 3 values per batch.\n       [1., 2., 3.]]\n\n  y = tfp.math.batch_interp_regular_1d_grid(  # Shape [2, 3]\n      x,\n      x_ref_min=[-3., 0.5],\n      x_ref_max=[3.2, 3.],\n      y_ref=y_ref,\n      axis=-1)\n\n  # y[0] approx tf.exp(x[0])\n  # y[1] approx tf.log(x[1])\n  ```\n\n  Interpolate a function of one variable on a log-spaced grid:\n\n  ```python\n  x_ref = tf.exp(tf.linspace(tf.log(1.), tf.log(100000.), num_pts))\n  y_ref = tf.log(x_ref + x_ref**2)\n\n  batch_interp_regular_1d_grid(x=[1.1, 2.2], x_ref_min=1., x_ref_max=100000.,\n      y_ref, grid_regularizing_transform=tf.log)\n  ==> [tf.log(1.1 + 1.1**2), tf.log(2.2 + 2.2**2)]\n  ```\n\n  \"\"\"\n\n  return _interp_regular_1d_grid_impl(\n      x,\n      x_ref_min,\n      x_ref_max,\n      y_ref,\n      axis=axis,\n      batch_y_ref=True,\n      fill_value=fill_value,\n      fill_value_below=fill_value_below,\n      fill_value_above=fill_value_above,\n      grid_regularizing_transform=grid_regularizing_transform,\n      name=name or 'batch_interp_regular_1d_grid')", "response": "This function computes a piecewise linear interpolation on a regular 1 - D grid."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef batch_interp_regular_nd_grid(x,\n                                 x_ref_min,\n                                 x_ref_max,\n                                 y_ref,\n                                 axis,\n                                 fill_value='constant_extension',\n                                 name=None):\n  \"\"\"Multi-linear interpolation on a regular (constant spacing) grid.\n\n  Given [a batch of] reference values, this function computes a multi-linear\n  interpolant and evaluates it on [a batch of] of new `x` values.\n\n  The interpolant is built from reference values indexed by `nd` dimensions\n  of `y_ref`, starting at `axis`.\n\n  For example, take the case of a `2-D` scalar valued function and no leading\n  batch dimensions.  In this case, `y_ref.shape = [C1, C2]` and `y_ref[i, j]`\n  is the reference value corresponding to grid point\n\n  ```\n  [x_ref_min[0] + i * (x_ref_max[0] - x_ref_min[0]) / (C1 - 1),\n   x_ref_min[1] + j * (x_ref_max[1] - x_ref_min[1]) / (C2 - 1)]\n  ```\n\n  In the general case, dimensions to the left of `axis` in `y_ref` are broadcast\n  with leading dimensions in `x`, `x_ref_min`, `x_ref_max`.\n\n  Args:\n    x: Numeric `Tensor` The x-coordinates of the interpolated output values for\n      each batch.  Shape `[..., D, nd]`, designating [a batch of] `D`\n      coordinates in `nd` space.  `D` must be `>= 1` and is not a batch dim.\n    x_ref_min:  `Tensor` of same `dtype` as `x`.  The minimum values of the\n      (implicitly defined) reference `x_ref`.  Shape `[..., nd]`.\n    x_ref_max:  `Tensor` of same `dtype` as `x`.  The maximum values of the\n      (implicitly defined) reference `x_ref`.  Shape `[..., nd]`.\n    y_ref:  `Tensor` of same `dtype` as `x`.  The reference output values. Shape\n      `[..., C1, ..., Cnd, B1,...,BM]`, designating [a batch of] reference\n      values indexed by `nd` dimensions, of a shape `[B1,...,BM]` valued\n      function (for `M >= 0`).\n    axis:  Scalar integer `Tensor`.  Dimensions `[axis, axis + nd)` of `y_ref`\n      index the interpolation table.  E.g. `3-D` interpolation of a scalar\n      valued function requires `axis=-3` and a `3-D` matrix valued function\n      requires `axis=-5`.\n    fill_value:  Determines what values output should take for `x` values that\n      are below `x_ref_min` or above `x_ref_max`. Scalar `Tensor` or\n      \"constant_extension\" ==> Extend as constant function.\n      Default value: `\"constant_extension\"`\n    name:  A name to prepend to created ops.\n      Default value: `\"batch_interp_regular_nd_grid\"`.\n\n  Returns:\n    y_interp:  Interpolation between members of `y_ref`, at points `x`.\n      `Tensor` of same `dtype` as `x`, and shape `[..., D, B1, ..., BM].`\n\n  Raises:\n    ValueError:  If `rank(x) < 2` is determined statically.\n    ValueError:  If `axis` is not a scalar is determined statically.\n    ValueError:  If `axis + nd > rank(y_ref)` is determined statically.\n\n  #### Examples\n\n  Interpolate a function of one variable.\n\n  ```python\n  y_ref = tf.exp(tf.linspace(start=0., stop=10., 20))\n\n  tfp.math.batch_interp_regular_nd_grid(\n      # x.shape = [3, 1], x_ref_min/max.shape = [1].  Trailing `1` for `1-D`.\n      x=[[6.0], [0.5], [3.3]], x_ref_min=[0.], x_ref_max=[1.], y_ref=y_ref)\n  ==> approx [exp(6.0), exp(0.5), exp(3.3)]\n  ```\n\n  Interpolate a scalar function of two variables.\n\n  ```python\n  x_ref_min = [0., 2 * np.pi]\n  x_ref_max = [0., 2 * np.pi]\n\n  # Build y_ref.\n  x0s, x1s = tf.meshgrid(\n      tf.linspace(x_ref_min[0], x_ref_max[0], num=100),\n      tf.linspace(x_ref_min[1], x_ref_max[1], num=100),\n      indexing='ij')\n\n  def func(x0, x1):\n    return tf.sin(x0) * tf.cos(x1)\n\n  y_ref = func(x0s, x1s)\n\n  x = np.pi * tf.random_uniform(shape=(10, 2))\n\n  tfp.math.batch_interp_regular_nd_grid(x, x_ref_min, x_ref_max, y_ref, axis=-2)\n  ==> tf.sin(x[:, 0]) * tf.cos(x[:, 1])\n  ```\n\n  \"\"\"\n  with tf.compat.v1.name_scope(\n      name,\n      default_name='interp_regular_nd_grid',\n      values=[x, x_ref_min, x_ref_max, y_ref, fill_value]):\n    dtype = dtype_util.common_dtype([x, x_ref_min, x_ref_max, y_ref],\n                                    preferred_dtype=tf.float32)\n\n    # Arg checking.\n    if isinstance(fill_value, str):\n      if fill_value != 'constant_extension':\n        raise ValueError(\n            'A fill value ({}) was not an allowed string ({})'.format(\n                fill_value, 'constant_extension'))\n    else:\n      fill_value = tf.convert_to_tensor(\n          value=fill_value, name='fill_value', dtype=dtype)\n      _assert_ndims_statically(fill_value, expect_ndims=0)\n\n    # x.shape = [..., nd].\n    x = tf.convert_to_tensor(value=x, name='x', dtype=dtype)\n    _assert_ndims_statically(x, expect_ndims_at_least=2)\n\n    # y_ref.shape = [..., C1,...,Cnd, B1,...,BM]\n    y_ref = tf.convert_to_tensor(value=y_ref, name='y_ref', dtype=dtype)\n\n    # x_ref_min.shape = [nd]\n    x_ref_min = tf.convert_to_tensor(\n        value=x_ref_min, name='x_ref_min', dtype=dtype)\n    x_ref_max = tf.convert_to_tensor(\n        value=x_ref_max, name='x_ref_max', dtype=dtype)\n    _assert_ndims_statically(\n        x_ref_min, expect_ndims_at_least=1, expect_static=True)\n    _assert_ndims_statically(\n        x_ref_max, expect_ndims_at_least=1, expect_static=True)\n\n    # nd is the number of dimensions indexing the interpolation table, it's the\n    # \"nd\" in the function name.\n    nd = tf.compat.dimension_value(x_ref_min.shape[-1])\n    if nd is None:\n      raise ValueError('`x_ref_min.shape[-1]` must be known statically.')\n    x_ref_max.shape[-1:].assert_is_compatible_with(x_ref_min.shape[-1:])\n\n    # Convert axis and check it statically.\n    axis = tf.convert_to_tensor(value=axis, dtype=tf.int32, name='axis')\n    axis = distribution_util.make_non_negative_axis(axis, tf.rank(y_ref))\n    axis.shape.assert_has_rank(0)\n    axis_ = tf.get_static_value(axis)\n    y_ref_rank_ = tf.get_static_value(tf.rank(y_ref))\n    if axis_ is not None and y_ref_rank_ is not None:\n      if axis_ + nd > y_ref_rank_:\n        raise ValueError(\n            'Since dims `[axis, axis + nd)` index the interpolation table, we '\n            'must have `axis + nd <= rank(y_ref)`.  Found: '\n            '`axis`: {},  rank(y_ref): {}, and inferred `nd` from trailing '\n            'dimensions of `x_ref_min` to be {}.'.format(\n                axis_, y_ref_rank_, nd))\n\n    x_batch_shape = tf.shape(input=x)[:-2]\n    x_ref_min_batch_shape = tf.shape(input=x_ref_min)[:-1]\n    x_ref_max_batch_shape = tf.shape(input=x_ref_max)[:-1]\n    y_ref_batch_shape = tf.shape(input=y_ref)[:axis]\n\n    # Do a brute-force broadcast of batch dims (add zeros).\n    batch_shape = y_ref_batch_shape\n    for tensor in [x_batch_shape, x_ref_min_batch_shape, x_ref_max_batch_shape]:\n      batch_shape = tf.broadcast_dynamic_shape(batch_shape, tensor)\n\n    def _batch_of_zeros_with_rightmost_singletons(n_singletons):\n      \"\"\"Return Tensor of zeros with some singletons on the rightmost dims.\"\"\"\n      ones = tf.ones(shape=[n_singletons], dtype=tf.int32)\n      return tf.zeros(shape=tf.concat([batch_shape, ones], axis=0), dtype=dtype)\n\n    x += _batch_of_zeros_with_rightmost_singletons(n_singletons=2)\n    x_ref_min += _batch_of_zeros_with_rightmost_singletons(n_singletons=1)\n    x_ref_max += _batch_of_zeros_with_rightmost_singletons(n_singletons=1)\n    y_ref += _batch_of_zeros_with_rightmost_singletons(\n        n_singletons=tf.rank(y_ref) - axis)\n\n    return _batch_interp_with_gather_nd(\n        x=x,\n        x_ref_min=x_ref_min,\n        x_ref_max=x_ref_max,\n        y_ref=y_ref,\n        nd=nd,\n        fill_value=fill_value,\n        batch_dims=tf.get_static_value(tf.rank(x)) - 2)", "response": "This function computes a multi - linear interpolation on a regular ND grid."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef _assert_ndims_statically(x,\n                             expect_ndims=None,\n                             expect_ndims_at_least=None,\n                             expect_static=False):\n  \"\"\"Assert that Tensor x has expected number of dimensions.\"\"\"\n  ndims = x.shape.ndims\n  if ndims is None:\n    if expect_static:\n      raise ValueError('Expected static ndims. Found: {}'.format(x))\n    return\n  if expect_ndims is not None and ndims != expect_ndims:\n    raise ValueError('ndims must be {}.  Found: {}'.format(expect_ndims, ndims))\n  if expect_ndims_at_least is not None and ndims < expect_ndims_at_least:\n    raise ValueError('ndims must be at least {}. Found {}'.format(\n        expect_ndims_at_least, ndims))", "response": "Assert that Tensor x has expected number of dimensions."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef _make_expand_x_fn_for_non_batch_interpolation(y_ref, axis):\n  # This expansion is to help x broadcast with `y`, the output.\n  # In the non-batch case, the output shape is going to be\n  #   y_ref.shape[:axis] + x.shape + y_ref.shape[axis+1:]\n\n  # Recall we made axis non-negative\n  y_ref_shape = tf.shape(input=y_ref)\n  y_ref_shape_left = y_ref_shape[:axis]\n  y_ref_shape_right = y_ref_shape[axis + 1:]\n\n  def expand_ends(x, broadcast=False):\n    \"\"\"Expand x so it can bcast w/ tensors of output shape.\"\"\"\n    # Assume out_shape = A + x.shape + B, and rank(A) = axis.\n    # Expand with singletons with same rank as A, B.\n    expanded_shape = tf.pad(\n        tensor=tf.shape(input=x),\n        paddings=[[axis, tf.size(input=y_ref_shape_right)]],\n        constant_values=1)\n    x_expanded = tf.reshape(x, expanded_shape)\n\n    if broadcast:\n      out_shape = tf.concat((\n          y_ref_shape_left,\n          tf.shape(input=x),\n          y_ref_shape_right,\n      ),\n                            axis=0)\n      if x.dtype.is_bool:\n        x_expanded = x_expanded | tf.cast(tf.zeros(out_shape), tf.bool)\n      else:\n        x_expanded += tf.zeros(out_shape, dtype=x.dtype)\n    return x_expanded\n\n  return expand_ends", "response": "Make function to expand left and right dims of tensors shaped like x."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nmake function to expand left and right dimensions of tensors of x.", "response": "def _make_expand_x_fn_for_batch_interpolation(y_ref, axis):\n  \"\"\"Make func to expand left/right (of axis) dims of tensors shaped like x.\"\"\"\n  # This expansion is to help x broadcast with `y`, the output.\n  # In the batch case, the output shape is going to be\n  #   Broadcast(y_ref.shape[:axis], x.shape[:-1]) +\n  #   x.shape[-1:] +  y_ref.shape[axis+1:]\n\n  # Recall we made axis non-negative\n  y_ref_shape = tf.shape(input=y_ref)\n  y_ref_shape_left = y_ref_shape[:axis]\n  y_ref_shape_right = y_ref_shape[axis + 1:]\n\n  def expand_right_dims(x, broadcast=False):\n    \"\"\"Expand x so it can bcast w/ tensors of output shape.\"\"\"\n    expanded_shape_left = tf.broadcast_dynamic_shape(\n        tf.shape(input=x)[:-1],\n        tf.ones([tf.size(input=y_ref_shape_left)], dtype=tf.int32))\n    expanded_shape = tf.concat(\n        (expanded_shape_left, tf.shape(input=x)[-1:],\n         tf.ones([tf.size(input=y_ref_shape_right)], dtype=tf.int32)),\n        axis=0)\n    x_expanded = tf.reshape(x, expanded_shape)\n    if broadcast:\n      broadcast_shape_left = tf.broadcast_dynamic_shape(\n          tf.shape(input=x)[:-1], y_ref_shape_left)\n      broadcast_shape = tf.concat(\n          (broadcast_shape_left, tf.shape(input=x)[-1:], y_ref_shape_right),\n          axis=0)\n      if x.dtype.is_bool:\n        x_expanded = x_expanded | tf.cast(tf.zeros(broadcast_shape), tf.bool)\n      else:\n        x_expanded += tf.zeros(broadcast_shape, dtype=x.dtype)\n    return x_expanded\n\n  return expand_right_dims"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nliking batch_gather but broadcasts to the left of axis.", "response": "def _batch_gather_with_broadcast(params, indices, axis):\n  \"\"\"Like batch_gather, but broadcasts to the left of axis.\"\"\"\n  # batch_gather assumes...\n  #   params.shape =  [A1,...,AN, B1,...,BM]\n  #   indices.shape = [A1,...,AN, C]\n  # which gives output of shape\n  #                   [A1,...,AN, C, B1,...,BM]\n  # Here we broadcast dims of each to the left of `axis` in params, and left of\n  # the rightmost dim in indices, e.g. we can\n  # have\n  #   params.shape =  [A1,...,AN, B1,...,BM]\n  #   indices.shape = [a1,...,aN, C],\n  # where ai broadcasts with Ai.\n\n  # leading_bcast_shape is the broadcast of [A1,...,AN] and [a1,...,aN].\n  leading_bcast_shape = tf.broadcast_dynamic_shape(\n      tf.shape(input=params)[:axis],\n      tf.shape(input=indices)[:-1])\n  params += tf.zeros(\n      tf.concat((leading_bcast_shape, tf.shape(input=params)[axis:]), axis=0),\n      dtype=params.dtype)\n  indices += tf.zeros(\n      tf.concat((leading_bcast_shape, tf.shape(input=indices)[-1:]), axis=0),\n      dtype=indices.dtype)\n  return tf.compat.v1.batch_gather(params, indices)"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef _broadcast_cat_event_and_params(event, params, base_dtype):\n  if dtype_util.is_integer(event.dtype):\n    pass\n  elif dtype_util.is_floating(event.dtype):\n    # When `validate_args=True` we've already ensured int/float casting\n    # is closed.\n    event = tf.cast(event, dtype=tf.int32)\n  else:\n    raise TypeError(\"`value` should have integer `dtype` or \"\n                    \"`self.dtype` ({})\".format(base_dtype))\n  shape_known_statically = (\n      tensorshape_util.rank(params.shape) is not None and\n      tensorshape_util.is_fully_defined(params.shape[:-1]) and\n      tensorshape_util.is_fully_defined(event.shape))\n  if not shape_known_statically or params.shape[:-1] != event.shape:\n    params *= tf.ones_like(event[..., tf.newaxis],\n                           dtype=params.dtype)\n    params_shape = tf.shape(input=params)[:-1]\n    event *= tf.ones(params_shape, dtype=event.dtype)\n    if tensorshape_util.rank(params.shape) is not None:\n      tensorshape_util.set_shape(event, params.shape[:-1])\n\n  return event, params", "response": "Broadcasts the event or distribution parameters."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef expectation_importance_sampler_logspace(\n    log_f,\n    log_p,\n    sampling_dist_q,\n    z=None,\n    n=None,\n    seed=None,\n    name='expectation_importance_sampler_logspace'):\n  r\"\"\"Importance sampling with a positive function, in log-space.\n\n  With \\\\(p(z) := exp^{log_p(z)}\\\\), and \\\\(f(z) = exp{log_f(z)}\\\\),\n  this `Op` returns\n\n  \\\\(Log[ n^{-1} sum_{i=1}^n [ f(z_i) p(z_i) / q(z_i) ] ],  z_i ~ q,\\\\)\n  \\\\(\\approx Log[ E_q[ f(Z) p(Z) / q(Z) ] ]\\\\)\n  \\\\(=       Log[E_p[f(Z)]]\\\\)\n\n  This integral is done in log-space with max-subtraction to better handle the\n  often extreme values that `f(z) p(z) / q(z)` can take on.\n\n  In contrast to `expectation_importance_sampler`, this `Op` returns values in\n  log-space.\n\n\n  User supplies either `Tensor` of samples `z`, or number of samples to draw `n`\n\n  Args:\n    log_f: Callable mapping samples from `sampling_dist_q` to `Tensors` with\n      shape broadcastable to `q.batch_shape`.\n      For example, `log_f` works \"just like\" `sampling_dist_q.log_prob`.\n    log_p:  Callable mapping samples from `sampling_dist_q` to `Tensors` with\n      shape broadcastable to `q.batch_shape`.\n      For example, `log_p` works \"just like\" `q.log_prob`.\n    sampling_dist_q:  The sampling distribution.\n      `tfp.distributions.Distribution`.\n      `float64` `dtype` recommended.\n      `log_p` and `q` should be supported on the same set.\n    z:  `Tensor` of samples from `q`, produced by `q.sample` for some `n`.\n    n:  Integer `Tensor`.  Number of samples to generate if `z` is not provided.\n    seed:  Python integer to seed the random number generator.\n    name:  A name to give this `Op`.\n\n  Returns:\n    Logarithm of the importance sampling estimate.  `Tensor` with `shape` equal\n      to batch shape of `q`, and `dtype` = `q.dtype`.\n  \"\"\"\n  q = sampling_dist_q\n  with tf.name_scope(name):\n    z = _get_samples(q, z, n, seed)\n    log_values = log_f(z) + log_p(z) - q.log_prob(z)\n    return _logspace_mean(log_values)", "response": "r A function that returns a positive function in log - space and a positive function in log - space."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _logspace_mean(log_values):\n  # center = Max[Log[values]],  with stop-gradient\n  # The center hopefully keep the exponentiated term small.  It is canceled\n  # from the final result, so putting stop gradient on it will not change the\n  # final result.  We put stop gradient on to eliminate unnecessary computation.\n  center = tf.stop_gradient(_sample_max(log_values))\n\n  # centered_values = exp{Log[values] - E[Log[values]]}\n  centered_values = tf.math.exp(log_values - center)\n\n  # log_mean_of_values = Log[ E[centered_values] ] + center\n  #                    = Log[ E[exp{log_values - E[log_values]}] ] + center\n  #                    = Log[E[values]] - E[log_values] + center\n  #                    = Log[E[values]]\n  log_mean_of_values = tf.math.log(_sample_mean(centered_values)) + center\n\n  return log_mean_of_values", "response": "Evaluate log_values in a stable manner."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef _broadcast_event_and_samples(event, samples, event_ndims):\n  # This is the shape of self.samples, without the samples axis, i.e. the shape\n  # of the result of a call to dist.sample(). This way we can broadcast it with\n  # event to get a properly-sized event, then add the singleton dim back at\n  # -event_ndims - 1.\n  samples_shape = tf.concat(\n      [tf.shape(input=samples)[:-event_ndims - 1],\n       tf.shape(input=samples)[tf.rank(samples) - event_ndims:]],\n      axis=0)\n  event *= tf.ones(samples_shape, dtype=event.dtype)\n  event = tf.expand_dims(event, axis=-event_ndims - 1)\n  samples *= tf.ones_like(event, dtype=samples.dtype)\n\n  return event, samples", "response": "Broadcasts the event or samples."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef one_step(self, current_state, previous_kernel_results):\n    with tf.compat.v1.name_scope(\n        name=mcmc_util.make_name(self.name, 'mh', 'one_step'),\n        values=[current_state, previous_kernel_results]):\n      # Take one inner step.\n      [\n          proposed_state,\n          proposed_results,\n      ] = self.inner_kernel.one_step(\n          current_state,\n          previous_kernel_results.accepted_results)\n\n      if (not has_target_log_prob(proposed_results) or\n          not has_target_log_prob(previous_kernel_results.accepted_results)):\n        raise ValueError('\"target_log_prob\" must be a member of '\n                         '`inner_kernel` results.')\n\n      # Compute log(acceptance_ratio).\n      to_sum = [proposed_results.target_log_prob,\n                -previous_kernel_results.accepted_results.target_log_prob]\n      try:\n        if (not mcmc_util.is_list_like(\n            proposed_results.log_acceptance_correction)\n            or proposed_results.log_acceptance_correction):\n          to_sum.append(proposed_results.log_acceptance_correction)\n      except AttributeError:\n        warnings.warn('Supplied inner `TransitionKernel` does not have a '\n                      '`log_acceptance_correction`. Assuming its value is `0.`')\n      log_accept_ratio = mcmc_util.safe_sum(\n          to_sum, name='compute_log_accept_ratio')\n\n      # If proposed state reduces likelihood: randomly accept.\n      # If proposed state increases likelihood: always accept.\n      # I.e., u < min(1, accept_ratio),  where u ~ Uniform[0,1)\n      #       ==> log(u) < log_accept_ratio\n      log_uniform = tf.math.log(\n          tf.random.uniform(\n              shape=tf.shape(input=proposed_results.target_log_prob),\n              dtype=proposed_results.target_log_prob.dtype.base_dtype,\n              seed=self._seed_stream()))\n      is_accepted = log_uniform < log_accept_ratio\n\n      next_state = mcmc_util.choose(\n          is_accepted,\n          proposed_state,\n          current_state,\n          name='choose_next_state')\n\n      kernel_results = MetropolisHastingsKernelResults(\n          accepted_results=mcmc_util.choose(\n              is_accepted,\n              proposed_results,\n              previous_kernel_results.accepted_results,\n              name='choose_inner_results'),\n          is_accepted=is_accepted,\n          log_accept_ratio=log_accept_ratio,\n          proposed_state=proposed_state,\n          proposed_results=proposed_results,\n          extra=[],\n      )\n\n      return next_state, kernel_results", "response": "Takes one step of the TransitionKernel."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef bootstrap_results(self, init_state):\n    with tf.compat.v1.name_scope(\n        name=mcmc_util.make_name(self.name, 'mh', 'bootstrap_results'),\n        values=[init_state]):\n      pkr = self.inner_kernel.bootstrap_results(init_state)\n      if not has_target_log_prob(pkr):\n        raise ValueError(\n            '\"target_log_prob\" must be a member of `inner_kernel` results.')\n      x = pkr.target_log_prob\n      return MetropolisHastingsKernelResults(\n          accepted_results=pkr,\n          is_accepted=tf.ones_like(x, dtype=tf.bool),\n          log_accept_ratio=tf.zeros_like(x),\n          proposed_state=init_state,\n          proposed_results=pkr,\n          extra=[],\n      )", "response": "Returns an object with the same type as returned by one_step."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\napply the BFGS minimization of a differentiable function using Jorge Nocedal and Stephen Wright.", "response": "def minimize(value_and_gradients_function,\n             initial_position,\n             tolerance=1e-8,\n             x_tolerance=0,\n             f_relative_tolerance=0,\n             initial_inverse_hessian_estimate=None,\n             max_iterations=50,\n             parallel_iterations=1,\n             stopping_condition=None,\n             name=None):\n  \"\"\"Applies the BFGS algorithm to minimize a differentiable function.\n\n  Performs unconstrained minimization of a differentiable function using the\n  BFGS scheme. For details of the algorithm, see [Nocedal and Wright(2006)][1].\n\n  ### Usage:\n\n  The following example demonstrates the BFGS optimizer attempting to find the\n  minimum for a simple two dimensional quadratic objective function.\n\n  ```python\n    minimum = np.array([1.0, 1.0])  # The center of the quadratic bowl.\n    scales = np.array([2.0, 3.0])  # The scales along the two axes.\n\n    # The objective function and the gradient.\n    def quadratic(x):\n      value = tf.reduce_sum(scales * (x - minimum) ** 2)\n      return value, tf.gradients(value, x)[0]\n\n    start = tf.constant([0.6, 0.8])  # Starting point for the search.\n    optim_results = tfp.optimizer.bfgs_minimize(\n        quadratic, initial_position=start, tolerance=1e-8)\n\n    with tf.Session() as session:\n      results = session.run(optim_results)\n      # Check that the search converged\n      assert(results.converged)\n      # Check that the argmin is close to the actual value.\n      np.testing.assert_allclose(results.position, minimum)\n      # Print out the total number of function evaluations it took. Should be 6.\n      print (\"Function evaluations: %d\" % results.num_objective_evaluations)\n  ```\n\n  ### References:\n  [1]: Jorge Nocedal, Stephen Wright. Numerical Optimization. Springer Series in\n    Operations Research. pp 136-140. 2006\n    http://pages.mtu.edu/~struther/Courses/OLD/Sp2013/5630/Jorge_Nocedal_Numerical_optimization_267490.pdf\n\n  Args:\n    value_and_gradients_function:  A Python callable that accepts a point as a\n      real `Tensor` and returns a tuple of `Tensor`s of real dtype containing\n      the value of the function and its gradient at that point. The function\n      to be minimized. The input should be of shape `[..., n]`, where `n` is\n      the size of the domain of input points, and all others are batching\n      dimensions. The first component of the return value should be a real\n      `Tensor` of matching shape `[...]`. The second component (the gradient)\n      should also be of shape `[..., n]` like the input value to the function.\n    initial_position: real `Tensor` of shape `[..., n]`. The starting point, or\n      points when using batching dimensions, of the search procedure. At these\n      points the function value and the gradient norm should be finite.\n    tolerance: Scalar `Tensor` of real dtype. Specifies the gradient tolerance\n      for the procedure. If the supremum norm of the gradient vector is below\n      this number, the algorithm is stopped.\n    x_tolerance: Scalar `Tensor` of real dtype. If the absolute change in the\n      position between one iteration and the next is smaller than this number,\n      the algorithm is stopped.\n    f_relative_tolerance: Scalar `Tensor` of real dtype. If the relative change\n      in the objective value between one iteration and the next is smaller\n      than this value, the algorithm is stopped.\n    initial_inverse_hessian_estimate: Optional `Tensor` of the same dtype\n      as the components of the output of the `value_and_gradients_function`.\n      If specified, the shape should broadcastable to shape `[..., n, n]`; e.g.\n      if a single `[n, n]` matrix is provided, it will be automatically\n      broadcasted to all batches. Alternatively, one can also specify a\n      different hessian estimate for each batch member.\n      For the correctness of the algorithm, it is required that this parameter\n      be symmetric and positive definite. Specifies the starting estimate for\n      the inverse of the Hessian at the initial point. If not specified,\n      the identity matrix is used as the starting estimate for the\n      inverse Hessian.\n    max_iterations: Scalar positive int32 `Tensor`. The maximum number of\n      iterations for BFGS updates.\n    parallel_iterations: Positive integer. The number of iterations allowed to\n      run in parallel.\n    stopping_condition: (Optional) A Python function that takes as input two\n      Boolean tensors of shape `[...]`, and returns a Boolean scalar tensor.\n      The input tensors are `converged` and `failed`, indicating the current\n      status of each respective batch member; the return value states whether\n      the algorithm should stop. The default is tfp.optimizer.converged_all\n      which only stops when all batch members have either converged or failed.\n      An alternative is tfp.optimizer.converged_any which stops as soon as one\n      batch member has converged, or when all have failed.\n    name: (Optional) Python str. The name prefixed to the ops created by this\n      function. If not supplied, the default name 'minimize' is used.\n\n  Returns:\n    optimizer_results: A namedtuple containing the following items:\n      converged: boolean tensor of shape `[...]` indicating for each batch\n        member whether the minimum was found within tolerance.\n      failed:  boolean tensor of shape `[...]` indicating for each batch\n        member whether a line search step failed to find a suitable step size\n        satisfying Wolfe conditions. In the absence of any constraints on the\n        number of objective evaluations permitted, this value will\n        be the complement of `converged`. However, if there is\n        a constraint and the search stopped due to available\n        evaluations being exhausted, both `failed` and `converged`\n        will be simultaneously False.\n      num_objective_evaluations: The total number of objective\n        evaluations performed.\n      position: A tensor of shape `[..., n]` containing the last argument value\n        found during the search from each starting point. If the search\n        converged, then this value is the argmin of the objective function.\n      objective_value: A tensor of shape `[...]` with the value of the\n        objective function at the `position`. If the search converged, then\n        this is the (local) minimum of the objective function.\n      objective_gradient: A tensor of shape `[..., n]` containing the gradient\n        of the objective function at the `position`. If the search converged\n        the max-norm of this tensor should be below the tolerance.\n      inverse_hessian_estimate: A tensor of shape `[..., n, n]` containing the\n        inverse of the estimated Hessian.\n  \"\"\"\n  with tf.compat.v1.name_scope(\n      name, 'minimize',\n      [initial_position, tolerance, initial_inverse_hessian_estimate]):\n    initial_position = tf.convert_to_tensor(\n        value=initial_position, name='initial_position')\n    dtype = initial_position.dtype.base_dtype\n    tolerance = tf.convert_to_tensor(\n        value=tolerance, dtype=dtype, name='grad_tolerance')\n    f_relative_tolerance = tf.convert_to_tensor(\n        value=f_relative_tolerance, dtype=dtype, name='f_relative_tolerance')\n    x_tolerance = tf.convert_to_tensor(\n        value=x_tolerance, dtype=dtype, name='x_tolerance')\n    max_iterations = tf.convert_to_tensor(\n        value=max_iterations, name='max_iterations')\n\n    input_shape = distribution_util.prefer_static_shape(initial_position)\n    batch_shape, domain_size = input_shape[:-1], input_shape[-1]\n\n    if stopping_condition is None:\n      stopping_condition = bfgs_utils.converged_all\n\n    # Control inputs are an optional list of tensors to evaluate before\n    # the start of the search procedure. These can be used to assert the\n    # validity of inputs to the search procedure.\n    control_inputs = None\n\n    if initial_inverse_hessian_estimate is None:\n      # Create a default initial inverse Hessian.\n      initial_inv_hessian = tf.eye(domain_size,\n                                   batch_shape=batch_shape,\n                                   dtype=dtype,\n                                   name='initial_inv_hessian')\n    else:\n      # If an initial inverse Hessian is supplied, compute some control inputs\n      # to ensure that it is positive definite and symmetric.\n      initial_inv_hessian = tf.convert_to_tensor(\n          value=initial_inverse_hessian_estimate,\n          dtype=dtype,\n          name='initial_inv_hessian')\n      control_inputs = _inv_hessian_control_inputs(initial_inv_hessian)\n      hessian_shape = tf.concat([batch_shape, [domain_size, domain_size]], 0)\n      initial_inv_hessian = tf.broadcast_to(initial_inv_hessian, hessian_shape)\n\n    # The `state` here is a `BfgsOptimizerResults` tuple with values for the\n    # current state of the algorithm computation.\n    def _cond(state):\n      \"\"\"Continue if iterations remain and stopping condition is not met.\"\"\"\n      return ((state.num_iterations < max_iterations) &\n              tf.logical_not(stopping_condition(state.converged, state.failed)))\n\n    def _body(state):\n      \"\"\"Main optimization loop.\"\"\"\n      search_direction = _get_search_direction(state.inverse_hessian_estimate,\n                                               state.objective_gradient)\n      derivative_at_start_pt = tf.reduce_sum(\n          input_tensor=state.objective_gradient * search_direction, axis=-1)\n\n      # If the derivative at the start point is not negative, recompute the\n      # search direction with the initial inverse Hessian.\n      needs_reset = (~state.failed & ~state.converged &\n                     (derivative_at_start_pt >= 0))\n\n      search_direction_reset = _get_search_direction(\n          initial_inv_hessian, state.objective_gradient)\n\n      actual_serch_direction = tf.where(\n          needs_reset, search_direction_reset, search_direction)\n      actual_inv_hessian = tf.where(\n          needs_reset, initial_inv_hessian, state.inverse_hessian_estimate)\n\n      # Replace the hessian estimate in the state, in case it had to be reset.\n      current_state = bfgs_utils.update_fields(\n          state, inverse_hessian_estimate=actual_inv_hessian)\n\n      next_state = bfgs_utils.line_search_step(\n          current_state,\n          value_and_gradients_function, actual_serch_direction,\n          tolerance, f_relative_tolerance, x_tolerance, stopping_condition)\n\n      # Update the inverse Hessian if needed and continue.\n      return [_update_inv_hessian(current_state, next_state)]\n\n    kwargs = bfgs_utils.get_initial_state_args(\n        value_and_gradients_function,\n        initial_position,\n        tolerance,\n        control_inputs)\n    kwargs['inverse_hessian_estimate'] = initial_inv_hessian\n    initial_state = BfgsOptimizerResults(**kwargs)\n    return tf.while_loop(\n        cond=_cond,\n        body=_body,\n        loop_vars=[initial_state],\n        parallel_iterations=parallel_iterations)[0]"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef _inv_hessian_control_inputs(inv_hessian):\n  # The easiest way to validate if the inverse Hessian is positive definite is\n  # to compute its Cholesky decomposition.\n  is_positive_definite = tf.reduce_all(\n      input_tensor=tf.math.is_finite(tf.linalg.cholesky(inv_hessian)),\n      axis=[-1, -2])\n\n  # Then check that the supplied inverse Hessian is symmetric.\n  is_symmetric = tf.equal(bfgs_utils.norm(\n      inv_hessian - _batch_transpose(inv_hessian), dims=2), 0)\n\n  # Simply adding a control dependencies on these results is not enough to\n  # trigger them, we need to add asserts on the results.\n  return [tf.Assert(is_positive_definite,\n                    ['Initial inverse Hessian is not positive definite.',\n                     inv_hessian]),\n          tf.Assert(is_symmetric,\n                    ['Initial inverse Hessian is not symmetric',\n                     inv_hessian])]", "response": "Computes control inputs to validate a provided inverse Hessian."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nupdate the inverse Hessian estimate of the BGFS state by computing the next inverse Hessian estimate.", "response": "def _update_inv_hessian(prev_state, next_state):\n  \"\"\"Update the BGFS state by computing the next inverse hessian estimate.\"\"\"\n  # Only update the inverse Hessian if not already failed or converged.\n  should_update = ~next_state.converged & ~next_state.failed\n\n  # Compute the normalization term (y^T . s), should not update if is singular.\n  gradient_delta = next_state.objective_gradient - prev_state.objective_gradient\n  position_delta = next_state.position - prev_state.position\n  normalization_factor = tf.reduce_sum(\n      input_tensor=gradient_delta * position_delta, axis=-1)\n  should_update = should_update & ~tf.equal(normalization_factor, 0)\n\n  def _do_update_inv_hessian():\n    next_inv_hessian = _bfgs_inv_hessian_update(\n        gradient_delta, position_delta, normalization_factor,\n        prev_state.inverse_hessian_estimate)\n    return bfgs_utils.update_fields(\n        next_state,\n        inverse_hessian_estimate=tf.where(should_update,\n                                          next_inv_hessian,\n                                          prev_state.inverse_hessian_estimate))\n\n  return prefer_static.cond(\n      tf.reduce_any(input_tensor=should_update),\n      _do_update_inv_hessian,\n      lambda: next_state)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef _bfgs_inv_hessian_update(grad_delta, position_delta, normalization_factor,\n                             inv_hessian_estimate):\n  \"\"\"Applies the BFGS update to the inverse Hessian estimate.\n\n  The BFGS update rule is (note A^T denotes the transpose of a vector/matrix A).\n\n  ```None\n    rho = 1/(grad_delta^T * position_delta)\n    U = (I - rho * position_delta * grad_delta^T)\n    H_1 =  U * H_0 * U^T + rho * position_delta * position_delta^T\n  ```\n\n  Here, `H_0` is the inverse Hessian estimate at the previous iteration and\n  `H_1` is the next estimate. Note that `*` should be interpreted as the\n  matrix multiplication (with the understanding that matrix multiplication for\n  scalars is usual multiplication and for matrix with vector is the action of\n  the matrix on the vector.).\n\n  The implementation below utilizes an expanded version of the above formula\n  to avoid the matrix multiplications that would be needed otherwise. By\n  expansion it is easy to see that one only needs matrix-vector or\n  vector-vector operations. The expanded version is:\n\n  ```None\n    f = 1 + rho * (grad_delta^T * H_0 * grad_delta)\n    H_1 - H_0 = - rho * [position_delta * (H_0 * grad_delta)^T +\n                        (H_0 * grad_delta) * position_delta^T] +\n                  rho * f * [position_delta * position_delta^T]\n  ```\n\n  All the terms in square brackets are matrices and are constructed using\n  vector outer products. All the other terms on the right hand side are scalars.\n  Also worth noting that the first and second lines are both rank 1 updates\n  applied to the current inverse Hessian estimate.\n\n  Args:\n    grad_delta: Real `Tensor` of shape `[..., n]`. The difference between the\n      gradient at the new position and the old position.\n    position_delta: Real `Tensor` of shape `[..., n]`. The change in position\n      from the previous iteration to the current one.\n    normalization_factor: Real `Tensor` of shape `[...]`. Should be equal to\n      `grad_delta^T * position_delta`, i.e. `1/rho` as defined above.\n    inv_hessian_estimate: Real `Tensor` of shape `[..., n, n]`. The previous\n      estimate of the inverse Hessian. Should be positive definite and\n      symmetric.\n\n  Returns:\n    A tuple containing the following fields\n      is_valid: A Boolean `Tensor` of shape `[...]` indicating batch members\n        where the update succeeded. The update can fail if the position change\n        becomes orthogonal to the gradient change.\n      next_inv_hessian_estimate: A `Tensor` of shape `[..., n, n]`. The next\n        Hessian estimate updated using the BFGS update scheme. If the\n        `inv_hessian_estimate` is symmetric and positive definite, the\n        `next_inv_hessian_estimate` is guaranteed to satisfy the same\n        conditions.\n  \"\"\"\n  # The quadratic form: y^T.H.y; where H is the inverse Hessian and y is the\n  # gradient change.\n  conditioned_grad_delta = _mul_right(inv_hessian_estimate, grad_delta)\n  conditioned_grad_delta_norm = tf.reduce_sum(\n      input_tensor=conditioned_grad_delta * grad_delta, axis=-1)\n\n  # The first rank 1 update term requires the outer product: s.y^T.\n  cross_term = _tensor_product(position_delta, conditioned_grad_delta)\n\n  def _expand_scalar(s):\n    # Expand dimensions of a batch of scalars to multiply or divide a matrix.\n    return s[..., tf.newaxis, tf.newaxis]\n\n  # Symmetrize\n  cross_term += _tensor_product(conditioned_grad_delta, position_delta)\n  position_term = _tensor_product(position_delta, position_delta)\n  with tf.control_dependencies([position_term]):\n    position_term *= _expand_scalar(\n        1 + conditioned_grad_delta_norm / normalization_factor)\n\n  return (inv_hessian_estimate +\n          (position_term - cross_term) / _expand_scalar(normalization_factor))", "response": "Applies the BFGS update to the inverse Hessian estimate."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\ncompute the product of a matrix with a vector on the right.", "response": "def _mul_right(mat, vec):\n  \"\"\"Computes the product of a matrix with a vector on the right.\n\n  Note this supports dynamic shapes and batched computation.\n\n  Examples:\n\n    M = tf.reshape(tf.range(6), shape=(3, 2))\n    # => [[0, 1],\n    #     [2, 3],\n    #     [4, 5]]\n    v = tf.constant([1, 2])  # Shape: (2,)\n    _mul_right(M, v)\n    # => [ 2,  8, 14]  # Shape: (3,)\n\n    M = tf.reshape(tf.range(30), shape=(2, 3, 5))\n    # => [[[ 0,  1,  2,  3,  4],\n    #     [ 5,  6,  7,  8,  9],\n    #     [10, 11, 12, 13, 14]],\n    #\n    #    [[15, 16, 17, 18, 19],\n    #     [20, 21, 22, 23, 24],\n    #     [25, 26, 27, 28, 29]]]\n    v = tf.reshape(tf.range(10), shape=(2, 5))\n    # => [[0, 1, 2, 3, 4],\n    #     [5, 6, 7, 8, 9]]\n    _mul_right(M, v)\n    # => [[ 30,  80, 130],\n    #     [605, 780, 955]]  # Shape: (2, 3)\n\n  Args:\n    mat: A `tf.Tensor` of shape `[..., n, m]`.\n    vec: A `tf.Tensor` of shape `[..., m]`.\n\n  Returns:\n    A tensor of shape `[..., n]` with matching batch dimensions.\n  \"\"\"\n  return tf.squeeze(tf.matmul(mat, tf.expand_dims(vec, axis=-1)), axis=-1)"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\ncomputes the outer product of two possibly batched vectors.", "response": "def _tensor_product(t1, t2):\n  \"\"\"Computes the outer product of two possibly batched vectors.\n\n  Args:\n    t1: A `tf.Tensor` of shape `[..., n]`.\n    t2: A `tf.Tensor` of shape `[..., m]`.\n\n  Returns:\n    A tensor of shape `[..., n, m]` with matching batch dimensions, let's call\n    it `r`, whose components are:\n\n    ```None\n      r[..., i, j] = t1[..., i] * t2[..., j]\n    ```\n  \"\"\"\n  return tf.matmul(tf.expand_dims(t1, axis=-1), tf.expand_dims(t2, axis=-2))"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef _batch_transpose(mat):\n  n = distribution_util.prefer_static_rank(mat)\n  perm = tf.range(n)\n  perm = tf.concat([perm[:-2], [perm[-1], perm[-2]]], axis=0)\n  return tf.transpose(a=mat, perm=perm)", "response": "Transpose a possibly batched matrix."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nreturns a Tensor with right - most ndims summed.", "response": "def sum_rightmost_ndims_preserving_shape(x, ndims):\n  \"\"\"Return `Tensor` with right-most ndims summed.\n\n  Args:\n    x: the `Tensor` whose right-most `ndims` dimensions to sum\n    ndims: number of right-most dimensions to sum.\n\n  Returns:\n    A `Tensor` resulting from calling `reduce_sum` on the `ndims` right-most\n    dimensions. If the shape of `x` is statically known, the result will also\n    have statically known shape. Otherwise, the resulting shape will only be\n    known at runtime.\n  \"\"\"\n  x = tf.convert_to_tensor(value=x)\n  if x.shape.ndims is not None:\n    axes = tf.range(x.shape.ndims - ndims, x.shape.ndims)\n  else:\n    axes = tf.range(tf.rank(x) - ndims, tf.rank(x))\n  return tf.reduce_sum(input_tensor=x, axis=axes)"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nreturn common dtype of items in arg_list or None.", "response": "def maybe_get_common_dtype(arg_list):\n  \"\"\"Return common dtype of arg_list, or None.\n\n  Args:\n    arg_list: an iterable of items which are either `None` or have a `dtype`\n      property.\n\n  Returns:\n    dtype: The common dtype of items in `arg_list`, or `None` if the list is\n      empty or all items are `None`.\n  \"\"\"\n  # Note that `all` defaults to `True` if `arg_list` is empty.\n  if all(a is None for a in arg_list):\n    return None\n  return dtype_util.common_dtype(arg_list, tf.float32)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\napply an unconstrained minimization of a differentiable function using the L - BFGS algorithm.", "response": "def minimize(value_and_gradients_function,\n             initial_position,\n             num_correction_pairs=10,\n             tolerance=1e-8,\n             x_tolerance=0,\n             f_relative_tolerance=0,\n             initial_inverse_hessian_estimate=None,\n             max_iterations=50,\n             parallel_iterations=1,\n             stopping_condition=None,\n             name=None):\n  \"\"\"Applies the L-BFGS algorithm to minimize a differentiable function.\n\n  Performs unconstrained minimization of a differentiable function using the\n  L-BFGS scheme. See [Nocedal and Wright(2006)][1] for details of the algorithm.\n\n  ### Usage:\n\n  The following example demonstrates the L-BFGS optimizer attempting to find the\n  minimum for a simple high-dimensional quadratic objective function.\n\n  ```python\n    # A high-dimensional quadratic bowl.\n    ndims = 60\n    minimum = np.ones([ndims], dtype='float64')\n    scales = np.arange(ndims, dtype='float64') + 1.0\n\n    # The objective function and the gradient.\n    def quadratic(x):\n      value = tf.reduce_sum(scales * (x - minimum) ** 2)\n      return value, tf.gradients(value, x)[0]\n\n    start = np.arange(ndims, 0, -1, dtype='float64')\n    optim_results = tfp.optimizer.lbfgs_minimize(\n        quadratic, initial_position=start, num_correction_pairs=10,\n        tolerance=1e-8)\n\n    with tf.Session() as session:\n      results = session.run(optim_results)\n      # Check that the search converged\n      assert(results.converged)\n      # Check that the argmin is close to the actual value.\n      np.testing.assert_allclose(results.position, minimum)\n  ```\n\n  ### References:\n\n  [1] Jorge Nocedal, Stephen Wright. Numerical Optimization. Springer Series\n      in Operations Research. pp 176-180. 2006\n\n  http://pages.mtu.edu/~struther/Courses/OLD/Sp2013/5630/Jorge_Nocedal_Numerical_optimization_267490.pdf\n\n  Args:\n    value_and_gradients_function:  A Python callable that accepts a point as a\n      real `Tensor` and returns a tuple of `Tensor`s of real dtype containing\n      the value of the function and its gradient at that point. The function\n      to be minimized. The input is of shape `[..., n]`, where `n` is the size\n      of the domain of input points, and all others are batching dimensions.\n      The first component of the return value is a real `Tensor` of matching\n      shape `[...]`. The second component (the gradient) is also of shape\n      `[..., n]` like the input value to the function.\n    initial_position: Real `Tensor` of shape `[..., n]`. The starting point, or\n      points when using batching dimensions, of the search procedure. At these\n      points the function value and the gradient norm should be finite.\n    num_correction_pairs: Positive integer. Specifies the maximum number of\n      (position_delta, gradient_delta) correction pairs to keep as implicit\n      approximation of the Hessian matrix.\n    tolerance: Scalar `Tensor` of real dtype. Specifies the gradient tolerance\n      for the procedure. If the supremum norm of the gradient vector is below\n      this number, the algorithm is stopped.\n    x_tolerance: Scalar `Tensor` of real dtype. If the absolute change in the\n      position between one iteration and the next is smaller than this number,\n      the algorithm is stopped.\n    f_relative_tolerance: Scalar `Tensor` of real dtype. If the relative change\n      in the objective value between one iteration and the next is smaller\n      than this value, the algorithm is stopped.\n    initial_inverse_hessian_estimate: None. Option currently not supported.\n    max_iterations: Scalar positive int32 `Tensor`. The maximum number of\n      iterations for L-BFGS updates.\n    parallel_iterations: Positive integer. The number of iterations allowed to\n      run in parallel.\n    stopping_condition: (Optional) A Python function that takes as input two\n      Boolean tensors of shape `[...]`, and returns a Boolean scalar tensor.\n      The input tensors are `converged` and `failed`, indicating the current\n      status of each respective batch member; the return value states whether\n      the algorithm should stop. The default is tfp.optimizer.converged_all\n      which only stops when all batch members have either converged or failed.\n      An alternative is tfp.optimizer.converged_any which stops as soon as one\n      batch member has converged, or when all have failed.\n    name: (Optional) Python str. The name prefixed to the ops created by this\n      function. If not supplied, the default name 'minimize' is used.\n\n  Returns:\n    optimizer_results: A namedtuple containing the following items:\n      converged: Scalar boolean tensor indicating whether the minimum was\n        found within tolerance.\n      failed:  Scalar boolean tensor indicating whether a line search\n        step failed to find a suitable step size satisfying Wolfe\n        conditions. In the absence of any constraints on the\n        number of objective evaluations permitted, this value will\n        be the complement of `converged`. However, if there is\n        a constraint and the search stopped due to available\n        evaluations being exhausted, both `failed` and `converged`\n        will be simultaneously False.\n      num_objective_evaluations: The total number of objective\n        evaluations performed.\n      position: A tensor containing the last argument value found\n        during the search. If the search converged, then\n        this value is the argmin of the objective function.\n      objective_value: A tensor containing the value of the objective\n        function at the `position`. If the search converged, then this is\n        the (local) minimum of the objective function.\n      objective_gradient: A tensor containing the gradient of the objective\n        function at the `position`. If the search converged the\n        max-norm of this tensor should be below the tolerance.\n      position_deltas: A tensor encoding information about the latest\n        changes in `position` during the algorithm execution.\n      gradient_deltas: A tensor encoding information about the latest\n        changes in `objective_gradient` during the algorithm execution.\n  \"\"\"\n  if initial_inverse_hessian_estimate is not None:\n    raise NotImplementedError(\n        'Support of initial_inverse_hessian_estimate arg not yet implemented')\n\n  if stopping_condition is None:\n    stopping_condition = bfgs_utils.converged_all\n\n  with tf.compat.v1.name_scope(name, 'minimize', [initial_position, tolerance]):\n    initial_position = tf.convert_to_tensor(\n        value=initial_position, name='initial_position')\n    dtype = initial_position.dtype.base_dtype\n    tolerance = tf.convert_to_tensor(\n        value=tolerance, dtype=dtype, name='grad_tolerance')\n    f_relative_tolerance = tf.convert_to_tensor(\n        value=f_relative_tolerance, dtype=dtype, name='f_relative_tolerance')\n    x_tolerance = tf.convert_to_tensor(\n        value=x_tolerance, dtype=dtype, name='x_tolerance')\n    max_iterations = tf.convert_to_tensor(\n        value=max_iterations, name='max_iterations')\n\n    # The `state` here is a `LBfgsOptimizerResults` tuple with values for the\n    # current state of the algorithm computation.\n    def _cond(state):\n      \"\"\"Continue if iterations remain and stopping condition is not met.\"\"\"\n      return ((state.num_iterations < max_iterations) &\n              tf.logical_not(stopping_condition(state.converged, state.failed)))\n\n    def _body(current_state):\n      \"\"\"Main optimization loop.\"\"\"\n      search_direction = _get_search_direction(current_state)\n\n      # TODO(b/120134934): Check if the derivative at the start point is not\n      # negative, if so then reset position/gradient deltas and recompute\n      # search direction.\n\n      next_state = bfgs_utils.line_search_step(\n          current_state,\n          value_and_gradients_function, search_direction,\n          tolerance, f_relative_tolerance, x_tolerance, stopping_condition)\n\n      # If not failed or converged, update the Hessian estimate.\n      should_update = ~(next_state.converged | next_state.failed)\n      state_after_inv_hessian_update = bfgs_utils.update_fields(\n          next_state,\n          position_deltas=_queue_push(\n              current_state.position_deltas, should_update,\n              next_state.position - current_state.position),\n          gradient_deltas=_queue_push(\n              current_state.gradient_deltas, should_update,\n              next_state.objective_gradient - current_state.objective_gradient))\n      return [state_after_inv_hessian_update]\n\n    initial_state = _get_initial_state(value_and_gradients_function,\n                                       initial_position,\n                                       num_correction_pairs,\n                                       tolerance)\n    return tf.while_loop(\n        cond=_cond,\n        body=_body,\n        loop_vars=[initial_state],\n        parallel_iterations=parallel_iterations)[0]"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\ncreating LBfgsOptimizerResults with initial state of search procedure.", "response": "def _get_initial_state(value_and_gradients_function,\n                       initial_position,\n                       num_correction_pairs,\n                       tolerance):\n  \"\"\"Create LBfgsOptimizerResults with initial state of search procedure.\"\"\"\n  init_args = bfgs_utils.get_initial_state_args(\n      value_and_gradients_function,\n      initial_position,\n      tolerance)\n  empty_queue = _make_empty_queue_for(num_correction_pairs, initial_position)\n  init_args.update(position_deltas=empty_queue, gradient_deltas=empty_queue)\n  return LBfgsOptimizerResults(**init_args)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef _get_search_direction(state):\n  # The number of correction pairs that have been collected so far.\n  num_elements = tf.minimum(\n      state.num_iterations,\n      distribution_util.prefer_static_shape(state.position_deltas)[0])\n\n  def _two_loop_algorithm():\n    \"\"\"L-BFGS two-loop algorithm.\"\"\"\n    # Correction pairs are always appended to the end, so only the latest\n    # `num_elements` vectors have valid position/gradient deltas.\n    position_deltas = state.position_deltas[-num_elements:]\n    gradient_deltas = state.gradient_deltas[-num_elements:]\n\n    # Pre-compute all `inv_rho[i]`s.\n    inv_rhos = tf.reduce_sum(\n        input_tensor=gradient_deltas * position_deltas, axis=-1)\n\n    def first_loop(acc, args):\n      _, q_direction = acc\n      position_delta, gradient_delta, inv_rho = args\n      alpha = tf.reduce_sum(\n          input_tensor=position_delta * q_direction, axis=-1) / inv_rho\n      direction_delta = tf.expand_dims(alpha, axis=-1) * gradient_delta\n      return (alpha, q_direction - direction_delta)\n\n    # Run first loop body computing and collecting `alpha[i]`s, while also\n    # computing the updated `q_direction` at each step.\n    zero = tf.zeros_like(inv_rhos[0])\n    alphas, q_directions = tf.scan(\n        first_loop, [position_deltas, gradient_deltas, inv_rhos],\n        initializer=(zero, state.objective_gradient), reverse=True)\n\n    # We use `H^0_k = gamma_k * I` as an estimate for the initial inverse\n    # hessian for the k-th iteration; then `r_direction = H^0_k * q_direction`.\n    gamma_k = inv_rhos[-1] / tf.reduce_sum(\n        input_tensor=gradient_deltas[-1] * gradient_deltas[-1], axis=-1)\n    r_direction = tf.expand_dims(gamma_k, axis=-1) * q_directions[0]\n\n    def second_loop(r_direction, args):\n      alpha, position_delta, gradient_delta, inv_rho = args\n      beta = tf.reduce_sum(\n          input_tensor=gradient_delta * r_direction, axis=-1) / inv_rho\n      direction_delta = tf.expand_dims(alpha - beta, axis=-1) * position_delta\n      return r_direction + direction_delta\n\n    # Finally, run second loop body computing the updated `r_direction` at each\n    # step.\n    r_directions = tf.scan(\n        second_loop, [alphas, position_deltas, gradient_deltas, inv_rhos],\n        initializer=r_direction)\n    return -r_directions[-1]\n\n  return prefer_static.cond(tf.equal(num_elements, 0),\n                            (lambda: -state.objective_gradient),\n                            _two_loop_algorithm)", "response": "Computes the search direction to follow at the current state."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef _make_empty_queue_for(k, element):\n  queue_shape = tf.concat(\n      [[k], distribution_util.prefer_static_shape(element)], axis=0)\n  return tf.zeros(queue_shape, dtype=element.dtype.base_dtype)", "response": "Creates a zero - filed queue suitable to hold k element - shaped tensors."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef _psd_mask(x):\n  # Allegedly\n  # https://scicomp.stackexchange.com/questions/12979/testing-if-a-matrix-is-positive-semi-definite\n  # it is more efficient to test for positive semi-definiteness by\n  # trying to compute the Cholesky decomposition -- the matrix is PSD\n  # if you succeed and not PSD if you fail.  However, TensorFlow's\n  # Cholesky raises an exception if _any_ of the input matrices are\n  # not PSD, from which I don't know how to extract _which ones_, so I\n  # proceed by explicitly computing all the eigenvalues and checking\n  # whether they are all positive or not.\n  #\n  # Also, as was discussed in the answer, it is somewhat dangerous to\n  # treat SPD-ness as binary in floating-point arithmetic. Cholesky\n  # factorization can complete and 'look' like everything is fine\n  # (e.g., O(1) entries and a diagonal of all ones) but the matrix can\n  # have an exponential condition number.\n  eigenvalues, _ = tf.linalg.eigh(x)\n  return tf.cast(\n      tf.reduce_min(input_tensor=eigenvalues, axis=-1) >= 0, dtype=x.dtype)", "response": "Computes whether each square matrix in the input is positive semi - definite."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef _det_large_enough_mask(x, det_bounds):\n  # For the curious: I wonder whether it is possible and desirable to\n  # use a Cholesky decomposition-based algorithm for this, since the\n  # only matrices whose determinant this code cares about will be PSD.\n  # Didn't figure out how to code that in TensorFlow.\n  #\n  # Expert opinion is that it would be about twice as fast since\n  # Cholesky is roughly half the cost of Gaussian Elimination with\n  # Partial Pivoting. But this is less of an impact than the switch in\n  # _psd_mask.\n  return tf.cast(tf.linalg.det(x) > det_bounds, dtype=x.dtype)", "response": "Returns True if the input matches the given determinant limit."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nreturning a uniformly random tensor of correlation - like matrices.", "response": "def _uniform_correlation_like_matrix(num_rows, batch_shape, dtype, seed):\n  \"\"\"Returns a uniformly random `Tensor` of \"correlation-like\" matrices.\n\n  A \"correlation-like\" matrix is a symmetric square matrix with all entries\n  between -1 and 1 (inclusive) and 1s on the main diagonal.  Of these,\n  the ones that are positive semi-definite are exactly the correlation\n  matrices.\n\n  Args:\n    num_rows: Python `int` dimension of the correlation-like matrices.\n    batch_shape: `Tensor` or Python `tuple` of `int` shape of the\n      batch to return.\n    dtype: `dtype` of the `Tensor` to return.\n    seed: Random seed.\n\n  Returns:\n    matrices: A `Tensor` of shape `batch_shape + [num_rows, num_rows]`\n      and dtype `dtype`.  Each entry is in [-1, 1], and each matrix\n      along the bottom two dimensions is symmetric and has 1s on the\n      main diagonal.\n  \"\"\"\n  num_entries = num_rows * (num_rows + 1) / 2\n  ones = tf.ones(shape=[num_entries], dtype=dtype)\n  # It seems wasteful to generate random values for the diagonal since\n  # I am going to throw them away, but `fill_triangular` fills the\n  # diagonal, so I probably need them.\n  # It's not impossible that it would be more efficient to just fill\n  # the whole matrix with random values instead of messing with\n  # `fill_triangular`.  Then would need to filter almost half out with\n  # `matrix_band_part`.\n  unifs = uniform.Uniform(-ones, ones).sample(batch_shape, seed=seed)\n  tril = util.fill_triangular(unifs)\n  symmetric = tril + tf.linalg.matrix_transpose(tril)\n  diagonal_ones = tf.ones(\n      shape=util.pad(batch_shape, axis=0, back=True, value=num_rows),\n      dtype=dtype)\n  return tf.linalg.set_diag(symmetric, diagonal_ones)"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef correlation_matrix_volume_rejection_samples(\n    det_bounds, dim, sample_shape, dtype, seed):\n  \"\"\"Returns rejection samples from trying to get good correlation matrices.\n\n  The proposal being rejected from is the uniform distribution on\n  \"correlation-like\" matrices.  We say a matrix is \"correlation-like\"\n  if it is a symmetric square matrix with all entries between -1 and 1\n  (inclusive) and 1s on the main diagonal.  Of these, the ones that\n  are positive semi-definite are exactly the correlation matrices.\n\n  The rejection algorithm, then, is to sample a `Tensor` of\n  `sample_shape` correlation-like matrices of dimensions `dim` by\n  `dim`, and check each one for (i) being a correlation matrix (i.e.,\n  PSD), and (ii) having determinant at least the corresponding entry\n  of `det_bounds`.\n\n  Args:\n    det_bounds: A `Tensor` of lower bounds on the determinants of\n      acceptable matrices.  The shape must broadcast with `sample_shape`.\n    dim: A Python `int` dimension of correlation matrices to sample.\n    sample_shape: Python `tuple` of `int` shape of the samples to\n      compute, excluding the two matrix dimensions.\n    dtype: The `dtype` in which to do the computation.\n    seed: Random seed.\n\n  Returns:\n    weights: A `Tensor` of shape `sample_shape`.  Each entry is 0 if the\n      corresponding matrix was not a correlation matrix, or had too\n      small of a determinant.  Otherwise, the entry is the\n      multiplicative inverse of the density of proposing that matrix\n      uniformly, i.e., the volume of the set of `dim` by `dim`\n      correlation-like matrices.\n    volume: The volume of the set of `dim` by `dim` correlation-like\n      matrices.\n  \"\"\"\n  with tf.compat.v1.name_scope(\"rejection_sampler\"):\n    rej_proposals = _uniform_correlation_like_matrix(\n        dim, sample_shape, dtype, seed=seed)\n    rej_proposal_volume = 2. ** (dim * (dim - 1) / 2.)\n    # The density of proposing any given point is 1 / rej_proposal_volume;\n    # The weight of that point should be scaled by\n    # 1 / density = rej_proposal_volume.\n    rej_weights = rej_proposal_volume * _psd_mask(\n        rej_proposals) * _det_large_enough_mask(rej_proposals, det_bounds)\n    return rej_weights, rej_proposal_volume", "response": "Returns the samples from trying to get good correlation matrices."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef _clopper_pearson_confidence_interval(samples, error_rate):\n  # TODO(b/78025336) Migrate this confidence interval function\n  # to statistical_testing.py.  In order to do that\n  # - Get the binomial CDF from the Binomial distribution\n  # - Implement scalar root finding in TF.  Batch bisection search\n  #   shouldn't be too hard, and is definitely good enough for this\n  #   problem.  Batching the Brent algorithm (from scipy) that is used\n  #   here may be more involved, but may also not be necessary---it's\n  #   only used here because scipy made it convenient.  In particular,\n  #   robustness is more important than speed here, which may make\n  #   bisection search actively better.\n  # - The rest is just a matter of rewriting in the appropriate style.\n  if optimize is None or stats is None:\n    raise ValueError(\n        \"Scipy is required for computing Clopper-Pearson confidence intervals\")\n  if len(samples.shape) != 1:\n    raise ValueError(\"Batch semantics not implemented\")\n  n = len(samples)\n  low = np.amin(samples)\n  high = np.amax(samples)\n  successes = np.count_nonzero(samples - low)\n  failures = np.count_nonzero(samples - high)\n  if successes + failures != n:\n    uniques = np.unique(samples)\n    msg = (\"Purportedly Bernoulli distribution had distinct samples\"\n           \" {}, {}, and {}\".format(uniques[0], uniques[1], uniques[2]))\n    raise ValueError(msg)\n  def p_small_enough(p):\n    prob = stats.binom.logcdf(successes, n, p)\n    return prob - np.log(error_rate / 2.)\n  def p_big_enough(p):\n    prob = stats.binom.logsf(successes, n, p)\n    return prob - np.log(error_rate / 2.)\n  high_p = optimize.brentq(\n      p_small_enough, float(successes) / n, 1., rtol=1e-9)\n  low_p = optimize.brentq(\n      p_big_enough, 0., float(successes) / n, rtol=1e-9)\n  low_interval = low + (high - low) * low_p\n  high_interval = low + (high - low) * high_p\n  return (low_interval, high_interval)", "response": "This function calculates the confidence interval for the given samples."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\ncomputes the confidence intervals for the true volumes of the given determinant bounds.", "response": "def compute_true_volumes(\n    det_bounds, dim, num_samples, error_rate=1e-6, seed=42):\n  \"\"\"Returns confidence intervals for the desired correlation matrix volumes.\n\n  The confidence intervals are computed by the [Clopper-Pearson method]\n  (https://en.wikipedia.org/wiki/Binomial_proportion_confidence_interval).\n\n  Args:\n    det_bounds: A rank-1 numpy array of lower bounds on the\n      determinants of acceptable matrices.  Entries must be unique.\n    dim: A Python `int` dimension of correlation matrices to sample.\n    num_samples: The number of samples to draw.\n    error_rate: The statistical significance of the returned\n      confidence intervals.  The significance is broadcast: Each\n      returned interval separately may be incorrect with probability\n      (under the sample of correlation-like matrices drawn internally)\n      at most `error_rate`.\n    seed: Random seed.\n\n  Returns:\n    bounds: A Python `dict` mapping each determinant bound to the low, high\n      tuple giving the confidence interval.\n  \"\"\"\n  bounds = {}\n  with tf.compat.v1.Session() as sess:\n    rej_weights, _ = correlation_matrix_volume_rejection_samples(\n        det_bounds, dim, [num_samples, len(det_bounds)], np.float32, seed=seed)\n    rej_weights = sess.run(rej_weights)\n    for rw, det in zip(np.rollaxis(rej_weights, 1), det_bounds):\n      template = (\"Estimating volume of {}x{} correlation \"\n                  \"matrices with determinant >= {}.\")\n      print(template.format(dim, dim, det))\n      sys.stdout.flush()\n      bounds[det] = _clopper_pearson_confidence_interval(\n          rw, error_rate=error_rate)\n    return bounds"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\ncompute the cumulative density function (CDF) of von Mises distribution. Denote the density of vonMises(loc=0, concentration=concentration) by p(t). Note that p(t) is periodic, p(t) = p(t + 2 pi). The CDF at the point x is defined as int_{-pi}^x p(t) dt. Thus, when x in [-pi, pi], the CDF is in [0, 1]; when x is in [pi, 3pi], the CDF is in [1, 2], etc. The CDF is not available in closed form. Instead, we use the method [1] which uses either a series expansion or a Normal approximation, depending on the value of concentration. We also compute the derivative of the CDF w.r.t. both x and concentration. The derivative w.r.t. x is p(x), while the derivative w.r.t. concentration is computed using automatic differentiation. We use forward mode for the series case (which allows to save memory) and backward mode for the Normal approximation. Arguments: x: The point at which to evaluate the CDF. concentration: The concentration parameter of the von Mises distribution. Returns: The value of the CDF computed elementwise. References: [1] G. Hill \"Algorithm 518: Incomplete Bessel Function I_0. The Von Mises Distribution.\" ACM Transactions on Mathematical Software, 1977", "response": "def von_mises_cdf(x, concentration):\n  \"\"\"Computes the cumulative density function (CDF) of von Mises distribution.\n\n  Denote the density of vonMises(loc=0, concentration=concentration) by p(t).\n  Note that p(t) is periodic, p(t) = p(t + 2 pi).\n  The CDF at the point x is defined as int_{-pi}^x p(t) dt.\n  Thus, when x in [-pi, pi], the CDF is in [0, 1]; when x is in [pi, 3pi], the\n  CDF is in [1, 2], etc.\n\n  The CDF is not available in closed form. Instead, we use the method [1]\n  which uses either a series expansion or a Normal approximation, depending on\n  the value of concentration.\n\n  We also compute the derivative of the CDF w.r.t. both x and concentration.\n  The derivative w.r.t. x is p(x), while the derivative w.r.t. concentration is\n  computed\n  using automatic differentiation. We use forward mode for the series case\n  (which allows to save memory) and backward mode for the Normal approximation.\n\n  Arguments:\n    x: The point at which to evaluate the CDF.\n    concentration: The concentration parameter of the von Mises distribution.\n\n  Returns:\n    The value of the CDF computed elementwise.\n\n  References:\n    [1] G. Hill \"Algorithm 518: Incomplete Bessel Function I_0. The Von Mises\n    Distribution.\" ACM Transactions on Mathematical Software, 1977\n  \"\"\"\n  x = tf.convert_to_tensor(value=x)\n  concentration = tf.convert_to_tensor(value=concentration)\n  dtype = x.dtype\n\n  # Map x to [-pi, pi].\n  num_periods = tf.round(x / (2. * np.pi))\n  x -= (2. * np.pi) * num_periods\n\n  # We take the hyperparameters from Table I of [1], the row for D=8\n  # decimal digits of accuracy. ck is the cut-off for concentration:\n  # if concentration < ck,  the series expansion is used;\n  # otherwise, the Normal approximation is used.\n  ck = 10.5\n  # The number of terms in the series expansion. [1] chooses it as a function\n  # of concentration, n(concentration). This is hard to implement in TF.\n  # Instead, we upper bound it over concentrations:\n  #   num_terms = ceil ( max_{concentration <= ck} n(concentration) ).\n  # The maximum is achieved for concentration = ck.\n  num_terms = 20\n\n  cdf_series, dcdf_dconcentration_series = _von_mises_cdf_series(\n      x, concentration, num_terms, dtype)\n  cdf_normal, dcdf_dconcentration_normal = _von_mises_cdf_normal(\n      x, concentration, dtype)\n\n  use_series = concentration < ck\n  cdf = tf.where(use_series, cdf_series, cdf_normal)\n  cdf += num_periods\n  dcdf_dconcentration = tf.where(use_series, dcdf_dconcentration_series,\n                                 dcdf_dconcentration_normal)\n\n  def grad(dy):\n    prob = tf.exp(concentration * (tf.cos(x) - 1.)) / (\n        (2. * np.pi) * tf.math.bessel_i0e(concentration))\n    return dy * prob, dy * dcdf_dconcentration\n\n  return cdf, grad"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\ncompute the von Mises CDF and its derivative via series expansion.", "response": "def _von_mises_cdf_series(x, concentration, num_terms, dtype):\n  \"\"\"Computes the von Mises CDF and its derivative via series expansion.\"\"\"\n  # Keep the number of terms as a float. It should be a small integer, so\n  # exactly representable as a float.\n  num_terms = tf.cast(num_terms, dtype=dtype)\n\n  def loop_body(n, rn, drn_dconcentration, vn, dvn_dconcentration):\n    \"\"\"One iteration of the series loop.\"\"\"\n\n    denominator = 2. * n / concentration + rn\n    ddenominator_dk = -2. * n / concentration ** 2 + drn_dconcentration\n    rn = 1. / denominator\n    drn_dconcentration = -ddenominator_dk / denominator ** 2\n\n    multiplier = tf.sin(n * x) / n + vn\n    vn = rn * multiplier\n    dvn_dconcentration = (drn_dconcentration * multiplier +\n                          rn * dvn_dconcentration)\n    n -= 1.\n\n    return n, rn, drn_dconcentration, vn, dvn_dconcentration\n\n  (_, _, _, vn, dvn_dconcentration) = tf.while_loop(\n      cond=lambda n, *_: n > 0.,\n      body=loop_body,\n      loop_vars=(\n          num_terms,  # n\n          tf.zeros_like(x, name=\"rn\"),\n          tf.zeros_like(x, name=\"drn_dconcentration\"),\n          tf.zeros_like(x, name=\"vn\"),\n          tf.zeros_like(x, name=\"dvn_dconcentration\"),\n      ),\n  )\n\n  cdf = .5 + x / (2. * np.pi) + vn / np.pi\n  dcdf_dconcentration = dvn_dconcentration / np.pi\n\n  # Clip the result to [0, 1].\n  cdf_clipped = tf.clip_by_value(cdf, 0., 1.)\n  # The clipped values do not depend on concentration anymore, so set their\n  # derivative to zero.\n  dcdf_dconcentration *= tf.cast((cdf >= 0.) & (cdf <= 1.), dtype)\n\n  return cdf_clipped, dcdf_dconcentration"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\ncomputes the von Mises CDF and its derivative via Normal approximation.", "response": "def _von_mises_cdf_normal(x, concentration, dtype):\n  \"\"\"Computes the von Mises CDF and its derivative via Normal approximation.\"\"\"\n\n  def cdf_func(concentration):\n    \"\"\"A helper function that is passed to value_and_gradient.\"\"\"\n    # z is an \"almost Normally distributed\" random variable.\n    z = ((np.sqrt(2. / np.pi) / tf.math.bessel_i0e(concentration)) *\n         tf.sin(.5 * x))\n\n    # This is the correction described in [1] which reduces the error\n    # of the Normal approximation.\n    z2 = z ** 2\n    z3 = z2 * z\n    z4 = z2 ** 2\n    c = 24. * concentration\n    c1 = 56.\n\n    xi = z - z3 / ((c - 2. * z2 - 16.) / 3. -\n                   (z4 + (7. / 4.) * z2 + 167. / 2.) / (c - c1 - z2 + 3.)) ** 2\n\n    distrib = normal.Normal(tf.cast(0., dtype), tf.cast(1., dtype))\n\n    return distrib.cdf(xi)\n\n  return value_and_gradient(cdf_func, concentration)"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef random_von_mises(shape, concentration, dtype=tf.float32, seed=None):\n  seed = SeedStream(seed, salt=\"von_mises\")\n  concentration = tf.convert_to_tensor(\n      value=concentration, dtype=dtype, name=\"concentration\")\n\n  @tf.custom_gradient\n  def rejection_sample_with_gradient(concentration):\n    \"\"\"Performs rejection sampling for standardized von Mises.\n\n    A nested function is required because @tf.custom_gradient does not handle\n    non-tensor inputs such as dtype. Instead, they are captured by the outer\n    scope.\n\n    Arguments:\n      concentration: The concentration parameter of the distribution.\n\n    Returns:\n      Differentiable samples of standardized von Mises.\n    \"\"\"\n    r = 1. + tf.sqrt(1. + 4. * concentration ** 2)\n    rho = (r - tf.sqrt(2. * r)) / (2. * concentration)\n\n    s_exact = (1. + rho ** 2) / (2. * rho)\n\n    # For low concentration, s becomes numerically unstable.\n    # To fix that, we use an approximation. Here is the derivation.\n    # First-order Taylor expansion at conc = 0 gives\n    #   sqrt(1 + 4 concentration^2) ~= 1 + (2 concentration)^2 / 2.\n    # Therefore, r ~= 2 + 2 concentration. By plugging this into rho, we have\n    #   rho ~= conc + 1 / conc - sqrt(1 + 1 / concentration^2).\n    # Let's expand the last term at concentration=0 up to the linear term:\n    #   sqrt(1 + 1 / concentration^2) ~= 1 / concentration + concentration / 2\n    # Thus, rho ~= concentration / 2. Finally,\n    #   s = 1 / (2 rho) + rho / 2 ~= 1 / concentration + concentration / 4.\n    # Since concentration is small, we drop the second term and simply use\n    #   s ~= 1 / concentration.\n    s_approximate = 1. / concentration\n\n    # To compute the cutoff, we compute s_exact using mpmath with 30 decimal\n    # digits precision and compare that to the s_exact and s_approximate\n    # computed with dtype. Then, the cutoff is the largest concentration for\n    # which abs(s_exact - s_exact_mpmath) > abs(s_approximate - s_exact_mpmath).\n    s_concentration_cutoff_dict = {\n        tf.float16: 1.8e-1,\n        tf.float32: 2e-2,\n        tf.float64: 1.2e-4,\n    }\n    s_concentration_cutoff = s_concentration_cutoff_dict[dtype]\n\n    s = tf.where(concentration > s_concentration_cutoff, s_exact, s_approximate)\n\n    def loop_body(done, u, w):\n      \"\"\"Resample the non-accepted points.\"\"\"\n      # We resample u each time completely. Only its sign is used outside the\n      # loop, which is random.\n      u = tf.random.uniform(\n          shape, minval=-1., maxval=1., dtype=dtype, seed=seed())\n      z = tf.cos(np.pi * u)\n      # Update the non-accepted points.\n      w = tf.where(done, w, (1. + s * z) / (s + z))\n      y = concentration * (s - w)\n\n      v = tf.random.uniform(\n          shape, minval=0., maxval=1., dtype=dtype, seed=seed())\n      accept = (y * (2. - y) >= v) | (tf.math.log(y / v) + 1. >= y)\n\n      return done | accept, u, w\n\n    _, u, w = tf.while_loop(\n        cond=lambda done, *_: ~tf.reduce_all(input_tensor=done),\n        body=loop_body,\n        loop_vars=(\n            tf.zeros(shape, dtype=tf.bool, name=\"done\"),\n            tf.zeros(shape, dtype=dtype, name=\"u\"),\n            tf.zeros(shape, dtype=dtype, name=\"w\"),\n        ),\n        # The expected number of iterations depends on concentration.\n        # It monotonically increases from one iteration for concentration = 0 to\n        # sqrt(2 pi / e) ~= 1.52 iterations for concentration = +inf [1].\n        # We use a limit of 100 iterations to avoid infinite loops\n        # for very large / nan concentration.\n        maximum_iterations=100,\n        parallel_iterations=1 if seed.original_seed is None else 10,\n    )\n\n    x = tf.sign(u) * tf.math.acos(w)\n\n    def grad(dy):\n      \"\"\"The gradient of the von Mises samples w.r.t. concentration.\"\"\"\n      broadcast_concentration = concentration + tf.zeros_like(x)\n      _, dcdf_dconcentration = value_and_gradient(\n          lambda conc: von_mises_cdf(x, conc), broadcast_concentration)\n      inv_prob = tf.exp(-broadcast_concentration * (tf.cos(x) - 1.)) * (\n          (2. * np.pi) * tf.math.bessel_i0e(broadcast_concentration))\n      # Compute the implicit reparameterization gradient [2],\n      # dz/dconc = -(dF(z; conc) / dconc) / p(z; conc)\n      ret = dy * (-inv_prob * dcdf_dconcentration)\n      # Sum over the sample dimensions. Assume that they are always the first\n      # ones.\n      num_sample_dimensions = (tf.rank(broadcast_concentration) -\n                               tf.rank(concentration))\n      return tf.reduce_sum(\n          input_tensor=ret, axis=tf.range(num_sample_dimensions))\n\n    return x, grad\n\n  return rejection_sample_with_gradient(concentration)", "response": "Returns a random tensor from the standardized von Mises distribution."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef one_step(\n    objective_function,\n    population,\n    population_values=None,\n    differential_weight=0.5,\n    crossover_prob=0.9,\n    seed=None,\n    name=None):\n  \"\"\"Performs one step of the differential evolution algorithm.\n\n  Args:\n    objective_function:  A Python callable that accepts a batch of possible\n      solutions and returns the values of the objective function at those\n      arguments as a rank 1 real `Tensor`. This specifies the function to be\n      minimized. The input to this callable may be either a single `Tensor`\n      or a Python `list` of `Tensor`s. The signature must match the format of\n      the argument `population`. (i.e. objective_function(*population) must\n      return the value of the function to be minimized).\n    population:  `Tensor` or Python `list` of `Tensor`s representing the\n      current population vectors. Each `Tensor` must be of the same real dtype.\n      The first dimension indexes individual population members while the\n      rest of the dimensions are consumed by the value function. For example,\n      if the population is a single `Tensor` of shape [n, m1, m2], then `n` is\n      the population size and the output of `objective_function` applied to the\n      population is a `Tensor` of shape [n]. If the population is a python\n      list of `Tensor`s then each `Tensor` in the list should have the first\n      axis of a common size, say `n` and `objective_function(*population)`\n      should return a `Tensor of shape [n]. The population must have at least\n      4 members for the algorithm to work correctly.\n    population_values: A `Tensor` of rank 1 and real dtype. The result of\n      applying `objective_function` to the `population`. If not supplied it is\n      computed using the `objective_function`.\n      Default value: None.\n    differential_weight: Real scalar `Tensor`. Must be positive and less than\n      2.0. The parameter controlling the strength of mutation.\n      Default value: 0.5\n    crossover_prob: Real scalar `Tensor`. Must be between 0 and 1. The\n      probability of recombination per site.\n      Default value: 0.9\n    seed: `int` or None. The random seed for this `Op`. If `None`, no seed is\n      applied.\n      Default value: None.\n    name: (Optional) Python str. The name prefixed to the ops created by this\n      function. If not supplied, the default name 'one_step' is\n      used.\n      Default value: None\n\n  Returns:\n    A sequence containing the following elements (in order):\n    next_population: A `Tensor` or Python `list` of `Tensor`s of the same\n      structure as the input population. The population at the next generation.\n    next_population_values: A `Tensor` of same shape and dtype as input\n      `population_values`. The function values for the `next_population`.\n  \"\"\"\n  with tf.compat.v1.name_scope(\n      name, 'one_step',\n      [population, population_values, differential_weight, crossover_prob]):\n    population, _ = _ensure_list(population)\n    if population_values is None:\n      population_values = objective_function(*population)\n    population_size = tf.shape(input=population[0])[0]\n    seed_stream = distributions.SeedStream(seed, salt='one_step')\n    mixing_indices = _get_mixing_indices(population_size, seed=seed_stream())\n    # Construct the mutated solution vectors. There is one for each member of\n    # the population.\n    mutants = _get_mutants(population,\n                           population_size,\n                           mixing_indices,\n                           differential_weight)\n    # Perform recombination between the parents and the mutants.\n    candidates = _binary_crossover(population,\n                                   population_size,\n                                   mutants,\n                                   crossover_prob,\n                                   seed=seed_stream())\n    candidate_values = objective_function(*candidates)\n    if population_values is None:\n      population_values = objective_function(*population)\n\n    infinity = tf.zeros_like(population_values) + np.inf\n\n    population_values = tf.where(\n        tf.math.is_nan(population_values), x=infinity, y=population_values)\n\n    to_replace = candidate_values < population_values\n    next_population = [\n        tf.where(to_replace, x=candidates_part, y=population_part)\n        for candidates_part, population_part in zip(candidates, population)\n    ]\n    next_values = tf.where(to_replace, x=candidate_values, y=population_values)\n\n  return next_population, next_values", "response": "A function that performs one step of differential evolution algorithm."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef minimize(objective_function,\n             initial_population=None,\n             initial_position=None,\n             population_size=50,\n             population_stddev=1.,\n             max_iterations=100,\n             func_tolerance=0,\n             position_tolerance=1e-8,\n             differential_weight=0.5,\n             crossover_prob=0.9,\n             seed=None,\n             name=None):\n  \"\"\"Applies the Differential evolution algorithm to minimize a function.\n\n  Differential Evolution is an evolutionary optimization algorithm which works\n  on a set of candidate solutions called the population. It iteratively\n  improves the population by applying genetic operators of mutation and\n  recombination. The objective function `f` supplies the fitness of each\n  candidate. A candidate `s_1` is considered better than `s_2` if\n  `f(s_1) < f(s_2)`.\n\n  This method allows the user to either specify an initial population or a\n  single candidate solution. If a single solution is specified, a population\n  of the specified size is initialized by adding independent normal noise\n  to the candidate solution.\n\n  The implementation also supports a multi-part specification of the state. For\n  example, consider the objective function:\n\n  ```python\n  # x is a tensor of shape [n, m] while y is of shape [n].\n  def objective(x, y):\n    return tf.math.reduce_sum(x ** 2, axis=-1) + y ** 2\n  ```\n  The state in this case is specified by two input tensors `x` and `y`. To\n  apply the algorithm to this objective function, one would need to specify\n  either an initial population as a list of two tensors of shapes\n  `[population_size, k]` and `[population_size]`. The following code shows the\n  complete example:\n\n  ```python\n    population_size = 40\n    # With an initial population and a multi-part state.\n    initial_population = (tf.random.normal([population_size]),\n                          tf.random.normal([population_size]))\n    def easom_fn(x, y):\n      return -(tf.math.cos(x) * tf.math.cos(y) *\n               tf.math.exp(-(x-np.pi)**2 - (y-np.pi)**2))\n\n    optim_results = tfp.optimizers.differential_evolution_minimize(\n        easom_fn,\n        initial_population=initial_population,\n        seed=43210)\n\n    print (optim_results.converged)\n    print (optim_results.position)  # Should be (close to) [pi, pi].\n    print (optim_results.objective_value)    # Should be -1.\n\n\n    # With a single starting point\n    initial_position = (tf.constant(1.0), tf.constant(1.0))\n\n    optim_results = tfp.optimizers.differential_evolution_minimize(\n        easom_fn,\n        initial_position=initial_position,\n        population_size=40,\n        population_stddev=2.0,\n        seed=43210)\n  ```\n\n  Args:\n    objective_function: A Python callable that accepts a batch of possible\n      solutions and returns the values of the objective function at those\n      arguments as a rank 1 real `Tensor`. This specifies the function to be\n      minimized. The input to this callable may be either a single `Tensor`\n      or a Python `list` of `Tensor`s. The signature must match the format of\n      the argument `population`. (i.e. objective_function(*population) must\n      return the value of the function to be minimized).\n    initial_population: A real `Tensor` or Python list of `Tensor`s.\n      If a list, each `Tensor` must be of rank at least 1 and with a common\n      first dimension. The first dimension indexes into the candidate solutions\n      while the rest of the dimensions (if any) index into an individual\n      solution. The size of the population must be at least 4. This is a\n      requirement of the DE algorithm.\n    initial_position: A real `Tensor` of any shape. The seed solution used\n      to initialize the population of solutions. If this parameter is specified\n      then `initial_population` must not be specified.\n    population_size: A positive scalar int32 `Tensor` greater than 4. The\n      size of the population to evolve. This parameter is ignored if\n      `initial_population` is specified.\n      Default value: 50.\n    population_stddev: A positive scalar real `Tensor` of the same dtype\n      as `initial_position`. This parameter is ignored if `initial_population`\n      is specified. Used to generate the population from the `initial_position`\n      by adding random normal noise with zero mean and the specified standard\n      deviation.\n      Default value: 1.0\n    max_iterations: Positive scalar int32 `Tensor`. The maximum number of\n      generations to evolve the population for.\n      Default value: 100\n    func_tolerance: Scalar `Tensor` of the same dtype as the output of the\n      `objective_function`. The algorithm stops if the absolute difference\n      between the largest and the smallest objective function value in the\n      population is below this number.\n      Default value: 0\n    position_tolerance: Scalar `Tensor` of the same real dtype as\n      `initial_position` or `initial_population`. The algorithm terminates if\n      the largest absolute difference between the coordinates of the population\n      members is below this threshold.\n      Default value: 1e-8\n    differential_weight: Real scalar `Tensor`. Must be positive and less than\n      2.0. The parameter controlling the strength of mutation in the algorithm.\n      Default value: 0.5\n    crossover_prob: Real scalar `Tensor`. Must be between 0 and 1. The\n      probability of recombination per site.\n      Default value: 0.9\n    seed: `int` or None. The random seed for this `Op`. If `None`, no seed is\n      applied.\n      Default value: None.\n    name: (Optional) Python str. The name prefixed to the ops created by this\n      function. If not supplied, the default name\n      'differential_evolution_minimize' is used.\n      Default value: None\n\n  Returns:\n    optimizer_results: An object containing the following attributes:\n      converged: Scalar boolean `Tensor` indicating whether the minimum was\n        found within the specified tolerances.\n      num_objective_evaluations: The total number of objective\n        evaluations performed.\n      position: A `Tensor` containing the best point found during the search.\n        If the search converged, then this value is the argmin of the\n        objective function within the specified tolerances.\n      objective_value: A `Tensor` containing the value of the objective\n        function at the `position`. If the search\n        converged, then this is the (local) minimum of\n        the objective function.\n      final_population: The final state of the population.\n      final_objective_values: The objective function evaluated at the\n        final population.\n      initial_population: The starting population.\n      initial_objective_values: The objective function evaluated at the\n        initial population.\n      num_iterations: The number of iterations of the main algorithm body.\n\n  Raises:\n    ValueError: If neither the initial population, nor the initial position\n      are specified or if both are specified.\n  \"\"\"\n\n  if initial_population is None and initial_position is None:\n    raise ValueError('Either the initial population or the initial position '\n                     'must be specified.')\n  if initial_population is not None and initial_position is not None:\n    raise ValueError('Only one of initial population or initial position '\n                     'should be specified')\n\n  with tf.compat.v1.name_scope(\n      name,\n      default_name='minimize',\n      values=[\n          initial_population, initial_position, population_size,\n          population_stddev, max_iterations, func_tolerance, position_tolerance,\n          differential_weight, crossover_prob\n      ]):\n    (\n        was_iterable,\n        population,\n        population_values,\n        max_iterations,\n        func_tolerance,\n        position_tolerance,\n        differential_weight,\n        crossover_prob\n    ) = _get_initial_args(objective_function,\n                          initial_population,\n                          initial_position,\n                          population_size,\n                          population_stddev,\n                          max_iterations,\n                          func_tolerance,\n                          position_tolerance,\n                          differential_weight,\n                          crossover_prob,\n                          seed)\n\n    def evolve_body(loop_vars):\n      \"\"\"Performs one step of the evolution.\"\"\"\n      next_population, next_population_values = one_step(\n          objective_function,\n          loop_vars.population,\n          population_values=loop_vars.population_values,\n          differential_weight=differential_weight,\n          crossover_prob=crossover_prob,\n          seed=seed)\n      converged = _check_convergence(next_population,\n                                     next_population_values,\n                                     func_tolerance,\n                                     position_tolerance)\n\n      failed = _check_failure(next_population_values)\n\n      return [_MinimizeLoopVars(\n          converged=converged,\n          failed=failed,\n          num_iterations=loop_vars.num_iterations+1,\n          population=next_population,\n          population_values=next_population_values)]\n\n    def evolve_cond(loop_vars):\n      should_stop = (\n          loop_vars.failed |\n          loop_vars.converged |\n          (max_iterations is not None and\n           loop_vars.num_iterations >= max_iterations))\n      return ~should_stop\n\n    initial_vars = _MinimizeLoopVars(\n        converged=tf.convert_to_tensor(value=False),\n        failed=tf.convert_to_tensor(value=False),\n        num_iterations=tf.convert_to_tensor(value=0),\n        population=population,\n        population_values=population_values)\n    final_state = tf.while_loop(\n        cond=evolve_cond, body=evolve_body, loop_vars=(initial_vars,))[0]\n    best_position, best_values = _find_best_in_population(\n        final_state.population,\n        final_state.population_values)\n    # Ensure we return a similar structure to what the user supplied.\n    final_population = final_state.population\n    if not was_iterable:\n      final_population = final_population[0]\n      best_position = best_position[0]\n    return DifferentialEvolutionOptimizerResults(\n        converged=final_state.converged,\n        failed=final_state.failed,\n        position=best_position,\n        objective_value=best_values,\n        final_population=final_population,\n        final_objective_values=final_state.population_values,\n        initial_population=population,\n        initial_objective_values=population_values,\n        num_iterations=final_state.num_iterations)", "response": "Applies the objective function to minimize a function."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nchecks if all the population values are NaN or infinite.", "response": "def _check_failure(population_values):\n  \"\"\"Checks if all the population values are NaN/infinite.\"\"\"\n  return tf.math.reduce_all(input_tensor=tf.math.is_inf(population_values))"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nfinds the population member with the lowest value.", "response": "def _find_best_in_population(population, values):\n  \"\"\"Finds the population member with the lowest value.\"\"\"\n  best_value = tf.math.reduce_min(input_tensor=values)\n  best_index = tf.where(tf.math.equal(values, best_value))[0, 0]\n\n  return ([population_part[best_index] for population_part in population],\n          best_value)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef _check_convergence(population,\n                       population_values,\n                       func_tolerance,\n                       position_tolerance):\n  \"\"\"Checks whether the convergence criteria have been met.\"\"\"\n  # Check func tolerance\n  value_range = tf.math.abs(\n      tf.math.reduce_max(input_tensor=population_values) -\n      tf.math.reduce_min(input_tensor=population_values))\n  value_converged = value_range <= func_tolerance\n  # Ideally, we would compute the position convergence by computing the\n  # pairwise distance between every member of the population and checking if\n  # the maximum of those is less than the supplied tolerance. However, this is\n  # completely infeasible in terms of performance. We adopt a more conservative\n  # approach which checks the distance between the first population member\n  # with the rest of the population. If the largest such distance is less than\n  # half the supplied tolerance, we stop. The reason why this is sufficient is\n  # as follows. For any pair of distinct points (a, b) in the population, we\n  # have the relation:  |a - b| <= |x0 - a| + |x0 - b|, where x0 is any\n  # other point. In particular, let x0 be the first element of the population\n  # and suppose that the largest distance between this point and any other\n  # member is epsilon. Then, for any pair of points (a, b),\n  # |a - b| <= 2 * epsilon and hence, the maximum distance between any pair of\n  # points in the population is bounded above by twice the distance between\n  # the first point and other points.\n  half_tol = position_tolerance / 2\n  def part_converged(part):\n    return tf.math.reduce_max(input_tensor=tf.math.abs(part -\n                                                       part[0])) <= half_tol\n\n  x_converged = tf.math.reduce_all(\n      input_tensor=[part_converged(part) for part in population])\n  return value_converged | x_converged", "response": "Checks whether the convergence criteria have been met."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _get_starting_population(initial_population,\n                             initial_position,\n                             population_size,\n                             population_stddev,\n                             seed):\n  \"\"\"Constructs the initial population.\n\n  If an initial population is not already provided, this function constructs\n  a population by adding random normal noise to the initial position.\n\n  Args:\n    initial_population: None or a list of `Tensor`s. The initial population.\n    initial_position: None or a list of `Tensor`s. The initial position.\n      If initial_population is None, this argument must not be None.\n    population_size: Scalar integer `Tensor`. The number of members in the\n      population. If the initial population is not None, this parameter is\n      ignored.\n    population_stddev: A positive scalar real `Tensor` of the same dtype\n      as `initial_position` or `initial_population` (whichever is not None).\n      This parameter is ignored if `initial_population`\n      is specified. Used to generate the population from the\n      `initial_position` by adding random normal noise with zero mean and\n      the specified standard deviation.\n    seed: Seed for random number generation.\n\n  Returns:\n    A list of `Tensor`s. The initial population.\n  \"\"\"\n  if initial_population is not None:\n    return [tf.convert_to_tensor(value=part) for part in initial_population]\n  # Constructs the population by adding normal noise to the initial position.\n  seed_stream = distributions.SeedStream(seed, salt='get_starting_population')\n  population = []\n  for part in initial_position:\n    part = tf.convert_to_tensor(value=part)\n    part_event_shape = tf.shape(input=part)\n    # We only draw population_size-1 random vectors because we want to ensure\n    # that the supplied position is part of the population. The first member\n    # is set to be the initial_position.\n    population_part_shape = tf.concat([[population_size-1],\n                                       part_event_shape], axis=0)\n    population_part = tf.random.normal(population_part_shape,\n                                       stddev=population_stddev,\n                                       dtype=part.dtype.base_dtype,\n                                       seed=seed_stream())\n    population_part += part\n    population_part = tf.concat([[part], population_part], axis=0)\n    population.append(population_part)\n  return population", "response": "Constructs the starting population."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nperforming binary crossover for the current population.", "response": "def _binary_crossover(population,\n                      population_size,\n                      mutants,\n                      crossover_prob,\n                      seed):\n  \"\"\"Performs recombination by binary crossover for the current population.\n\n  Let v_i denote the i'th component of the member v and m_i the corresponding\n  component of the mutant vector corresponding to v. Then the crossed over\n  vector w_i is determined by setting w_i =\n  (m_i with probability=crossover_prob else v_i). In addition, DE requires that\n  at least one of the components is crossed over (otherwise we end\n  up with no change). This is done by choosing on index say k randomly where\n  a force crossover is performed (i.e. w_k = m_k). This is the scheme\n  implemented in this function.\n\n  Args:\n    population: A Python list of `Tensor`s where each `Tensor` in the list\n      must be of rank at least 1 and all the elements must have a common\n      first dimension. The base population to cross over.\n    population_size: A scalar integer `Tensor`. The number of elements in the\n      population (i.e. size of the first dimension of any member of\n      `population`).\n    mutants: A Python list of `Tensor`s with the same structure as `population`.\n      The mutated population.\n    crossover_prob: A postive real scalar `Tensor` bounded above by 1.0. The\n      probability of a crossover being performed for each axis.\n    seed: `int` or None. The random seed for this `Op`. If `None`, no seed is\n      applied.\n\n  Returns:\n    A list of `Tensor`s of the same structure, dtype and shape as `population`.\n    The recombined population.\n  \"\"\"\n  sizes = [tf.cast(tf.size(input=x), dtype=tf.float64) for x in population]\n  seed_stream = distributions.SeedStream(seed, salt='binary_crossover')\n  force_crossover_group = distributions.Categorical(sizes).sample(\n      [population_size, 1], seed=seed_stream())\n  recombinants = []\n  for i, population_part in enumerate(population):\n    pop_part_flat = tf.reshape(population_part, [population_size, -1])\n    mutant_part_flat = tf.reshape(mutants[i], [population_size, -1])\n    part_size = tf.size(input=population_part) // population_size\n    force_crossovers = tf.one_hot(\n        tf.random.uniform([population_size],\n                          minval=0,\n                          maxval=part_size,\n                          dtype=tf.int32,\n                          seed=seed_stream()),\n        part_size,\n        on_value=True,\n        off_value=False,\n        dtype=tf.bool)  # Tensor of shape [population_size, size]\n    group_mask = tf.math.equal(force_crossover_group, i)\n    force_crossovers &= group_mask\n    do_binary_crossover = tf.random.uniform(\n        [population_size, part_size],\n        dtype=crossover_prob.dtype.base_dtype,\n        seed=seed_stream()) < crossover_prob\n    do_binary_crossover |= force_crossovers\n    recombinant_flat = tf.where(\n        do_binary_crossover,\n        x=mutant_part_flat,\n        y=pop_part_flat)\n    recombinant = tf.reshape(recombinant_flat, tf.shape(input=population_part))\n    recombinants.append(recombinant)\n  return recombinants"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\ncompute the mutatated vectors for each population member.", "response": "def _get_mutants(population,\n                 population_size,\n                 mixing_indices,\n                 differential_weight):\n  \"\"\"Computes the mutatated vectors for each population member.\n\n  Args:\n    population:  Python `list` of `Tensor`s representing the\n      current population vectors. Each `Tensor` must be of the same real dtype.\n      The first dimension of each `Tensor` indexes individual\n      population members. For example, if the population is a list with a\n      single `Tensor` of shape [n, m1, m2], then `n` is the population size and\n      the shape of an individual solution is [m1, m2].\n      If there is more than one element in the population, then each `Tensor`\n      in the list should have the first axis of the same size.\n    population_size: Scalar integer `Tensor`. The size of the population.\n    mixing_indices: `Tensor` of integral dtype and shape [n, 3] where `n` is the\n      number of members in the population. Each element of the `Tensor` must be\n      a valid index into the first dimension of the population (i.e range\n      between `0` and `n-1` inclusive).\n    differential_weight: Real scalar `Tensor`. Must be positive and less than\n      2.0. The parameter controlling the strength of mutation.\n\n  Returns:\n    mutants: `Tensor` or Python `list` of `Tensor`s of the same shape and dtype\n      as the input population. The mutated vectors.\n  \"\"\"\n  mixing_indices = tf.reshape(mixing_indices, [-1])\n  weights = tf.stack([1.0, differential_weight, -differential_weight])\n  def _mutant_part(population_part):\n    donors = tf.gather(population_part, mixing_indices)\n    donors = tf.transpose(\n        a=tf.reshape(donors, [population_size, 3, -1]), perm=[0, 2, 1])\n    return tf.math.reduce_sum(input_tensor=donors * weights, axis=-1)\n\n  return [_mutant_part(population_part) for population_part in population]"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef _get_mixing_indices(size, seed=None, name=None):\n  with tf.compat.v1.name_scope(\n      name, default_name='get_mixing_indices', values=[size]):\n    size = tf.convert_to_tensor(value=size)\n    dtype = size.dtype\n    seed_stream = distributions.SeedStream(seed, salt='get_mixing_indices')\n    first = tf.random.uniform([size],\n                              maxval=size-1,\n                              dtype=dtype,\n                              seed=seed_stream())\n    second = tf.random.uniform([size],\n                               maxval=size-2,\n                               dtype=dtype,\n                               seed=seed_stream())\n    third = tf.random.uniform([size],\n                              maxval=size-3,\n                              dtype=dtype,\n                              seed=seed_stream())\n\n    # Shift second if it is on top of or to the right of first\n    second = tf.where(first < second, x=second, y=second + 1)\n    smaller = tf.math.minimum(first, second)\n    larger = tf.math.maximum(first, second)\n    # Shift the third one so it does not coincide with either the first or the\n    # second number. Assuming first < second, shift by 1 if the number is in\n    # [first, second) and by 2 if the number is greater than or equal to the\n    # second.\n    third = tf.where(third < smaller, x=third, y=third + 1)\n    third = tf.where(third < larger, x=third, y=third + 1)\n    sample = tf.stack([first, second, third], axis=1)\n    to_avoid = tf.expand_dims(tf.range(size), axis=-1)\n    sample = tf.where(sample < to_avoid, x=sample, y=sample + 1)\n    return sample", "response": "Generates an array of indices suitable for a differential evolution mutation operation."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef _ensure_list(tensor_or_list):\n  if isinstance(tensor_or_list, (list, tuple)):\n    return list(tensor_or_list), True\n  return [tensor_or_list], False", "response": "Converts the input arg to a list if it is not a list already."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef _get_tol(tol, dtype, validate_args):\n  if tol is None:\n    return tf.convert_to_tensor(value=0, dtype=dtype)\n\n  tol = tf.convert_to_tensor(value=tol, dtype=dtype)\n  if validate_args:\n    tol = distribution_util.with_dependencies([\n        assert_util.assert_non_negative(\n            tol, message=\"Argument 'tol' must be non-negative\")\n    ], tol)\n  return tol", "response": "Gets a Tensor of type dtype 0 if tol is None validation optional."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\ncalculating the batched KL divergence with a Deterministic distribution.", "response": "def _kl_deterministic_distribution(a, b, name=None):\n  \"\"\"Calculate the batched KL divergence `KL(a || b)` with `a` Deterministic.\n\n  Args:\n    a: instance of a Deterministic distribution object.\n    b: instance of a Distribution distribution object.\n    name: (optional) Name to use for created operations. Default is\n      \"kl_deterministic_distribution\".\n\n  Returns:\n    Batchwise `KL(a || b)`.\n  \"\"\"\n  with tf.name_scope(name or \"kl_deterministic_distribution\"):\n    return -b.log_prob(a.loc)"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nclipping values to a specified min and max while leaving gradient unaltered.", "response": "def clip_by_value_preserve_gradient(t, clip_value_min, clip_value_max,\n                                    name=None):\n  \"\"\"Clips values to a specified min and max while leaving gradient unaltered.\n\n  Like `tf.clip_by_value`, this function returns a tensor of the same type and\n  shape as input `t` but with values clamped to be no smaller than to\n  `clip_value_min` and no larger than `clip_value_max`. Unlike\n  `tf.clip_by_value`, the gradient is unaffected by this op, i.e.,\n\n  ```python\n  tf.gradients(tfp.math.clip_by_value_preserve_gradient(x), x)[0]\n  # ==> ones_like(x)\n  ```\n\n  Note: `clip_value_min` needs to be smaller or equal to `clip_value_max` for\n  correct results.\n\n  Args:\n    t: A `Tensor`.\n    clip_value_min: A scalar `Tensor`, or a `Tensor` with the same shape\n      as `t`. The minimum value to clip by.\n    clip_value_max: A scalar `Tensor`, or a `Tensor` with the same shape\n      as `t`. The maximum value to clip by.\n    name: A name for the operation (optional).\n      Default value: `'clip_by_value_preserve_gradient'`.\n\n  Returns:\n    clipped_t: A clipped `Tensor`.\n  \"\"\"\n  with tf.compat.v1.name_scope(name, 'clip_by_value_preserve_gradient',\n                               [t, clip_value_min, clip_value_max]):\n    t = tf.convert_to_tensor(value=t, name='t')\n    clip_t = tf.clip_by_value(t, clip_value_min, clip_value_max)\n    return t + tf.stop_gradient(clip_t - t)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef build_input_pipeline(train_images, batch_size):\n\n  training_dataset = tf.data.Dataset.from_tensor_slices(train_images)\n  training_batches = training_dataset.shuffle(\n      50000, reshuffle_each_iteration=True).repeat().batch(batch_size)\n  training_iterator = tf.compat.v1.data.make_one_shot_iterator(training_batches)\n  images = training_iterator.get_next()\n  return images", "response": "Build an iterator over training batches."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nsave a synthetic image as a PNG file.", "response": "def plot_generated_images(images, fname):\n  \"\"\"Save a synthetic image as a PNG file.\n\n  Args:\n    images: samples of synthetic images generated by the generative network.\n    fname: Python `str`, filename to save the plot to.\n  \"\"\"\n  fig = plt.figure(figsize=(4, 4))\n  canvas = backend_agg.FigureCanvasAgg(fig)\n\n  for i, image in enumerate(images):\n    ax = fig.add_subplot(4, 4, i + 1)\n    plt.axis('off')\n    ax.set_xticklabels([])\n    ax.set_yticklabels([])\n    ax.imshow(image.reshape(IMAGE_SHAPE[:-1]), cmap='Greys_r')\n\n  fig.tight_layout()\n  plt.subplots_adjust(wspace=0.05, hspace=0.05)\n  canvas.print_figure(fname, format='png')"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nconverting a sequence of productions into a string of terminal symbols.", "response": "def convert_to_string(self, productions):\n    \"\"\"Converts a sequence of productions into a string of terminal symbols.\n\n    Args:\n      productions: Tensor of shape [1, num_productions, num_production_rules].\n        Slices along the `num_productions` dimension represent one-hot vectors.\n\n    Returns:\n      str that concatenates all terminal symbols from `productions`.\n\n    Raises:\n      ValueError: If the first production rule does not begin with\n        `self.start_symbol`.\n    \"\"\"\n    symbols = []\n    for production in tf.unstack(productions, axis=1):\n      lhs, rhs = self.production_rules[tf.argmax(input=production, axis=-1)]\n      if not symbols:  # first iteration\n        if lhs != self.start_symbol:\n          raise ValueError(\"`productions` must begin with `self.start_symbol`.\")\n        symbols = rhs\n      else:\n        # Greedily unroll the nonterminal symbols based on the first occurrence\n        # in a linear sequence.\n        index = symbols.index(lhs)\n        symbols = symbols[:index] + rhs + symbols[index + 1:]\n    string = \"\".join(symbols)\n    return string"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nproduce a masking tensor for valid production rules.", "response": "def mask(self, symbol, on_value, off_value):\n    \"\"\"Produces a masking tensor for (in)valid production rules.\n\n    Args:\n      symbol: str, a symbol in the grammar.\n      on_value: Value to use for a valid production rule.\n      off_value: Value to use for an invalid production rule.\n\n    Returns:\n      Tensor of shape [1, num_production_rules]. An element is `on_value`\n      if its corresponding production rule has `symbol` on its left-hand-side;\n      the element is `off_value` otherwise.\n    \"\"\"\n    mask_values = [on_value if lhs == symbol else off_value\n                   for lhs, _ in self.production_rules]\n    mask_values = tf.reshape(mask_values, [1, len(self.production_rules)])\n    return mask_values"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef call(self, inputs):\n    del inputs  # unused\n    latent_code = ed.MultivariateNormalDiag(loc=tf.zeros(self.latent_size),\n                                            sample_shape=1,\n                                            name=\"latent_code\")\n    state = self.lstm.zero_state(1, dtype=tf.float32)\n    t = 0\n    productions = []\n    stack = [self.grammar.start_symbol]\n    while stack:\n      symbol = stack.pop()\n      net, state = self.lstm(latent_code, state)\n      logits = (self.output_layer(net) +\n                self.grammar.mask(symbol, on_value=0., off_value=-1e9))\n      production = ed.OneHotCategorical(logits=logits,\n                                        name=\"production_\" + str(t))\n      _, rhs = self.grammar.production_rules[tf.argmax(\n          input=production, axis=-1)]\n      for symbol in rhs:\n        if symbol in self.grammar.nonterminal_symbols:\n          stack.append(symbol)\n      productions.append(production)\n      t += 1\n    return tf.stack(productions, axis=1)", "response": "Runs the model forward to generate a sequence of productions."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef call(self, inputs):\n    net = self.encoder_net(tf.cast(inputs, tf.float32))\n    return ed.MultivariateNormalDiag(\n        loc=net[..., :self.latent_size],\n        scale_diag=tf.nn.softplus(net[..., self.latent_size:]),\n        name=\"latent_code_posterior\")", "response": "Runs the model forward to return a stochastic encoding."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef _hat_integral(self, x):\n    x = tf.cast(x, self.power.dtype)\n    t = self.power - 1.\n    return tf.exp((-t) * tf.math.log1p(x) - tf.math.log(t))", "response": "Integral of the hat function used for sampling."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _hat_integral_inverse(self, x):\n    x = tf.cast(x, self.power.dtype)\n    t = self.power - 1.\n    return tf.math.expm1(-(tf.math.log(t) + tf.math.log(x)) / t)", "response": "Inverse function of _hat_integral."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef matrix_rank(a, tol=None, validate_args=False, name=None):\n  with tf.compat.v1.name_scope(name, 'matrix_rank', [a, tol]):\n    a = tf.convert_to_tensor(value=a, dtype_hint=tf.float32, name='a')\n    assertions = _maybe_validate_matrix(a, validate_args)\n    if assertions:\n      with tf.control_dependencies(assertions):\n        a = tf.identity(a)\n    s = tf.linalg.svd(a, compute_uv=False)\n    if tol is None:\n      if a.shape[-2:].is_fully_defined():\n        m = np.max(a.shape[-2:].as_list())\n      else:\n        m = tf.reduce_max(input_tensor=tf.shape(input=a)[-2:])\n      eps = np.finfo(a.dtype.as_numpy_dtype).eps\n      tol = (eps * tf.cast(m, a.dtype) *\n             tf.reduce_max(input_tensor=s, axis=-1, keepdims=True))\n    return tf.reduce_sum(input_tensor=tf.cast(s > tol, tf.int32), axis=-1)", "response": "Compute the matrix rank ; the number of non - zero SVD singular values."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nconcatenate cholesky decomposition of chol.", "response": "def cholesky_concat(chol, cols, name=None):\n  \"\"\"Concatenates `chol @ chol.T` with additional rows and columns.\n\n  This operation is conceptually identical to:\n  ```python\n  def cholesky_concat_slow(chol, cols):  # cols shaped (n + m) x m = z x m\n    mat = tf.matmul(chol, chol, adjoint_b=True)  # batch of n x n\n    # Concat columns.\n    mat = tf.concat([mat, cols[..., :tf.shape(mat)[-2], :]], axis=-1)  # n x z\n    # Concat rows.\n    mat = tf.concat([mat, tf.linalg.matrix_transpose(cols)], axis=-2)  # z x z\n    return tf.linalg.cholesky(mat)\n  ```\n  but whereas `cholesky_concat_slow` would cost `O(z**3)` work,\n  `cholesky_concat` only costs `O(z**2 + m**3)` work.\n\n  The resulting (implicit) matrix must be symmetric and positive definite.\n  Thus, the bottom right `m x m` must be self-adjoint, and we do not require a\n  separate `rows` argument (which can be inferred from `conj(cols.T)`).\n\n  Args:\n    chol: Cholesky decomposition of `mat = chol @ chol.T`.\n    cols: The new columns whose first `n` rows we would like concatenated to the\n      right of `mat = chol @ chol.T`, and whose conjugate transpose we would\n      like concatenated to the bottom of `concat(mat, cols[:n,:])`. A `Tensor`\n      with final dims `(n+m, m)`. The first `n` rows are the top right rectangle\n      (their conjugate transpose forms the bottom left), and the bottom `m x m`\n      is self-adjoint.\n    name: Optional name for this op.\n\n  Returns:\n    chol_concat: The Cholesky decomposition of:\n      ```\n      [ [ mat  cols[:n, :] ]\n        [   conj(cols.T)   ] ]\n      ```\n  \"\"\"\n  with tf.compat.v2.name_scope(name or 'cholesky_extend'):\n    dtype = dtype_util.common_dtype([chol, cols], preferred_dtype=tf.float32)\n    chol = tf.convert_to_tensor(value=chol, name='chol', dtype=dtype)\n    cols = tf.convert_to_tensor(value=cols, name='cols', dtype=dtype)\n    n = prefer_static.shape(chol)[-1]\n    mat_nm, mat_mm = cols[..., :n, :], cols[..., n:, :]\n    solved_nm = linear_operator_util.matrix_triangular_solve_with_broadcast(\n        chol, mat_nm)\n    lower_right_mm = tf.linalg.cholesky(\n        mat_mm - tf.matmul(solved_nm, solved_nm, adjoint_a=True))\n    lower_left_mn = tf.math.conj(tf.linalg.matrix_transpose(solved_nm))\n    out_batch = prefer_static.shape(solved_nm)[:-2]\n    chol = tf.broadcast_to(\n        chol,\n        tf.concat([out_batch, prefer_static.shape(chol)[-2:]], axis=0))\n    top_right_zeros_nm = tf.zeros_like(solved_nm)\n    return tf.concat([tf.concat([chol, top_right_zeros_nm], axis=-1),\n                      tf.concat([lower_left_mn, lower_right_mm], axis=-1)],\n                     axis=-2)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nswapping `m` and `i` on axis -1. (Helper for pivoted_cholesky.) Given a batch of int64 vectors `vecs`, scalar index `m`, and compatibly shaped per-vector indices `i`, this function swaps elements `m` and `i` in each vector. For the use-case below, these are permutation vectors. Args: vecs: Vectors on which we perform the swap, int64 `Tensor`. m: Scalar int64 `Tensor`, the index into which the `i`th element is going. i: Batch int64 `Tensor`, shaped like vecs.shape[:-1] + [1]; the index into which the `m`th element is going. Returns: vecs: The updated vectors.", "response": "def _swap_m_with_i(vecs, m, i):\n  \"\"\"Swaps `m` and `i` on axis -1. (Helper for pivoted_cholesky.)\n\n  Given a batch of int64 vectors `vecs`, scalar index `m`, and compatibly shaped\n  per-vector indices `i`, this function swaps elements `m` and `i` in each\n  vector. For the use-case below, these are permutation vectors.\n\n  Args:\n    vecs: Vectors on which we perform the swap, int64 `Tensor`.\n    m: Scalar int64 `Tensor`, the index into which the `i`th element is going.\n    i: Batch int64 `Tensor`, shaped like vecs.shape[:-1] + [1]; the index into\n      which the `m`th element is going.\n\n  Returns:\n    vecs: The updated vectors.\n  \"\"\"\n  vecs = tf.convert_to_tensor(value=vecs, dtype=tf.int64, name='vecs')\n  m = tf.convert_to_tensor(value=m, dtype=tf.int64, name='m')\n  i = tf.convert_to_tensor(value=i, dtype=tf.int64, name='i')\n  trailing_elts = tf.broadcast_to(\n      tf.range(m + 1,\n               prefer_static.shape(vecs, out_type=tf.int64)[-1]),\n      prefer_static.shape(vecs[..., m + 1:]))\n  shp = prefer_static.shape(trailing_elts)\n  trailing_elts = tf.where(\n      tf.equal(trailing_elts, tf.broadcast_to(i, shp)),\n      tf.broadcast_to(tf.gather(vecs, [m], axis=-1), shp),\n      tf.broadcast_to(vecs[..., m + 1:], shp))\n  # TODO(bjp): Could we use tensor_scatter_nd_update?\n  vecs_shape = vecs.shape\n  vecs = tf.concat([\n      vecs[..., :m],\n      tf.gather(vecs, i, batch_dims=prefer_static.rank(vecs) - 1), trailing_elts\n  ], axis=-1)\n  tensorshape_util.set_shape(vecs, vecs_shape)\n  return vecs"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef pivoted_cholesky(matrix, max_rank, diag_rtol=1e-3, name=None):\n  with tf.compat.v2.name_scope(name or 'pivoted_cholesky'):\n    dtype = dtype_util.common_dtype([matrix, diag_rtol],\n                                    preferred_dtype=tf.float32)\n    matrix = tf.convert_to_tensor(value=matrix, name='matrix', dtype=dtype)\n    if tensorshape_util.rank(matrix.shape) is None:\n      raise NotImplementedError('Rank of `matrix` must be known statically')\n\n    max_rank = tf.convert_to_tensor(\n        value=max_rank, name='max_rank', dtype=tf.int64)\n    max_rank = tf.minimum(max_rank,\n                          prefer_static.shape(matrix, out_type=tf.int64)[-1])\n    diag_rtol = tf.convert_to_tensor(\n        value=diag_rtol, dtype=dtype, name='diag_rtol')\n    matrix_diag = tf.linalg.diag_part(matrix)\n    # matrix is P.D., therefore all matrix_diag > 0, so we don't need abs.\n    orig_error = tf.reduce_max(input_tensor=matrix_diag, axis=-1)\n\n    def cond(m, pchol, perm, matrix_diag):\n      \"\"\"Condition for `tf.while_loop` continuation.\"\"\"\n      del pchol\n      del perm\n      error = tf.linalg.norm(tensor=matrix_diag, ord=1, axis=-1)\n      max_err = tf.reduce_max(input_tensor=error / orig_error)\n      return (m < max_rank) & (tf.equal(m, 0) | (max_err > diag_rtol))\n\n    batch_dims = tensorshape_util.rank(matrix.shape) - 2\n    def batch_gather(params, indices, axis=-1):\n      return tf.gather(params, indices, axis=axis, batch_dims=batch_dims)\n\n    def body(m, pchol, perm, matrix_diag):\n      \"\"\"Body of a single `tf.while_loop` iteration.\"\"\"\n      # Here is roughly a numpy, non-batched version of what's going to happen.\n      # (See also Algorithm 1 of Harbrecht et al.)\n      # 1: maxi = np.argmax(matrix_diag[perm[m:]]) + m\n      # 2: maxval = matrix_diag[perm][maxi]\n      # 3: perm[m], perm[maxi] = perm[maxi], perm[m]\n      # 4: row = matrix[perm[m]][perm[m + 1:]]\n      # 5: row -= np.sum(pchol[:m][perm[m + 1:]] * pchol[:m][perm[m]]], axis=-2)\n      # 6: pivot = np.sqrt(maxval); row /= pivot\n      # 7: row = np.concatenate([[[pivot]], row], -1)\n      # 8: matrix_diag[perm[m:]] -= row**2\n      # 9: pchol[m, perm[m:]] = row\n\n      # Find the maximal position of the (remaining) permuted diagonal.\n      # Steps 1, 2 above.\n      permuted_diag = batch_gather(matrix_diag, perm[..., m:])\n      maxi = tf.argmax(\n          input=permuted_diag, axis=-1, output_type=tf.int64)[..., tf.newaxis]\n      maxval = batch_gather(permuted_diag, maxi)\n      maxi = maxi + m\n      maxval = maxval[..., 0]\n      # Update perm: Swap perm[...,m] with perm[...,maxi]. Step 3 above.\n      perm = _swap_m_with_i(perm, m, maxi)\n      # Step 4.\n      row = batch_gather(matrix, perm[..., m:m + 1], axis=-2)\n      row = batch_gather(row, perm[..., m + 1:])\n      # Step 5.\n      prev_rows = pchol[..., :m, :]\n      prev_rows_perm_m_onward = batch_gather(prev_rows, perm[..., m + 1:])\n      prev_rows_pivot_col = batch_gather(prev_rows, perm[..., m:m + 1])\n      row -= tf.reduce_sum(\n          input_tensor=prev_rows_perm_m_onward * prev_rows_pivot_col,\n          axis=-2)[..., tf.newaxis, :]\n      # Step 6.\n      pivot = tf.sqrt(maxval)[..., tf.newaxis, tf.newaxis]\n      # Step 7.\n      row = tf.concat([pivot, row / pivot], axis=-1)\n      # TODO(b/130899118): Pad grad fails with int64 paddings.\n      # Step 8.\n      paddings = tf.concat([\n          tf.zeros([prefer_static.rank(pchol) - 1, 2], dtype=tf.int32),\n          [[tf.cast(m, tf.int32), 0]]], axis=0)\n      diag_update = tf.pad(tensor=row**2, paddings=paddings)[..., 0, :]\n      reverse_perm = _invert_permutation(perm)\n      matrix_diag -= batch_gather(diag_update, reverse_perm)\n      # Step 9.\n      row = tf.pad(tensor=row, paddings=paddings)\n      # TODO(bjp): Defer the reverse permutation all-at-once at the end?\n      row = batch_gather(row, reverse_perm)\n      pchol_shape = pchol.shape\n      pchol = tf.concat([pchol[..., :m, :], row, pchol[..., m + 1:, :]],\n                        axis=-2)\n      tensorshape_util.set_shape(pchol, pchol_shape)\n      return m + 1, pchol, perm, matrix_diag\n\n    m = np.int64(0)\n    pchol = tf.zeros_like(matrix[..., :max_rank, :])\n    matrix_shape = prefer_static.shape(matrix, out_type=tf.int64)\n    perm = tf.broadcast_to(\n        prefer_static.range(matrix_shape[-1]), matrix_shape[:-1])\n    _, pchol, _, _ = tf.while_loop(\n        cond=cond, body=body, loop_vars=(m, pchol, perm, matrix_diag))\n    pchol = tf.linalg.matrix_transpose(pchol)\n    tensorshape_util.set_shape(\n        pchol, tensorshape_util.concatenate(matrix_diag.shape, [None]))\n    return pchol", "response": "Computes the ( partial ) pivoted Cholesky decomposition of matrix."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef pinv(a, rcond=None, validate_args=False, name=None):\n  with tf.compat.v1.name_scope(name, 'pinv', [a, rcond]):\n    a = tf.convert_to_tensor(value=a, name='a')\n\n    assertions = _maybe_validate_matrix(a, validate_args)\n    if assertions:\n      with tf.control_dependencies(assertions):\n        a = tf.identity(a)\n\n    dtype = a.dtype.as_numpy_dtype\n\n    if rcond is None:\n      def get_dim_size(dim):\n        if tf.compat.dimension_value(a.shape[dim]) is not None:\n          return tf.compat.dimension_value(a.shape[dim])\n        return tf.shape(input=a)[dim]\n\n      num_rows = get_dim_size(-2)\n      num_cols = get_dim_size(-1)\n      if isinstance(num_rows, int) and isinstance(num_cols, int):\n        max_rows_cols = float(max(num_rows, num_cols))\n      else:\n        max_rows_cols = tf.cast(tf.maximum(num_rows, num_cols), dtype)\n      rcond = 10. * max_rows_cols * np.finfo(dtype).eps\n\n    rcond = tf.convert_to_tensor(value=rcond, dtype=dtype, name='rcond')\n\n    # Calculate pseudo inverse via SVD.\n    # Note: if a is symmetric then u == v. (We might observe additional\n    # performance by explicitly setting `v = u` in such cases.)\n    [\n        singular_values,         # Sigma\n        left_singular_vectors,   # U\n        right_singular_vectors,  # V\n    ] = tf.linalg.svd(a, full_matrices=False, compute_uv=True)\n\n    # Saturate small singular values to inf. This has the effect of make\n    # `1. / s = 0.` while not resulting in `NaN` gradients.\n    cutoff = rcond * tf.reduce_max(input_tensor=singular_values, axis=-1)\n    singular_values = tf.where(\n        singular_values > cutoff[..., tf.newaxis], singular_values,\n        tf.fill(tf.shape(input=singular_values), np.array(np.inf, dtype)))\n\n    # Although `a == tf.matmul(u, s * v, transpose_b=True)` we swap\n    # `u` and `v` here so that `tf.matmul(pinv(A), A) = tf.eye()`, i.e.,\n    # a matrix inverse has \"transposed\" semantics.\n    a_pinv = tf.matmul(\n        right_singular_vectors / singular_values[..., tf.newaxis, :],\n        left_singular_vectors,\n        adjoint_b=True)\n\n    if a.shape.ndims is not None:\n      a_pinv.set_shape(a.shape[:-2].concatenate([a.shape[-1], a.shape[-2]]))\n\n    return a_pinv", "response": "Compute the Moore - Penrose pseudo - inverse of a matrix."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef lu_solve(lower_upper, perm, rhs,\n             validate_args=False,\n             name=None):\n  \"\"\"Solves systems of linear eqns `A X = RHS`, given LU factorizations.\n\n  Note: this function does not verify the implied matrix is actually invertible\n  nor is this condition checked even when `validate_args=True`.\n\n  Args:\n    lower_upper: `lu` as returned by `tf.linalg.lu`, i.e., if\n      `matmul(P, matmul(L, U)) = X` then `lower_upper = L + U - eye`.\n    perm: `p` as returned by `tf.linag.lu`, i.e., if\n      `matmul(P, matmul(L, U)) = X` then `perm = argmax(P)`.\n    rhs: Matrix-shaped float `Tensor` representing targets for which to solve;\n      `A X = RHS`. To handle vector cases, use:\n      `lu_solve(..., rhs[..., tf.newaxis])[..., 0]`.\n    validate_args: Python `bool` indicating whether arguments should be checked\n      for correctness. Note: this function does not verify the implied matrix is\n      actually invertible, even when `validate_args=True`.\n      Default value: `False` (i.e., don't validate arguments).\n    name: Python `str` name given to ops managed by this object.\n      Default value: `None` (i.e., \"lu_solve\").\n\n  Returns:\n    x: The `X` in `A @ X = RHS`.\n\n  #### Examples\n\n  ```python\n  import numpy as np\n  import tensorflow as tf\n  import tensorflow_probability as tfp\n\n  x = [[[1., 2],\n        [3, 4]],\n       [[7, 8],\n        [3, 4]]]\n  inv_x = tfp.math.lu_solve(*tf.linalg.lu(x), rhs=tf.eye(2))\n  tf.assert_near(tf.matrix_inverse(x), inv_x)\n  # ==> True\n  ```\n\n  \"\"\"\n\n  with tf.compat.v1.name_scope(name, 'lu_solve', [lower_upper, perm, rhs]):\n    lower_upper = tf.convert_to_tensor(\n        value=lower_upper, dtype_hint=tf.float32, name='lower_upper')\n    perm = tf.convert_to_tensor(value=perm, dtype_hint=tf.int32, name='perm')\n    rhs = tf.convert_to_tensor(\n        value=rhs, dtype_hint=lower_upper.dtype, name='rhs')\n\n    assertions = _lu_solve_assertions(lower_upper, perm, rhs, validate_args)\n    if assertions:\n      with tf.control_dependencies(assertions):\n        lower_upper = tf.identity(lower_upper)\n        perm = tf.identity(perm)\n        rhs = tf.identity(rhs)\n\n    if rhs.shape.ndims == 2 and perm.shape.ndims == 1:\n      # Both rhs and perm have scalar batch_shape.\n      permuted_rhs = tf.gather(rhs, perm, axis=-2)\n    else:\n      # Either rhs or perm have non-scalar batch_shape or we can't determine\n      # this information statically.\n      rhs_shape = tf.shape(input=rhs)\n      broadcast_batch_shape = tf.broadcast_dynamic_shape(\n          rhs_shape[:-2],\n          tf.shape(input=perm)[:-1])\n      d, m = rhs_shape[-2], rhs_shape[-1]\n      rhs_broadcast_shape = tf.concat([broadcast_batch_shape, [d, m]], axis=0)\n\n      # Tile out rhs.\n      broadcast_rhs = tf.broadcast_to(rhs, rhs_broadcast_shape)\n      broadcast_rhs = tf.reshape(broadcast_rhs, [-1, d, m])\n\n      # Tile out perm and add batch indices.\n      broadcast_perm = tf.broadcast_to(perm, rhs_broadcast_shape[:-1])\n      broadcast_perm = tf.reshape(broadcast_perm, [-1, d])\n      broadcast_batch_size = tf.reduce_prod(input_tensor=broadcast_batch_shape)\n      broadcast_batch_indices = tf.broadcast_to(\n          tf.range(broadcast_batch_size)[:, tf.newaxis],\n          [broadcast_batch_size, d])\n      broadcast_perm = tf.stack([broadcast_batch_indices, broadcast_perm],\n                                axis=-1)\n\n      permuted_rhs = tf.gather_nd(broadcast_rhs, broadcast_perm)\n      permuted_rhs = tf.reshape(permuted_rhs, rhs_broadcast_shape)\n\n    lower = tf.linalg.set_diag(\n        tf.linalg.band_part(lower_upper, num_lower=-1, num_upper=0),\n        tf.ones(tf.shape(input=lower_upper)[:-1], dtype=lower_upper.dtype))\n    return linear_operator_util.matrix_triangular_solve_with_broadcast(\n        lower_upper,  # Only upper is accessed.\n        linear_operator_util.matrix_triangular_solve_with_broadcast(\n            lower, permuted_rhs),\n        lower=False)", "response": "Solves system of linear eqns A X = RHS given LU factorizations."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\ncompute the matrix inverse given the matrix s LU decomposition.", "response": "def lu_matrix_inverse(lower_upper, perm, validate_args=False, name=None):\n  \"\"\"Computes a matrix inverse given the matrix's LU decomposition.\n\n  This op is conceptually identical to,\n\n  ````python\n  inv_X = tf.lu_matrix_inverse(*tf.linalg.lu(X))\n  tf.assert_near(tf.matrix_inverse(X), inv_X)\n  # ==> True\n  ```\n\n  Note: this function does not verify the implied matrix is actually invertible\n  nor is this condition checked even when `validate_args=True`.\n\n  Args:\n    lower_upper: `lu` as returned by `tf.linalg.lu`, i.e., if\n      `matmul(P, matmul(L, U)) = X` then `lower_upper = L + U - eye`.\n    perm: `p` as returned by `tf.linag.lu`, i.e., if\n      `matmul(P, matmul(L, U)) = X` then `perm = argmax(P)`.\n    validate_args: Python `bool` indicating whether arguments should be checked\n      for correctness. Note: this function does not verify the implied matrix is\n      actually invertible, even when `validate_args=True`.\n      Default value: `False` (i.e., don't validate arguments).\n    name: Python `str` name given to ops managed by this object.\n      Default value: `None` (i.e., \"lu_matrix_inverse\").\n\n  Returns:\n    inv_x: The matrix_inv, i.e.,\n      `tf.matrix_inverse(tfp.math.lu_reconstruct(lu, perm))`.\n\n  #### Examples\n\n  ```python\n  import numpy as np\n  import tensorflow as tf\n  import tensorflow_probability as tfp\n\n  x = [[[3., 4], [1, 2]],\n       [[7., 8], [3, 4]]]\n  inv_x = tfp.math.lu_matrix_inverse(*tf.linalg.lu(x))\n  tf.assert_near(tf.matrix_inverse(x), inv_x)\n  # ==> True\n  ```\n\n  \"\"\"\n\n  with tf.compat.v1.name_scope(name, 'lu_matrix_inverse', [lower_upper, perm]):\n    lower_upper = tf.convert_to_tensor(\n        value=lower_upper, dtype_hint=tf.float32, name='lower_upper')\n    perm = tf.convert_to_tensor(value=perm, dtype_hint=tf.int32, name='perm')\n    assertions = _lu_reconstruct_assertions(lower_upper, perm, validate_args)\n    if assertions:\n      with tf.control_dependencies(assertions):\n        lower_upper = tf.identity(lower_upper)\n        perm = tf.identity(perm)\n    shape = tf.shape(input=lower_upper)\n    return lu_solve(\n        lower_upper, perm,\n        rhs=tf.eye(shape[-1], batch_shape=shape[:-2], dtype=lower_upper.dtype),\n        validate_args=False)"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef lu_reconstruct(lower_upper, perm, validate_args=False, name=None):\n  with tf.compat.v1.name_scope(name, 'lu_reconstruct', [lower_upper, perm]):\n    lower_upper = tf.convert_to_tensor(\n        value=lower_upper, dtype_hint=tf.float32, name='lower_upper')\n    perm = tf.convert_to_tensor(value=perm, dtype_hint=tf.int32, name='perm')\n\n    assertions = _lu_reconstruct_assertions(lower_upper, perm, validate_args)\n    if assertions:\n      with tf.control_dependencies(assertions):\n        lower_upper = tf.identity(lower_upper)\n        perm = tf.identity(perm)\n\n    shape = tf.shape(input=lower_upper)\n\n    lower = tf.linalg.set_diag(\n        tf.linalg.band_part(lower_upper, num_lower=-1, num_upper=0),\n        tf.ones(shape[:-1], dtype=lower_upper.dtype))\n    upper = tf.linalg.band_part(lower_upper, num_lower=0, num_upper=-1)\n    x = tf.matmul(lower, upper)\n\n    if lower_upper.shape.ndims is None or lower_upper.shape.ndims != 2:\n      # We either don't know the batch rank or there are >0 batch dims.\n      batch_size = tf.reduce_prod(input_tensor=shape[:-2])\n      d = shape[-1]\n      x = tf.reshape(x, [batch_size, d, d])\n      perm = tf.reshape(perm, [batch_size, d])\n      perm = tf.map_fn(tf.math.invert_permutation, perm)\n      batch_indices = tf.broadcast_to(\n          tf.range(batch_size)[:, tf.newaxis],\n          [batch_size, d])\n      x = tf.gather_nd(x, tf.stack([batch_indices, perm], axis=-1))\n      x = tf.reshape(x, shape)\n    else:\n      x = tf.gather(x, tf.math.invert_permutation(perm))\n\n    x.set_shape(lower_upper.shape)\n    return x", "response": "The inverse LU decomposition of X."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nreturns a list of assertions related to lu_reconstruct assumptions.", "response": "def _lu_reconstruct_assertions(lower_upper, perm, validate_args):\n  \"\"\"Returns list of assertions related to `lu_reconstruct` assumptions.\"\"\"\n  assertions = []\n\n  message = 'Input `lower_upper` must have at least 2 dimensions.'\n  if lower_upper.shape.ndims is not None:\n    if lower_upper.shape.ndims < 2:\n      raise ValueError(message)\n  elif validate_args:\n    assertions.append(\n        tf.compat.v1.assert_rank_at_least(lower_upper, rank=2, message=message))\n\n  message = '`rank(lower_upper)` must equal `rank(perm) + 1`'\n  if lower_upper.shape.ndims is not None and perm.shape.ndims is not None:\n    if lower_upper.shape.ndims != perm.shape.ndims + 1:\n      raise ValueError(message)\n  elif validate_args:\n    assertions.append(\n        tf.compat.v1.assert_rank(\n            lower_upper, rank=tf.rank(perm) + 1, message=message))\n\n  message = '`lower_upper` must be square.'\n  if lower_upper.shape[:-2].is_fully_defined():\n    if lower_upper.shape[-2] != lower_upper.shape[-1]:\n      raise ValueError(message)\n  elif validate_args:\n    m, n = tf.split(tf.shape(input=lower_upper)[-2:], num_or_size_splits=2)\n    assertions.append(tf.compat.v1.assert_equal(m, n, message=message))\n\n  return assertions"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _lu_solve_assertions(lower_upper, perm, rhs, validate_args):\n  assertions = _lu_reconstruct_assertions(lower_upper, perm, validate_args)\n\n  message = 'Input `rhs` must have at least 2 dimensions.'\n  if rhs.shape.ndims is not None:\n    if rhs.shape.ndims < 2:\n      raise ValueError(message)\n  elif validate_args:\n    assertions.append(\n        tf.compat.v1.assert_rank_at_least(rhs, rank=2, message=message))\n\n  message = '`lower_upper.shape[-1]` must equal `rhs.shape[-1]`.'\n  if (tf.compat.dimension_value(lower_upper.shape[-1]) is not None and\n      tf.compat.dimension_value(rhs.shape[-2]) is not None):\n    if lower_upper.shape[-1] != rhs.shape[-2]:\n      raise ValueError(message)\n  elif validate_args:\n    assertions.append(\n        tf.compat.v1.assert_equal(\n            tf.shape(input=lower_upper)[-1],\n            tf.shape(input=rhs)[-2],\n            message=message))\n\n  return assertions", "response": "Returns list of assertions related to lu_solve assumptions."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nreturn a Tensor or SparseTensor with a Tensor.", "response": "def sparse_or_dense_matmul(sparse_or_dense_a,\n                           dense_b,\n                           validate_args=False,\n                           name=None,\n                           **kwargs):\n  \"\"\"Returns (batched) matmul of a SparseTensor (or Tensor) with a Tensor.\n\n  Args:\n    sparse_or_dense_a: `SparseTensor` or `Tensor` representing a (batch of)\n      matrices.\n    dense_b: `Tensor` representing a (batch of) matrices, with the same batch\n      shape as `sparse_or_dense_a`. The shape must be compatible with the shape\n      of `sparse_or_dense_a` and kwargs.\n    validate_args: When `True`, additional assertions might be embedded in the\n      graph.\n      Default value: `False` (i.e., no graph assertions are added).\n    name: Python `str` prefixed to ops created by this function.\n      Default value: \"sparse_or_dense_matmul\".\n    **kwargs: Keyword arguments to `tf.sparse_tensor_dense_matmul` or\n      `tf.matmul`.\n\n  Returns:\n    product: A dense (batch of) matrix-shaped Tensor of the same batch shape and\n    dtype as `sparse_or_dense_a` and `dense_b`. If `sparse_or_dense_a` or\n    `dense_b` is adjointed through `kwargs` then the shape is adjusted\n    accordingly.\n  \"\"\"\n  with tf.compat.v1.name_scope(name, 'sparse_or_dense_matmul',\n                               [sparse_or_dense_a, dense_b]):\n    dense_b = tf.convert_to_tensor(\n        value=dense_b, dtype_hint=tf.float32, name='dense_b')\n\n    if validate_args:\n      assert_a_rank_at_least_2 = tf.compat.v1.assert_rank_at_least(\n          sparse_or_dense_a,\n          rank=2,\n          message='Input `sparse_or_dense_a` must have at least 2 dimensions.')\n      assert_b_rank_at_least_2 = tf.compat.v1.assert_rank_at_least(\n          dense_b,\n          rank=2,\n          message='Input `dense_b` must have at least 2 dimensions.')\n      with tf.control_dependencies(\n          [assert_a_rank_at_least_2, assert_b_rank_at_least_2]):\n        sparse_or_dense_a = tf.identity(sparse_or_dense_a)\n        dense_b = tf.identity(dense_b)\n\n    if isinstance(sparse_or_dense_a,\n                  (tf.SparseTensor, tf.compat.v1.SparseTensorValue)):\n      return _sparse_tensor_dense_matmul(sparse_or_dense_a, dense_b, **kwargs)\n    else:\n      return tf.matmul(sparse_or_dense_a, dense_b, **kwargs)"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nreturns a Tensor of the same shape as sparse_or_dense_matrix and dense_vector.", "response": "def sparse_or_dense_matvecmul(sparse_or_dense_matrix,\n                              dense_vector,\n                              validate_args=False,\n                              name=None,\n                              **kwargs):\n  \"\"\"Returns (batched) matmul of a (sparse) matrix with a column vector.\n\n  Args:\n    sparse_or_dense_matrix: `SparseTensor` or `Tensor` representing a (batch of)\n      matrices.\n    dense_vector: `Tensor` representing a (batch of) vectors, with the same\n      batch shape as `sparse_or_dense_matrix`. The shape must be compatible with\n      the shape of `sparse_or_dense_matrix` and kwargs.\n    validate_args: When `True`, additional assertions might be embedded in the\n      graph.\n      Default value: `False` (i.e., no graph assertions are added).\n    name: Python `str` prefixed to ops created by this function.\n      Default value: \"sparse_or_dense_matvecmul\".\n    **kwargs: Keyword arguments to `tf.sparse_tensor_dense_matmul` or\n      `tf.matmul`.\n\n  Returns:\n    product: A dense (batch of) vector-shaped Tensor of the same batch shape and\n    dtype as `sparse_or_dense_matrix` and `dense_vector`.\n  \"\"\"\n  with tf.compat.v1.name_scope(name, 'sparse_or_dense_matvecmul',\n                               [sparse_or_dense_matrix, dense_vector]):\n    dense_vector = tf.convert_to_tensor(\n        value=dense_vector, dtype_hint=tf.float32, name='dense_vector')\n    return tf.squeeze(\n        sparse_or_dense_matmul(\n            sparse_or_dense_matrix,\n            dense_vector[..., tf.newaxis],\n            validate_args=validate_args,\n            **kwargs),\n        axis=[-1])"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nreturn a dense matrix - shaped Tensor with a Tensor.", "response": "def _sparse_tensor_dense_matmul(sp_a, b, **kwargs):\n  \"\"\"Returns (batched) matmul of a SparseTensor with a Tensor.\n\n  Args:\n    sp_a: `SparseTensor` representing a (batch of) matrices.\n    b: `Tensor` representing a (batch of) matrices, with the same batch shape of\n      `sp_a`. The shape must be compatible with the shape of `sp_a` and kwargs.\n    **kwargs: Keyword arguments to `tf.sparse_tensor_dense_matmul`.\n\n  Returns:\n    product: A dense (batch of) matrix-shaped Tensor of the same batch shape and\n    dtype as `sp_a` and `b`. If `sp_a` or `b` is adjointed through `kwargs` then\n    the shape is adjusted accordingly.\n  \"\"\"\n  batch_shape = _get_shape(sp_a)[:-2]\n\n  # Reshape the SparseTensor into a rank 3 SparseTensors, with the\n  # batch shape flattened to a single dimension. If the batch rank is 0, then\n  # we add a batch dimension of rank 1.\n  sp_a = tf.sparse.reshape(sp_a, tf.concat([[-1], _get_shape(sp_a)[-2:]],\n                                           axis=0))\n  # Reshape b to stack the batch dimension along the rows.\n  b = tf.reshape(b, tf.concat([[-1], _get_shape(b)[-1:]], axis=0))\n\n  # Convert the SparseTensor to a matrix in block diagonal form with blocks of\n  # matrices [M, N]. This allow us to use tf.sparse_tensor_dense_matmul which\n  # only accepts rank 2 (Sparse)Tensors.\n  out = tf.sparse.sparse_dense_matmul(_sparse_block_diag(sp_a), b, **kwargs)\n\n  # Finally retrieve the original batch shape from the resulting rank 2 Tensor.\n  # Note that we avoid inferring the final shape from `sp_a` or `b` because we\n  # might have transposed one or both of them.\n  return tf.reshape(\n      out,\n      tf.concat([batch_shape, [-1], _get_shape(out)[-1:]], axis=0))"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nreturn a SparseTensor from a batch of SparseTensors.", "response": "def _sparse_block_diag(sp_a):\n  \"\"\"Returns a block diagonal rank 2 SparseTensor from a batch of SparseTensors.\n\n  Args:\n    sp_a: A rank 3 `SparseTensor` representing a batch of matrices.\n\n  Returns:\n    sp_block_diag_a: matrix-shaped, `float` `SparseTensor` with the same dtype\n    as `sparse_or_matrix`, of shape [B * M, B * N] where `sp_a` has shape\n    [B, M, N]. Each [M, N] batch of `sp_a` is lined up along the diagonal.\n  \"\"\"\n  # Construct the matrix [[M, N], [1, 0], [0, 1]] which would map the index\n  # (b, i, j) to (Mb + i, Nb + j). This effectively creates a block-diagonal\n  # matrix of dense shape [B * M, B * N].\n  # Note that this transformation doesn't increase the number of non-zero\n  # entries in the SparseTensor.\n  sp_a_shape = tf.convert_to_tensor(value=_get_shape(sp_a, tf.int64))\n  ind_mat = tf.concat([[sp_a_shape[-2:]], tf.eye(2, dtype=tf.int64)], axis=0)\n  indices = tf.matmul(sp_a.indices, ind_mat)\n  dense_shape = sp_a_shape[0] * sp_a_shape[1:]\n  return tf.SparseTensor(\n      indices=indices, values=sp_a.values, dense_shape=dense_shape)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef _maybe_validate_matrix(a, validate_args):\n  assertions = []\n  if not a.dtype.is_floating:\n    raise TypeError('Input `a` must have `float`-like `dtype` '\n                    '(saw {}).'.format(a.dtype.name))\n  if a.shape.ndims is not None:\n    if a.shape.ndims < 2:\n      raise ValueError('Input `a` must have at least 2 dimensions '\n                       '(saw: {}).'.format(a.shape.ndims))\n  elif validate_args:\n    assertions.append(tf.compat.v1.assert_rank_at_least(\n        a, rank=2, message='Input `a` must have at least 2 dimensions.'))\n  return assertions", "response": "Checks that input is a float matrix."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef _grad_neg_log_likelihood_and_fim(model_matrix, linear_response, response,\n                                     model):\n  \"\"\"Computes the neg-log-likelihood gradient and Fisher information for a GLM.\n\n  Note that Fisher information is related to the Hessian of the log-likelihood\n  by the equation\n\n  ```none\n  FisherInfo = E[Hessian with respect to model_coefficients of -LogLikelihood(\n      Y | model_matrix, model_coefficients)]\n  ```\n\n  where `LogLikelihood` is the log-likelihood of a generalized linear model\n  parameterized by `model_matrix` and `model_coefficients`, and the expectation\n  is taken over Y, distributed according to the same GLM with the same parameter\n  values.\n\n  Args:\n    model_matrix: (Batch of) matrix-shaped, `float` `Tensor` or `SparseTensor`\n      where each row represents a sample's features.  Has shape `[N, n]` where\n      `N` is the number of data samples and `n` is the number of features per\n      sample.\n    linear_response: (Batch of) vector-shaped `Tensor` with the same dtype as\n      `model_matrix`, equal to `model_matix @ model_coefficients` where\n      `model_coefficients` are the coefficients of the linear component of the\n      GLM.\n    response: (Batch of) vector-shaped `Tensor` with the same dtype as\n      `model_matrix` where each element represents a sample's observed response\n      (to the corresponding row of features).\n    model: `tfp.glm.ExponentialFamily`-like instance, which specifies the link\n      function and distribution of the GLM, and thus characterizes the negative\n      log-likelihood. Must have sufficient statistic equal to the response, that\n      is, `T(y) = y`.\n\n  Returns:\n    grad_neg_log_likelihood: (Batch of) vector-shaped `Tensor` with the same\n      shape and dtype as a single row of `model_matrix`, representing the\n      gradient of the negative log likelihood of `response` given linear\n      response `linear_response`.\n    fim_middle: (Batch of) vector-shaped `Tensor` with the same shape and dtype\n      as a single column of `model_matrix`, satisfying the equation\n      `Fisher information =\n      Transpose(model_matrix)\n      @ diag(fim_middle)\n      @ model_matrix`.\n  \"\"\"\n  # TODO(b/111926503): Determine whether there are some practical cases where it\n  # is computationally favorable to compute the full FIM.\n  mean, variance, grad_mean = model(linear_response)\n\n  is_valid = (\n      tf.math.is_finite(grad_mean) & tf.not_equal(grad_mean, 0.)\n      & tf.math.is_finite(variance) & (variance > 0.))\n\n  def _mask_if_invalid(x, mask):\n    mask = tf.fill(\n        tf.shape(input=x), value=np.array(mask, x.dtype.as_numpy_dtype))\n    return tf.where(is_valid, x, mask)\n\n  # TODO(b/111923449): Link to derivation once it's available.\n  v = (response - mean) * _mask_if_invalid(grad_mean, 1) / _mask_if_invalid(\n      variance, np.inf)\n  grad_log_likelihood = sparse_or_dense_matvecmul(\n      model_matrix, v, adjoint_a=True)\n  fim_middle = _mask_if_invalid(grad_mean, 0.)**2 / _mask_if_invalid(\n      variance, np.inf)\n  return -grad_log_likelihood, fim_middle", "response": "Returns the gradient of the neg - log - likelihood of a generalized linear model and the Fisher information for a GLM."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\ngenerates the slices for building an autoregressive mask.", "response": "def _gen_slices(num_blocks, n_in, n_out, mask_type=MASK_EXCLUSIVE):\n  \"\"\"Generate the slices for building an autoregressive mask.\"\"\"\n  # TODO(b/67594795): Better support of dynamic shape.\n  slices = []\n  col = 0\n  d_in = n_in // num_blocks\n  d_out = n_out // num_blocks\n  row = d_out if mask_type == MASK_EXCLUSIVE else 0\n  for _ in range(num_blocks):\n    row_slice = slice(row, None)\n    col_slice = slice(col, col + d_in)\n    slices.append([row_slice, col_slice])\n    col += d_in\n    row += d_out\n  return slices"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef _gen_mask(num_blocks,\n              n_in,\n              n_out,\n              mask_type=MASK_EXCLUSIVE,\n              dtype=tf.float32):\n  \"\"\"Generate the mask for building an autoregressive dense layer.\"\"\"\n  # TODO(b/67594795): Better support of dynamic shape.\n  mask = np.zeros([n_out, n_in], dtype=dtype.as_numpy_dtype())\n  slices = _gen_slices(num_blocks, n_in, n_out, mask_type=mask_type)\n  for [row_slice, col_slice] in slices:\n    mask[row_slice, col_slice] = 1\n  return mask", "response": "Generate the mask for building an autoregressive dense layer."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nbuilding the Masked Autoregressive Density Estimator (Germain et al., 2015). This will be wrapped in a make_template to ensure the variables are only created once. It takes the input and returns the `loc` (\"mu\" in [Germain et al. (2015)][1]) and `log_scale` (\"alpha\" in [Germain et al. (2015)][1]) from the MADE network. Warning: This function uses `masked_dense` to create randomly initialized `tf.Variables`. It is presumed that these will be fit, just as you would any other neural architecture which uses `tf.layers.dense`. #### About Hidden Layers Each element of `hidden_layers` should be greater than the `input_depth` (i.e., `input_depth = tf.shape(input)[-1]` where `input` is the input to the neural network). This is necessary to ensure the autoregressivity property. #### About Clipping This function also optionally clips the `log_scale` (but possibly not its gradient). This is useful because if `log_scale` is too small/large it might underflow/overflow making it impossible for the `MaskedAutoregressiveFlow` bijector to implement a bijection. Additionally, the `log_scale_clip_gradient` `bool` indicates whether the gradient should also be clipped. The default does not clip the gradient; this is useful because it still provides gradient information (for fitting) yet solves the numerical stability problem. I.e., `log_scale_clip_gradient = False` means `grad[exp(clip(x))] = grad[x] exp(clip(x))` rather than the usual `grad[clip(x)] exp(clip(x))`. Args: hidden_layers: Python `list`-like of non-negative integer, scalars indicating the number of units in each hidden layer. Default: `[512, 512]. shift_only: Python `bool` indicating if only the `shift` term shall be computed. Default: `False`. activation: Activation function (callable). Explicitly setting to `None` implies a linear activation. log_scale_min_clip: `float`-like scalar `Tensor`, or a `Tensor` with the same shape as `log_scale`. The minimum value to clip by. Default: -5. log_scale_max_clip: `float`-like scalar `Tensor`, or a `Tensor` with the same shape as `log_scale`. The maximum value to clip by. Default: 3. log_scale_clip_gradient: Python `bool` indicating that the gradient of `tf.clip_by_value` should be preserved. Default: `False`. name: A name for ops managed by this function. Default: \"masked_autoregressive_default_template\". *args: `tf.layers.dense` arguments. **kwargs: `tf.layers.dense` keyword arguments. Returns: shift: `Float`-like `Tensor` of shift terms (the \"mu\" in [Germain et al. (2015)][1]). log_scale: `Float`-like `Tensor` of log(scale) terms (the \"alpha\" in [Germain et al. (2015)][1]). Raises: NotImplementedError: if rightmost dimension of `inputs` is unknown prior to graph execution. #### References [1]: Mathieu Germain, Karol Gregor, Iain Murray, and Hugo Larochelle. MADE: Masked Autoencoder for Distribution Estimation. In _International Conference on Machine Learning_, 2015. https://arxiv.org/abs/1502.03509", "response": "def masked_autoregressive_default_template(hidden_layers,\n                                           shift_only=False,\n                                           activation=tf.nn.relu,\n                                           log_scale_min_clip=-5.,\n                                           log_scale_max_clip=3.,\n                                           log_scale_clip_gradient=False,\n                                           name=None,\n                                           *args,  # pylint: disable=keyword-arg-before-vararg\n                                           **kwargs):\n  \"\"\"Build the Masked Autoregressive Density Estimator (Germain et al., 2015).\n\n  This will be wrapped in a make_template to ensure the variables are only\n  created once. It takes the input and returns the `loc` (\"mu\" in [Germain et\n  al. (2015)][1]) and `log_scale` (\"alpha\" in [Germain et al. (2015)][1]) from\n  the MADE network.\n\n  Warning: This function uses `masked_dense` to create randomly initialized\n  `tf.Variables`. It is presumed that these will be fit, just as you would any\n  other neural architecture which uses `tf.layers.dense`.\n\n  #### About Hidden Layers\n\n  Each element of `hidden_layers` should be greater than the `input_depth`\n  (i.e., `input_depth = tf.shape(input)[-1]` where `input` is the input to the\n  neural network). This is necessary to ensure the autoregressivity property.\n\n  #### About Clipping\n\n  This function also optionally clips the `log_scale` (but possibly not its\n  gradient). This is useful because if `log_scale` is too small/large it might\n  underflow/overflow making it impossible for the `MaskedAutoregressiveFlow`\n  bijector to implement a bijection. Additionally, the `log_scale_clip_gradient`\n  `bool` indicates whether the gradient should also be clipped. The default does\n  not clip the gradient; this is useful because it still provides gradient\n  information (for fitting) yet solves the numerical stability problem. I.e.,\n  `log_scale_clip_gradient = False` means\n  `grad[exp(clip(x))] = grad[x] exp(clip(x))` rather than the usual\n  `grad[clip(x)] exp(clip(x))`.\n\n  Args:\n    hidden_layers: Python `list`-like of non-negative integer, scalars\n      indicating the number of units in each hidden layer. Default: `[512, 512].\n    shift_only: Python `bool` indicating if only the `shift` term shall be\n      computed. Default: `False`.\n    activation: Activation function (callable). Explicitly setting to `None`\n      implies a linear activation.\n    log_scale_min_clip: `float`-like scalar `Tensor`, or a `Tensor` with the\n      same shape as `log_scale`. The minimum value to clip by. Default: -5.\n    log_scale_max_clip: `float`-like scalar `Tensor`, or a `Tensor` with the\n      same shape as `log_scale`. The maximum value to clip by. Default: 3.\n    log_scale_clip_gradient: Python `bool` indicating that the gradient of\n      `tf.clip_by_value` should be preserved. Default: `False`.\n    name: A name for ops managed by this function. Default:\n      \"masked_autoregressive_default_template\".\n    *args: `tf.layers.dense` arguments.\n    **kwargs: `tf.layers.dense` keyword arguments.\n\n  Returns:\n    shift: `Float`-like `Tensor` of shift terms (the \"mu\" in\n      [Germain et al.  (2015)][1]).\n    log_scale: `Float`-like `Tensor` of log(scale) terms (the \"alpha\" in\n      [Germain et al. (2015)][1]).\n\n  Raises:\n    NotImplementedError: if rightmost dimension of `inputs` is unknown prior to\n      graph execution.\n\n  #### References\n\n  [1]: Mathieu Germain, Karol Gregor, Iain Murray, and Hugo Larochelle. MADE:\n       Masked Autoencoder for Distribution Estimation. In _International\n       Conference on Machine Learning_, 2015. https://arxiv.org/abs/1502.03509\n  \"\"\"\n  name = name or \"masked_autoregressive_default_template\"\n  with tf.compat.v2.name_scope(name):\n\n    def _fn(x):\n      \"\"\"MADE parameterized via `masked_autoregressive_default_template`.\"\"\"\n      # TODO(b/67594795): Better support of dynamic shape.\n      input_depth = tf.compat.dimension_value(\n          tensorshape_util.with_rank_at_least(x.shape, 1)[-1])\n      if input_depth is None:\n        raise NotImplementedError(\n            \"Rightmost dimension must be known prior to graph execution.\")\n      input_shape = (\n          np.int32(tensorshape_util.as_list(x.shape))\n          if tensorshape_util.is_fully_defined(x.shape) else tf.shape(input=x))\n      if tensorshape_util.rank(x.shape) == 1:\n        x = x[tf.newaxis, ...]\n      for i, units in enumerate(hidden_layers):\n        x = masked_dense(\n            inputs=x,\n            units=units,\n            num_blocks=input_depth,\n            exclusive=True if i == 0 else False,\n            activation=activation,\n            *args,  # pylint: disable=keyword-arg-before-vararg\n            **kwargs)\n      x = masked_dense(\n          inputs=x,\n          units=(1 if shift_only else 2) * input_depth,\n          num_blocks=input_depth,\n          activation=None,\n          *args,  # pylint: disable=keyword-arg-before-vararg\n          **kwargs)\n      if shift_only:\n        x = tf.reshape(x, shape=input_shape)\n        return x, None\n      x = tf.reshape(x, shape=tf.concat([input_shape, [2]], axis=0))\n      shift, log_scale = tf.unstack(x, num=2, axis=-1)\n      which_clip = (\n          tf.clip_by_value\n          if log_scale_clip_gradient else clip_by_value_preserve_gradient)\n      log_scale = which_clip(log_scale, log_scale_min_clip, log_scale_max_clip)\n      return shift, log_scale\n\n    return tf.compat.v1.make_template(name, _fn)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef _create_input_order(input_size, input_order=\"left-to-right\"):\n  if isinstance(input_order, six.string_types):\n    if input_order == \"left-to-right\":\n      return np.arange(start=1, stop=input_size + 1)\n    elif input_order == \"right-to-left\":\n      return np.arange(start=input_size, stop=0, step=-1)\n    elif input_order == \"random\":\n      ret = np.arange(start=1, stop=input_size + 1)\n      np.random.shuffle(ret)\n      return ret\n  elif np.all(np.sort(input_order) == np.arange(1, input_size + 1)):\n    return np.array(input_order)\n\n  raise ValueError(\"Invalid input order: '{}'.\".format(input_order))", "response": "Creates a degree vectors for the input."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef _create_degrees(input_size,\n                    hidden_units=None,\n                    input_order=\"left-to-right\",\n                    hidden_degrees=\"equal\"):\n  \"\"\"Returns a list of degree vectors, one for each input and hidden layer.\n\n  A unit with degree d can only receive input from units with degree < d. Output\n  units always have the same degree as their associated input unit.\n\n  Args:\n    input_size: Number of inputs.\n    hidden_units: list with the number of hidden units per layer. It does not\n      include the output layer. Each hidden unit size must be at least the size\n      of length (otherwise autoregressivity is not possible).\n    input_order: Order of degrees to the input units: 'random', 'left-to-right',\n      'right-to-left', or an array of an explicit order. For example,\n      'left-to-right' builds an autoregressive model\n      p(x) = p(x1) p(x2 | x1) ... p(xD | x<D).\n    hidden_degrees: Method for assigning degrees to the hidden units:\n      'equal', 'random'.  If 'equal', hidden units in each layer are allocated\n      equally (up to a remainder term) to each degree.  Default: 'equal'.\n\n  Raises:\n    ValueError: invalid input order.\n    ValueError: invalid hidden degrees.\n  \"\"\"\n  input_order = _create_input_order(input_size, input_order)\n  degrees = [input_order]\n\n  if hidden_units is None:\n    hidden_units = []\n\n  for units in hidden_units:\n    if isinstance(hidden_degrees, six.string_types):\n      if hidden_degrees == \"random\":\n        # samples from: [low, high)\n        degrees.append(\n            np.random.randint(low=min(np.min(degrees[-1]), input_size - 1),\n                              high=input_size,\n                              size=units))\n      elif hidden_degrees == \"equal\":\n        min_degree = min(np.min(degrees[-1]), input_size - 1)\n        degrees.append(np.maximum(\n            min_degree,\n            # Evenly divide the range `[1, input_size - 1]` in to `units + 1`\n            # segments, and pick the boundaries between the segments as degrees.\n            np.ceil(np.arange(1, units + 1)\n                    * (input_size - 1) / float(units + 1)).astype(np.int32)))\n    else:\n      raise ValueError('Invalid hidden order: \"{}\".'.format(hidden_degrees))\n\n  return degrees", "response": "Creates a list of degree vectors for each input and hidden unit."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nreturn a list of binary mask matrices enforcing autoregressivity.", "response": "def _create_masks(degrees):\n  \"\"\"Returns a list of binary mask matrices enforcing autoregressivity.\"\"\"\n  return [\n      # Create input->hidden and hidden->hidden masks.\n      inp[:, np.newaxis] <= out\n      for inp, out in zip(degrees[:-1], degrees[1:])\n  ] + [\n      # Create hidden->output mask.\n      degrees[-1][:, np.newaxis] < degrees[0]\n  ]"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nreturning a masked version of the given initializer.", "response": "def _make_masked_initializer(mask, initializer):\n  \"\"\"Returns a masked version of the given initializer.\"\"\"\n  initializer = tf.keras.initializers.get(initializer)\n  def masked_initializer(shape, dtype=None, partition_info=None):\n    # If no `partition_info` is given, then don't pass it to `initializer`, as\n    # `initializer` may be a `tf.compat.v2.initializers.Initializer` (which\n    # don't accept a `partition_info` argument).\n    if partition_info is None:\n      x = initializer(shape, dtype)\n    else:\n      x = initializer(shape, dtype, partition_info)\n    return tf.cast(mask, x.dtype) * x\n  return masked_initializer"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nbuild the final layer.", "response": "def build(self, input_shape):\n    \"\"\"See tfkl.Layer.build.\"\"\"\n    if self._event_shape is None:\n      # `event_shape` wasn't specied at __init__, so infer from `input_shape`.\n      self._event_shape = [tf.compat.dimension_value(input_shape[-1])]\n      self._event_size = self._event_shape[-1]\n      self._event_ndims = len(self._event_shape)\n      # Should we throw if input_shape has rank > 2?\n\n    if input_shape[-1] != self._event_shape[-1]:\n      raise ValueError(\"Invalid final dimension of `input_shape`. \"\n                       \"Expected `{!r}`, but got `{!r}`\".format(\n                           self._event_shape[-1], input_shape[-1]))\n\n    # Construct the masks.\n    self._input_order = _create_input_order(\n        self._event_size, self._input_order_param)\n    self._masks = _create_masks(_create_degrees(\n        input_size=self._event_size,\n        hidden_units=self._hidden_units,\n        input_order=self._input_order,\n        hidden_degrees=self._hidden_degrees))\n\n    # In the final layer, we will produce `self._params` outputs for each of the\n    # `self._event_size` inputs to `AutoregressiveLayer`.  But `masks[-1]` has\n    # shape `[self._hidden_units[-1], self._event_size]`.  Thus, we need to\n    # expand the mask to `[hidden_units[-1], event_size * self._params]` such\n    # that all units for the same input are masked identically.  In particular,\n    # we tile the mask so the j-th element of `tf.unstack(output, axis=-1)` is a\n    # tensor of the j-th parameter/unit for each input.\n    #\n    # NOTE: Other orderings of the output could be faster -- should benchmark.\n    self._masks[-1] = np.reshape(\n        np.tile(self._masks[-1][..., tf.newaxis], [1, 1, self._params]),\n        [self._masks[-1].shape[0], self._event_size * self._params])\n\n    self._network = tf.keras.Sequential([\n        # Starting this model with an `InputLayer` ensures that Keras will build\n        # and propagate our `dtype` to each layer we add.\n        tf.keras.layers.InputLayer((self._event_size,), dtype=self.dtype)\n    ])\n\n    # Input-to-hidden, hidden-to-hidden, and hidden-to-output layers:\n    #  [..., self._event_size] -> [..., self._hidden_units[0]].\n    #  [..., self._hidden_units[k-1]] -> [..., self._hidden_units[k]].\n    #  [..., self._hidden_units[-1]] -> [..., event_size * self._params].\n    layer_output_sizes = self._hidden_units + [self._event_size * self._params]\n    for k in range(len(self._masks)):\n      self._network.add(tf.keras.layers.Dense(\n          layer_output_sizes[k],\n          kernel_initializer=_make_masked_initializer(\n              self._masks[k], self._kernel_initializer),\n          kernel_constraint=_make_masked_constraint(self._masks[k]),\n          activation=self._activation if k + 1 < len(self._masks) else None,\n          use_bias=self._use_bias,\n          **self._kwargs))\n\n    # Record that the layer has been built.\n    super(AutoregressiveLayer, self).build(input_shape)"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef call(self, x):\n    with tf.compat.v2.name_scope(self.name or \"AutoregressiveLayer_call\"):\n      x = tf.convert_to_tensor(value=x, dtype=self.dtype, name=\"x\")\n      input_shape = tf.shape(input=x)\n      # TODO(b/67594795): Better support for dynamic shapes.\n      if tensorshape_util.rank(x.shape) == 1:\n        x = x[tf.newaxis, ...]\n      return tf.reshape(self._network(x),\n                        tf.concat([input_shape, [self._params]], axis=0))", "response": "See tf. Layer. call."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\ndrawing samples from a multinomial dataset.", "response": "def draw_sample(num_samples, num_classes, logits, num_trials, dtype, seed):\n  \"\"\"Sample a multinomial.\n\n  The batch shape is given by broadcasting num_trials with\n  remove_last_dimension(logits).\n\n  Args:\n    num_samples: Python int or singleton integer Tensor: number of multinomial\n      samples to draw.\n    num_classes: Python int or singleton integer Tensor: number of classes.\n    logits: Floating Tensor with last dimension k, of (unnormalized) logit\n      probabilities per class.\n    num_trials: Tensor of number of categorical trials each multinomial consists\n      of.  num_trials[..., tf.newaxis] must broadcast with logits.\n    dtype: dtype at which to emit samples.\n    seed: Random seed.\n\n  Returns:\n    samples: Tensor of given dtype and shape [n] + batch_shape + [k].\n  \"\"\"\n  with tf.name_scope(\"multinomial.draw_sample\"):\n    # broadcast the num_trials and logits to same shape\n    num_trials = tf.ones_like(\n        logits[..., 0], dtype=num_trials.dtype) * num_trials\n    logits = tf.ones_like(\n        num_trials[..., tf.newaxis], dtype=logits.dtype) * logits\n\n    # flatten the total_count and logits\n    # flat_logits has shape [B1B2...Bm, num_classes]\n    flat_logits = tf.reshape(logits, [-1, num_classes])\n    flat_num_trials = num_samples * tf.reshape(num_trials, [-1])  # [B1B2...Bm]\n\n    # Computes each logits and num_trials situation by map_fn.\n\n    # Using just one batch tf.random.categorical call doesn't work because that\n    # requires num_trials to be the same across all members of the batch of\n    # logits.  This restriction makes sense for tf.random.categorical because\n    # for it, num_trials is part of the returned shape.  However, the\n    # multinomial sampler does not need that restriction, because it sums out\n    # exactly that dimension.\n\n    # One possibility would be to draw a batch categorical whose sample count is\n    # max(num_trials) and mask out the excess ones.  However, if the elements of\n    # num_trials vary widely, this can be wasteful of memory.\n\n    # TODO(b/123763054, b/112152209): Revisit the possibility of writing this\n    # with a batch categorical followed by batch unsorted_segment_sum, once both\n    # of those work and are memory-efficient enough.\n    def _sample_one_batch_member(args):\n      logits, num_cat_samples = args[0], args[1]  # [K], []\n      # x has shape [1, num_cat_samples = num_samples * num_trials]\n      x = tf.random.categorical(\n          logits[tf.newaxis, ...], num_cat_samples, seed=seed)\n      x = tf.reshape(x, shape=[num_samples, -1])  # [num_samples, num_trials]\n      x = tf.one_hot(\n          x, depth=num_classes)  # [num_samples, num_trials, num_classes]\n      x = tf.reduce_sum(input_tensor=x, axis=-2)  # [num_samples, num_classes]\n      return tf.cast(x, dtype=dtype)\n\n    x = tf.map_fn(\n        _sample_one_batch_member, [flat_logits, flat_num_trials],\n        dtype=dtype)  # [B1B2...Bm, num_samples, num_classes]\n\n    # reshape the results to proper shape\n    x = tf.transpose(a=x, perm=[1, 0, 2])\n    final_shape = tf.concat([[num_samples],\n                             tf.shape(input=num_trials), [num_classes]],\n                            axis=0)\n    x = tf.reshape(x, final_shape)\n\n    return x"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nbuild a zero - dimensional MVNDiag object.", "response": "def _zero_dimensional_mvndiag(dtype):\n  \"\"\"Build a zero-dimensional MVNDiag object.\"\"\"\n  dummy_mvndiag = tfd.MultivariateNormalDiag(\n      scale_diag=tf.ones([0], dtype=dtype))\n  dummy_mvndiag.covariance = lambda: dummy_mvndiag.variance()[..., tf.newaxis]\n  return dummy_mvndiag"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nbuild an observation_noise_fn that observes a Tensor timeseries.", "response": "def _observe_timeseries_fn(timeseries):\n  \"\"\"Build an observation_noise_fn that observes a Tensor timeseries.\"\"\"\n  def observation_noise_fn(t):\n    current_slice = timeseries[..., t, :]\n    return tfd.MultivariateNormalDiag(\n        loc=current_slice,\n        scale_diag=tf.zeros_like(current_slice))\n  return observation_noise_fn"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef params_to_weights(self,\n                        global_scale_variance,\n                        global_scale_noncentered,\n                        local_scale_variances,\n                        local_scales_noncentered,\n                        weights_noncentered):\n    \"\"\"Build regression weights from model parameters.\"\"\"\n    global_scale = (global_scale_noncentered *\n                    tf.sqrt(global_scale_variance) *\n                    self.weights_prior_scale)\n\n    local_scales = local_scales_noncentered * tf.sqrt(local_scale_variances)\n    return weights_noncentered * local_scales * global_scale[..., tf.newaxis]", "response": "Build regression weights from model parameters."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\ncomputes the number of edges on longest path from node to root.", "response": "def _depth(g):\n  \"\"\"Computes the number of edges on longest path from node to root.\"\"\"\n  def _explore(v):\n    if v.depth < 0:\n      v.depth = ((1 + max([-1] + [_explore(annotated_graph[u])\n                                  for u in v.parents]))\n                 if v.parents else 0)\n    return v.depth\n  annotated_graph = {k: _Node(k, v) for k, v in g.items()}\n  for v in annotated_graph.values():\n    _explore(v)\n  return annotated_graph"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nreturns a tuple of str tuple - str pairs representing resolved & sorted DAG.", "response": "def _best_order(g):\n  \"\"\"Creates tuple of str tuple-str pairs representing resolved & sorted DAG.\"\"\"\n  def _explore(u):\n    \"\"\"Recursive function to ascend up through unvisited dependencies.\"\"\"\n    if u.depth < 0:\n      return  # Already visited.\n    if not u.parents:\n      result.append((u.name, u.parents))\n      u.depth = -1  # Mark visited.\n      return\n    b = (u.name, [])\n    result.append(b)\n    u.depth = -1  # Mark visited.\n    d = 0\n    for v in sorted((g.get(p) for p in u.parents), key=lambda v: v.depth):\n      n0 = len(result)\n      _explore(v)\n      n1 = len(result)\n      b[1].extend(['_']*d + [v.name])\n      d = n1 - n0 - 1\n  g = _depth(g)\n  result = []\n  for u in sorted(g.values(), key=lambda v: v.depth, reverse=True):\n    _explore(u)\n  return tuple(reversed(result))"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef _prob_chain_rule_flatten(named_makers):\n  def _make(dist_fn, args):\n    if args is None:\n      return lambda *_: dist_fn\n    if not args:\n      return lambda *_: dist_fn()\n    def _fn(*xs):\n      kwargs = dict(zip(args, reversed(xs[-len(args):])))\n      kwargs.pop('_', None)\n      return dist_fn(**kwargs)\n    return _fn\n  named_makers = _convert_to_dict(named_makers)\n  g = {k: (None if distribution_util.is_distribution_instance(v)\n           else joint_distribution_sequential._get_required_args(v))  # pylint: disable=protected-access\n       for k, v in named_makers.items()}\n  g = _best_order(g)\n  dist_fn_name, dist_fn_args = zip(*g)\n  dist_fn_args = tuple(None if a is None else tuple(a) for a in dist_fn_args)\n  dist_fn_wrapped = tuple(_make(named_makers[name], parents)\n                          for (name, parents) in g)\n  dist_fn = tuple(named_makers.get(n) for n in dist_fn_name)\n  return dist_fn, dist_fn_wrapped, dist_fn_args, dist_fn_name", "response": "Creates lists of callables suitable for JDSeq."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef _build(self, model):\n    if not _is_dict_like(model):\n      raise TypeError('`model` must be convertible to `dict` (saw: {}).'.format(\n          type(model).__name__))\n    [\n        self._dist_fn,\n        self._dist_fn_wrapped,\n        self._dist_fn_args,\n        self._dist_fn_name,  # JointDistributionSequential doesn't have this.\n    ] = _prob_chain_rule_flatten(model)", "response": "Creates the distribution function and distribution function list from the given model."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef variational_loss(self,\n                       observations,\n                       observation_index_points=None,\n                       kl_weight=1.,\n                       name='variational_loss'):\n    \"\"\"Variational loss for the VGP.\n\n    Given `observations` and `observation_index_points`, compute the\n    negative variational lower bound as specified in [Hensman, 2013][1].\n\n    Args:\n      observations: `float` `Tensor` representing collection, or batch of\n        collections, of observations corresponding to\n        `observation_index_points`. Shape has the form `[b1, ..., bB, e]`, which\n        must be brodcastable with the batch and example shapes of\n        `observation_index_points`. The batch shape `[b1, ..., bB]` must be\n        broadcastable with the shapes of all other batched parameters\n        (`kernel.batch_shape`, `observation_index_points`, etc.).\n      observation_index_points: `float` `Tensor` representing finite (batch of)\n        vector(s) of points where observations are defined. Shape has the\n        form `[b1, ..., bB, e1, f1, ..., fF]` where `F` is the number of feature\n        dimensions and must equal `kernel.feature_ndims` and `e1` is the number\n        (size) of index points in each batch (we denote it `e1` to distinguish\n        it from the numer of inducing index points, denoted `e2` below). If\n        set to `None` uses `index_points` as the origin for observations.\n        Default value: None.\n      kl_weight: Amount by which to scale the KL divergence loss between prior\n        and posterior.\n        Default value: 1.\n      name: Python `str` name prefixed to Ops created by this class.\n        Default value: \"GaussianProcess\".\n    Returns:\n      loss: Scalar tensor representing the negative variational lower bound.\n        Can be directly used in a `tf.Optimizer`.\n    Raises:\n      ValueError: if `mean_fn` is not `None` and is not callable.\n\n    #### References\n\n    [1]: Hensman, J., Lawrence, N. \"Gaussian Processes for Big Data\", 2013\n         https://arxiv.org/abs/1309.6835\n    \"\"\"\n\n    with tf.name_scope(name or 'variational_gp_loss'):\n      if observation_index_points is None:\n        observation_index_points = self._index_points\n      observation_index_points = tf.convert_to_tensor(\n          value=observation_index_points, dtype=self._dtype,\n          name='observation_index_points')\n      observations = tf.convert_to_tensor(\n          value=observations, dtype=self._dtype, name='observations')\n      kl_weight = tf.convert_to_tensor(\n          value=kl_weight, dtype=self._dtype,\n          name='kl_weight')\n\n      # The variational loss is a negative ELBO. The ELBO can be broken down\n      # into three terms:\n      #  1. a likelihood term\n      #  2. a trace term arising from the covariance of the posterior predictive\n\n      kzx = self.kernel.matrix(self._inducing_index_points,\n                               observation_index_points)\n\n      kzx_linop = tf.linalg.LinearOperatorFullMatrix(kzx)\n      loc = (self._mean_fn(observation_index_points) +\n             kzx_linop.matvec(self._kzz_inv_varloc, adjoint=True))\n\n      likelihood = independent.Independent(\n          normal.Normal(\n              loc=loc,\n              scale=tf.sqrt(self._observation_noise_variance + self._jitter),\n              name='NormalLikelihood'),\n          reinterpreted_batch_ndims=1)\n      obs_ll = likelihood.log_prob(observations)\n\n      chol_kzz_linop = tf.linalg.LinearOperatorLowerTriangular(self._chol_kzz)\n      chol_kzz_inv_kzx = chol_kzz_linop.solve(kzx)\n      kzz_inv_kzx = chol_kzz_linop.solve(chol_kzz_inv_kzx, adjoint=True)\n\n      kxx_diag = tf.linalg.diag_part(\n          self.kernel.matrix(\n              observation_index_points, observation_index_points))\n      ktilde_trace_term = (\n          tf.reduce_sum(input_tensor=kxx_diag, axis=-1) -\n          tf.reduce_sum(input_tensor=chol_kzz_inv_kzx ** 2, axis=[-2, -1]))\n\n      # Tr(SB)\n      # where S = A A.T, A = variational_inducing_observations_scale\n      # and B = Kzz^-1 Kzx Kzx.T Kzz^-1\n      #\n      # Now Tr(SB) = Tr(A A.T Kzz^-1 Kzx Kzx.T Kzz^-1)\n      #            = Tr(A.T Kzz^-1 Kzx Kzx.T Kzz^-1 A)\n      #            = sum_ij (A.T Kzz^-1 Kzx)_{ij}^2\n      other_trace_term = tf.reduce_sum(\n          input_tensor=(\n              self._variational_inducing_observations_posterior.scale.matmul(\n                  kzz_inv_kzx) ** 2),\n          axis=[-2, -1])\n\n      trace_term = (.5 * (ktilde_trace_term + other_trace_term) /\n                    self._observation_noise_variance)\n\n      inducing_prior = gaussian_process.GaussianProcess(\n          kernel=self._kernel,\n          mean_fn=self._mean_fn,\n          index_points=self._inducing_index_points,\n          observation_noise_variance=self._observation_noise_variance)\n\n      kl_term = kl_weight * kullback_leibler.kl_divergence(\n          self._variational_inducing_observations_posterior,\n          inducing_prior)\n\n      lower_bound = (obs_ll - trace_term - kl_term)\n\n      return -tf.reduce_mean(input_tensor=lower_bound)", "response": "Returns the variational loss for the VGP."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nmodel selection for optimal variational hyperparameters. Given the full training set (parameterized by `observations` and `observation_index_points`), compute the optimal variational location and scale for the VGP. This is based of the method suggested in [Titsias, 2009][1]. Args: kernel: `PositiveSemidefiniteKernel`-like instance representing the GP's covariance function. inducing_index_points: `float` `Tensor` of locations of inducing points in the index set. Shape has the form `[b1, ..., bB, e2, f1, ..., fF]`, just like `observation_index_points`. The batch shape components needn't be identical to those of `observation_index_points`, but must be broadcast compatible with them. observation_index_points: `float` `Tensor` representing finite (batch of) vector(s) of points where observations are defined. Shape has the form `[b1, ..., bB, e1, f1, ..., fF]` where `F` is the number of feature dimensions and must equal `kernel.feature_ndims` and `e1` is the number (size) of index points in each batch (we denote it `e1` to distinguish it from the numer of inducing index points, denoted `e2` below). observations: `float` `Tensor` representing collection, or batch of collections, of observations corresponding to `observation_index_points`. Shape has the form `[b1, ..., bB, e]`, which must be brodcastable with the batch and example shapes of `observation_index_points`. The batch shape `[b1, ..., bB]` must be broadcastable with the shapes of all other batched parameters (`kernel.batch_shape`, `observation_index_points`, etc.). observation_noise_variance: `float` `Tensor` representing the variance of the noise in the Normal likelihood distribution of the model. May be batched, in which case the batch shape must be broadcastable with the shapes of all other batched parameters (`kernel.batch_shape`, `index_points`, etc.). Default value: `0.` mean_fn: Python `callable` that acts on index points to produce a (batch of) vector(s) of mean values at those index points. Takes a `Tensor` of shape `[b1, ..., bB, f1, ..., fF]` and returns a `Tensor` whose shape is (broadcastable with) `[b1, ..., bB]`. Default value: `None` implies constant zero function. jitter: `float` scalar `Tensor` added to the diagonal of the covariance matrix to ensure positive definiteness of the covariance matrix. Default value: `1e-6`. name: Python `str` name prefixed to Ops created by this class. Default value: \"optimal_variational_posterior\". Returns: loc, scale: Tuple representing the variational location and scale. Raises: ValueError: if `mean_fn` is not `None` and is not callable. #### References [1]: Titsias, M. \"Variational Model Selection for Sparse Gaussian Process Regression\", 2009. http://proceedings.mlr.press/v5/titsias09a/titsias09a.pdf", "response": "def optimal_variational_posterior(\n      kernel,\n      inducing_index_points,\n      observation_index_points,\n      observations,\n      observation_noise_variance,\n      mean_fn=None,\n      jitter=1e-6,\n      name=None):\n    \"\"\"Model selection for optimal variational hyperparameters.\n\n    Given the full training set (parameterized by `observations` and\n    `observation_index_points`), compute the optimal variational\n    location and scale for the VGP. This is based of the method suggested\n    in [Titsias, 2009][1].\n\n    Args:\n      kernel: `PositiveSemidefiniteKernel`-like instance representing the\n        GP's covariance function.\n      inducing_index_points: `float` `Tensor` of locations of inducing points in\n        the index set. Shape has the form `[b1, ..., bB, e2, f1, ..., fF]`, just\n        like `observation_index_points`. The batch shape components needn't be\n        identical to those of `observation_index_points`, but must be broadcast\n        compatible with them.\n      observation_index_points: `float` `Tensor` representing finite (batch of)\n        vector(s) of points where observations are defined. Shape has the\n        form `[b1, ..., bB, e1, f1, ..., fF]` where `F` is the number of feature\n        dimensions and must equal `kernel.feature_ndims` and `e1` is the number\n        (size) of index points in each batch (we denote it `e1` to distinguish\n        it from the numer of inducing index points, denoted `e2` below).\n      observations: `float` `Tensor` representing collection, or batch of\n        collections, of observations corresponding to\n        `observation_index_points`. Shape has the form `[b1, ..., bB, e]`, which\n        must be brodcastable with the batch and example shapes of\n        `observation_index_points`. The batch shape `[b1, ..., bB]` must be\n        broadcastable with the shapes of all other batched parameters\n        (`kernel.batch_shape`, `observation_index_points`, etc.).\n      observation_noise_variance: `float` `Tensor` representing the variance\n        of the noise in the Normal likelihood distribution of the model. May be\n        batched, in which case the batch shape must be broadcastable with the\n        shapes of all other batched parameters (`kernel.batch_shape`,\n        `index_points`, etc.).\n        Default value: `0.`\n      mean_fn: Python `callable` that acts on index points to produce a (batch\n        of) vector(s) of mean values at those index points. Takes a `Tensor` of\n        shape `[b1, ..., bB, f1, ..., fF]` and returns a `Tensor` whose shape is\n        (broadcastable with) `[b1, ..., bB]`. Default value: `None` implies\n        constant zero function.\n      jitter: `float` scalar `Tensor` added to the diagonal of the covariance\n        matrix to ensure positive definiteness of the covariance matrix.\n        Default value: `1e-6`.\n      name: Python `str` name prefixed to Ops created by this class.\n        Default value: \"optimal_variational_posterior\".\n    Returns:\n      loc, scale: Tuple representing the variational location and scale.\n    Raises:\n      ValueError: if `mean_fn` is not `None` and is not callable.\n\n    #### References\n\n    [1]: Titsias, M. \"Variational Model Selection for Sparse Gaussian Process\n         Regression\", 2009.\n         http://proceedings.mlr.press/v5/titsias09a/titsias09a.pdf\n    \"\"\"\n\n    with tf.name_scope(name or 'optimal_variational_posterior'):\n      dtype = dtype_util.common_dtype(\n          [inducing_index_points,\n           observation_index_points,\n           observations,\n           observation_noise_variance,\n           jitter], tf.float32)\n\n      inducing_index_points = tf.convert_to_tensor(\n          value=inducing_index_points,\n          dtype=dtype, name='inducing_index_points')\n      observation_index_points = tf.convert_to_tensor(\n          value=observation_index_points, dtype=dtype,\n          name='observation_index_points')\n      observations = tf.convert_to_tensor(\n          value=observations, dtype=dtype, name='observations')\n      observation_noise_variance = tf.convert_to_tensor(\n          value=observation_noise_variance,\n          dtype=dtype,\n          name='observation_noise_variance')\n      jitter = tf.convert_to_tensor(\n          value=jitter, dtype=dtype, name='jitter')\n\n      # Default to a constant zero function.\n      if mean_fn is None:\n        mean_fn = lambda x: tf.zeros([1], dtype=dtype)\n      else:\n        if not callable(mean_fn):\n          raise ValueError('`mean_fn` must be a Python callable')\n\n      # z are the inducing points and x are the observation index points.\n      kzz = kernel.matrix(inducing_index_points, inducing_index_points)\n      kzx = kernel.matrix(inducing_index_points, observation_index_points)\n\n      noise_var_inv = tf.math.reciprocal(observation_noise_variance)\n\n      sigma_inv = _add_diagonal_shift(\n          kzz + noise_var_inv * tf.matmul(kzx, kzx, adjoint_b=True),\n          jitter)\n\n      chol_sigma_inv = tf.linalg.cholesky(sigma_inv)\n\n      kzx_lin_op = tf.linalg.LinearOperatorFullMatrix(kzx)\n      kzx_obs = kzx_lin_op.matvec(\n          observations - mean_fn(observation_index_points))\n      kzz_lin_op = tf.linalg.LinearOperatorFullMatrix(kzz)\n      loc = (mean_fn(inducing_index_points) +\n             noise_var_inv * kzz_lin_op.matvec(\n                 _solve_cholesky_factored_system_vec(chol_sigma_inv, kzx_obs)))\n\n      chol_sigma_inv_lin_op = tf.linalg.LinearOperatorLowerTriangular(\n          chol_sigma_inv)\n      scale = chol_sigma_inv_lin_op.solve(kzz)\n\n      return loc, scale"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nbuilds utility method to compute whether the season is changing.", "response": "def build_is_last_day_of_season(num_steps_per_season):\n  \"\"\"Build utility method to compute whether the season is changing.\"\"\"\n  num_steps_per_cycle = np.sum(num_steps_per_season)\n  changepoints = np.cumsum(np.ravel(num_steps_per_season)) - 1\n  def is_last_day_of_season(t):\n    t_ = dist_util.maybe_get_static_value(t)\n    if t_ is not None:  # static case\n      step_in_cycle = t_ % num_steps_per_cycle\n      return any(step_in_cycle == changepoints)\n    else:\n      step_in_cycle = tf.math.floormod(t, num_steps_per_cycle)\n      return tf.reduce_any(\n          input_tensor=tf.equal(step_in_cycle, changepoints))\n  return is_last_day_of_season"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef build_effects_to_residuals_matrix(num_seasons, dtype):\n\n  # Build the matrix that converts effects `e_i` into differences from the mean\n  # effect `(e_i - sum(e_i)) / num_seasons`, with the mean effect in the last\n  # row so that the transformation is invertible.\n  effects_to_residuals_fullrank = np.eye(num_seasons) - 1./num_seasons\n  effects_to_residuals_fullrank[-1, :] = 1./num_seasons  # compute mean effect\n  residuals_to_effects_fullrank = np.linalg.inv(effects_to_residuals_fullrank)\n\n  # Drop the final dimension, effectively setting the mean effect to zero.\n  effects_to_residuals = effects_to_residuals_fullrank[:-1, :]\n  residuals_to_effects = residuals_to_effects_fullrank[:, :-1]\n\n  # Return Tensor values of the specified dtype.\n  effects_to_residuals = tf.cast(\n      effects_to_residuals, dtype=dtype, name='effects_to_residuals')\n  residuals_to_effects = tf.cast(\n      residuals_to_effects, dtype=dtype, name='residuals_to_effects')\n\n  return effects_to_residuals, residuals_to_effects", "response": "Builds the change - of - basis matrices for constrained seasonal effects."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nbuild a function computing the transition matrix for a seasonal effect model.", "response": "def build_seasonal_transition_matrix(\n    num_seasons, is_last_day_of_season, dtype,\n    basis_change_matrix=None, basis_change_matrix_inv=None):\n  \"\"\"Build a function computing transitions for a seasonal effect model.\"\"\"\n\n  with tf.compat.v1.name_scope('build_seasonal_transition_matrix'):\n    # If the season is changing, the transition matrix permutes the latent\n    # state to shift all seasons up by a dimension, and sends the current\n    # season's effect to the bottom.\n    seasonal_permutation = np.concatenate(\n        [np.arange(1, num_seasons), [0]], axis=0)\n    seasonal_permutation_matrix = tf.constant(\n        np.eye(num_seasons)[seasonal_permutation], dtype=dtype)\n\n    # Optionally transform the transition matrix into a reparameterized space,\n    # enforcing the zero-sum constraint for ConstrainedSeasonalStateSpaceModel.\n    if basis_change_matrix is not None:\n      seasonal_permutation_matrix = tf.matmul(\n          basis_change_matrix,\n          tf.matmul(seasonal_permutation_matrix, basis_change_matrix_inv))\n\n    identity_matrix = tf.eye(\n        tf.shape(input=seasonal_permutation_matrix)[-1], dtype=dtype)\n\n    def seasonal_transition_matrix(t):\n      return tf.linalg.LinearOperatorFullMatrix(\n          matrix=dist_util.pick_scalar_condition(\n              is_last_day_of_season(t),\n              seasonal_permutation_matrix,\n              identity_matrix))\n\n  return seasonal_transition_matrix"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef build_seasonal_transition_noise(\n    drift_scale, num_seasons, is_last_day_of_season):\n  \"\"\"Build the transition noise model for a SeasonalStateSpaceModel.\"\"\"\n\n  # If the current season has just ended, increase the variance of its effect\n  # following drift_scale. (the just-ended seasonal effect will always be the\n  # bottom element of the vector). Otherwise, do nothing.\n  drift_scale_diag = tf.stack(\n      [tf.zeros_like(drift_scale)] * (num_seasons - 1) + [drift_scale],\n      axis=-1)\n  def seasonal_transition_noise(t):\n    noise_scale_diag = dist_util.pick_scalar_condition(\n        is_last_day_of_season(t),\n        drift_scale_diag,\n        tf.zeros_like(drift_scale_diag))\n    return tfd.MultivariateNormalDiag(\n        loc=tf.zeros(num_seasons, dtype=drift_scale.dtype),\n        scale_diag=noise_scale_diag)\n  return seasonal_transition_noise", "response": "Builds the transition noise model for a SeasonalStateSpaceModel."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef build_constrained_seasonal_transition_noise(\n    drift_scale, num_seasons, is_last_day_of_season):\n  \"\"\"Build transition noise distribution for a ConstrainedSeasonalSSM.\"\"\"\n\n  # Conceptually, this method takes the noise covariance on effects L @ L'\n  # computed by `build_seasonal_transition_noise`, with scale factor\n  #       L = [ 0, 0, ..., 0\n  #             ...\n  #             0, 0, ..., drift_scale],\n  # and transforms it to act on the constrained-residual representation.\n  #\n  # The resulting noise covariance M @ M' is equivalent to\n  #    M @ M' = effects_to_residuals @ LL' @ residuals_to_effects\n  # where `@` is matrix multiplication. However because this matrix is\n  # rank-deficient, we can't take its Cholesky decomposition directly, so we'll\n  # construct its lower-triangular scale factor `M` by hand instead.\n  #\n  # Concretely, let `M = P @ R @ L` be the scale factor in the\n  # transformed space, with matrices `R`, `P` applying the reparameterization\n  # and zero-mean constraint respectively as defined in the\n  # \"Mathematical Details\" section of `ConstrainedSeasonalStateSpaceModel`. It's\n  # easy to see (*) that the implied covariance\n  # `M @ M' = P @ R @ L @ L' @ R' @ P'` is just the constant matrix\n  #  `M @ M' = [ 1, 1, ..., 1, 0\n  #              1, 1, ..., 1, 0\n  #              ...\n  #              1, 1, ..., 1, 0\n  #              0, 0, ..., 0, 0] * (drift_scale / num_seasons)**2`\n  # with zeros in the final row and column. So we can directly construct\n  # the lower-triangular factor\n  #  `Q = [ 1, 0, ...  0\n  #         1, 0, ..., 0\n  #         ...\n  #         1, 0, ..., 0\n  #         0, 0, ..., 0 ] * drift_scale/num_seasons`\n  # such that Q @ Q' = M @ M'. In practice, we don't reify the final row and\n  # column full of zeroes, i.e., we construct\n  # `Q[:num_seasons-1, :num_seasons-1]` as the scale-TriL covariance factor.\n  #\n  # (*) Argument: `L` is zero everywhere but the last column, so `R @ L` will be\n  # too. Since the last column of `R` is the constant `-1/num_seasons`, `R @ L`\n  # is simply the matrix with constant `-drift_scale/num_seasons` in the final\n  # column (except the final row, which is negated) and zero in all other\n  # columns, and `M = P @ R @ L` additionally zeroes out the final row. Then\n  # M @ M' is just the outer product of that final column with itself (since all\n  # other columns are zero), which gives the matrix shown above.\n\n  drift_scale_tril_nonzeros = tf.concat([\n      tf.ones([num_seasons - 1, 1], dtype=drift_scale.dtype),\n      tf.zeros([num_seasons - 1, num_seasons - 2], dtype=drift_scale.dtype)],\n                                        axis=-1)\n  drift_scale_tril = (drift_scale_tril_nonzeros *\n                      drift_scale[..., tf.newaxis, tf.newaxis] / num_seasons)\n\n  # Inject transition noise iff it is the last day of the season.\n  def seasonal_transition_noise(t):\n    noise_scale_tril = dist_util.pick_scalar_condition(\n        is_last_day_of_season(t),\n        drift_scale_tril,\n        tf.zeros_like(drift_scale_tril))\n    return tfd.MultivariateNormalTriL(\n        loc=tf.zeros(num_seasons-1, dtype=drift_scale.dtype),\n        scale_tril=noise_scale_tril)\n  return seasonal_transition_noise", "response": "Builds a constrained Seasonal State Space Model transition noise distribution for a Seasonal State Space model."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _is_empty_observation_data(\n    feature_ndims, observation_index_points, observations):\n  \"\"\"Returns `True` if given observation data is empty.\n\n  Emptiness means either\n    1. Both `observation_index_points` and `observations` are `None`, or\n    2. the \"number of observations\" shape is 0. The shape of\n    `observation_index_points` is `[..., N, f1, ..., fF]`, where `N` is the\n    number of observations and the `f`s are feature dims. Thus, we look at the\n    shape element just to the left of the leftmost feature dim. If that shape is\n    zero, we consider the data empty.\n\n  We don't check the shape of observations; validations are checked elsewhere in\n  the calling code, to ensure these shapes are consistent.\n\n  Args:\n    feature_ndims: the number of feature dims, as reported by the GP kernel.\n    observation_index_points: the observation data locations in the index set.\n    observations: the observation data.\n\n  Returns:\n    is_empty: True if the data were deemed to be empty.\n  \"\"\"\n  # If both input locations and observations are `None`, we consider this\n  # \"empty\" observation data.\n  if observation_index_points is None and observations is None:\n    return True\n  num_obs = tf.compat.dimension_value(\n      observation_index_points.shape[-(feature_ndims + 1)])\n  if num_obs is not None and num_obs == 0:\n    return True\n  return False", "response": "Returns True if given observation data is empty."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nchecks that the observation data and locations have consistent shapes.", "response": "def _validate_observation_data(\n    kernel, observation_index_points, observations):\n  \"\"\"Ensure that observation data and locations have consistent shapes.\n\n  This basically means that the batch shapes are broadcastable. We can only\n  ensure this when those shapes are fully statically defined.\n\n\n  Args:\n    kernel: The GP kernel.\n    observation_index_points: the observation data locations in the index set.\n    observations: the observation data.\n\n  Raises:\n    ValueError: if the observations' batch shapes are not broadcastable.\n  \"\"\"\n  # Check that observation index points and observation counts broadcast.\n  ndims = kernel.feature_ndims\n  if (tensorshape_util.is_fully_defined(\n      observation_index_points.shape[:-ndims]) and\n      tensorshape_util.is_fully_defined(observations.shape)):\n    index_point_count = observation_index_points.shape[:-ndims]\n    observation_count = observations.shape\n    try:\n      tf.broadcast_static_shape(index_point_count, observation_count)\n    except ValueError:\n      # Re-raise with our own more contextual error message.\n      raise ValueError(\n          'Observation index point and observation counts are not '\n          'broadcastable: {} and {}, respectively.'.format(\n              index_point_count, observation_count))"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef _kl_gamma_gamma(g0, g1, name=None):\n  with tf.name_scope(name or \"kl_gamma_gamma\"):\n    # Result from:\n    #   http://www.fil.ion.ucl.ac.uk/~wpenny/publications/densities.ps\n    # For derivation see:\n    #   http://stats.stackexchange.com/questions/11646/kullback-leibler-divergence-between-two-gamma-distributions   pylint: disable=line-too-long\n    return (((g0.concentration - g1.concentration) *\n             tf.math.digamma(g0.concentration)) +\n            tf.math.lgamma(g1.concentration) -\n            tf.math.lgamma(g0.concentration) +\n            g1.concentration * tf.math.log(g0.rate) -\n            g1.concentration * tf.math.log(g1.rate) + g0.concentration *\n            (g1.rate / g0.rate - 1.))", "response": "Calculate the batched KL divergence between two Gamma distributions."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nadding a learning rate scheduler to the contained schedules", "response": "def add(self, scheduler, max_iteration, bigdl_type=\"float\"):\n        \"\"\"\n        Add a learning rate scheduler to the contained `schedules`\n\n        :param scheduler: learning rate scheduler to be add\n        :param max_iteration: iteration numbers this scheduler will run\n        \"\"\"\n        return callBigDlFunc(bigdl_type, \"addScheduler\", self.value, scheduler, max_iteration)"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nsaving the current object to a file.", "response": "def save(self, path, overWrite):\n        \"\"\"\n        save OptimMethod\n        :param path      path\n        :param overWrite whether to overwrite\n        \"\"\"\n        method=self.value\n        return callBigDlFunc(self.bigdl_type, \"saveOptimMethod\", method, path, overWrite)"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef set_checkpoint(self, checkpoint_trigger,\n                       checkpoint_path, isOverWrite=True):\n        \"\"\"\n        Configure checkpoint settings.\n\n\n        :param checkpoint_trigger: the interval to write snapshots\n        :param checkpoint_path: the path to write snapshots into\n        :param isOverWrite: whether to overwrite existing snapshots in path.default is True\n        \"\"\"\n        if not os.path.exists(checkpoint_path):\n            mkpath(checkpoint_path)\n        callBigDlFunc(self.bigdl_type, \"setCheckPoint\", self.value,\n                      checkpoint_trigger, checkpoint_path, isOverWrite)", "response": "Configure checkpoint settings.\n\n\n        :param checkpoint_trigger: the interval to write snapshots\n        :param checkpoint_path: the path to write snapshots into\n        :param isOverWrite: whether to overwrite existing snapshots in path.default is True"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nconfiguring constant clipping settings.", "response": "def set_gradclip_const(self, min_value, max_value):\n        \"\"\"\n        Configure constant clipping settings.\n\n\n        :param min_value: the minimum value to clip by\n        :param max_value: the maxmimum value to clip by\n        \"\"\"\n        callBigDlFunc(self.bigdl_type, \"setConstantClip\", self.value, min_value, max_value)"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef set_train_summary(self, summary):\n        callBigDlFunc(self.bigdl_type, \"setTrainSummary\", self.value,\n                      summary)\n        return self", "response": "Set the train summary."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nset the validation summary for the current object.", "response": "def set_val_summary(self, summary):\n        \"\"\"\n        Set validation summary. A ValidationSummary object contains information\n        necessary for the optimizer to know how often the logs are recorded,\n        where to store the logs and how to retrieve them, etc. For details,\n        refer to the docs of ValidationSummary.\n\n\n        :param summary: a ValidationSummary object\n\n\n        \"\"\"\n        callBigDlFunc(self.bigdl_type, \"setValSummary\", self.value,\n                      summary)\n        return self"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef create(model,\n               training_set,\n               criterion,\n               end_trigger=None,\n               batch_size=32,\n               optim_method=None,\n               cores=None,\n               bigdl_type=\"float\"):\n        \"\"\"\n        Create an optimizer.\n        Depend on the input type, the returning optimizer can be a local optimizer \\\n        or a distributed optimizer.\n\n        :param model: the neural net model\n        :param training_set: (features, label) for local mode. RDD[Sample] for distributed mode.\n        :param criterion: the loss function\n        :param optim_method: the algorithm to use for optimization,\n           e.g. SGD, Adagrad, etc. If optim_method is None, the default algorithm is SGD.\n        :param end_trigger: when to end the optimization. default value is MapEpoch(1)\n        :param batch_size: training batch size\n        :param cores: This is for local optimizer only and use total physical cores as the default value\n        \"\"\"\n        if not end_trigger:\n            end_trigger = MaxEpoch(1)\n        if not optim_method:\n            optim_method = SGD()\n        if isinstance(training_set, RDD) or isinstance(training_set, DataSet):\n            return DistriOptimizer(model=model,\n                                   training_rdd=training_set,\n                                   criterion=criterion,\n                                   end_trigger=end_trigger,\n                                   batch_size=batch_size,\n                                   optim_method=optim_method,\n                                   bigdl_type=bigdl_type)\n        elif isinstance(training_set, tuple) and len(training_set) == 2:\n            x, y = training_set\n            return LocalOptimizer(X=x,\n                                  Y=y,\n                                  model=model,\n                                  criterion=criterion,\n                                  end_trigger=end_trigger,\n                                  batch_size=batch_size,\n                                  optim_method=optim_method,\n                                  cores=cores,\n                                  bigdl_type=\"float\")\n        else:\n            raise Exception(\"Not supported training set: %s\" % type(training_set))", "response": "Create an optimizer object."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nconfigure the validation settings for the current object.", "response": "def set_validation(self, batch_size, val_rdd, trigger, val_method=None):\n        \"\"\"\n        Configure validation settings.\n\n\n        :param batch_size: validation batch size\n        :param val_rdd: validation dataset\n        :param trigger: validation interval\n        :param val_method: the ValidationMethod to use,e.g. \"Top1Accuracy\", \"Top5Accuracy\", \"Loss\"\n        \"\"\"\n        if val_method is None:\n            val_method = [Top1Accuracy()]\n        func_name = \"setValidation\"\n        if isinstance(val_rdd, DataSet):\n            func_name = \"setValidationFromDataSet\"\n        callBigDlFunc(self.bigdl_type, func_name, self.value, batch_size,\n                      trigger, val_rdd, to_list(val_method))"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nsets new training dataset for optimizer reuse the training dataset", "response": "def set_traindata(self, training_rdd, batch_size):\n        \"\"\"\n        Set new training dataset, for optimizer reuse\n\n        :param training_rdd: the training dataset\n        :param batch_size: training batch size\n        :return:\n        \"\"\"\n        callBigDlFunc(self.bigdl_type, \"setTrainData\", self.value,\n                     training_rdd, batch_size)"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nconfigure the validation settings for the current log - likelihood dataset.", "response": "def set_validation(self, batch_size, X_val, Y_val, trigger, val_method=None):\n        \"\"\"\n        Configure validation settings.\n\n        :param batch_size: validation batch size\n        :param X_val: features of validation dataset\n        :param Y_val: label of validation dataset\n        :param trigger: validation interval\n        :param val_method: the ValidationMethod to use,e.g. \"Top1Accuracy\", \"Top5Accuracy\", \"Loss\"\n        \"\"\"\n        if val_method is None:\n            val_method = [Top1Accuracy()]\n        callBigDlFunc(self.bigdl_type, \"setValidation\", self.value, batch_size,\n                      trigger, [JTensor.from_ndarray(X) for X in to_list(X_val)],\n                      JTensor.from_ndarray(Y_val), to_list(val_method))"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nsets the interval of recording for each indicator.", "response": "def set_summary_trigger(self, name, trigger):\n        \"\"\"\n        Set the interval of recording for each indicator.\n\n\n        :param tag: tag name. Supported tag names are \"LearningRate\", \"Loss\",\"Throughput\", \"Parameters\". \"Parameters\" is an umbrella tag thatincludes weight, bias, gradWeight, gradBias, and some running status(eg. runningMean and runningVar in BatchNormalization). If youdidn't set any triggers, we will by default record Loss and Throughputin each iteration, while *NOT* recording LearningRate and Parameters,as recording parameters may introduce substantial overhead when themodel is very big, LearningRate is not a public attribute for allOptimMethod.\n        :param trigger: trigger\n        \"\"\"\n        return callBigDlFunc(self.bigdl_type, \"summarySetTrigger\", self.value,\n                             name, trigger)"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef read_data_sets(train_dir, data_type=\"train\"):\n    TRAIN_IMAGES = 'train-images-idx3-ubyte.gz'\n    TRAIN_LABELS = 'train-labels-idx1-ubyte.gz'\n    TEST_IMAGES = 't10k-images-idx3-ubyte.gz'\n    TEST_LABELS = 't10k-labels-idx1-ubyte.gz'\n\n    if data_type == \"train\":\n        local_file = base.maybe_download(TRAIN_IMAGES, train_dir,\n                                         SOURCE_URL + TRAIN_IMAGES)\n        with open(local_file, 'rb') as f:\n            train_images = extract_images(f)\n\n        local_file = base.maybe_download(TRAIN_LABELS, train_dir,\n                                         SOURCE_URL + TRAIN_LABELS)\n        with open(local_file, 'rb') as f:\n            train_labels = extract_labels(f)\n        return train_images, train_labels\n\n    else:\n        local_file = base.maybe_download(TEST_IMAGES, train_dir,\n                                         SOURCE_URL + TEST_IMAGES)\n        with open(local_file, 'rb') as f:\n            test_images = extract_images(f)\n\n        local_file = base.maybe_download(TEST_LABELS, train_dir,\n                                         SOURCE_URL + TEST_LABELS)\n        with open(local_file, 'rb') as f:\n            test_labels = extract_labels(f)\n        return test_images, test_labels", "response": "Parse or download mnist data sets."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef get_news20(source_dir=\"./data/news20/\"):\n    news_dir = download_news20(source_dir)\n    texts = []  # list of text samples\n    label_id = 0\n    for name in sorted(os.listdir(news_dir)):\n        path = os.path.join(news_dir, name)\n        label_id += 1\n        if os.path.isdir(path):\n            for fname in sorted(os.listdir(path)):\n                if fname.isdigit():\n                    fpath = os.path.join(path, fname)\n                    if sys.version_info < (3,):\n                        f = open(fpath)\n                    else:\n                        f = open(fpath, encoding='latin-1')\n                    content = f.read()\n                    texts.append((content, label_id))\n                    f.close()\n\n    print('Found %s texts.' % len(texts))\n    return texts", "response": "Parse or download news20 and return a list of tokens and labels."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nparse or download the pre - trained glove word2vec file and return a dict mapping from word to vector.", "response": "def get_glove_w2v(source_dir=\"./data/news20/\", dim=100):\n    \"\"\"\n    Parse or download the pre-trained glove word2vec if source_dir is empty.\n\n    :param source_dir: The directory storing the pre-trained word2vec\n    :param dim: The dimension of a vector\n    :return: A dict mapping from word to vector\n    \"\"\"\n    w2v_dir = download_glove_w2v(source_dir)\n    w2v_path = os.path.join(w2v_dir, \"glove.6B.%sd.txt\" % dim)\n    if sys.version_info < (3,):\n        w2v_f = open(w2v_path)\n    else:\n        w2v_f = open(w2v_path, encoding='latin-1')\n    pre_w2v = {}\n    for line in w2v_f.readlines():\n        items = line.split(\" \")\n        pre_w2v[items[0]] = [float(i) for i in items[1:]]\n    w2v_f.close()\n    return pre_w2v"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\ncompile the learning process.", "response": "def compile(self, optimizer, loss, metrics=None):\n        \"\"\"\n        Configures the learning process. Must be called before fit or evaluate.\n\n        # Arguments\n        optimizer: Optimization method to be used. One can alternatively pass in the corresponding\n                   string representation, such as 'sgd'.\n        loss: Criterion to be used. One can alternatively pass in the corresponding string\n              representation, such as 'mse'.\n        metrics: List of validation methods to be used. Default is None. One can alternatively use ['accuracy'].\n        \"\"\"\n        if isinstance(optimizer, six.string_types):\n            optimizer = self.__convert_optim_method(optimizer)\n        if isinstance(loss, six.string_types):\n            loss = self.__convert_criterion(loss)\n        if all(isinstance(metric, six.string_types) for metric in metrics):\n            metrics = self.__convert_metrics(metrics)\n        callBigDlFunc(self.bigdl_type, \"compile\",\n                      self.value,\n                      optimizer,\n                      loss,\n                      metrics)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef fit(self, x, y=None, batch_size=32, nb_epoch=10, validation_data=None, distributed=True):\n        if distributed:\n            if isinstance(x, np.ndarray) and isinstance(y, np.ndarray):\n                training_data = to_sample_rdd(x, y)\n                if validation_data:\n                    validation_data = to_sample_rdd(*validation_data)\n            elif (isinstance(x, RDD) or isinstance(x, DataSet)) and not y:\n                training_data = x\n            else:\n                raise TypeError(\"Unsupported training data type: %s\" % type(x))\n            callBigDlFunc(self.bigdl_type, \"fit\",\n                          self.value,\n                          training_data,\n                          batch_size,\n                          nb_epoch,\n                          validation_data)\n        else:\n            if validation_data:\n                val_x = [JTensor.from_ndarray(x) for x in to_list(validation_data[0])]\n                val_y = JTensor.from_ndarray(validation_data[1])\n            else:\n                val_x, val_y = None, None\n            callBigDlFunc(self.bigdl_type, \"fit\",\n                          self.value,\n                          [JTensor.from_ndarray(x) for x in to_list(x)],\n                          JTensor.from_ndarray(y),\n                          batch_size,\n                          nb_epoch,\n                          val_x,\n                          val_y,\n                          multiprocessing.cpu_count())", "response": "Train a model for a fixed number of epochs on a dataset."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nevaluating a model on a given dataset in distributed mode.", "response": "def evaluate(self, x, y=None, batch_size=32):\n        \"\"\"\n        Evaluate a model on a given dataset in distributed mode.\n\n        # Arguments\n        x: Input data. A Numpy array or RDD of Sample.\n        y: Labels. A Numpy array. Default is None if x is already RDD of Sample.\n        batch_size: Number of samples per gradient update.\n        \"\"\"\n        if isinstance(x, np.ndarray) and isinstance(y, np.ndarray):\n            evaluation_data = to_sample_rdd(x, y)\n        elif isinstance(x, RDD) and not y:\n            evaluation_data = x\n        else:\n            raise TypeError(\"Unsupported evaluation data type: %s\" % type(x))\n        return callBigDlFunc(self.bigdl_type, \"evaluate\",\n                             self.value,\n                             evaluation_data,\n                             batch_size)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nusing a model to predict the class of the user s income.", "response": "def predict(self, x, distributed=True):\n        \"\"\"\n        Use a model to do prediction.\n\n        # Arguments\n        x: Input data. A Numpy array or RDD of Sample.\n        distributed: Boolean. Whether to do prediction in distributed mode or local mode.\n                     Default is True. In local mode, x must be a Numpy array.\n        \"\"\"\n        if is_distributed:\n            if isinstance(x, np.ndarray):\n                features = to_sample_rdd(x, np.zeros([x.shape[0]]))\n            elif isinstance(x, RDD):\n                features = x\n            else:\n                raise TypeError(\"Unsupported prediction data type: %s\" % type(x))\n            return self.predict_distributed(features)\n        else:\n            if isinstance(x, np.ndarray):\n                return self.predict_local(x)\n            else:\n                raise TypeError(\"Unsupported prediction data type: %s\" % type(x))"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\ncreates a Python Model from a Java value.", "response": "def from_jvalue(jvalue, bigdl_type=\"float\"):\n        \"\"\"\n        Create a Python Model base on the given java value\n        :param jvalue: Java object create by Py4j\n        :return: A Python Model\n        \"\"\"\n        model = Sequential(jvalue=jvalue)\n        model.value = jvalue\n        return model"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\ncreate a Python Model from a Java value.", "response": "def from_jvalue(jvalue, bigdl_type=\"float\"):\n        \"\"\"\n        Create a Python Model base on the given java value\n        :param jvalue: Java object create by Py4j\n        :return: A Python Model\n        \"\"\"\n        model = Model([], [], jvalue=jvalue)\n        model.value = jvalue\n        return model"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nget mnist dataset and parallelize into RDDs.", "response": "def get_mnist(sc, data_type=\"train\", location=\"/tmp/mnist\"):\n    \"\"\"\n    Get mnist dataset and parallelize into RDDs.\n    Data would be downloaded automatically if it doesn't present at the specific location.\n\n    :param sc: SparkContext.\n    :param data_type: \"train\" for training data and \"test\" for testing data.\n    :param location: Location to store mnist dataset.\n    :return: RDD of (features: ndarray, label: ndarray).\n    \"\"\"\n    (images, labels) = mnist.read_data_sets(location, data_type)\n    images = sc.parallelize(images)\n    labels = sc.parallelize(labels + 1)  # Target start from 1 in BigDL\n    record = images.zip(labels)\n    return record"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef get_end_trigger(options):\n    if options.endTriggerType.lower() == \"epoch\":\n        return MaxEpoch(options.endTriggerNum)\n    else:\n        return MaxIteration(options.endTriggerNum)", "response": "Returns the end trigger number for the current optimization based on input option."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef validate_optimizer(optimizer, test_data, options):\n    optimizer.set_validation(\n        batch_size=options.batchSize,\n        val_rdd=test_data,\n        trigger=EveryEpoch(),\n        val_method=[Top1Accuracy()]\n    )\n    optimizer.set_checkpoint(EveryEpoch(), options.checkpointPath)", "response": "Set validation and checkpoint for distributed optimizer."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nset the value of batchSize.", "response": "def setBatchSize(self, val):\n        \"\"\"\n        Sets the value of :py:attr:`batchSize`.\n        \"\"\"\n        self._paramMap[self.batchSize] = val\n        pythonBigDL_method_name = \"setBatchSize\" + self.__class__.__name__\n        callBigDlFunc(self.bigdl_type, pythonBigDL_method_name, self.value, val)\n        return self"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nreturning the broadcasted value of the attribute.", "response": "def value(self):\n        \"\"\" Return the broadcasted value\n        \"\"\"\n        if not hasattr(self, \"_value\") and self._path is not None:\n            self._value = self._load(self._path)\n        return self._value"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef to_sample_rdd(x, y, numSlices=None):\n    sc = get_spark_context()\n    from bigdl.util.common import Sample\n    x_rdd = sc.parallelize(x, numSlices)\n    y_rdd = sc.parallelize(y, numSlices)\n    return x_rdd.zip(y_rdd).map(lambda item: Sample.from_ndarray(item[0], item[1]))", "response": "Conver x and y into RDD [ Sample ]"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nget the current SparkContext and create one if no active instance", "response": "def get_spark_context(conf=None):\n    \"\"\"\n    Get the current active spark context and create one if no active instance\n    :param conf: combining bigdl configs into spark conf\n    :return: SparkContext\n    \"\"\"\n    if hasattr(SparkContext, \"getOrCreate\"):\n        with SparkContext._lock:\n            if SparkContext._active_spark_context is None:\n                spark_conf = create_spark_conf() if conf is None else conf\n                return SparkContext.getOrCreate(spark_conf)\n            else:\n                return SparkContext.getOrCreate()\n\n    else:\n        # Might have threading issue but we cann't add _lock here\n        # as it's not RLock in spark1.5;\n        if SparkContext._active_spark_context is None:\n            spark_conf = create_spark_conf() if conf is None else conf\n            return SparkContext(conf=spark_conf)\n        else:\n            return SparkContext._active_spark_context"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\ncalls API in PythonBigDL", "response": "def callBigDlFunc(bigdl_type, name, *args):\n    \"\"\" Call API in PythonBigDL \"\"\"\n    gateway = _get_gateway()\n    error = Exception(\"Cannot find function: %s\" % name)\n    for jinvoker in JavaCreator.instance(bigdl_type, gateway).value:\n        # hasattr(jinvoker, name) always return true here,\n        # so you need to invoke the method to check if it exist or not\n        try:\n            api = getattr(jinvoker, name)\n            result = callJavaFunc(api, *args)\n        except Exception as e:\n            error = e\n            if \"does not exist\" not in str(e):\n                raise e\n        else:\n            return result\n    raise error"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nreturning a JavaRDD of Object by unpickling .", "response": "def _to_java_object_rdd(rdd):\n    \"\"\" Return a JavaRDD of Object by unpickling\n\n\n    It will convert each Python object into Java object by Pyrolite, whenever\n    the RDD is serialized in batch or not.\n    \"\"\"\n    rdd = rdd._reserialize(AutoBatchedSerializer(PickleSerializer()))\n    return \\\n        rdd.ctx._jvm.org.apache.spark.bigdl.api.python.BigDLSerDe.pythonToJava(\n            rdd._jrdd, True)"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nconvert Python object into Java object.", "response": "def _py2java(gateway, obj):\n    \"\"\" Convert Python object into Java \"\"\"\n    if isinstance(obj, RDD):\n        obj = _to_java_object_rdd(obj)\n    elif isinstance(obj, DataFrame):\n        obj = obj._jdf\n    elif isinstance(obj, SparkContext):\n        obj = obj._jsc\n    elif isinstance(obj, (list, tuple)):\n        obj = ListConverter().convert([_py2java(gateway, x) for x in obj],\n                                      gateway._gateway_client)\n    elif isinstance(obj, dict):\n        result = {}\n        for (key, value) in obj.items():\n            result[key] = _py2java(gateway, value)\n        obj = MapConverter().convert(result, gateway._gateway_client)\n    elif isinstance(obj, JavaValue):\n        obj = obj.value\n    elif isinstance(obj, JavaObject):\n        pass\n    elif isinstance(obj, (int, long, float, bool, bytes, unicode)):\n        pass\n    else:\n        data = bytearray(PickleSerializer().dumps(obj))\n        obj = gateway.jvm.org.apache.spark.bigdl.api.python.BigDLSerDe.loads(data)\n    return obj"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef get_activation_by_name(activation_name, activation_id=None):\n    import bigdl.nn.layer as BLayer\n    activation = None\n    activation_name = activation_name.lower()\n    if activation_name == \"tanh\":\n        activation = BLayer.Tanh()\n    elif activation_name == \"sigmoid\":\n        activation = BLayer.Sigmoid()\n    elif activation_name == \"hard_sigmoid\":\n        activation = BLayer.HardSigmoid()\n    elif activation_name == \"relu\":\n        activation = BLayer.ReLU()\n    elif activation_name == \"softmax\":\n        activation = BLayer.SoftMax()\n    elif activation_name == \"softplus\":\n        activation = BLayer.SoftPlus(beta=1.0)\n    elif activation_name == \"softsign\":\n        activation = BLayer.SoftSign()\n    elif activation_name == \"linear\":\n        activation = BLayer.Identity()\n    else:\n        raise Exception(\"Unsupported activation type: %s\" % activation_name)\n    if not activation_id:\n        activation.set_name(activation_id)\n    return activation", "response": "Convert a activation name to a bigdl activation layer"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef from_ndarray(cls, a_ndarray, bigdl_type=\"float\"):\n        if a_ndarray is None:\n            return None\n        assert isinstance(a_ndarray, np.ndarray), \\\n            \"input should be a np.ndarray, not %s\" % type(a_ndarray)\n        return cls(a_ndarray,\n                   a_ndarray.shape if a_ndarray.shape else (a_ndarray.size),\n                   bigdl_type)", "response": "Convert a numpy array to a DenseTensor which would be used in Java side."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef sparse(cls, a_ndarray, i_ndarray, shape, bigdl_type=\"float\"):\n        if a_ndarray is None:\n            return None\n        assert isinstance(a_ndarray, np.ndarray), \\\n            \"values array should be a np.ndarray, not %s\" % type(a_ndarray)\n        assert isinstance(i_ndarray, np.ndarray), \\\n            \"indices array should be a np.ndarray, not %s\" % type(a_ndarray)\n        assert i_ndarray.size == a_ndarray.size * shape.size, \\\n            \"size of values and indices should match.\"\n        return cls(a_ndarray,\n                   shape,\n                   bigdl_type,\n                   i_ndarray)", "response": "Convert a three ndarray to SparseTensor which would be used in Java side."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef to_ndarray(self):\n        assert self.indices is None, \"sparseTensor to ndarray is not supported\"\n        return np.array(self.storage, dtype=get_dtype(self.bigdl_type)).reshape(self.shape)", "response": "Transfer JTensor to ndarray"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef from_ndarray(cls, features, labels, bigdl_type=\"float\"):\n        if isinstance(features, np.ndarray):\n            features = [features]\n        else:\n            assert all(isinstance(feature, np.ndarray) for feature in features), \\\n                \"features should be a list of np.ndarray, not %s\" % type(features)\n        if np.isscalar(labels):  # in case labels is a scalar.\n            labels = [np.array(labels)]\n        elif isinstance(labels, np.ndarray):\n            labels = [labels]\n        else:\n            assert all(isinstance(label, np.ndarray) for label in labels), \\\n                \"labels should be a list of np.ndarray, not %s\" % type(labels)\n        return cls(\n            features=[JTensor.from_ndarray(feature) for feature in features],\n            labels=[JTensor.from_ndarray(label) for label in labels],\n            bigdl_type=bigdl_type)", "response": "Convert a numpy ndarray of features and labels to Sample which would be used in Java side."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nget label as ndarray from ImageFeature", "response": "def get_label(self):\n        \"\"\"\n        get label as ndarray from ImageFeature\n        \"\"\"\n        label = callBigDlFunc(self.bigdl_type, \"imageFeatureToLabelTensor\", self.value)\n        return label.to_ndarray()"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nread images from HDFS or local file system.", "response": "def read(cls, path, sc=None, min_partitions=1, bigdl_type=\"float\"):\n        \"\"\"\n        Read images as Image Frame\n        if sc is defined, Read image as DistributedImageFrame from local file system or HDFS\n        if sc is null, Read image as LocalImageFrame from local file system\n        :param path path to read images\n        if sc is defined, path can be local or HDFS. Wildcard character are supported.\n        if sc is null, path is local directory/image file/image file with wildcard character\n        :param sc SparkContext\n        :param min_partitions A suggestion value of the minimal splitting number for input data.\n        :return ImageFrame\n        \"\"\"\n        return ImageFrame(jvalue=callBigDlFunc(bigdl_type, \"read\", path, sc, min_partitions))"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef read_parquet(cls, path, sc, bigdl_type=\"float\"):\n        return DistributedImageFrame(jvalue=callBigDlFunc(bigdl_type, \"readParquet\", path, sc))", "response": "Read a DistributedImageFrame from a parquet file."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nwriting ImageFrame as parquet file", "response": "def write_parquet(cls, path, output, sc, partition_num = 1, bigdl_type=\"float\"):\n        \"\"\"\n        write ImageFrame as parquet file\n        \"\"\"\n        return callBigDlFunc(bigdl_type, \"writeParquet\", path, output, sc, partition_num)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\ntransform the image frame of this object.", "response": "def transform(self, transformer, bigdl_type=\"float\"):\n        \"\"\"\n        transformImageFrame\n        \"\"\"\n        self.value = callBigDlFunc(bigdl_type,\n                                 \"transformImageFrame\", transformer, self.value)\n        return self"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef get_image(self, float_key=\"floats\", to_chw=True):\n        return self.image_frame.get_image(float_key, to_chw)", "response": "get image from ImageFrame"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef random_split(self, weights):\n        jvalues =  self.image_frame.random_split(weights)\n        return [ImageFrame(jvalue) for jvalue in jvalues]", "response": "Random split imageframes according to weights"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef get_image(self, float_key=\"floats\", to_chw=True):\n        tensors = callBigDlFunc(self.bigdl_type,\n                                   \"localImageFrameToImageTensor\", self.value, float_key, to_chw)\n        return map(lambda tensor: tensor.to_ndarray(), tensors)", "response": "get list of images from ImageFrame"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef get_label(self):\n        tensor_rdd = callBigDlFunc(self.bigdl_type, \"distributedImageFrameToLabelTensorRdd\", self.value)\n        return tensor_rdd.map(lambda tensor: tensor.to_ndarray())", "response": "get label rdd from ImageFrame\n       "}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\ngets prediction rdd from ImageFrame", "response": "def get_predict(self, key=\"predict\"):\n        \"\"\"\n        get prediction rdd from ImageFrame\n        \"\"\"\n        predicts = callBigDlFunc(self.bigdl_type, \"distributedImageFrameToPredict\", self.value, key)\n        return predicts.map(lambda predict: (predict[0], predict[1].to_ndarray()) if predict[1] else (predict[0], None))"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef files_to_image_frame(cls,\n                             url,\n                             sc,\n                             class_num,\n                             partition_num=-1,\n                             bigdl_type=\"float\"):\n        \"\"\"\n        Extract hadoop sequence files from an HDFS path as ImageFrame\n        :param url: sequence files folder path\n        :param sc: spark context\n        :param class_num: class number of data\n        :param partition_num: partition number, default: Engine.nodeNumber() * Engine.coreNumber()\n        \"\"\"\n        jvalue = callBigDlFunc(bigdl_type,\n                               \"seqFilesToImageFrame\",\n                               url,\n                               sc,\n                               class_num,\n                               partition_num)\n        return ImageFrame(jvalue=jvalue)", "response": "This function extracts the hadoop sequence files from an HDFS path as ImageFrame."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nevaluate a model by the given metrics.", "response": "def evaluate(self, x, y, batch_size=32, sample_weight=None, is_distributed=False):\n        \"\"\"\n        Evaluate a model by the given metrics.\n        :param x: ndarray or list of ndarray for local mode.\n                  RDD[Sample] for distributed mode\n        :param y: ndarray or list of ndarray for local mode and would be None for cluster mode.\n        :param batch_size\n        :param is_distributed: run in local mode or distributed mode.\n               NB: if is_distributed=true, x should be RDD[Sample] and y should be None\n        :return:\n        \"\"\"\n        if sample_weight:\n            unsupport_exp(\"sample_weight\")\n        if is_distributed:\n            if isinstance(x, np.ndarray):\n                input = to_sample_rdd(x, y)\n            elif isinstance(x, RDD):\n                input = x\n            if self.metrics:\n                sc = get_spark_context()\n                return [r.result for r in\n                        self.bmodel.evaluate(input, batch_size, self.metrics)]\n            else:\n                raise Exception(\"No Metrics found.\")\n        else:\n            raise Exception(\"We only support evaluation in distributed mode\")"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef predict(self, x, batch_size=None, verbose=None, is_distributed=False):\n        if batch_size or verbose:\n            raise Exception(\"we don't support batch_size or verbose for now\")\n        if is_distributed:\n            if isinstance(x, np.ndarray):\n                input = to_sample_rdd(x, np.zeros([x.shape[0]]))\n            #  np.asarray(self.bmodel.predict(x_rdd).collect())\n            elif isinstance(x, RDD):\n                input = x\n            return self.bmodel.predict(input)\n        else:\n            if isinstance(x, np.ndarray):\n                return self.bmodel.predict_local(x)\n        raise Exception(\"not supported type: %s\" % x)", "response": "Generates output predictions for the input samples in a batched way."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef fit(self, x, y=None, batch_size=32, nb_epoch=10, verbose=1, callbacks=None,\n            validation_split=0., validation_data=None, shuffle=True,\n            class_weight=None, sample_weight=None, initial_epoch=0, is_distributed=False):\n        \"\"\"Optimize the model by the given options\n\n        :param x: ndarray or list of ndarray for local mode.\n                  RDD[Sample] for distributed mode\n        :param y: ndarray or list of ndarray for local mode and would be None for cluster mode.\n            is_distributed: used to control run in local or cluster. the default value is False.\n            NB: if is_distributed=true, x should be RDD[Sample] and y should be None\n        :param is_distributed: Whether to train in local mode or distributed mode\n        :return:\n            A Numpy array or RDD[Sample] of predictions.\n        \"\"\"\n        if callbacks:\n            raise Exception(\"We don't support callbacks in fit for now\")\n        if class_weight:\n            unsupport_exp(\"class_weight\")\n        if sample_weight:\n            unsupport_exp(\"sample_weight\")\n        if initial_epoch != 0:\n            unsupport_exp(\"initial_epoch\")\n        if shuffle != True:\n            unsupport_exp(\"shuffle\")\n        if validation_split != 0.:\n            unsupport_exp(\"validation_split\")\n        bopt = self.__create_optimizer(x=x,\n                                       y=y,\n                                       batch_size=batch_size,\n                                       nb_epoch=nb_epoch,\n                                       validation_data=validation_data,\n                                       is_distributed=is_distributed)\n        bopt.optimize()", "response": "Optimize the model by the given options"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef transform(self, dataset):\n        self._transfer_params_to_java()\n        return callBigDlFunc(self.bigdl_type, \"dlImageTransform\", self.value, dataset)", "response": "Apply the transformer to the images in inputCol and store the transformed result in outputCols"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nsaving a Keras model definition to JSON with given path", "response": "def save_keras_definition(keras_model, path):\n    \"\"\"\n    Save a Keras model definition to JSON with given path\n    \"\"\"\n    model_json = keras_model.to_json()\n    with open(path, \"w\") as json_file:\n        json_file.write(model_json)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\ndownloading or load MNIST dataset to the specified path.", "response": "def get_mnist(sc, data_type=\"train\", location=\"/tmp/mnist\"):\n    \"\"\"\n    Download or load MNIST dataset to/from the specified path.\n    Normalize and transform input data into an RDD of Sample\n    \"\"\"\n    from bigdl.dataset import mnist\n    from bigdl.dataset.transformer import normalizer\n    (images, labels) = mnist.read_data_sets(location, data_type)\n    images = images.reshape((images.shape[0], ) + input_shape)\n    images = sc.parallelize(images)\n    labels = sc.parallelize(labels + 1)  # Target start from 1 in BigDL\n    record = images.zip(labels).map(lambda rec_tuple: (normalizer(rec_tuple[0], mnist.TRAIN_MEAN, mnist.TRAIN_STD),\n                                    rec_tuple[1])) \\\n                               .map(lambda t: Sample.from_ndarray(t[0], t[1]))\n    return record"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\ndefine a convnet model in Keras 1. 2. 2", "response": "def build_keras_model():\n    \"\"\"\n    Define a convnet model in Keras 1.2.2\n    \"\"\"\n    from keras.models import Sequential\n    from keras.layers import Dense, Dropout, Activation, Flatten\n    from keras.layers import Convolution2D, MaxPooling2D\n\n    keras_model = Sequential()\n    keras_model.add(Convolution2D(32, 3, 3, border_mode='valid',\n                                  input_shape=input_shape))\n    keras_model.add(Activation('relu'))\n    keras_model.add(Convolution2D(32, 3, 3))\n    keras_model.add(Activation('relu'))\n    keras_model.add(MaxPooling2D(pool_size=(2, 2)))\n    keras_model.add(Dropout(0.25))\n    keras_model.add(Flatten())\n    keras_model.add(Dense(128))\n    keras_model.add(Activation('relu'))\n    keras_model.add(Dropout(0.5))\n    keras_model.add(Dense(10))\n    keras_model.add(Activation('softmax'))\n    return keras_model"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nloads a pre - trained Bigdl model.", "response": "def load(path, bigdl_type=\"float\"):\n        \"\"\"\n        Load a pre-trained Bigdl model.\n\n        :param path: The path containing the pre-trained model.\n        :return: A pre-trained model.\n        \"\"\"\n        jmodel = callBigDlFunc(bigdl_type, \"loadBigDL\", path)\n        return Layer.of(jmodel)"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\ncreating a Python Layer base on the given java value and the real type.", "response": "def of(jvalue, bigdl_type=\"float\"):\n        \"\"\"\n        Create a Python Layer base on the given java value and the real type.\n        :param jvalue: Java object create by Py4j\n        :return: A Python Layer\n        \"\"\"\n        def get_py_name(jclass_name):\n            if jclass_name == \"StaticGraph\" or jclass_name == \"DynamicGraph\":\n                return \"Model\"\n            elif jclass_name == \"Input\":\n                return \"Layer\"\n            else:\n                return jclass_name\n\n        jname = callBigDlFunc(bigdl_type,\n                                      \"getRealClassNameOfJValue\",\n                                      jvalue)\n\n        jpackage_name = \".\".join(jname.split(\".\")[:-1])\n        pclass_name = get_py_name(jname.split(\".\")[-1])\n\n        if \"com.intel.analytics.bigdl.nn.keras.Model\" == jname or \\\n                        \"com.intel.analytics.bigdl.nn.keras.Sequential\" == jname:\n            base_module = importlib.import_module('bigdl.nn.keras.topology')\n        elif \"com.intel.analytics.bigdl.nn.keras\" == jpackage_name:\n            base_module = importlib.import_module('bigdl.nn.keras.layer')\n        else:\n            base_module = importlib.import_module('bigdl.nn.layer')\n\n        realClassName = \"Layer\" # The top base class\n        if pclass_name in dir(base_module):\n            realClassName = pclass_name\n        module = getattr(base_module, realClassName)\n        jvalue_creator = getattr(module, \"from_jvalue\")\n        model = jvalue_creator(jvalue, bigdl_type)\n        return model"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nsetting the running mean of the layer.", "response": "def set_running_mean(self, running_mean):\n        \"\"\"\n        Set the running mean of the layer.\n        Only use this method for a BatchNormalization layer.\n        :param running_mean: a Numpy array.\n        \"\"\"\n        callBigDlFunc(self.bigdl_type, \"setRunningMean\",\n                      self.value, JTensor.from_ndarray(running_mean))\n        return self"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef set_running_std(self, running_std):\n        callBigDlFunc(self.bigdl_type, \"setRunningStd\",\n                      self.value, JTensor.from_ndarray(running_std))\n        return self", "response": "Set the running variance of the layer."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\ncreating a Python Model from a Java value.", "response": "def from_jvalue(jvalue, bigdl_type=\"float\"):\n        \"\"\"\n        Create a Python Model base on the given java value\n        :param jvalue: Java object create by Py4j\n        :return: A Python Model\n        \"\"\"\n        model = Layer(jvalue=jvalue, bigdl_type=bigdl_type)\n        model.value = jvalue\n        return model"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\ncheck input for valid input for the KATCP version.", "response": "def check_input(input):\n        \"\"\"\n        :param input: ndarray or list of ndarray or JTensor or list of JTensor.\n        :return: (list of JTensor, isTable)\n        \"\"\"\n        def to_jtensor(i):\n            if isinstance(i, np.ndarray):\n                return JTensor.from_ndarray(i)\n            elif isinstance(i, JTensor):\n                return i\n            else:\n                raise Exception(\"Error unknown input type %s\" % type(i))\n        if type(input) is list:\n            if len(input) == 0:\n                raise Exception('Error when checking: empty input')\n            return list(map(lambda i: to_jtensor(i), input)), True\n        else:\n            return [to_jtensor(input)], False"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef backward(self, input, grad_output):\n        jinput, input_is_table = self.check_input(input)\n        jgrad_output, grad_output_is_table = self.check_input(grad_output)\n        output = callBigDlFunc(self.bigdl_type,\n                               \"modelBackward\",\n                               self.value,\n                               jinput,\n                               input_is_table,\n                               jgrad_output,\n                               grad_output_is_table)\n        return self.convert_output(output)", "response": "This method performs a backward propagation step through the module with respect to the input."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef parameters(self):\n        name_to_params = callBigDlFunc(self.bigdl_type,\n                                       \"modelGetParameters\",\n                                       self.value)\n\n        def to_ndarray(params):\n            return dict((param_name,\n                         np.array(values[0], dtype=self.get_dtype()).reshape(\n                             values[1])) for param_name, values in\n                        params.items())\n\n        return dict((layer_name, to_ndarray(params)) for layer_name, params in\n                name_to_params.items())", "response": "Get the model parameters which contains weight bias gradWeight gradBias gradWeight\n       "}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nevaluate the model for the current layer.", "response": "def evaluate(self, *args):\n        \"\"\"\n        No argument passed in:\n        Evaluate the model to set train = false, useful when doing test/forward\n        :return: layer itself\n\n        Three arguments passed in:\n        A method to benchmark the model quality.\n\n        :param dataset: the input data\n        :param batch_size: batch size\n        :param val_methods: a list of validation methods. i.e: Top1Accuracy,Top5Accuracy and Loss.\n        :return: a list of the metrics result\n        \"\"\"\n        if len(args) == 0:\n            callBigDlFunc(self.bigdl_type,\n                          \"evaluate\", self.value)\n            return self\n        elif len(args) == 3:\n            dataset, batch_size, val_methods = args\n            if (isinstance(dataset, ImageFrame)):\n                return callBigDlFunc(self.bigdl_type,\n                                    \"modelEvaluateImageFrame\",\n                                    self.value,\n                                    dataset, batch_size, val_methods)\n            else:\n                return callBigDlFunc(self.bigdl_type,\n                                     \"modelEvaluate\",\n                                     self.value,\n                                     dataset, batch_size, val_methods)\n        else:\n            raise Exception(\"Error when calling evaluate(): it takes no argument or exactly three arguments only\")"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\npredicts the local values of the model with the given data.", "response": "def predict_local(self, X, batch_size = -1):\n        \"\"\"\n        :param X: X can be a ndarray or list of ndarray if the model has multiple inputs.\n                  The first dimension of X should be batch.\n        :param batch_size: total batch size of prediction.\n        :return: a ndarray as the prediction result.\n        \"\"\"\n\n        jresults = callBigDlFunc(self.bigdl_type,\n                             \"predictLocal\",\n                               self.value,\n                               self._to_jtensors(X),\n                               batch_size)\n\n        return np.stack([j.to_ndarray()for j in jresults])"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef predict(self, features, batch_size = -1):\n        if isinstance(features, RDD):\n            return self.predict_distributed(features, batch_size)\n        else:\n            return self.predict_local(features, batch_size)", "response": "Predict the model for the given features."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef predict_class(self, features):\n        if isinstance(features, RDD):\n            return self.predict_class_distributed(features)\n        else:\n            return self.predict_class_local(features)", "response": "Model inference base on the given data which returning label\n       "}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\npredicts the distributed version of the data_rdd.", "response": "def predict_distributed(self, data_rdd, batch_size = -1):\n        \"\"\"\n        Model inference base on the given data.\n        You need to invoke collect() to trigger those action \\\n        as the returning result is an RDD.\n\n        :param data_rdd: the data to be predict.\n        :param batch_size: total batch size of prediction.\n        :return: An RDD represent the predict result.\n        \"\"\"\n        result = callBigDlFunc(self.bigdl_type,\n                               \"modelPredictRDD\", self.value, data_rdd, batch_size)\n        return result.map(lambda data: data.to_ndarray())"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef predict_class_distributed(self, data_rdd):\n        result = callBigDlFunc(self.bigdl_type,\n                               \"modelPredictClass\", self.value, data_rdd)\n        return result", "response": "Predict the class for the given data RDD."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef predict_image(self, image_frame, output_layer=None, share_buffer=False,\n                      batch_per_partition=4, predict_key=\"predict\"):\n        \"\"\"\n        model predict images, return imageFrame with predicted tensor\n        :param image_frame imageFrame that contains images\n        :param output_layer if output_layer is not null, the output of layer that matches\n        output_layer will be used as predicted output\n        :param share_buffer whether to share same memory for each batch predict results\n        :param batch_per_partition batch size per partition, default is 4\n        :param predict_key key to store predicted results\n        \"\"\"\n\n        image_frame = callBigDlFunc(self.bigdl_type, \"modelPredictImage\", self.value,\n                             image_frame,\n                             output_layer,\n                             share_buffer,\n                             batch_per_partition,\n                             predict_key)\n        return ImageFrame(image_frame)", "response": "model predict images return imageFrame with predicted tensor\n       "}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nset the weights for this layer.", "response": "def set_weights(self, weights):\n        \"\"\"\n        Set weights for this layer\n\n        :param weights: a list of numpy arrays which represent weight and bias\n        :return:\n\n        >>> linear = Linear(3,2)\n        creating: createLinear\n        >>> linear.set_weights([np.array([[1,2,3],[4,5,6]]), np.array([7,8])])\n        >>> weights = linear.get_weights()\n        >>> weights[0].shape == (2,3)\n        True\n        >>> np.testing.assert_allclose(weights[0][0], np.array([1., 2., 3.]))\n        >>> np.testing.assert_allclose(weights[1], np.array([7., 8.]))\n        >>> relu = ReLU()\n        creating: createReLU\n        >>> from py4j.protocol import Py4JJavaError\n        >>> try:\n        ...     relu.set_weights([np.array([[1,2,3],[4,5,6]]), np.array([7,8])])\n        ... except Py4JJavaError as err:\n        ...     print(err.java_exception)\n        ...\n        java.lang.IllegalArgumentException: requirement failed: this layer does not have weight/bias\n        >>> relu.get_weights()\n        The layer does not have weight/bias\n        >>> add = Add(2)\n        creating: createAdd\n        >>> try:\n        ...     add.set_weights([np.array([7,8]), np.array([1,2])])\n        ... except Py4JJavaError as err:\n        ...     print(err.java_exception)\n        ...\n        java.lang.IllegalArgumentException: requirement failed: the number of input weight/bias is not consistant with number of weight/bias of this layer, number of input 1, number of output 2\n        >>> cAdd = CAdd([4, 1])\n        creating: createCAdd\n        >>> cAdd.set_weights(np.ones([4, 1]))\n        >>> (cAdd.get_weights()[0] == np.ones([4, 1])).all()\n        True\n        \"\"\"\n        tensors = [JTensor.from_ndarray(param, self.bigdl_type) for param in to_list(weights)]\n        callBigDlFunc(self.bigdl_type, \"setWeights\", self.value, tensors)"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef get_weights(self):\n        tensorWeights = callBigDlFunc(self.bigdl_type,\n                              \"getWeights\", self.value)\n        if tensorWeights is not None:\n            return [tensor.to_ndarray() for tensor in tensorWeights]\n        else:\n            print(\"The layer does not have weight/bias\")\n            return None", "response": "Get weights for this layer"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nsaving a model to protobuf files.", "response": "def save_tensorflow(self, inputs, path, byte_order=\"little_endian\", data_format=\"nhwc\"):\n        \"\"\"\n        Save a model to protobuf files so that it can be used in tensorflow inference.\n\n        When saving the model, placeholders will be added to the tf model as input nodes. So\n        you need to pass in the names and shapes of the placeholders. BigDL model doesn't have\n        such information. The order of the placeholder information should be same as the inputs\n        of the graph model.\n        :param inputs: placeholder information, should be an array of tuples (input_name, shape)\n                       where 'input_name' is a string and shape is an array of integer\n        :param path: the path to be saved to\n        :param byte_order: model byte order\n        :param data_format: model data format, should be \"nhwc\" or \"nchw\"\n        \"\"\"\n        callBigDlFunc(self.bigdl_type, \"saveTF\", self.value, inputs, path, byte_order, data_format)"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nfreeze the current module.", "response": "def freeze(self, names=None):\n        \"\"\"\n        freeze module, if names is not None, set an array of layers that match given names\n        to be freezed\n        :param names: an array of layer names\n        :return:\n        \"\"\"\n        callBigDlFunc(self.bigdl_type, \"freeze\", self.value, names)\n        return self"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nsets this layer in training mode or in predition mode.", "response": "def training(self, is_training=True):\n        '''\n        Set this layer in the training mode or in predition mode if is_training=False\n        '''\n        if is_training:\n            callJavaFunc(self.value.training)\n        else:\n            callJavaFunc(self.value.evaluate)\n        return self"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef quantize(self):\n        '''\n        Clone self and quantize it, at last return a new quantized model.\n        :return: A new quantized model.\n\n        >>> fc = Linear(4, 2)\n        creating: createLinear\n        >>> fc.set_weights([np.ones((2, 4)), np.ones((2,))])\n        >>> input = np.ones((2, 4))\n        >>> output = fc.forward(input)\n        >>> expected_output = np.array([[5., 5.], [5., 5.]])\n        >>> np.testing.assert_allclose(output, expected_output)\n        >>> quantized_fc = fc.quantize()\n        >>> quantized_output = quantized_fc.forward(input)\n        >>> expected_quantized_output = np.array([[5., 5.], [5., 5.]])\n        >>> np.testing.assert_allclose(quantized_output, expected_quantized_output)\n\n        >>> assert(\"quantized.Linear\" in quantized_fc.__str__())\n        >>> conv = SpatialConvolution(1, 2, 3, 3)\n        creating: createSpatialConvolution\n        >>> conv.set_weights([np.ones((2, 1, 3, 3)), np.zeros((2,))])\n        >>> input = np.ones((2, 1, 4, 4))\n        >>> output = conv.forward(input)\n        >>> expected_output = np.array([[[[9., 9.], [9., 9.]], [[9., 9.], [9., 9.]]], [[[9., 9.], [9., 9.]], [[9., 9.], [9., 9.]]]])\n        >>> np.testing.assert_allclose(output, expected_output)\n        >>> quantized_conv = conv.quantize()\n        >>> quantized_output = quantized_conv.forward(input)\n        >>> expected_quantized_output = np.array([[[[9., 9.], [9., 9.]], [[9., 9.], [9., 9.]]], [[[9., 9.], [9., 9.]], [[9., 9.], [9., 9.]]]])\n        >>> np.testing.assert_allclose(quantized_output, expected_quantized_output)\n        >>> assert(\"quantized.SpatialConvolution\" in quantized_conv.__str__())\n        >>> seq = Sequential()\n        creating: createSequential\n        >>> seq = seq.add(conv)\n        >>> seq = seq.add(Reshape([8, 4], False))\n        creating: createReshape\n        >>> seq = seq.add(fc)\n        >>> input = np.ones([1, 1, 6, 6])\n        >>> output = seq.forward(input)\n        >>> expected_output = np.array([[37., 37.], [37., 37.], [37., 37.], [37., 37.], [37., 37.], [37., 37.], [37., 37.], [37., 37.]])\n        >>> np.testing.assert_allclose(output, expected_output)\n        >>> quantized_seq = seq.quantize()\n        >>> quantized_output = quantized_seq.forward(input)\n        >>> expected_quantized_output = np.array([[37., 37.], [37., 37.], [37., 37.], [37., 37.], [37., 37.], [37., 37.], [37., 37.], [37., 37.]])\n        >>> np.testing.assert_allclose(quantized_output, expected_quantized_output)\n        >>> assert(\"quantized.Linear\" in quantized_seq.__str__())\n        >>> assert(\"quantized.SpatialConvolution\" in quantized_seq.__str__())\n        '''\n        quantized_model = callBigDlFunc(self.bigdl_type, \"quantize\", self.value)\n        return Layer.of(quantized_model)", "response": "Clone self and quantize it at last return a new quantized model."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef loadModel(modelPath, weightPath =None, bigdl_type=\"float\"):\n        jmodel = callBigDlFunc(bigdl_type, \"loadBigDLModule\", modelPath, weightPath)\n        return Layer.of(jmodel)", "response": "Load a pre - trained Bigdl model."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef load_torch(path, bigdl_type=\"float\"):\n        jmodel = callBigDlFunc(bigdl_type, \"loadTorch\", path)\n        return Layer.of(jmodel)", "response": "Load a pre - trained Torch model."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nloads a pre - trained Keras model.", "response": "def load_keras(json_path=None, hdf5_path=None, by_name=False):\n        \"\"\"\n        Load a pre-trained Keras model.\n\n        :param json_path: The json path containing the keras model definition.\n        :param hdf5_path: The HDF5 path containing the pre-trained keras model weights with or without the model architecture.\n        :return: A bigdl model.\n        \"\"\"\n        import os\n        try:\n            import tensorflow\n        except ImportError:\n            os.environ['KERAS_BACKEND'] = \"theano\"\n            try:\n                # Make theano backend compatible with Python3\n                from theano import ifelse\n            except ImportError:\n                raise Exception(\"No backend is found for Keras. \"\n                                \"Please install either tensorflow or theano.\")\n        from bigdl.keras.converter import DefinitionLoader, WeightLoader\n        if json_path and not hdf5_path:\n            return DefinitionLoader.from_json_path(json_path)\n        elif json_path and hdf5_path:\n            return WeightLoader.load_weights_from_json_hdf5(json_path, hdf5_path, by_name=by_name)\n        elif hdf5_path and not json_path:\n            kmodel, bmodel = DefinitionLoader.from_hdf5_path(hdf5_path)\n            WeightLoader.load_weights_from_kmodel(bmodel, kmodel)\n            return bmodel"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef load_caffe(model, defPath, modelPath, match_all=True, bigdl_type=\"float\"):\n        jmodel = callBigDlFunc(bigdl_type, \"loadCaffe\", model, defPath, modelPath, match_all)\n        return Layer.of(jmodel)", "response": "Load a pre - trained caffe model."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nload a pre - trained Caffe model.", "response": "def load_caffe_model(defPath, modelPath, bigdl_type=\"float\"):\n        \"\"\"\n        Load a pre-trained Caffe model.\n\n\n        :param defPath: The path containing the caffe model definition.\n        :param modelPath: The path containing the pre-trained caffe model.\n        :return: A pre-trained model.\n        \"\"\"\n        jmodel = callBigDlFunc(bigdl_type, \"loadCaffeModel\", defPath, modelPath)\n        return Layer.of(jmodel)"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef load_tensorflow(path, inputs, outputs, byte_order = \"little_endian\",\n                        bin_file = None, generated_backward = True, bigdl_type = \"float\"):\n        \"\"\"\n        Load a pre-trained Tensorflow model.\n        :param path: The path containing the pre-trained model.\n        :param inputs: The input node of this graph\n        :param outputs: The output node of this graph\n        :param byte_order: byte_order of the file, `little_endian` or `big_endian`\n        :param bin_file: the optional bin file produced by bigdl dump_model util function to store the weights\n        :param generated_backward: if generate backward graph\n        :return: A pre-trained model.\n        \"\"\"\n        jmodel = callBigDlFunc(bigdl_type, \"loadTF\", path, inputs, outputs,\n                               byte_order, bin_file, generated_backward)\n        return Model.of(jmodel)", "response": "Load a pre - trained Tensorflow model."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef stop_gradient(self, stop_layers, bigdl_type=\"float\"):\n        callBigDlFunc(bigdl_type, \"setStopGradient\", self.value, stop_layers)\n        return self", "response": "Stop the gradient of the input of the given set of layers that depend on them."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nreturn the corresponding node has the given name.", "response": "def node(self, name, bigdl_type=\"float\"):\n        \"\"\"\n        Return the corresponding node has the given name. If the given name doesn't match any node,\n        an exception will be thrown\n        :param name: node name\n        :param bigdl_type: \n        :return: \n        \"\"\"\n        jnode = callBigDlFunc(bigdl_type, \"findGraphNode\", self.value, name)\n        return Node.of(jnode)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nsave the current model topology to a folder.", "response": "def save_graph_topology(self, log_path, bigdl_type=\"float\"):\n        \"\"\"\n        save current model graph to a folder, which can be display in tensorboard by running\n            tensorboard --logdir logPath\n        :param log_path: path to save the model graph\n        :param bigdl_type:\n        :return:\n        \"\"\"\n        callBigDlFunc(bigdl_type, \"saveGraphTopology\", self.value, log_path)\n        return self"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef forward(self, input, target):\n        jinput, input_is_table = Layer.check_input(input)\n        jtarget, target_is_table = Layer.check_input(target)\n        output = callBigDlFunc(self.bigdl_type,\n                               \"criterionForward\",\n                               self.value,\n                               jinput,\n                               input_is_table,\n                               jtarget,\n                               target_is_table)\n        return output", "response": "This method computes the loss of the criterion from input to target."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef of(cls, jcriterion, bigdl_type=\"float\"):\n        criterion = Criterion(bigdl_type, jcriterion)\n        criterion.value = jcriterion\n        criterion.bigdl_type = bigdl_type\n        return criterion", "response": "Create a python Criterion by a java criterion object."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef readImages(path, sc=None, minParitions = 1, bigdl_type=\"float\"):\n        df = callBigDlFunc(bigdl_type, \"dlReadImage\", path, sc, minParitions)\n        df._sc._jsc = sc._jsc\n        return df", "response": "Read the directory of images into a DataFrame."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef load_weights_from_json_hdf5(def_json, weights_hdf5, by_name=False):\n        bmodel = DefinitionLoader.from_json_path(def_json)\n        def_value = BCommon.text_from_path(def_json)\n        kmodel = model_from_json(def_value)\n        WeightLoader.load_weights_from_hdf5(bmodel, kmodel, weights_hdf5, by_name)\n        return bmodel", "response": "Loads a WeightModel from a json file."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef load_weights_from_hdf5(bmodel, kmodel, filepath, by_name=False):\n        '''Loads all layer weights from a HDF5 save file.\n        filepath can be stored in a local file system, HDFS, S3,\n        or any Hadoop-supported file system.\n        If `by_name` is False (default) weights are loaded\n        based on the network's execution order topology,\n        meaning layers in the execution seq should be exactly the same\n        the architecture\n\n        If `by_name` is True, weights are loaded into layers\n        only if they share the same name. This is useful\n        for fine-tuning or transfer-learning models where\n        some of the layers have changed.\n        '''\n        local_file_path = BCommon.get_local_file(filepath)\n        kmodel.load_weights(filepath=local_file_path, by_name=by_name)\n        WeightLoader.load_weights_from_kmodel(bmodel, kmodel)", "response": "Loads all layer weights from a HDF5 save file."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nconverts KModel s weights to bigdl format.", "response": "def get_weights_from_kmodel(kmodel):\n        \"\"\"\n        Convert kmodel's weights to bigdl format.\n        We are supposing the order is the same as the execution order.\n        :param kmodel: keras model\n        :return: list of ndarray\n        \"\"\"\n        layers_with_weights = [layer for layer in kmodel.layers if layer.weights]\n        bweights = []\n        for klayer in layers_with_weights:\n            # bws would be [weights, bias] or [weights]\n            bws = WeightsConverter.get_bigdl_weights_from_klayer(klayer)\n            for w in bws:\n                bweights.append(w)\n        return bweights"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef from_hdf5_path(cls, hdf5_path):\n        from keras.models import load_model\n        hdf5_local_path = BCommon.get_local_file(hdf5_path)\n        kmodel = load_model(hdf5_local_path)\n        return kmodel, DefinitionLoader.from_kmodel(kmodel)", "response": "Loads a BigDL model from an hdf5 file."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nloads a BigDL model from a JSON file.", "response": "def from_json_path(cls, json_path):\n        \"\"\"\n        :param json_path: definition path which can be stored in a local file system, HDFS, S3, or any Hadoop-supported file system.\n        :return: BigDL Model\n        \"\"\"\n        json_str = BCommon.text_from_path(json_path)\n        return DefinitionLoader.from_json_str(json_str)"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef build_keras_model():\n    from keras.models import Sequential\n    from keras.layers import Dense, Dropout, Activation\n    from keras.layers import Embedding\n    from keras.layers import LSTM\n    from keras.layers import Convolution1D, MaxPooling1D\n    keras_model = Sequential()\n    keras_model.add(Embedding(20000, 128, input_length=100))\n    keras_model.add(Dropout(0.25))\n    keras_model.add(Convolution1D(nb_filter=64,\n                                  filter_length=5,\n                                  border_mode='valid',\n                                  activation='relu',\n                                  subsample_length=1))\n    keras_model.add(MaxPooling1D(pool_length=4))\n    keras_model.add(LSTM(70))\n    keras_model.add(Dense(1))\n    keras_model.add(Activation('sigmoid'))\n    return keras_model", "response": "Define a keras model in Keras 1. 2. 2\n "}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nreturns a list of shape tuples if there are multiple inputs.", "response": "def get_input_shape(self):\n        \"\"\"\n        Return a list of shape tuples if there are multiple inputs.\n        Return one shape tuple otherwise.\n        \"\"\"\n        input = callBigDlFunc(self.bigdl_type, \"getInputShape\",\n                              self.value)\n        return self.__process_shape(input)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef get_output_shape(self):\n        output = callBigDlFunc(self.bigdl_type, \"getOutputShape\",\n                               self.value)\n        return self.__process_shape(output)", "response": "Return a list of shape tuples if there are multiple outputs."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef get_mnist(data_type=\"train\", location=\"/tmp/mnist\"):\n    X, Y = mnist.read_data_sets(location, data_type)\n    return X, Y + 1", "response": "Get mnist dataset with features and label as ndarray."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nparsing or download movielens 1m data and return a 2D numpy array with user index and item index in each row", "response": "def read_data_sets(data_dir):\n    \"\"\"\n    Parse or download movielens 1m  data if train_dir is empty.\n\n    :param data_dir: The directory storing the movielens data\n    :return: a 2D numpy array with user index and item index in each row \n    \"\"\"\n    WHOLE_DATA = 'ml-1m.zip'\n    local_file = base.maybe_download(WHOLE_DATA, data_dir, SOURCE_URL + WHOLE_DATA)\n    zip_ref = zipfile.ZipFile(local_file, 'r')\n    extracted_to = os.path.join(data_dir, \"ml-1m\")\n    if not os.path.exists(extracted_to):\n        print(\"Extracting %s to %s\" % (local_file, data_dir))\n        zip_ref.extractall(data_dir)\n        zip_ref.close()\n    rating_files = os.path.join(extracted_to,\"ratings.dat\")\n\n    rating_list = [i.strip().split(\"::\") for i in open(rating_files,\"r\").readlines()]    \n    movielens_data = np.array(rating_list).astype(int)\n    return movielens_data"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nget and return the jar path for bigdl if exists.", "response": "def get_bigdl_classpath():\n    \"\"\"\n    Get and return the jar path for bigdl if exists.\n    \"\"\"\n    if os.getenv(\"BIGDL_CLASSPATH\"):\n        return os.environ[\"BIGDL_CLASSPATH\"]\n    jar_dir = os.path.abspath(__file__ + \"/../../\")\n    jar_paths = glob.glob(os.path.join(jar_dir, \"share/lib/*.jar\"))\n    if jar_paths:\n        assert len(jar_paths) == 1, \"Expecting one jar: %s\" % len(jar_paths)\n        return jar_paths[0]\n    return \"\""}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef is_spark_below_2_2():\n    import pyspark\n    if(hasattr(pyspark,\"version\")):\n        full_version = pyspark.version.__version__\n        # We only need the general spark version (eg, 1.6, 2.2).\n        parts = full_version.split(\".\")\n        spark_version = parts[0] + \".\" + parts[1]\n        if(compare_version(spark_version, \"2.2\")>=0):\n            return False\n    return True", "response": "Check if spark version is below 2. 2"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef compare_version(version1, version2):\n    v1Arr = version1.split(\".\")\n    v2Arr = version2.split(\".\")\n    len1 = len(v1Arr)\n    len2 = len(v2Arr)\n    lenMax = max(len1, len2)\n    for x in range(lenMax):\n        v1Token = 0\n        if x < len1:\n            v1Token = int(v1Arr[x])\n        v2Token = 0\n        if x < len2:\n            v2Token = int(v2Arr[x])\n        if v1Token < v2Token:\n            return -1\n        if v1Token > v2Token:\n            return 1\n    return 0", "response": "Compare two version strings."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef convert(input_ops, output_ops, byte_order, bigdl_type):\n\n    input_names = map(lambda x: x.name.split(\":\")[0], input_ops)\n    output_names = map(lambda x: x.name.split(\":\")[0], output_ops)\n    temp = tempfile.mkdtemp()\n\n    dump_model(path=temp)\n    model_path = temp + '/model.pb'\n    bin_path = temp + '/model.bin'\n\n    model = Model.load_tensorflow(model_path, input_names, output_names,\n                                  byte_order, bin_path, bigdl_type)\n\n    try:\n        shutil.rmtree(temp)\n    except OSError as e:\n        if e.errno != errno.ENOENT:\n            raise\n\n    return model", "response": "Convert tensorflow model to bigdl model"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nexporting variable tensors from the checkpoint files.", "response": "def export_checkpoint(checkpoint_path):\n    \"\"\"\n    Export variable tensors from the checkpoint files.\n\n    :param checkpoint_path: tensorflow checkpoint path\n    :return: dictionary of tensor. The key is the variable name and the value is the numpy\n    \"\"\"\n    reader = tf.train.NewCheckpointReader(checkpoint_path)\n\n    # Get tensor name list\n    tensor_names = filter(lambda n: n!='global_step',\n                          reader.get_variable_to_shape_map().keys())\n    # Prepare key-value dictionary\n    tensors = {}\n    for tn in tensor_names:\n        tensors[tn] = reader.get_tensor(tn)\n\n    return tensors"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nsave a variable dictionary to a Java object file.", "response": "def save_variable_bigdl(tensors, target_path, bigdl_type=\"float\"):\n    \"\"\"\n    Save a variable dictionary to a Java object file, so it can be read by BigDL\n\n    :param tensors: tensor dictionary\n    :param target_path: where is the Java object file store\n    :param bigdl_type: model variable numeric type\n    :return: nothing\n    \"\"\"\n    import numpy as np\n    jtensors = {}\n    for tn in tensors.keys():\n        if not isinstance(tensors[tn], np.ndarray):\n            value = np.array(tensors[tn])\n        else:\n            value = tensors[tn]\n        jtensors[tn] = JTensor.from_ndarray(value)\n        \n    callBigDlFunc(bigdl_type, \"saveTensorDictionary\", jtensors, target_path)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef dump_model(path, graph=None, sess=None, ckpt_file=None, bigdl_type=\"float\"):\n    if not os.path.isdir(path):\n        raise ValueError(\"Folder \" + path + \" does not exist\")\n\n    temp = None\n    if ckpt_file is None:\n        if sess is None:\n            sess = tf.Session()\n            init = tf.global_variables_initializer()\n            sess.run(init)\n            temp = tempfile.mkdtemp()\n            ckpt_file = temp\n        # dump checkpoint to temp files\n        saver = tf.train.Saver()\n        saver.save(sess, ckpt_file)\n\n    # generate bin files\n    tensors = export_checkpoint(ckpt_file)\n    save_variable_bigdl(tensors, path + \"/model.bin\", bigdl_type)\n\n    # dump grap to pb file\n    graph = sess.graph if graph is None else graph\n    with gfile.GFile(path + \"/model.pb\", \"wb\") as f:\n        f.write(graph.as_graph_def().SerializeToString())\n    if temp is not None:\n        try:\n            shutil.rmtree(temp)\n        except OSError as e:\n            if e.errno != errno.ENOENT:\n                raise", "response": "Dump a tensorflow model to files."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef merge_checkpoint(input_graph,\n                     checkpoint,\n                     output_node_names,\n                     output_graph,\n                     sess):\n    \"\"\"\n    Get the variable values from the checkpoint file, and merge them to the GraphDef file\n    Args:\n        input_graph: the GraphDef file, doesn't contain variable values\n        checkpoint: the checkpoint file\n        output_node_names: A list of string, the output names\n        output_graph: String of the location and the name of the\n            output graph\n    \"\"\"\n    restore_op_name = \"save/restore_all\"\n    filename_tensor_name = \"save/Const:0\"\n\n    input_graph_def = graph_pb2.GraphDef()\n    with gfile.FastGFile(input_graph, \"r\") as f:\n        text_format.Merge(f.read().decode(\"utf-8\"), input_graph_def)\n\n    for node in input_graph_def.node:\n        node.device = \"\"\n\n    importer.import_graph_def(input_graph_def, name=\"\")\n\n    sess.run([restore_op_name], {filename_tensor_name: checkpoint})\n    output_graph_def = graph_util.convert_variables_to_constants(\n        sess,\n        input_graph_def,\n        output_node_names,\n        variable_names_blacklist=\"\"\n    )\n\n    with gfile.GFile(output_graph, \"wb\") as f:\n        f.write(output_graph_def.SerializeToString())", "response": "Merge the checkpoint file into the output graph"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nprocess utterances batch through skills filter and skills processor.", "response": "def _call(self, utterances_batch: list, utterances_ids: Optional[list]=None) -> list:\n        \"\"\"\n        Processes batch of utterances and returns corresponding responses batch.\n\n        Each call of Agent passes incoming utterances batch through skills filter,\n        agent skills, skills processor. Batch of dialog IDs can be provided, in\n        other case utterances indexes in incoming batch are used as dialog IDs.\n\n        Args:\n            utterances_batch: Batch of incoming utterances.\n            utterances_ids: Batch of dialog IDs corresponding to incoming utterances.\n\n        Returns:\n            responses: A batch of responses corresponding to the\n                utterance batch received by agent.\n        \"\"\"\n        batch_size = len(utterances_batch)\n        ids = utterances_ids or list(range(batch_size))\n        batch_history = [self.history[utt_id] for utt_id in ids]\n        responses = []\n\n        filtered = self.skills_filter(utterances_batch, batch_history)\n\n        for skill_i, (filtered_utterances, skill) in enumerate(zip(filtered, self.wrapped_skills)):\n            skill_i_utt_indexes = [utt_index for utt_index, utt_filter in enumerate(filtered_utterances) if utt_filter]\n\n            if skill_i_utt_indexes:\n                skill_i_utt_batch = [utterances_batch[i] for i in skill_i_utt_indexes]\n                skill_i_utt_ids = [ids[i] for i in skill_i_utt_indexes]\n                res = [(None, 0.)] * batch_size\n                predicted, confidence = skill(skill_i_utt_batch, skill_i_utt_ids)\n\n                for i, predicted, confidence in zip(skill_i_utt_indexes, predicted, confidence):\n                    res[i] = (predicted, confidence)\n\n                responses.append(res)\n\n        responses = self.skills_processor(utterances_batch, batch_history, *responses)\n\n        return responses"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nexpanding and tile tensor along given axis.", "response": "def expand_tile(units, axis):\n    \"\"\"\n    Expand and tile tensor along given axis\n\n    Args:\n        units: tf tensor with dimensions [batch_size, time_steps, n_input_features]\n        axis: axis along which expand and tile. Must be 1 or 2\n\n    \"\"\"\n    assert axis in (1, 2)\n    n_time_steps = K.int_shape(units)[1]\n    repetitions = [1, 1, 1, 1]\n    repetitions[axis] = n_time_steps\n    if axis == 1:\n        expanded = Reshape(target_shape=( (1,) + K.int_shape(units)[1:] ))(units)\n    else:\n        expanded = Reshape(target_shape=(K.int_shape(units)[1:2] + (1,) + K.int_shape(units)[2:]))(units)\n    return K.tile(expanded, repetitions)"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\ncomputes additive self attention for time series of vectors with batch dimension.", "response": "def additive_self_attention(units, n_hidden=None, n_output_features=None, activation=None):\n    \"\"\"\n    Compute additive self attention for time series of vectors (with batch dimension)\n            the formula: score(h_i, h_j) = <v, tanh(W_1 h_i + W_2 h_j)>\n            v is a learnable vector of n_hidden dimensionality,\n            W_1 and W_2 are learnable [n_hidden, n_input_features] matrices\n\n    Args:\n        units: tf tensor with dimensionality [batch_size, time_steps, n_input_features]\n        n_hidden: number of2784131 units in hidden representation of similarity measure\n        n_output_features: number of features in output dense layer\n        activation: activation at the output\n\n    Returns:\n        output: self attended tensor with dimensionality [batch_size, time_steps, n_output_features]\n        \"\"\"\n    n_input_features = K.int_shape(units)[2]\n    if n_hidden is None:\n        n_hidden = n_input_features\n    if n_output_features is None:\n        n_output_features = n_input_features\n    exp1 = Lambda(lambda x: expand_tile(x, axis=1))(units)\n    exp2 = Lambda(lambda x: expand_tile(x, axis=2))(units)\n    units_pairs = Concatenate(axis=3)([exp1, exp2])\n    query = Dense(n_hidden, activation=\"tanh\")(units_pairs)\n    attention = Dense(1, activation=lambda x: softmax(x, axis=2))(query)\n    attended_units = Lambda(lambda x: K.sum(attention * x, axis=2))(exp1)\n    output = Dense(n_output_features, activation=activation)(attended_units)\n    return output"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef multiplicative_self_attention(units, n_hidden=None, n_output_features=None, activation=None):\n    n_input_features = K.int_shape(units)[2]\n    if n_hidden is None:\n        n_hidden = n_input_features\n    if n_output_features is None:\n        n_output_features = n_input_features\n    exp1 = Lambda(lambda x: expand_tile(x, axis=1))(units)\n    exp2 = Lambda(lambda x: expand_tile(x, axis=2))(units)\n    queries = Dense(n_hidden)(exp1)\n    keys = Dense(n_hidden)(exp2)\n    scores = Lambda(lambda x: K.sum(queries * x, axis=3, keepdims=True))(keys)\n    attention = Lambda(lambda x: softmax(x, axis=2))(scores)\n    mult = Multiply()([attention, exp1])\n    attended_units = Lambda(lambda x: K.sum(x, axis=2))(mult)\n    output = Dense(n_output_features, activation=activation)(attended_units)\n    return output", "response": "Compute multiplicative self attention for time series of vectors with batch dimension."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\ncollect possible continuations of length <= n for every node", "response": "def precompute_future_symbols(trie, n, allow_spaces=False):\n    \"\"\"\n    Collecting possible continuations of length <= n for every node\n    \"\"\"\n    if n == 0:\n        return\n    if trie.is_terminated and trie.precompute_symbols:\n        # \u0441\u0438\u043c\u0432\u043e\u043b\u044b \u0443\u0436\u0435 \u043f\u0440\u0435\u0434\u043f\u043e\u0441\u0447\u0438\u0442\u0430\u043d\u044b\n        return\n    for index, final in enumerate(trie.final):\n        trie.data[index] = [set() for i in range(n)]\n    for index, (node_data, final) in enumerate(zip(trie.data, trie.final)):\n        node_data[0] = set(trie._get_letters(index))\n        if allow_spaces and final:\n            node_data[0].add(\" \")\n    for d in range(1, n):\n        for index, (node_data, final) in enumerate(zip(trie.data, trie.final)):\n            children = set(trie._get_children(index))\n            for child in children:\n                node_data[d] |= trie.data[child][d - 1]\n            # \u0432 \u0441\u043b\u0443\u0447\u0430\u0435, \u0435\u0441\u043b\u0438 \u0440\u0430\u0437\u0440\u0435\u0448\u0451\u043d \u0432\u043e\u0437\u0432\u0440\u0430\u0442 \u043f\u043e \u043f\u0440\u043e\u0431\u0435\u043b\u0443 \u0432 \u0441\u0442\u0430\u0440\u0442\u043e\u0432\u043e\u0435 \u0441\u043e\u0441\u0442\u043e\u044f\u043d\u0438\u0435\n            if allow_spaces and final:\n                node_data[d] |= trie.data[trie.root][d - 1]\n    trie.terminated = True"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef save(self, outfile):\n        with open(outfile, \"w\", encoding=\"utf8\") as fout:\n            attr_values = [getattr(self, attr) for attr in Trie.ATTRS]\n            attr_values.append(any(x is not None for x in self.data))\n            fout.write(\"{}\\n{}\\t{}\\n\".format(\n                \" \".join(\"T\" if x else \"F\" for x in attr_values),\n                self.nodes_number, self.root))\n            fout.write(\" \".join(str(a) for a in self.alphabet) + \"\\n\")\n            for index, label in enumerate(self.final):\n                letters = self._get_letters(index, return_indexes=True)\n                children = self._get_children(index)\n                fout.write(\"{}\\t{}\\n\".format(\n                    \"T\" if label else \"F\", \" \".join(\"{}:{}\".format(*elem)\n                                                    for elem in zip(letters, children))))\n            if self.precompute_symbols is not None:\n                for elem in self.data:\n                    fout.write(\":\".join(\",\".join(\n                        map(str, symbols)) for symbols in elem) + \"\\n\")\n        return", "response": "Save the trie to a file."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef make_cashed(self):\n        self._descendance_cash = [dict() for _ in self.graph]\n        self.descend = self._descend_cashed", "response": "Make a cashed version of the tree."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef add(self, s):\n        if self.is_terminated:\n            raise TypeError(\"Impossible to add string to fitted trie\")\n        if s == \"\":\n            self._set_final(self.root)\n            return\n        curr = self.root\n        for i, a in enumerate(s):\n            code = self.alphabet_codes[a]\n            next = self.graph[curr][code]\n            if next == Trie.NO_NODE:\n                curr = self._add_descendant(curr, s[i:])\n                break\n            else:\n                curr = next\n        self._set_final(curr)\n        return self", "response": "Adds a string to the trie."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef words(self):\n        branch, word, indexes = [self.root], [], [0]\n        letters_with_children = [self._get_children_and_letters(self.root)]\n        while len(branch) > 0:\n            if self.is_final(branch[-1]):\n                yield \"\".join(word)\n            while indexes[-1] == len(letters_with_children[-1]):\n                indexes.pop()\n                letters_with_children.pop()\n                branch.pop()\n                if len(indexes) == 0:\n                    raise StopIteration()\n                word.pop()\n            next_letter, next_child = letters_with_children[-1][indexes[-1]]\n            indexes[-1] += 1\n            indexes.append(0)\n            word.append(next_letter)\n            branch.append(next_child)\n            letters_with_children.append(self._get_children_and_letters(branch[-1]))", "response": "Yields all words in the tree."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef find_partitions(self, s, max_count=1):\n        curr_agenda = [(self.root, [], 0)]\n        for i, a in enumerate(s):\n            next_agenda = []\n            for curr, borders, cost in curr_agenda:\n                if cost >= max_count:\n                    continue\n                child = self.graph[curr][self.alphabet_codes[a]]\n                # child = self.graph[curr][a]\n                if child == Trie.NO_NODE:\n                    continue\n                next_agenda.append((child, borders, cost))\n                if self.is_final(child):\n                    next_agenda.append((self.root, borders + [i+1], cost+1))\n            curr_agenda = next_agenda\n        answer = []\n        for curr, borders, cost in curr_agenda:\n            if curr == self.root:\n                borders = [0] + borders\n                answer.append([s[left:borders[i+1]] for i, left in enumerate(borders[:-1])])\n        return answer", "response": "Find partitions of a sequence of nodes."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nadding an empty child node to the graph.", "response": "def _add_empty_child(self, parent, code, final=False):\n        \"\"\"\n        \u0414\u043e\u0431\u0430\u0432\u043b\u0435\u043d\u0438\u0435 \u0440\u0435\u0431\u0451\u043d\u043a\u0430 \u043a \u0432\u0435\u0440\u0448\u0438\u043d\u0435 parent \u043f\u043e \u0441\u0438\u043c\u0432\u043e\u043b\u0443 \u0441 \u043a\u043e\u0434\u043e\u043c code\n        \"\"\"\n        self.graph[parent][code] = self.nodes_number\n        self.graph.append(self._make_default_node())\n        self.data.append(None)\n        self.final.append(final)\n        self.nodes_number += 1\n        return (self.nodes_number - 1)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef _descend_simple(self, curr, s):\n        for a in s:\n            curr = self.graph[curr][self.alphabet_codes[a]]\n            if curr == Trie.NO_NODE:\n                break\n        return curr", "response": "Simple function to get the most specific node from a node."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nreturning the node that is cashed by the given node.", "response": "def _descend_cashed(self, curr, s):\n        \"\"\"\n        \u0421\u043f\u0443\u0441\u043a \u0438\u0437 \u0432\u0435\u0440\u0448\u0438\u043d\u044b curr \u043f\u043e \u0441\u0442\u0440\u043e\u043a\u0435 s \u0441 \u043a\u044d\u0448\u0438\u0440\u043e\u0432\u0430\u043d\u0438\u0435\u043c\n        \"\"\"\n        if s == \"\":\n            return curr\n        curr_cash = self._descendance_cash[curr]\n        answer = curr_cash.get(s, None)\n        if answer is not None:\n            return answer\n        # \u0434\u043b\u044f \u043e\u043f\u0442\u0438\u043c\u0438\u0437\u0430\u0446\u0438\u0438 \u0434\u0443\u0431\u043b\u0438\u0440\u0443\u0435\u043c \u043a\u043e\u0434\n        res = curr\n        for a in s:\n            res = self.graph[res][self.alphabet_codes[a]]\n            # res = self.graph[res][a]\n            if res == Trie.NO_NODE:\n                break\n        curr_cash[s] = res\n        return res"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\ngetting the letters from the given index.", "response": "def _get_letters(self, index, return_indexes=False):\n        \"\"\"\n        \u0418\u0437\u0432\u043b\u0435\u043a\u0430\u0435\u0442 \u0432\u0441\u0435 \u043c\u0435\u0442\u043a\u0438 \u0432\u044b\u0445\u043e\u0434\u043d\u044b\u0445 \u0440\u0451\u0431\u0435\u0440 \u0432\u0435\u0440\u0448\u0438\u043d\u044b \u0441 \u043d\u043e\u043c\u0435\u0440\u043e\u043c index\n        \"\"\"\n        if self.dict_storage:\n            answer = list(self.graph[index].keys())\n        else:\n            answer =  [i for i, elem in enumerate(self.graph[index])\n                       if elem != Trie.NO_NODE]\n        if not return_indexes:\n            answer = [(self.alphabet[i] if i >= 0 else \" \") for i in answer]\n        return answer"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef _get_children(self, index):\n        if self.dict_storage:\n            return list(self.graph[index].values())\n        else:\n            return [elem for elem in self.graph[index] if elem != Trie.NO_NODE]", "response": "Returns a list of children of the node at the given index."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\ngenerate post order for a trie.", "response": "def generate_postorder(self, trie):\n        \"\"\"\n        \u041e\u0431\u0440\u0430\u0442\u043d\u0430\u044f \u0442\u043e\u043f\u043e\u043b\u043e\u0433\u0438\u0447\u0435\u0441\u043a\u0430\u044f \u0441\u043e\u0440\u0442\u0438\u0440\u043e\u0432\u043a\u0430\n        \"\"\"\n        order, stack = [], []\n        stack.append(trie.root)\n        colors = ['white'] * len(trie)\n        while len(stack) > 0:\n            index = stack[-1]\n            color = colors[index]\n            if color == 'white': # \u0432\u0435\u0440\u0448\u0438\u043d\u0430 \u0435\u0449\u0451 \u043d\u0435 \u043e\u0431\u0440\u0430\u0431\u0430\u0442\u044b\u0432\u0430\u043b\u0430\u0441\u044c\n                colors[index] = 'grey'\n                for child in trie._get_children(index):\n                    # \u043f\u0440\u043e\u0432\u0435\u0440\u044f\u0435\u043c, \u043f\u043e\u0441\u0435\u0449\u0430\u043b\u0438 \u043b\u0438 \u043c\u044b \u0440\u0435\u0431\u0451\u043d\u043a\u0430 \u0440\u0430\u043d\u044c\u0448\u0435\n                    if child != Trie.NO_NODE and colors[child] == 'white':\n                        stack.append(child)\n            else:\n                if color == 'grey':\n                    colors[index] = 'black'\n                    order.append(index)\n                stack = stack[:-1]\n        return order"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nruns the population via current python executor", "response": "def run_population(population, evolution, gpus):\n    \"\"\"\n    Change save and load paths for obtained population, save config.json with model config,\n    run population via current python executor (with which evolve.py already run)\n    and on given devices (-1 means CPU, other integeres - visible for evolve.py GPUs)\n    Args:\n        population: list of dictionaries - configs of current population\n        evolution: ParamsEvolution\n        gpus: list of given devices (list of integers)\n\n    Returns:\n        None\n    \"\"\"\n    population_size = len(population)\n    for k in range(population_size // len(gpus) + 1):\n        procs = []\n        for j in range(len(gpus)):\n            i = k * len(gpus) + j\n            if i < population_size:\n                save_path = expand_path(\n                    evolution.get_value_from_config(parse_config(population[i]),\n                                                    evolution.path_to_models_save_path))\n\n                save_path.mkdir(parents=True, exist_ok=True)\n                f_name = save_path / \"config.json\"\n                save_json(population[i], f_name)\n\n                with save_path.joinpath('out.txt').open('w', encoding='utf8') as outlog,\\\n                        save_path.joinpath('err.txt').open('w', encoding='utf8') as errlog:\n                    env = dict(os.environ)\n                    if len(gpus) > 1 or gpus[0] != -1:\n                        env['CUDA_VISIBLE_DEVICES'] = str(gpus[j])\n\n                    procs.append(Popen(\"{} -m deeppavlov train {}\".format(sys.executable, str(f_name)),\n                                       shell=True, stdout=outlog, stderr=errlog, env=env))\n        for j, proc in enumerate(procs):\n            i = k * len(gpus) + j\n            log.info(f'Waiting on {i}th proc')\n            if proc.wait() != 0:\n                save_path = expand_path(\n                    evolution.get_value_from_config(parse_config(population[i]),\n                                                    evolution.path_to_models_save_path))\n                with save_path.joinpath('err.txt').open(encoding='utf8') as errlog:\n                    log.warning(f'Population {i} returned an error code {proc.returncode} and an error log:\\n' +\n                                errlog.read())\n    return None"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef dot_attention(inputs, memory, mask, att_size, keep_prob=1.0, scope=\"dot_attention\"):\n    with tf.variable_scope(scope):\n        BS, IL, IH = tf.unstack(tf.shape(inputs))\n        BS, ML, MH = tf.unstack(tf.shape(memory))\n\n        d_inputs = tf.nn.dropout(inputs, keep_prob=keep_prob, noise_shape=[BS, 1, IH])\n        d_memory = tf.nn.dropout(memory, keep_prob=keep_prob, noise_shape=[BS, 1, MH])\n\n        with tf.variable_scope(\"attention\"):\n            inputs_att = tf.layers.dense(d_inputs, att_size, use_bias=False, activation=tf.nn.relu)\n            memory_att = tf.layers.dense(d_memory, att_size, use_bias=False, activation=tf.nn.relu)\n            logits = tf.matmul(inputs_att, tf.transpose(memory_att, [0, 2, 1])) / (att_size ** 0.5)\n            mask = tf.tile(tf.expand_dims(mask, axis=1), [1, IL, 1])\n            att_weights = tf.nn.softmax(softmax_mask(logits, mask))\n            outputs = tf.matmul(att_weights, memory)\n            res = tf.concat([inputs, outputs], axis=2)\n\n        with tf.variable_scope(\"gate\"):\n            dim = res.get_shape().as_list()[-1]\n            d_res = tf.nn.dropout(res, keep_prob=keep_prob, noise_shape=[BS, 1, IH + MH])\n            gate = tf.layers.dense(d_res, dim, use_bias=False, activation=tf.nn.sigmoid)\n            return res * gate", "response": "Computes dot product between input and memory items."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef simple_attention(memory, att_size, mask, keep_prob=1.0, scope=\"simple_attention\"):\n    with tf.variable_scope(scope):\n        BS, ML, MH = tf.unstack(tf.shape(memory))\n        memory_do = tf.nn.dropout(memory, keep_prob=keep_prob, noise_shape=[BS, 1, MH])\n        logits = tf.layers.dense(tf.layers.dense(memory_do, att_size, activation=tf.nn.tanh), 1, use_bias=False)\n        logits = softmax_mask(tf.squeeze(logits, [2]), mask)\n        att_weights = tf.expand_dims(tf.nn.softmax(logits), axis=2)\n        res = tf.reduce_sum(att_weights * memory, axis=1)\n        return res", "response": "Simple attention with any conditions."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\ncomputes weighted sum of inputs conditioned on state.", "response": "def attention(inputs, state, att_size, mask, scope=\"attention\"):\n    \"\"\"Computes weighted sum of inputs conditioned on state\"\"\"\n    with tf.variable_scope(scope):\n        u = tf.concat([tf.tile(tf.expand_dims(state, axis=1), [1, tf.shape(inputs)[1], 1]), inputs], axis=2)\n        logits = tf.layers.dense(tf.layers.dense(u, att_size, activation=tf.nn.tanh), 1, use_bias=False)\n        logits = softmax_mask(tf.squeeze(logits, [2]), mask)\n        att_weights = tf.expand_dims(tf.nn.softmax(logits), axis=2)\n        res = tf.reduce_sum(att_weights * inputs, axis=1)\n        return res, logits"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\ncompute the BLEU score of translated segments against one or more references.", "response": "def compute_bleu(reference_corpus, translation_corpus, max_order=4,\n                 smooth=False):\n  \"\"\"Computes BLEU score of translated segments against one or more references.\n\n  Args:\n    reference_corpus: list of lists of references for each translation. Each\n        reference should be tokenized into a list of tokens.\n    translation_corpus: list of translations to score. Each translation\n        should be tokenized into a list of tokens.\n    max_order: Maximum n-gram order to use when computing BLEU score.\n    smooth: Whether or not to apply Lin et al. 2004 smoothing.\n\n  Returns:\n    3-Tuple with the BLEU score, n-gram precisions, geometric mean of n-gram\n    precisions and brevity penalty.\n  \"\"\"\n  matches_by_order = [0] * max_order\n  possible_matches_by_order = [0] * max_order\n  reference_length = 0\n  translation_length = 0\n  for (references, translation) in zip(reference_corpus,\n                                       translation_corpus):\n    reference_length += min(len(r) for r in references)\n    translation_length += len(translation)\n\n    merged_ref_ngram_counts = collections.Counter()\n    for reference in references:\n      merged_ref_ngram_counts |= _get_ngrams(reference, max_order)\n    translation_ngram_counts = _get_ngrams(translation, max_order)\n    overlap = translation_ngram_counts & merged_ref_ngram_counts\n    for ngram in overlap:\n      matches_by_order[len(ngram)-1] += overlap[ngram]\n    for order in range(1, max_order+1):\n      possible_matches = len(translation) - order + 1\n      if possible_matches > 0:\n        possible_matches_by_order[order-1] += possible_matches\n\n  precisions = [0] * max_order\n  for i in range(0, max_order):\n    if smooth:\n      precisions[i] = ((matches_by_order[i] + 1.) /\n                       (possible_matches_by_order[i] + 1.))\n    else:\n      if possible_matches_by_order[i] > 0:\n        precisions[i] = (float(matches_by_order[i]) /\n                         possible_matches_by_order[i])\n      else:\n        precisions[i] = 0.0\n\n  if min(precisions) > 0:\n    p_log_sum = sum((1. / max_order) * math.log(p) for p in precisions)\n    geo_mean = math.exp(p_log_sum)\n  else:\n    geo_mean = 0\n\n  ratio = float(translation_length) / reference_length\n\n  if ratio > 1.0:\n    bp = 1.\n  else:\n    bp = math.exp(1 - 1. / ratio)\n\n  bleu = geo_mean * bp\n\n  return (bleu, precisions, bp, ratio, translation_length, reference_length)"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nreturns opened Python file object for writing dialog logs.", "response": "def _get_log_file(self):\n        \"\"\"Returns opened file object for writing dialog logs.\n\n        Returns:\n            log_file: opened Python file object.\n        \"\"\"\n        log_dir: Path = Path(self.config['log_path']).expanduser().resolve() / self.agent_name\n        log_dir.mkdir(parents=True, exist_ok=True)\n        log_file_path = Path(log_dir, f'{self._get_timestamp_utc_str()}_{self.agent_name}.log')\n        log_file = open(log_file_path, 'a', buffering=1, encoding='utf8')\n        return log_file"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nlog a single dialog utterance to the current dialog log file.", "response": "def _log(self, utterance: Any, direction: str, dialog_id: Optional[Hashable]=None):\n        \"\"\"Logs single dialog utterance to current dialog log file.\n\n        Args:\n            utterance: Dialog utterance.\n            direction: 'in' or 'out' utterance direction.\n            dialog_id: Dialog ID.\n        \"\"\"\n        if isinstance(utterance, str):\n            pass\n        elif isinstance(utterance, RichMessage):\n            utterance = utterance.json()\n        elif isinstance(utterance, (list, dict)):\n            utterance = jsonify_data(utterance)\n        else:\n            utterance = str(utterance)\n\n        dialog_id = str(dialog_id) if not isinstance(dialog_id, str) else dialog_id\n\n        if self.log_file.tell() >= self.log_max_size * 1024:\n            self.log_file.close()\n            self.log_file = self._get_log_file()\n        else:\n            try:\n                log_msg = {}\n                log_msg['timestamp'] = self._get_timestamp_utc_str()\n                log_msg['dialog_id'] = dialog_id\n                log_msg['direction'] = direction\n                log_msg['message'] = utterance\n                log_str = json.dumps(log_msg, ensure_ascii=self.config['ensure_ascii'])\n                self.log_file.write(f'{log_str}\\n')\n            except IOError:\n                log.error('Failed to write dialog log.')"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nwrap _log method for all input utterances.", "response": "def log_in(self, utterance: Any, dialog_id: Optional[Hashable] = None) -> None:\n        \"\"\"Wraps _log method for all input utterances.\n        Args:\n            utterance: Dialog utterance.\n            dialog_id: Dialog ID.\n        \"\"\"\n        if self.enabled:\n            self._log(utterance, 'in', dialog_id)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\ngetting summary ops for the magnitude of gradient updates", "response": "def summary_gradient_updates(grads, opt, lr):\n    \"\"\"get summary ops for the magnitude of gradient updates\"\"\"\n\n    # strategy:\n    # make a dict of variable name -> [variable, grad, adagrad slot]\n    vars_grads = {}\n    for v in tf.trainable_variables():\n        vars_grads[v.name] = [v, None, None]\n    for g, v in grads:\n        vars_grads[v.name][1] = g\n        vars_grads[v.name][2] = opt.get_slot(v, 'accumulator')\n\n    # now make summaries\n    ret = []\n    for vname, (v, g, a) in vars_grads.items():\n\n        if g is None:\n            continue\n\n        if isinstance(g, tf.IndexedSlices):\n            # a sparse gradient - only take norm of params that are updated\n            updates = lr * g.values\n            if a is not None:\n                updates /= tf.sqrt(tf.gather(a, g.indices))\n        else:\n            updates = lr * g\n            if a is not None:\n                updates /= tf.sqrt(a)\n\n        values_norm = tf.sqrt(tf.reduce_sum(v * v)) + 1.0e-7\n        updates_norm = tf.sqrt(tf.reduce_sum(updates * updates))\n        ret.append(tf.summary.scalar('UPDATE/' + vname.replace(\":\", \"_\"), updates_norm / values_norm))\n\n    return ret"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _deduplicate_indexed_slices(values, indices):\n    unique_indices, new_index_positions = tf.unique(indices)\n    summed_values = tf.unsorted_segment_sum(values,\n                                            new_index_positions,\n                                            tf.shape(unique_indices)[0])\n    return (summed_values, unique_indices)", "response": "Sums values associated with any non - unique indices."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef dump_weights(tf_save_dir, outfile, options):\n\n    def _get_outname(tf_name):\n        outname = re.sub(':0$', '', tf_name)\n        outname = outname.lstrip('lm/')\n        outname = re.sub('/rnn/', '/RNN/', outname)\n        outname = re.sub('/multi_rnn_cell/', '/MultiRNNCell/', outname)\n        outname = re.sub('/cell_', '/Cell', outname)\n        outname = re.sub('/lstm_cell/', '/LSTMCell/', outname)\n        if '/RNN/' in outname:\n            if 'projection' in outname:\n                outname = re.sub('projection/kernel', 'W_P_0', outname)\n            else:\n                outname = re.sub('/kernel', '/W_0', outname)\n                outname = re.sub('/bias', '/B', outname)\n        return outname\n\n    ckpt_file = tf.train.latest_checkpoint(tf_save_dir)\n\n    config = tf.ConfigProto(allow_soft_placement=True)\n    with tf.Graph().as_default():\n        with tf.Session(config=config) as sess:\n            with tf.variable_scope('lm'):\n                LanguageModel(options, False)  # Create graph\n                # we use the \"Saver\" class to load the variables\n                loader = tf.train.Saver()\n                loader.restore(sess, ckpt_file)\n\n            with h5py.File(outfile, 'w') as fout:\n                for v in tf.trainable_variables():\n                    if v.name.find('softmax') >= 0:\n                        # don't dump these\n                        continue\n                    outname = _get_outname(v.name)\n                    # print(\"Saving variable {0} with name {1}\".format(\n                    #     v.name, outname))\n                    shape = v.get_shape().as_list()\n                    dset = fout.create_dataset(outname, shape, dtype='float32')\n                    values = sess.run([v])[0]\n                    dset[...] = values", "response": "Dump the trained weights from a model to a HDF5 file."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef read_data_by_config(config: dict):\n    dataset_config = config.get('dataset', None)\n\n    if dataset_config:\n        config.pop('dataset')\n        ds_type = dataset_config['type']\n        if ds_type == 'classification':\n            reader = {'class_name': 'basic_classification_reader'}\n            iterator = {'class_name': 'basic_classification_iterator'}\n            config['dataset_reader'] = {**dataset_config, **reader}\n            config['dataset_iterator'] = {**dataset_config, **iterator}\n        else:\n            raise Exception(\"Unsupported dataset type: {}\".format(ds_type))\n\n    try:\n        reader_config = dict(config['dataset_reader'])\n    except KeyError:\n        raise ConfigError(\"No dataset reader is provided in the JSON config.\")\n\n    reader = get_model(reader_config.pop('class_name'))()\n    data_path = reader_config.pop('data_path', '')\n    if isinstance(data_path, list):\n        data_path = [expand_path(x) for x in data_path]\n    else:\n        data_path = expand_path(data_path)\n\n    return reader.read(data_path, **reader_config)", "response": "Read data by dataset_reader from specified config."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef get_iterator_from_config(config: dict, data: dict):\n    iterator_config = config['dataset_iterator']\n    iterator: Union[DataLearningIterator, DataFittingIterator] = from_params(iterator_config,\n                                                                             data=data)\n    return iterator", "response": "Create iterator from config and data."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nmake training and evaluation of the model described in corresponding configuration file.", "response": "def train_evaluate_model_from_config(config: Union[str, Path, dict],\n                                     iterator: Union[DataLearningIterator, DataFittingIterator] = None, *,\n                                     to_train: bool = True,\n                                     evaluation_targets: Optional[Iterable[str]] = None,\n                                     to_validate: Optional[bool] = None,\n                                     download: bool = False,\n                                     start_epoch_num: Optional[int] = None,\n                                     recursive: bool = False) -> Dict[str, Dict[str, float]]:\n    \"\"\"Make training and evaluation of the model described in corresponding configuration file.\"\"\"\n    config = parse_config(config)\n\n    if download:\n        deep_download(config)\n\n    if to_train and recursive:\n        for subconfig in get_all_elems_from_json(config['chainer'], 'config_path'):\n            log.info(f'Training \"{subconfig}\"')\n            train_evaluate_model_from_config(subconfig, download=False, recursive=True)\n\n    import_packages(config.get('metadata', {}).get('imports', []))\n\n    if iterator is None:\n        try:\n            data = read_data_by_config(config)\n        except ConfigError as e:\n            to_train = False\n            log.warning(f'Skipping training. {e.message}')\n        else:\n            iterator = get_iterator_from_config(config, data)\n\n    if 'train' not in config:\n        log.warning('Train config is missing. Populating with default values')\n    train_config = config.get('train')\n\n    if start_epoch_num is not None:\n        train_config['start_epoch_num'] = start_epoch_num\n\n    if 'evaluation_targets' not in train_config and ('validate_best' in train_config\n                                                     or 'test_best' in train_config):\n        log.warning('\"validate_best\" and \"test_best\" parameters are deprecated.'\n                    ' Please, use \"evaluation_targets\" list instead')\n\n        train_config['evaluation_targets'] = []\n        if train_config.pop('validate_best', True):\n            train_config['evaluation_targets'].append('valid')\n        if train_config.pop('test_best', True):\n            train_config['evaluation_targets'].append('test')\n\n    trainer_class = get_model(train_config.pop('class_name', 'nn_trainer'))\n    trainer = trainer_class(config['chainer'], **train_config)\n\n    if to_train:\n        trainer.train(iterator)\n\n    res = {}\n\n    if iterator is not None:\n        if to_validate is not None:\n            if evaluation_targets is None:\n                log.warning('\"to_validate\" parameter is deprecated and will be removed in future versions.'\n                            ' Please, use \"evaluation_targets\" list instead')\n                evaluation_targets = ['test']\n                if to_validate:\n                    evaluation_targets.append('valid')\n            else:\n                log.warn('Both \"evaluation_targets\" and \"to_validate\" parameters are specified.'\n                         ' \"to_validate\" is deprecated and will be ignored')\n\n        res = trainer.evaluate(iterator, evaluation_targets, print_reports=True)\n        trainer.get_chainer().destroy()\n\n    res = {k: v['metrics'] for k, v in res.items()}\n\n    return res"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef interact_alice(agent: Agent):\n    data = request.get_json()\n    text = data['request'].get('command', '').strip()\n    payload = data['request'].get('payload')\n\n    session_id = data['session']['session_id']\n    user_id = data['session']['user_id']\n    message_id = data['session']['message_id']\n\n    dialog_id = DialogID(user_id, session_id)\n\n    response = {\n        'response': {\n            'end_session': True,\n            'text': ''\n        },\n        \"session\": {\n            'session_id': session_id,\n            'message_id': message_id,\n            'user_id': user_id\n        },\n        'version': '1.0'\n    }\n\n    agent_response: Union[str, RichMessage] = agent([payload or text], [dialog_id])[0]\n    if isinstance(agent_response, RichMessage):\n        response['response']['text'] = '\\n'.join([j['content']\n                                                  for j in agent_response.json()\n                                                  if j['type'] == 'plain_text'])\n    else:\n        response['response']['text'] = str(agent_response)\n\n    return jsonify(response), 200", "response": "Interacts with the Yandex API."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nconverting labels to one - hot vectors for multi - class multi - label classification", "response": "def labels2onehot(labels: [List[str], List[List[str]], np.ndarray], classes:  [list, np.ndarray]) -> np.ndarray:\n    \"\"\"\n    Convert labels to one-hot vectors for multi-class multi-label classification\n\n    Args:\n        labels: list of samples where each sample is a class or a list of classes which sample belongs with\n        classes: array of classes' names\n\n    Returns:\n        2d array with one-hot representation of given samples\n    \"\"\"\n    n_classes = len(classes)\n    y = []\n    for sample in labels:\n        curr = np.zeros(n_classes)\n        if isinstance(sample, list):\n            for intent in sample:\n                if intent not in classes:\n                    log.warning('Unknown intent {} detected. Assigning no class'.format(intent))\n                else:\n                    curr[np.where(np.array(classes) == intent)[0]] = 1\n        else:\n            curr[np.where(np.array(classes) == sample)[0]] = 1\n        y.append(curr)\n    y = np.asarray(y)\n    return y"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nconverting vectors of probabilities to labels using confident threshold", "response": "def proba2labels(proba: [list, np.ndarray], confident_threshold: float, classes:  [list, np.ndarray]) -> List[List]:\n    \"\"\"\n    Convert vectors of probabilities to labels using confident threshold\n    (if probability to belong with the class is bigger than confident_threshold, sample belongs with the class;\n    if no probabilities bigger than confident threshold, sample belongs with the class with the biggest probability)\n\n    Args:\n        proba: list of samples where each sample is a vector of probabilities to belong with given classes\n        confident_threshold (float): boundary of probability to belong with a class\n        classes: array of classes' names\n\n    Returns:\n        list of lists of labels for each sample\n    \"\"\"\n    y = []\n    for sample in proba:\n        to_add = np.where(sample > confident_threshold)[0]\n        if len(to_add) > 0:\n            y.append(np.array(classes)[to_add].tolist())\n        else:\n            y.append(np.array([np.array(classes)[np.argmax(sample)]]).tolist())\n\n    return y"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef proba2onehot(proba: [list, np.ndarray], confident_threshold: float, classes:  [list, np.ndarray]) -> np.ndarray:\n    return labels2onehot(proba2labels(proba, confident_threshold, classes), classes)", "response": "Convert vectors of probabilities to one - hot representations using confident threshold"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef _config_session():\n        config = tf.ConfigProto()\n        config.gpu_options.allow_growth = True\n        config.gpu_options.visible_device_list = '0'\n        return tf.Session(config=config)", "response": "Configure session for particular device"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nprocess event after epoch.", "response": "def process_event(self, event_name: str, data: dict) -> None:\n        \"\"\"\n        Process event after epoch\n        Args:\n            event_name: whether event is send after epoch or batch.\n                    Set of values: ``\"after_epoch\", \"after_batch\"``\n            data: event data (dictionary)\n\n        Returns:\n            None\n        \"\"\"\n        if event_name == \"after_epoch\":\n            self.epochs_done = data[\"epochs_done\"]\n            self.batches_seen = data[\"batches_seen\"]\n            self.train_examples_seen = data[\"train_examples_seen\"]\n        return"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nloading the model from the file if it exists.", "response": "def load(self) -> None:\n        \"\"\"Checks existence of the model file, loads the model if the file exists\"\"\"\n\n        # Checks presence of the model files\n        if self.load_path.exists():\n            path = str(self.load_path.resolve())\n            log.info('[loading model from {}]'.format(path))\n            self._net.load(path)"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef save(self) -> None:\n        path = str(self.save_path.absolute())\n        log.info('[saving model to {}]'.format(path))\n        self._net.save(path)", "response": "Saves the current is\n        to the save_path provided in config."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef train_on_batch(self, *args) -> None:\n        *data, labels = args\n        self._net.train_on_batch(data, labels)", "response": "Trains the model on a single batch."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nreturn the value of the momentum variable from the optimizer", "response": "def get_momentum_variable(self):\n        \"\"\"\n        Extract values of momentum variables from optimizer\n\n        Returns:\n            optimizer's `rho` or `beta_1`\n        \"\"\"\n        optimizer = self.get_optimizer()\n        if hasattr(optimizer, 'rho'):\n            return optimizer.rho\n        elif hasattr(optimizer, 'beta_1'):\n            return optimizer.beta_1\n        return None"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nupdates the learning rate and momentum variables in the graph.", "response": "def _update_graph_variables(self, learning_rate: float = None, momentum: float = None):\n        \"\"\"\n        Update graph variables setting giving `learning_rate` and `momentum`\n\n        Args:\n            learning_rate: learning rate value to be set in graph (set if not None)\n            momentum: momentum value to be set in graph (set if not None)\n\n        Returns:\n            None\n        \"\"\"\n        if learning_rate is not None:\n            K.set_value(self.get_learning_rate_variable(), learning_rate)\n            # log.info(f\"Learning rate = {learning_rate}\")\n        if momentum is not None:\n            K.set_value(self.get_momentum_variable(), momentum)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nprocessing event after epoch or batch.", "response": "def process_event(self, event_name: str, data: dict):\n        \"\"\"\n        Process event after epoch\n        Args:\n            event_name: whether event is send after epoch or batch.\n                    Set of values: ``\"after_epoch\", \"after_batch\"``\n            data: event data (dictionary)\n\n        Returns:\n            None\n        \"\"\"\n        if (isinstance(self.opt.get(\"learning_rate\", None), float) and\n                isinstance(self.opt.get(\"learning_rate_decay\", None), float)):\n            pass\n        else:\n            if event_name == 'after_train_log':\n                if (self.get_learning_rate_variable() is not None) and ('learning_rate' not in data):\n                    data['learning_rate'] = float(K.get_value(self.get_learning_rate_variable()))\n                    # data['learning_rate'] = self._lr\n                if (self.get_momentum_variable() is not None) and ('momentum' not in data):\n                    data['momentum'] = float(K.get_value(self.get_momentum_variable()))\n                    # data['momentum'] = self._mom\n            else:\n                super().process_event(event_name, data)"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\ncalculating F1 ( binary ) measure.", "response": "def round_f1(y_true, y_predicted):\n    \"\"\"\n    Calculates F1 (binary) measure.\n\n    Args:\n        y_true: list of true values\n        y_predicted: list of predicted values\n\n    Returns:\n        F1 score\n    \"\"\"\n    try:\n        predictions = [np.round(x) for x in y_predicted]\n    except TypeError:\n        predictions = y_predicted\n\n    return f1_score(y_true, predictions)"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef round_f1_macro(y_true, y_predicted):\n    try:\n        predictions = [np.round(x) for x in y_predicted]\n    except TypeError:\n        predictions = y_predicted\n\n    return f1_score(np.array(y_true), np.array(predictions), average=\"macro\")", "response": "Calculates F1 macro measure."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nconvert a word to a tuple of symbols optionally converts it to lowercase and adds capitalization label.", "response": "def process_word(word: str, to_lower: bool = False,\n                 append_case: Optional[str] = None) -> Tuple[str]:\n    \"\"\"Converts word to a tuple of symbols, optionally converts it to lowercase\n    and adds capitalization label.\n\n    Args:\n        word: input word\n        to_lower: whether to lowercase\n        append_case: whether to add case mark\n            ('<FIRST_UPPER>' for first capital and '<ALL_UPPER>' for all caps)\n\n    Returns:\n        a preprocessed word\n    \"\"\"\n    if all(x.isupper() for x in word) and len(word) > 1:\n        uppercase = \"<ALL_UPPER>\"\n    elif word[0].isupper():\n        uppercase = \"<FIRST_UPPER>\"\n    else:\n        uppercase = None\n    if to_lower:\n        word = word.lower()\n    if word.isdigit():\n        answer = [\"<DIGIT>\"]\n    elif word.startswith(\"http://\") or word.startswith(\"www.\"):\n        answer = [\"<HTTP>\"]\n    else:\n        answer = list(word)\n    if to_lower and uppercase is not None:\n        if append_case == \"first\":\n            answer = [uppercase] + answer\n        elif append_case == \"last\":\n            answer = answer + [uppercase]\n    return tuple(answer)"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef stacked_cnn(units: tf.Tensor,\n                n_hidden_list: List,\n                filter_width=3,\n                use_batch_norm=False,\n                use_dilation=False,\n                training_ph=None,\n                add_l2_losses=False):\n    \"\"\" Number of convolutional layers stacked on top of each other\n\n    Args:\n        units: a tensorflow tensor with dimensionality [None, n_tokens, n_features]\n        n_hidden_list: list with number of hidden units at the ouput of each layer\n        filter_width: width of the kernel in tokens\n        use_batch_norm: whether to use batch normalization between layers\n        use_dilation: use power of 2 dilation scheme [1, 2, 4, 8 .. ] for layers 1, 2, 3, 4 ...\n        training_ph: boolean placeholder determining whether is training phase now or not.\n            It is used only for batch normalization to determine whether to use\n            current batch average (std) or memory stored average (std)\n        add_l2_losses: whether to add l2 losses on network kernels to\n                tf.GraphKeys.REGULARIZATION_LOSSES or not\n\n    Returns:\n        units: tensor at the output of the last convolutional layer\n    \"\"\"\n    l2_reg = tf.nn.l2_loss if add_l2_losses else None\n    for n_layer, n_hidden in enumerate(n_hidden_list):\n        if use_dilation:\n            dilation_rate = 2 ** n_layer\n        else:\n            dilation_rate = 1\n        units = tf.layers.conv1d(units,\n                                 n_hidden,\n                                 filter_width,\n                                 padding='same',\n                                 dilation_rate=dilation_rate,\n                                 kernel_initializer=INITIALIZER(),\n                                 kernel_regularizer=l2_reg)\n        if use_batch_norm:\n            assert training_ph is not None\n            units = tf.layers.batch_normalization(units, training=training_ph)\n        units = tf.nn.relu(units)\n    return units", "response": "Number of convolutional layers stacked on top of each other."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef dense_convolutional_network(units: tf.Tensor,\n                                n_hidden_list: List,\n                                filter_width=3,\n                                use_dilation=False,\n                                use_batch_norm=False,\n                                training_ph=None):\n    \"\"\" Densely connected convolutional layers. Based on the paper:\n        [Gao 17] https://arxiv.org/abs/1608.06993\n\n        Args:\n            units: a tensorflow tensor with dimensionality [None, n_tokens, n_features]\n            n_hidden_list: list with number of hidden units at the ouput of each layer\n            filter_width: width of the kernel in tokens\n            use_batch_norm: whether to use batch normalization between layers\n            use_dilation: use power of 2 dilation scheme [1, 2, 4, 8 .. ] for layers 1, 2, 3, 4 ...\n            training_ph: boolean placeholder determining whether is training phase now or not.\n                It is used only for batch normalization to determine whether to use\n                current batch average (std) or memory stored average (std)\n        Returns:\n            units: tensor at the output of the last convolutional layer\n                with dimensionality [None, n_tokens, n_hidden_list[-1]]\n        \"\"\"\n    units_list = [units]\n    for n_layer, n_filters in enumerate(n_hidden_list):\n        total_units = tf.concat(units_list, axis=-1)\n        if use_dilation:\n            dilation_rate = 2 ** n_layer\n        else:\n            dilation_rate = 1\n        units = tf.layers.conv1d(total_units,\n                                 n_filters,\n                                 filter_width,\n                                 dilation_rate=dilation_rate,\n                                 padding='same',\n                                 kernel_initializer=INITIALIZER())\n        if use_batch_norm:\n            units = tf.layers.batch_normalization(units, training=training_ph)\n        units = tf.nn.relu(units)\n        units_list.append(units)\n    return units", "response": "Dense convolutional network of the last layer."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nfunction that creates Bi Directional Recurrent Neural Network with optional parameters.", "response": "def bi_rnn(units: tf.Tensor,\n           n_hidden: List,\n           cell_type='gru',\n           seq_lengths=None,\n           trainable_initial_states=False,\n           use_peepholes=False,\n           name='Bi-'):\n    \"\"\" Bi directional recurrent neural network. GRU or LSTM\n\n        Args:\n            units: a tensorflow tensor with dimensionality [None, n_tokens, n_features]\n            n_hidden: list with number of hidden units at the ouput of each layer\n            seq_lengths: length of sequences for different length sequences in batch\n                can be None for maximum length as a length for every sample in the batch\n            cell_type: 'lstm' or 'gru'\n            trainable_initial_states: whether to create a special trainable variable\n                to initialize the hidden states of the network or use just zeros\n            use_peepholes: whether to use peephole connections (only 'lstm' case affected)\n            name: what variable_scope to use for the network parameters\n\n        Returns:\n            units: tensor at the output of the last recurrent layer\n                with dimensionality [None, n_tokens, n_hidden_list[-1]]\n            last_units: tensor of last hidden states for GRU and tuple\n                of last hidden stated and last cell states for LSTM\n                dimensionality of cell states and hidden states are\n                similar and equal to [B x 2 * H], where B - batch\n                size and H is number of hidden units\n    \"\"\"\n\n    with tf.variable_scope(name + '_' + cell_type.upper()):\n        if cell_type == 'gru':\n            forward_cell = tf.nn.rnn_cell.GRUCell(n_hidden, kernel_initializer=INITIALIZER())\n            backward_cell = tf.nn.rnn_cell.GRUCell(n_hidden, kernel_initializer=INITIALIZER())\n            if trainable_initial_states:\n                initial_state_fw = tf.tile(tf.get_variable('init_fw_h', [1, n_hidden]), (tf.shape(units)[0], 1))\n                initial_state_bw = tf.tile(tf.get_variable('init_bw_h', [1, n_hidden]), (tf.shape(units)[0], 1))\n            else:\n                initial_state_fw = initial_state_bw = None\n        elif cell_type == 'lstm':\n            forward_cell = tf.nn.rnn_cell.LSTMCell(n_hidden, use_peepholes=use_peepholes, initializer=INITIALIZER())\n            backward_cell = tf.nn.rnn_cell.LSTMCell(n_hidden, use_peepholes=use_peepholes, initializer=INITIALIZER())\n            if trainable_initial_states:\n                initial_state_fw = tf.nn.rnn_cell.LSTMStateTuple(\n                    tf.tile(tf.get_variable('init_fw_c', [1, n_hidden]), (tf.shape(units)[0], 1)),\n                    tf.tile(tf.get_variable('init_fw_h', [1, n_hidden]), (tf.shape(units)[0], 1)))\n                initial_state_bw = tf.nn.rnn_cell.LSTMStateTuple(\n                    tf.tile(tf.get_variable('init_bw_c', [1, n_hidden]), (tf.shape(units)[0], 1)),\n                    tf.tile(tf.get_variable('init_bw_h', [1, n_hidden]), (tf.shape(units)[0], 1)))\n            else:\n                initial_state_fw = initial_state_bw = None\n        else:\n            raise RuntimeError('cell_type must be either \"gru\" or \"lstm\"s')\n        (rnn_output_fw, rnn_output_bw), (fw, bw) = \\\n            tf.nn.bidirectional_dynamic_rnn(forward_cell,\n                                            backward_cell,\n                                            units,\n                                            dtype=tf.float32,\n                                            sequence_length=seq_lengths,\n                                            initial_state_fw=initial_state_fw,\n                                            initial_state_bw=initial_state_bw)\n    kernels = [var for var in forward_cell.trainable_variables +\n               backward_cell.trainable_variables if 'kernel' in var.name]\n    for kernel in kernels:\n        tf.add_to_collection(tf.GraphKeys.REGULARIZATION_LOSSES, tf.nn.l2_loss(kernel))\n    return (rnn_output_fw, rnn_output_bw), (fw, bw)"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef stacked_bi_rnn(units: tf.Tensor,\n                   n_hidden_list: List,\n                   cell_type='gru',\n                   seq_lengths=None,\n                   use_peepholes=False,\n                   name='RNN_layer'):\n    \"\"\" Stackted recurrent neural networks GRU or LSTM\n\n        Args:\n            units: a tensorflow tensor with dimensionality [None, n_tokens, n_features]\n            n_hidden_list: list with number of hidden units at the ouput of each layer\n            seq_lengths: length of sequences for different length sequences in batch\n                can be None for maximum length as a length for every sample in the batch\n            cell_type: 'lstm' or 'gru'\n            use_peepholes: whether to use peephole connections (only 'lstm' case affected)\n            name: what variable_scope to use for the network parameters\n        Returns:\n            units: tensor at the output of the last recurrent layer\n                with dimensionality [None, n_tokens, n_hidden_list[-1]]\n            last_units: tensor of last hidden states for GRU and tuple\n                of last hidden stated and last cell states for LSTM\n                dimensionality of cell states and hidden states are\n                similar and equal to [B x 2 * H], where B - batch\n                size and H is number of hidden units\n    \"\"\"\n    for n, n_hidden in enumerate(n_hidden_list):\n        with tf.variable_scope(name + '_' + str(n)):\n            if cell_type == 'gru':\n                forward_cell = tf.nn.rnn_cell.GRUCell(n_hidden)\n                backward_cell = tf.nn.rnn_cell.GRUCell(n_hidden)\n            elif cell_type == 'lstm':\n                forward_cell = tf.nn.rnn_cell.LSTMCell(n_hidden, use_peepholes=use_peepholes)\n                backward_cell = tf.nn.rnn_cell.LSTMCell(n_hidden, use_peepholes=use_peepholes)\n            else:\n                raise RuntimeError('cell_type must be either gru or lstm')\n\n            (rnn_output_fw, rnn_output_bw), (fw, bw) = \\\n                tf.nn.bidirectional_dynamic_rnn(forward_cell,\n                                                backward_cell,\n                                                units,\n                                                dtype=tf.float32,\n                                                sequence_length=seq_lengths)\n            units = tf.concat([rnn_output_fw, rnn_output_bw], axis=2)\n            if cell_type == 'gru':\n                last_units = tf.concat([fw, bw], axis=1)\n            else:\n                (c_fw, h_fw), (c_bw, h_bw) = fw, bw\n                c = tf.concat([c_fw, c_bw], axis=1)\n                h = tf.concat([h_fw, h_bw], axis=1)\n                last_units = (h, c)\n    return units, last_units", "response": "Stacked BI RNN layer."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef u_shape(units: tf.Tensor,\n            n_hidden_list: List,\n            filter_width=7,\n            use_batch_norm=False,\n            training_ph=None):\n    \"\"\" Network architecture inspired by One Hundred layer Tiramisu.\n        https://arxiv.org/abs/1611.09326. U-Net like.\n\n        Args:\n            units: a tensorflow tensor with dimensionality [None, n_tokens, n_features]\n            n_hidden_list: list with number of hidden units at the ouput of each layer\n            filter_width: width of the kernel in tokens\n            use_batch_norm: whether to use batch normalization between layers\n            training_ph: boolean placeholder determining whether is training phase now or not.\n                It is used only for batch normalization to determine whether to use\n                current batch average (std) or memory stored average (std)\n        Returns:\n            units: tensor at the output of the last convolutional layer\n                    with dimensionality [None, n_tokens, n_hidden_list[-1]]\n    \"\"\"\n\n    # Bread Crumbs\n    units_for_skip_conn = []\n    conv_net_params = {'filter_width': filter_width,\n                       'use_batch_norm': use_batch_norm,\n                       'training_ph': training_ph}\n\n    # Go down the rabbit hole\n    for n_hidden in n_hidden_list:\n        units = stacked_cnn(units, [n_hidden], **conv_net_params)\n        units_for_skip_conn.append(units)\n        units = tf.layers.max_pooling1d(units, pool_size=2, strides=2, padding='same')\n\n    units = stacked_cnn(units, [n_hidden], **conv_net_params)\n\n    # Up to the sun light\n    for down_step, n_hidden in enumerate(n_hidden_list[::-1]):\n        units = tf.expand_dims(units, axis=2)\n        units = tf.layers.conv2d_transpose(units, n_hidden, filter_width, strides=(2, 1), padding='same')\n        units = tf.squeeze(units, axis=2)\n\n        # Skip connection\n        skip_units = units_for_skip_conn[-(down_step + 1)]\n        if skip_units.get_shape().as_list()[-1] != n_hidden:\n            skip_units = tf.layers.dense(skip_units, n_hidden)\n        units = skip_units + units\n\n        units = stacked_cnn(units, [n_hidden], **conv_net_params)\n    return units", "response": "Returns the shape of the units at the output of the last convolutional layer in the list of hidden units."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef stacked_highway_cnn(units: tf.Tensor,\n                        n_hidden_list: List,\n                        filter_width=3,\n                        use_batch_norm=False,\n                        use_dilation=False,\n                        training_ph=None):\n    \"\"\" Highway convolutional network. Skip connection with gating\n        mechanism.\n\n    Args:\n        units: a tensorflow tensor with dimensionality [None, n_tokens, n_features]\n        n_hidden_list: list with number of hidden units at the output of each layer\n        filter_width: width of the kernel in tokens\n        use_batch_norm: whether to use batch normalization between layers\n        use_dilation: use power of 2 dilation scheme [1, 2, 4, 8 .. ] for layers 1, 2, 3, 4 ...\n        training_ph: boolean placeholder determining whether is training phase now or not.\n            It is used only for batch normalization to determine whether to use\n            current batch average (std) or memory stored average (std)\n    Returns:\n        units: tensor at the output of the last convolutional layer\n                with dimensionality [None, n_tokens, n_hidden_list[-1]]\n    \"\"\"\n\n    for n_layer, n_hidden in enumerate(n_hidden_list):\n        input_units = units\n        # Projection if needed\n        if input_units.get_shape().as_list()[-1] != n_hidden:\n            input_units = tf.layers.dense(input_units, n_hidden)\n        if use_dilation:\n            dilation_rate = 2 ** n_layer\n        else:\n            dilation_rate = 1\n        units = tf.layers.conv1d(units,\n                                 n_hidden,\n                                 filter_width,\n                                 padding='same',\n                                 dilation_rate=dilation_rate,\n                                 kernel_initializer=INITIALIZER())\n        if use_batch_norm:\n            units = tf.layers.batch_normalization(units, training=training_ph)\n        sigmoid_gate = tf.layers.dense(input_units, 1, activation=tf.sigmoid, kernel_initializer=INITIALIZER())\n        input_units = sigmoid_gate * input_units + (1 - sigmoid_gate) * units\n        input_units = tf.nn.relu(input_units)\n    units = input_units\n    return units", "response": "Highway convolutional network. Skip connection with gating\n        mechanism.\n\n    Args:\n        units: a tensorflow tensor with dimensionality [None, n_tokens, n_features]\n        n_hidden_list: list with number of hidden units at the output of each layer\n        filter_width: width of the kernel in tokens\n        use_batch_norm: whether to use batch normalization between layers\n        use_dilation: use power of 2 dilation scheme [1, 2, 4, 8 .. ] for layers 1, 2, 3, 4 ...\n        training_ph: boolean placeholder determining whether is training phase now or not.\n            It is used only for batch normalization to determine whether to use\n            current batch average (std) or memory stored average (std)\n    Returns:\n        units: tensor at the output of the last convolutional layer\n                with dimensionality [None, n_tokens, n_hidden_list[-1]]"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef embedding_layer(token_indices=None,\n                    token_embedding_matrix=None,\n                    n_tokens=None,\n                    token_embedding_dim=None,\n                    name: str = None,\n                    trainable=True):\n    \"\"\" Token embedding layer. Create matrix of for token embeddings.\n        Can be initialized with given matrix (for example pre-trained\n        with word2ve algorithm\n\n    Args:\n        token_indices: token indices tensor of type tf.int32\n        token_embedding_matrix: matrix of embeddings with dimensionality\n            [n_tokens, embeddings_dimension]\n        n_tokens: total number of unique tokens\n        token_embedding_dim: dimensionality of embeddings, typical 100..300\n        name: embedding matrix name (variable name)\n        trainable: whether to set the matrix trainable or not\n\n    Returns:\n        embedded_tokens: tf tensor of size [B, T, E], where B - batch size\n            T - number of tokens, E - token_embedding_dim\n    \"\"\"\n    if token_embedding_matrix is not None:\n        tok_mat = token_embedding_matrix\n        if trainable:\n            Warning('Matrix of embeddings is passed to the embedding_layer, '\n                    'possibly there is a pre-trained embedding matrix. '\n                    'Embeddings paramenters are set to Trainable!')\n    else:\n        tok_mat = np.random.randn(n_tokens, token_embedding_dim).astype(np.float32) / np.sqrt(token_embedding_dim)\n    tok_emb_mat = tf.Variable(tok_mat, name=name, trainable=trainable)\n    embedded_tokens = tf.nn.embedding_lookup(tok_emb_mat, token_indices)\n    return embedded_tokens", "response": "Create token embedding layer."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef character_embedding_network(char_placeholder: tf.Tensor,\n                                n_characters: int = None,\n                                emb_mat: np.array = None,\n                                char_embedding_dim: int = None,\n                                filter_widths=(3, 4, 5, 7),\n                                highway_on_top=False):\n    \"\"\" Characters to vector. Every sequence of characters (token)\n        is embedded to vector space with dimensionality char_embedding_dim\n        Convolution plus max_pooling is used to obtain vector representations\n        of words.\n\n    Args:\n        char_placeholder: placeholder of int32 type with dimensionality [B, T, C]\n            B - batch size (can be None)\n            T - Number of tokens (can be None)\n            C - number of characters (can be None)\n        n_characters: total number of unique characters\n        emb_mat: if n_characters is not provided the emb_mat should be provided\n            it is a numpy array with dimensions [V, E], where V - vocabulary size\n            and E - embeddings dimension\n        char_embedding_dim: dimensionality of characters embeddings\n        filter_widths: array of width of kernel in convolutional embedding network\n            used in parallel\n\n    Returns:\n        embeddings: tf.Tensor with dimensionality [B, T, F],\n            where F is dimensionality of embeddings\n    \"\"\"\n    if emb_mat is None:\n        emb_mat = np.random.randn(n_characters, char_embedding_dim).astype(np.float32) / np.sqrt(char_embedding_dim)\n    else:\n        char_embedding_dim = emb_mat.shape[1]\n    char_emb_var = tf.Variable(emb_mat, trainable=True)\n    with tf.variable_scope('Char_Emb_Network'):\n        # Character embedding layer\n        c_emb = tf.nn.embedding_lookup(char_emb_var, char_placeholder)\n\n        # Character embedding network\n        conv_results_list = []\n        for filter_width in filter_widths:\n            conv_results_list.append(tf.layers.conv2d(c_emb,\n                                                      char_embedding_dim,\n                                                      (1, filter_width),\n                                                      padding='same',\n                                                      kernel_initializer=INITIALIZER))\n        units = tf.concat(conv_results_list, axis=3)\n        units = tf.reduce_max(units, axis=2)\n        if highway_on_top:\n            sigmoid_gate = tf.layers.dense(units,\n                                           1,\n                                           activation=tf.sigmoid,\n                                           kernel_initializer=INITIALIZER,\n                                           kernel_regularizer=tf.nn.l2_loss)\n            deeper_units = tf.layers.dense(units,\n                                           tf.shape(units)[-1],\n                                           kernel_initializer=INITIALIZER,\n                                           kernel_regularizer=tf.nn.l2_loss)\n            units = sigmoid_gate * units + (1 - sigmoid_gate) * deeper_units\n            units = tf.nn.relu(units)\n    return units", "response": "Returns the network of the character embedding network for the given character."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef expand_tile(units, axis):\n    assert axis in (1, 2)\n    n_time_steps = tf.shape(units)[1]\n    repetitions = [1, 1, 1, 1]\n    repetitions[axis] = n_time_steps\n    return tf.tile(tf.expand_dims(units, axis), repetitions)", "response": "Expand and tile tensor along given axis."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nadds a self - attention to the time series of vectors.", "response": "def additive_self_attention(units, n_hidden=None, n_output_features=None, activation=None):\n    \"\"\" Computes additive self attention for time series of vectors (with batch dimension)\n        the formula: score(h_i, h_j) = <v, tanh(W_1 h_i + W_2 h_j)>\n        v is a learnable vector of n_hidden dimensionality,\n        W_1 and W_2 are learnable [n_hidden, n_input_features] matrices\n\n    Args:\n        units: tf tensor with dimensionality [batch_size, time_steps, n_input_features]\n        n_hidden: number of units in hidden representation of similarity measure\n        n_output_features: number of features in output dense layer\n        activation: activation at the output\n\n    Returns:\n        output: self attended tensor with dimensionality [batch_size, time_steps, n_output_features]\n    \"\"\"\n    n_input_features = units.get_shape().as_list()[2]\n    if n_hidden is None:\n        n_hidden = n_input_features\n    if n_output_features is None:\n        n_output_features = n_input_features\n    units_pairs = tf.concat([expand_tile(units, 1), expand_tile(units, 2)], 3)\n    query = tf.layers.dense(units_pairs, n_hidden, activation=tf.tanh, kernel_initializer=INITIALIZER())\n    attention = tf.nn.softmax(tf.layers.dense(query, 1), dim=2)\n    attended_units = tf.reduce_sum(attention * expand_tile(units, 1), axis=2)\n    output = tf.layers.dense(attended_units, n_output_features, activation, kernel_initializer=INITIALIZER())\n    return output"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef multiplicative_self_attention(units, n_hidden=None, n_output_features=None, activation=None):\n    n_input_features = units.get_shape().as_list()[2]\n    if n_hidden is None:\n        n_hidden = n_input_features\n    if n_output_features is None:\n        n_output_features = n_input_features\n    queries = tf.layers.dense(expand_tile(units, 1), n_hidden, kernel_initializer=INITIALIZER())\n    keys = tf.layers.dense(expand_tile(units, 2), n_hidden, kernel_initializer=INITIALIZER())\n    scores = tf.reduce_sum(queries * keys, axis=3, keep_dims=True)\n    attention = tf.nn.softmax(scores, dim=2)\n    attended_units = tf.reduce_sum(attention * expand_tile(units, 1), axis=2)\n    output = tf.layers.dense(attended_units, n_output_features, activation, kernel_initializer=INITIALIZER())\n    return output", "response": "Computes multiplicative self attention for a time series of vectors."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nfast CuDNN GRU implementation Args: units: tf.Tensor with dimensions [B x T x F], where B - batch size T - number of tokens F - features n_hidden: dimensionality of hidden state trainable_initial_states: whether to create a special trainable variable to initialize the hidden states of the network or use just zeros seq_lengths: tensor of sequence lengths with dimension [B] n_layers: number of layers input_initial_h: initial hidden state, tensor name: name of the variable scope to use reuse:whether to reuse already initialized variable Returns: h - all hidden states along T dimension, tf.Tensor with dimensionality [B x T x F] h_last - last hidden state, tf.Tensor with dimensionality [B x H]", "response": "def cudnn_gru(units, n_hidden, n_layers=1, trainable_initial_states=False,\n              seq_lengths=None, input_initial_h=None, name='cudnn_gru', reuse=False):\n    \"\"\" Fast CuDNN GRU implementation\n\n    Args:\n        units: tf.Tensor with dimensions [B x T x F], where\n            B - batch size\n            T - number of tokens\n            F - features\n\n        n_hidden: dimensionality of hidden state\n        trainable_initial_states: whether to create a special trainable variable\n            to initialize the hidden states of the network or use just zeros\n        seq_lengths: tensor of sequence lengths with dimension [B]\n        n_layers: number of layers\n        input_initial_h: initial hidden state, tensor\n        name: name of the variable scope to use\n        reuse:whether to reuse already initialized variable\n\n    Returns:\n        h - all hidden states along T dimension,\n            tf.Tensor with dimensionality [B x T x F]\n        h_last - last hidden state, tf.Tensor with dimensionality [B x H]\n    \"\"\"\n    with tf.variable_scope(name, reuse=reuse):\n        gru = tf.contrib.cudnn_rnn.CudnnGRU(num_layers=n_layers,\n                                            num_units=n_hidden)\n\n        if trainable_initial_states:\n            init_h = tf.get_variable('init_h', [n_layers, 1, n_hidden])\n            init_h = tf.tile(init_h, (1, tf.shape(units)[0], 1))\n        else:\n            init_h = tf.zeros([n_layers, tf.shape(units)[0], n_hidden])\n\n        initial_h = input_initial_h or init_h\n\n        h, h_last = gru(tf.transpose(units, (1, 0, 2)), (initial_h, ))\n        h = tf.transpose(h, (1, 0, 2))\n        h_last = tf.squeeze(h_last, axis=0)[-1]  # extract last layer state\n\n        # Extract last states if they are provided\n        if seq_lengths is not None:\n            indices = tf.stack([tf.range(tf.shape(h)[0]), seq_lengths-1], axis=1)\n            h_last = tf.gather_nd(h, indices)\n\n        return h, h_last"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef cudnn_lstm(units, n_hidden, n_layers=1, trainable_initial_states=None, seq_lengths=None, initial_h=None,\n               initial_c=None, name='cudnn_lstm', reuse=False):\n    \"\"\" Fast CuDNN LSTM implementation\n\n        Args:\n            units: tf.Tensor with dimensions [B x T x F], where\n                B - batch size\n                T - number of tokens\n                F - features\n            n_hidden: dimensionality of hidden state\n            n_layers: number of layers\n            trainable_initial_states: whether to create a special trainable variable\n                to initialize the hidden states of the network or use just zeros\n            seq_lengths: tensor of sequence lengths with dimension [B]\n            initial_h: optional initial hidden state, masks trainable_initial_states\n                if provided\n            initial_c: optional initial cell state, masks trainable_initial_states\n                if provided\n            name: name of the variable scope to use\n            reuse:whether to reuse already initialized variable\n\n\n        Returns:\n            h - all hidden states along T dimension,\n                tf.Tensor with dimensionality [B x T x F]\n            h_last - last hidden state, tf.Tensor with dimensionality [B x H]\n                where H - number of hidden units\n            c_last - last cell state, tf.Tensor with dimensionality [B x H]\n                where H - number of hidden units\n        \"\"\"\n    with tf.variable_scope(name, reuse=reuse):\n        lstm = tf.contrib.cudnn_rnn.CudnnLSTM(num_layers=n_layers,\n                                              num_units=n_hidden)\n        if trainable_initial_states:\n            init_h = tf.get_variable('init_h', [n_layers, 1, n_hidden])\n            init_h = tf.tile(init_h, (1, tf.shape(units)[0], 1))\n            init_c = tf.get_variable('init_c', [n_layers, 1, n_hidden])\n            init_c = tf.tile(init_c, (1, tf.shape(units)[0], 1))\n        else:\n            init_h = init_c = tf.zeros([n_layers, tf.shape(units)[0], n_hidden])\n\n        initial_h = initial_h or init_h\n        initial_c = initial_c or init_c\n\n        h, (h_last, c_last) = lstm(tf.transpose(units, (1, 0, 2)), (initial_h, initial_c))\n        h = tf.transpose(h, (1, 0, 2))\n        h_last = h_last[-1]\n        c_last = c_last[-1]\n\n        # Extract last states if they are provided\n        if seq_lengths is not None:\n            indices = tf.stack([tf.range(tf.shape(h)[0]), seq_lengths-1], axis=1)\n            h_last = tf.gather_nd(h, indices)\n\n        return h, (h_last, c_last)", "response": "A function that creates a Fast CuDNN LSTM implementation of the specified number of hidden states and initial states."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef cudnn_compatible_lstm(units, n_hidden, n_layers=1, trainable_initial_states=None, seq_lengths=None, initial_h=None,\n                          initial_c=None, name='cudnn_lstm', reuse=False):\n    \"\"\" CuDNN Compatible LSTM implementation.\n        It should be used to load models saved with CudnnLSTMCell to run on CPU.\n\n        Args:\n            units: tf.Tensor with dimensions [B x T x F], where\n                B - batch size\n                T - number of tokens\n                F - features\n            n_hidden: dimensionality of hidden state\n            n_layers: number of layers\n            trainable_initial_states: whether to create a special trainable variable\n                to initialize the hidden states of the network or use just zeros\n            seq_lengths: tensor of sequence lengths with dimension [B]\n            initial_h: optional initial hidden state, masks trainable_initial_states\n                if provided\n            initial_c: optional initial cell state, masks trainable_initial_states\n                if provided\n            name: name of the variable scope to use\n            reuse:whether to reuse already initialized variable\n\n\n        Returns:\n            h - all hidden states along T dimension,\n                tf.Tensor with dimensionality [B x T x F]\n            h_last - last hidden state, tf.Tensor with dimensionality [B x H]\n                where H - number of hidden units\n            c_last - last cell state, tf.Tensor with dimensionality [B x H]\n                where H - number of hidden units\n        \"\"\"\n\n    with tf.variable_scope(name, reuse=reuse):\n        if trainable_initial_states:\n            init_h = tf.get_variable('init_h', [n_layers, 1, n_hidden])\n            init_h = tf.tile(init_h, (1, tf.shape(units)[0], 1))\n            init_c = tf.get_variable('init_c', [n_layers, 1, n_hidden])\n            init_c = tf.tile(init_c, (1, tf.shape(units)[0], 1))\n        else:\n            init_h = init_c = tf.zeros([n_layers, tf.shape(units)[0], n_hidden])\n\n        initial_h = initial_h or init_h\n        initial_c = initial_c or init_c\n\n        with tf.variable_scope('cudnn_lstm', reuse=reuse):\n            def single_cell(): return tf.contrib.cudnn_rnn.CudnnCompatibleLSTMCell(n_hidden)\n\n            cell = tf.nn.rnn_cell.MultiRNNCell([single_cell() for _ in range(n_layers)])\n\n            units = tf.transpose(units, (1, 0, 2))\n\n            init = tuple([tf.nn.rnn_cell.LSTMStateTuple(ic, ih) for ih, ic in\n                          zip(tf.unstack(initial_h, axis=0), tf.unstack(initial_c, axis=0))])\n\n            h, state = tf.nn.dynamic_rnn(cell=cell, inputs=units, time_major=True, initial_state=init)\n\n            h = tf.transpose(h, (1, 0, 2))\n            h_last = state[-1].h\n            c_last = state[-1].c\n\n            # Extract last states if they are provided\n            if seq_lengths is not None:\n                indices = tf.stack([tf.range(tf.shape(h)[0]), seq_lengths-1], axis=1)\n                h_last = tf.gather_nd(h, indices)\n\n            return h, (h_last, c_last)", "response": "CuDNN Compatible LSTM implementation."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef cudnn_bi_gru(units,\n                 n_hidden,\n                 seq_lengths=None,\n                 n_layers=1,\n                 trainable_initial_states=False,\n                 name='cudnn_bi_gru',\n                 reuse=False):\n    \"\"\" Fast CuDNN Bi-GRU implementation\n\n    Args:\n        units: tf.Tensor with dimensions [B x T x F], where\n            B - batch size\n            T - number of tokens\n            F - features\n        n_hidden: dimensionality of hidden state\n        seq_lengths: number of tokens in each sample in the batch\n        n_layers: number of layers\n        trainable_initial_states: whether to create a special trainable variable\n                to initialize the hidden states of the network or use just zeros\n        name: name of the variable scope to use\n        reuse:whether to reuse already initialized variable\n\n\n    Returns:\n        h - all hidden states along T dimension,\n            tf.Tensor with dimensionality [B x T x F]\n        h_last - last hidden state, tf.Tensor with dimensionality [B x H * 2]\n            where H - number of hidden units\n    \"\"\"\n\n    with tf.variable_scope(name, reuse=reuse):\n        if seq_lengths is None:\n            seq_lengths = tf.ones([tf.shape(units)[0]], dtype=tf.int32) * tf.shape(units)[1]\n        with tf.variable_scope('Forward'):\n            h_fw, h_last_fw = cudnn_gru_wrapper(units,\n                                                n_hidden,\n                                                n_layers=n_layers,\n                                                trainable_initial_states=trainable_initial_states,\n                                                seq_lengths=seq_lengths,\n                                                reuse=reuse)\n\n        with tf.variable_scope('Backward'):\n            reversed_units = tf.reverse_sequence(units, seq_lengths=seq_lengths, seq_dim=1, batch_dim=0)\n            h_bw, h_last_bw = cudnn_gru_wrapper(reversed_units,\n                                                n_hidden,\n                                                n_layers=n_layers,\n                                                trainable_initial_states=trainable_initial_states,\n                                                seq_lengths=seq_lengths,\n                                                reuse=reuse)\n            h_bw = tf.reverse_sequence(h_bw, seq_lengths=seq_lengths, seq_dim=1, batch_dim=0)\n\n    return (h_fw, h_bw), (h_last_fw, h_last_bw)", "response": "A function that creates a graph of the CuDNN Bi - GRU implementation of the given set of units."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nfasting CuDNN Bi-LSTM implementation Args: units: tf.Tensor with dimensions [B x T x F], where B - batch size T - number of tokens F - features n_hidden: dimensionality of hidden state seq_lengths: number of tokens in each sample in the batch n_layers: number of layers trainable_initial_states: whether to create a special trainable variable to initialize the hidden states of the network or use just zeros name: name of the variable scope to use reuse:whether to reuse already initialized variable Returns: h - all hidden states along T dimension, tf.Tensor with dimensionality [B x T x F] h_last - last hidden state, tf.Tensor with dimensionality [B x H * 2] where H - number of hidden units c_last - last cell state, tf.Tensor with dimensionality [B x H * 2] where H - number of hidden units", "response": "def cudnn_bi_lstm(units,\n                  n_hidden,\n                  seq_lengths=None,\n                  n_layers=1,\n                  trainable_initial_states=False,\n                  name='cudnn_bi_gru',\n                  reuse=False):\n    \"\"\" Fast CuDNN Bi-LSTM implementation\n\n    Args:\n        units: tf.Tensor with dimensions [B x T x F], where\n            B - batch size\n            T - number of tokens\n            F - features\n        n_hidden: dimensionality of hidden state\n        seq_lengths: number of tokens in each sample in the batch\n        n_layers: number of layers\n        trainable_initial_states: whether to create a special trainable variable\n            to initialize the hidden states of the network or use just zeros\n        name: name of the variable scope to use\n        reuse:whether to reuse already initialized variable\n\n    Returns:\n        h - all hidden states along T dimension,\n            tf.Tensor with dimensionality [B x T x F]\n        h_last - last hidden state, tf.Tensor with dimensionality [B x H * 2]\n            where H - number of hidden units\n        c_last - last cell state, tf.Tensor with dimensionality [B x H * 2]\n            where H - number of hidden units\n        \"\"\"\n    with tf.variable_scope(name, reuse=reuse):\n        if seq_lengths is None:\n            seq_lengths = tf.ones([tf.shape(units)[0]], dtype=tf.int32) * tf.shape(units)[1]\n        with tf.variable_scope('Forward'):\n            h_fw, (h_fw_last, c_fw_last) = cudnn_lstm_wrapper(units,\n                                                              n_hidden,\n                                                              n_layers=n_layers,\n                                                              trainable_initial_states=trainable_initial_states,\n                                                              seq_lengths=seq_lengths)\n\n        with tf.variable_scope('Backward'):\n            reversed_units = tf.reverse_sequence(units, seq_lengths=seq_lengths, seq_dim=1, batch_dim=0)\n            h_bw, (h_bw_last, c_bw_last) = cudnn_lstm_wrapper(reversed_units,\n                                                              n_hidden,\n                                                              n_layers=n_layers,\n                                                              trainable_initial_states=trainable_initial_states,\n                                                              seq_lengths=seq_lengths)\n\n            h_bw = tf.reverse_sequence(h_bw, seq_lengths=seq_lengths, seq_dim=1, batch_dim=0)\n        return (h_fw, h_bw), ((h_fw_last, c_fw_last), (h_bw_last, c_bw_last))"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef cudnn_stacked_bi_gru(units,\n                         n_hidden,\n                         seq_lengths=None,\n                         n_stacks=2,\n                         keep_prob=1.0,\n                         concat_stacked_outputs=False,\n                         trainable_initial_states=False,\n                         name='cudnn_stacked_bi_gru',\n                         reuse=False):\n    \"\"\" Fast CuDNN Stacked Bi-GRU implementation\n\n    Args:\n        units: tf.Tensor with dimensions [B x T x F], where\n            B - batch size\n            T - number of tokens\n            F - features\n        n_hidden: dimensionality of hidden state\n        seq_lengths: number of tokens in each sample in the batch\n        n_stacks: number of stacked Bi-GRU\n        keep_prob: dropout keep_prob between Bi-GRUs (intra-layer dropout)\n        concat_stacked_outputs: return last Bi-GRU output or concat outputs from every Bi-GRU,\n        trainable_initial_states: whether to create a special trainable variable\n                to initialize the hidden states of the network or use just zeros\n        name: name of the variable scope to use\n        reuse: whether to reuse already initialized variable\n\n\n    Returns:\n        h - all hidden states along T dimension,\n            tf.Tensor with dimensionality [B x T x ((n_hidden * 2) * n_stacks)]\n    \"\"\"\n    if seq_lengths is None:\n        seq_lengths = tf.ones([tf.shape(units)[0]], dtype=tf.int32) * tf.shape(units)[1]\n\n    outputs = [units]\n\n    with tf.variable_scope(name, reuse=reuse):\n        for n in range(n_stacks):\n\n            if n == 0:\n                inputs = outputs[-1]\n            else:\n                inputs = variational_dropout(outputs[-1], keep_prob=keep_prob)\n\n            (h_fw, h_bw), _ = cudnn_bi_gru(inputs, n_hidden, seq_lengths,\n                                           n_layers=1,\n                                           trainable_initial_states=trainable_initial_states,\n                                           name='{}_cudnn_bi_gru'.format(n),\n                                           reuse=reuse)\n\n            outputs.append(tf.concat([h_fw, h_bw], axis=2))\n\n    if concat_stacked_outputs:\n        return tf.concat(outputs[1:], axis=2)\n\n    return outputs[-1]", "response": "A function that computes the Bi - GRU for each module in the network."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nbuilding the model using Keras.", "response": "def build(self):\n        \"\"\"Builds the network using Keras.\n        \"\"\"\n        word_inputs = kl.Input(shape=(None, MAX_WORD_LENGTH+2), dtype=\"int32\")\n        inputs = [word_inputs]\n        word_outputs = self._build_word_cnn(word_inputs)\n        if len(self.word_vectorizers) > 0:\n            additional_word_inputs = [kl.Input(shape=(None, input_dim), dtype=\"float32\")\n                                      for input_dim, dense_dim in self.word_vectorizers]\n            inputs.extend(additional_word_inputs)\n            additional_word_embeddings = [kl.Dense(dense_dim)(additional_word_inputs[i])\n                                          for i, (_, dense_dim) in enumerate(self.word_vectorizers)]\n            word_outputs = kl.Concatenate()([word_outputs] + additional_word_embeddings)\n        outputs, lstm_outputs = self._build_basic_network(word_outputs)\n        compile_args = {\"optimizer\": ko.nadam(lr=0.002, clipnorm=5.0),\n                        \"loss\": \"categorical_crossentropy\", \"metrics\": [\"accuracy\"]}\n        self.model_ = Model(inputs, outputs)\n        self.model_.compile(**compile_args)\n        if self.verbose > 0:\n            self.model_.summary(print_fn=log.info)\n        return self"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nbuild word - level network from input inputs.", "response": "def _build_word_cnn(self, inputs):\n        \"\"\"Builds word-level network\n        \"\"\"\n        inputs = kl.Lambda(kb.one_hot, arguments={\"num_classes\": self.symbols_number_},\n                           output_shape=lambda x: tuple(x) + (self.symbols_number_,))(inputs)\n        char_embeddings = kl.Dense(self.char_embeddings_size, use_bias=False)(inputs)\n        conv_outputs = []\n        self.char_output_dim_ = 0\n        for window_size, filters_number in zip(self.char_window_size, self.char_filters):\n            curr_output = char_embeddings\n            curr_filters_number = (min(self.char_filter_multiple * window_size, 200)\n                                   if filters_number is None else filters_number)\n            for _ in range(self.char_conv_layers - 1):\n                curr_output = kl.Conv2D(curr_filters_number, (1, window_size),\n                                        padding=\"same\", activation=\"relu\",\n                                        data_format=\"channels_last\")(curr_output)\n                if self.conv_dropout > 0.0:\n                    curr_output = kl.Dropout(self.conv_dropout)(curr_output)\n            curr_output = kl.Conv2D(curr_filters_number, (1, window_size),\n                                    padding=\"same\", activation=\"relu\",\n                                    data_format=\"channels_last\")(curr_output)\n            conv_outputs.append(curr_output)\n            self.char_output_dim_ += curr_filters_number\n        if len(conv_outputs) > 1:\n            conv_output = kl.Concatenate(axis=-1)(conv_outputs)\n        else:\n            conv_output = conv_outputs[0]\n        highway_input = kl.Lambda(kb.max, arguments={\"axis\": -2})(conv_output)\n        if self.intermediate_dropout > 0.0:\n            highway_input = kl.Dropout(self.intermediate_dropout)(highway_input)\n        for i in range(self.char_highway_layers - 1):\n            highway_input = Highway(activation=\"relu\")(highway_input)\n            if self.highway_dropout > 0.0:\n                highway_input = kl.Dropout(self.highway_dropout)(highway_input)\n        highway_output = Highway(activation=\"relu\")(highway_input)\n        return highway_output"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nbuild the basic network for the given word embeddings.", "response": "def _build_basic_network(self, word_outputs):\n        \"\"\"\n        Creates the basic network architecture,\n        transforming word embeddings to intermediate outputs\n        \"\"\"\n        if self.word_dropout > 0.0:\n            lstm_outputs = kl.Dropout(self.word_dropout)(word_outputs)\n        else:\n            lstm_outputs = word_outputs\n        for j in range(self.word_lstm_layers-1):\n            lstm_outputs = kl.Bidirectional(\n                kl.LSTM(self.word_lstm_units[j], return_sequences=True,\n                        dropout=self.lstm_dropout))(lstm_outputs)\n        lstm_outputs = kl.Bidirectional(\n                kl.LSTM(self.word_lstm_units[-1], return_sequences=True,\n                        dropout=self.lstm_dropout))(lstm_outputs)\n        pre_outputs = kl.TimeDistributed(\n                kl.Dense(self.tags_number_, activation=\"softmax\",\n                         activity_regularizer=self.regularizer),\n                name=\"p\")(lstm_outputs)\n        return pre_outputs, lstm_outputs"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef train_on_batch(self, data: List[Iterable], labels: Iterable[list]) -> None:\n        X, Y = self._transform_batch(data, labels)\n        self.model_.train_on_batch(X, Y)", "response": "Trains model on a single batch of word sequences."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nmakes predictions on a single batch of word sequences.", "response": "def predict_on_batch(self, data: Union[list, tuple],\n                         return_indexes: bool = False) -> List[List[str]]:\n        \"\"\"\n        Makes predictions on a single batch\n\n        Args:\n            data: a batch of word sequences together with additional inputs\n            return_indexes: whether to return tag indexes in vocabulary or tags themselves\n\n        Returns:\n            a batch of label sequences\n        \"\"\"\n        X = self._transform_batch(data)\n        objects_number, lengths = len(X[0]), [len(elem) for elem in data[0]]\n        Y = self.model_.predict_on_batch(X)\n        labels = np.argmax(Y, axis=-1)\n        answer: List[List[str]] = [None] * objects_number\n        for i, (elem, length) in enumerate(zip(labels, lengths)):\n            elem = elem[:length]\n            answer[i] = elem if return_indexes else self.tags.idxs2toks(elem)\n        return answer"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\ntransforming a sentence to Numpy array which will be the network input.", "response": "def _make_sent_vector(self, sent: List, bucket_length: int =None) -> np.ndarray:\n        \"\"\"Transforms a sentence to Numpy array, which will be the network input.\n\n        Args:\n            sent: input sentence\n            bucket_length: the width of the bucket\n\n        Returns:\n            A 3d array, answer[i][j][k] contains the index of k-th letter\n            in j-th word of i-th input sentence.\n        \"\"\"\n        bucket_length = bucket_length or len(sent)\n        answer = np.zeros(shape=(bucket_length, MAX_WORD_LENGTH+2), dtype=np.int32)\n        for i, word in enumerate(sent):\n            answer[i, 0] = self.tags.tok2idx(\"BEGIN\")\n            m = min(len(word), MAX_WORD_LENGTH)\n            for j, x in enumerate(word[-m:]):\n                answer[i, j+1] = self.symbols.tok2idx(x)\n            answer[i, m+1] = self.tags.tok2idx(\"END\")\n            answer[i, m+2:] = self.tags.tok2idx(\"PAD\")\n        return answer"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef _make_tags_vector(self, tags, bucket_length=None) -> np.ndarray:\n        bucket_length = bucket_length or len(tags)\n        answer = np.zeros(shape=(bucket_length,), dtype=np.int32)\n        for i, tag in enumerate(tags):\n            answer[i] = self.tags.tok2idx(tag)\n        return answer", "response": "Transforms a sentence of tags to Numpy array which will be the network target."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\ncalculating BLEU score for a set of tokens y_true and y_predicted.", "response": "def bleu_advanced(y_true: List[Any], y_predicted: List[Any],\n                  weights: Tuple=(1,), smoothing_function=SMOOTH.method1,\n                  auto_reweigh=False, penalty=True) -> float:\n    \"\"\"Calculate BLEU score\n\n    Parameters:\n        y_true: list of reference tokens\n        y_predicted: list of query tokens\n        weights: n-gram weights\n        smoothing_function: SmoothingFunction\n        auto_reweigh: Option to re-normalize the weights uniformly\n        penalty: either enable brevity penalty or not\n\n    Return:\n        BLEU score\n    \"\"\"\n\n    bleu_measure = sentence_bleu([y_true], y_predicted, weights, smoothing_function, auto_reweigh)\n\n    hyp_len = len(y_predicted)\n    hyp_lengths = hyp_len\n    ref_lengths = closest_ref_length([y_true], hyp_len)\n\n    bpenalty = brevity_penalty(ref_lengths, hyp_lengths)\n\n    if penalty is True or bpenalty == 0:\n        return bleu_measure\n\n    return bleu_measure/bpenalty"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nverifies the signature certificate URL against Amazon Alexa requirements.", "response": "def verify_sc_url(url: str) -> bool:\n    \"\"\"Verify signature certificate URL against Amazon Alexa requirements.\n\n    Each call of Agent passes incoming utterances batch through skills filter,\n    agent skills, skills processor. Batch of dialog IDs can be provided, in\n    other case utterances indexes in incoming batch are used as dialog IDs.\n\n    Args:\n        url: Signature certificate URL from SignatureCertChainUrl HTTP header.\n\n    Returns:\n        result: True if verification was successful, False if not.\n    \"\"\"\n    parsed = urlsplit(url)\n\n    scheme: str = parsed.scheme\n    netloc: str = parsed.netloc\n    path: str = parsed.path\n\n    try:\n        port = parsed.port\n    except ValueError:\n        port = None\n\n    result = (scheme.lower() == 'https' and\n              netloc.lower().split(':')[0] == 's3.amazonaws.com' and\n              path.startswith('/echo.api/') and\n              (port == 443 or port is None))\n\n    return result"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef extract_certs(certs_txt: str) -> List[crypto.X509]:\n    pattern = r'-----BEGIN CERTIFICATE-----.+?-----END CERTIFICATE-----'\n    certs_txt = re.findall(pattern, certs_txt, flags=re.DOTALL)\n    certs = [crypto.load_certificate(crypto.FILETYPE_PEM, cert_txt) for cert_txt in certs_txt]\n    return certs", "response": "Extracts pycrypto. X509 objects from SSL certificates chain string."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nverifies Subject Alternative Names ( SANs ) for Amazon certificate.", "response": "def verify_sans(amazon_cert: crypto.X509) -> bool:\n    \"\"\"Verifies Subject Alternative Names (SANs) for Amazon certificate.\n\n    Args:\n        amazon_cert: Pycrypto X509 Amazon certificate.\n\n    Returns:\n        result: True if verification was successful, False if not.\n    \"\"\"\n    cert_extentions = [amazon_cert.get_extension(i) for i in range(amazon_cert.get_extension_count())]\n    subject_alt_names = ''\n\n    for extention in cert_extentions:\n        if 'subjectAltName' in str(extention.get_short_name()):\n            subject_alt_names = extention.__str__()\n            break\n\n    result = 'echo-api.amazon.com' in subject_alt_names\n\n    return result"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nverify if Amazon and additional certificates creates chain of trust to root CA.", "response": "def verify_certs_chain(certs_chain: List[crypto.X509], amazon_cert: crypto.X509) -> bool:\n    \"\"\"Verifies if Amazon and additional certificates creates chain of trust to a root CA.\n\n    Args:\n        certs_chain: List of pycrypto X509 intermediate certificates from signature chain URL.\n        amazon_cert: Pycrypto X509 Amazon certificate.\n\n    Returns:\n        result: True if verification was successful, False if not.\n    \"\"\"\n    store = crypto.X509Store()\n\n    # add certificates from Amazon provided certs chain\n    for cert in certs_chain:\n        store.add_cert(cert)\n\n    # add CA certificates\n    default_verify_paths = ssl.get_default_verify_paths()\n\n    default_verify_file = default_verify_paths.cafile\n    default_verify_file = Path(default_verify_file).resolve() if default_verify_file else None\n\n    default_verify_path = default_verify_paths.capath\n    default_verify_path = Path(default_verify_path).resolve() if default_verify_path else None\n\n    ca_files = [ca_file for ca_file in default_verify_path.iterdir()] if default_verify_path else []\n    if default_verify_file:\n        ca_files.append(default_verify_file)\n\n    for ca_file in ca_files:\n        ca_file: Path\n        if ca_file.is_file():\n            with ca_file.open('r', encoding='ascii') as crt_f:\n                ca_certs_txt = crt_f.read()\n                ca_certs = extract_certs(ca_certs_txt)\n                for cert in ca_certs:\n                    store.add_cert(cert)\n\n    # add CA certificates (Windows)\n    ssl_context = ssl.create_default_context()\n    der_certs = ssl_context.get_ca_certs(binary_form=True)\n    pem_certs = '\\n'.join([ssl.DER_cert_to_PEM_cert(der_cert) for der_cert in der_certs])\n    ca_certs = extract_certs(pem_certs)\n    for ca_cert in ca_certs:\n        store.add_cert(ca_cert)\n\n    store_context = crypto.X509StoreContext(store, amazon_cert)\n\n    try:\n        store_context.verify_certificate()\n        result = True\n    except crypto.X509StoreContextError:\n        result = False\n\n    return result"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nverifies the Alexa request signature.", "response": "def verify_signature(amazon_cert: crypto.X509, signature: str, request_body: bytes) -> bool:\n    \"\"\"Verifies Alexa request signature.\n\n    Args:\n        amazon_cert: Pycrypto X509 Amazon certificate.\n        signature: Base64 decoded Alexa request signature from Signature HTTP header.\n        request_body: full HTTPS request body\n    Returns:\n        result: True if verification was successful, False if not.\n    \"\"\"\n    signature = base64.b64decode(signature)\n\n    try:\n        crypto.verify(amazon_cert, signature, request_body, 'sha1')\n        result = True\n    except crypto.Error:\n        result = False\n\n    return result"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nverify the certificate chain for the given signature_chain_url.", "response": "def verify_cert(signature_chain_url: str) -> Optional[crypto.X509]:\n    \"\"\"Conducts series of Alexa SSL certificate verifications against Amazon Alexa requirements.\n\n    Args:\n        signature_chain_url: Signature certificate URL from SignatureCertChainUrl HTTP header.\n    Returns:\n        result: Amazon certificate if verification was successful, None if not.\n    \"\"\"\n    try:\n        certs_chain_get = requests.get(signature_chain_url)\n    except requests.exceptions.ConnectionError as e:\n        log.error(f'Amazon signature chain get error: {e}')\n        return None\n\n    certs_chain_txt = certs_chain_get.text\n    certs_chain = extract_certs(certs_chain_txt)\n\n    amazon_cert: crypto.X509 = certs_chain.pop(0)\n\n    # verify signature chain url\n    sc_url_verification = verify_sc_url(signature_chain_url)\n    if not sc_url_verification:\n        log.error(f'Amazon signature url {signature_chain_url} was not verified')\n\n    # verify not expired\n    expired_verification = not amazon_cert.has_expired()\n    if not expired_verification:\n        log.error(f'Amazon certificate ({signature_chain_url}) expired')\n\n    # verify subject alternative names\n    sans_verification = verify_sans(amazon_cert)\n    if not sans_verification:\n        log.error(f'Subject alternative names verification for ({signature_chain_url}) certificate failed')\n\n    # verify certs chain\n    chain_verification = verify_certs_chain(certs_chain, amazon_cert)\n    if not chain_verification:\n        log.error(f'Certificates chain verification for ({signature_chain_url}) certificate failed')\n\n    result = (sc_url_verification and expired_verification and sans_verification and chain_verification)\n\n    return amazon_cert if result else None"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nreturning the json representation of the RichMessage instance.", "response": "def json(self) -> list:\n        \"\"\"Returns list of json compatible states of the RichMessage instance\n        nested controls.\n\n        Returns:\n            json_controls: Json representation of RichMessage instance\n                nested controls.\n        \"\"\"\n        json_controls = [control.json() for control in self.controls]\n        return json_controls"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nreturns list of MS Bot Framework compatible states of the RichMessage instance nested controls.", "response": "def ms_bot_framework(self) -> list:\n        \"\"\"Returns list of MS Bot Framework compatible states of the\n        RichMessage instance nested controls.\n\n        Returns:\n            ms_bf_controls: MS Bot Framework representation of RichMessage instance\n                nested controls.\n        \"\"\"\n        ms_bf_controls = [control.ms_bot_framework() for control in self.controls]\n        return ms_bf_controls"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef telegram(self) -> list:\n        telegram_controls = [control.telegram() for control in self.controls]\n        return telegram_controls", "response": "Returns list of Telegram compatible states of the RichMessage instance nested controls."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nreturning list of Amazon Alexa compatible states of the RichMessage instance nested controls.", "response": "def alexa(self) -> list:\n        \"\"\"Returns list of Amazon Alexa compatible states of the RichMessage\n        instance nested controls.\n\n        Returns:\n            alexa_controls: Amazon Alexa representation of RichMessage instance nested\n                controls.\n        \"\"\"\n        alexa_controls = [control.alexa() for control in self.controls]\n        return alexa_controls"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef _graph_wrap(func, graph):\n    @wraps(func)\n    def _wrapped(*args, **kwargs):\n        with graph.as_default():\n            return func(*args, **kwargs)\n    return _wrapped", "response": "Constructs function encapsulated in the graph."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nconstruct function encapsulated in the graph and the session.", "response": "def _keras_wrap(func, graph, session):\n    \"\"\"Constructs function encapsulated in the graph and the session.\"\"\"\n    import keras.backend as K\n\n    @wraps(func)\n    def _wrapped(*args, **kwargs):\n        with graph.as_default():\n            K.set_session(session)\n            return func(*args, **kwargs)\n    return _wrapped"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef roc_auc_score(y_true: Union[List[List[float]], List[List[int]], np.ndarray],\n                  y_pred: Union[List[List[float]], List[List[int]], np.ndarray]) -> float:\n    \"\"\"\n    Compute Area Under the Curve (AUC) from prediction scores.\n\n    Args:\n        y_true: true binary labels\n        y_pred: target scores, can either be probability estimates of the positive class\n\n    Returns:\n        Area Under the Curve (AUC) from prediction scores\n    \"\"\"\n    try:\n        return sklearn.metrics.roc_auc_score(np.squeeze(np.array(y_true)),\n                                             np.squeeze(np.array(y_pred)), average=\"macro\")\n    except ValueError:\n        return 0.", "response": "Compute Area Under the Curve ( AUC ) from prediction scores."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nconverts a token to a hashed version of given size.", "response": "def hash_(token: str, hash_size: int) -> int:\n    \"\"\"Convert a token to a hash of given size.\n    Args:\n        token: a word\n        hash_size: hash size\n\n    Returns:\n        int, hashed token\n\n    \"\"\"\n    return murmurhash3_32(token, positive=True) % hash_size"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\ncalculate accuracy in terms of absolute coincidence", "response": "def accuracy(y_true: [list, np.ndarray], y_predicted: [list, np.ndarray]) -> float:\n    \"\"\"\n    Calculate accuracy in terms of absolute coincidence\n\n    Args:\n        y_true: array of true values\n        y_predicted: array of predicted values\n\n    Returns:\n        portion of absolutely coincidental samples\n    \"\"\"\n    examples_len = len(y_true)\n    correct = sum([y1 == y2 for y1, y2 in zip(y_true, y_predicted)])\n    return correct / examples_len if examples_len else 0"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nround predictions and calculates accuracy in terms of absolute coincidence.", "response": "def round_accuracy(y_true, y_predicted):\n    \"\"\"\n    Rounds predictions and calculates accuracy in terms of absolute coincidence.\n\n    Args:\n        y_true: list of true values\n        y_predicted: list of predicted values\n\n    Returns:\n        portion of absolutely coincidental samples\n    \"\"\"\n    predictions = [round(x) for x in y_predicted]\n    examples_len = len(y_true)\n    correct = sum([y1 == y2 for y1, y2 in zip(y_true, predictions)])\n    return correct / examples_len if examples_len else 0"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _pretrained_initializer(varname, weight_file, embedding_weight_file=None):\n    weight_name_map = {}\n    for i in range(2):\n        for j in range(8):  # if we decide to add more layers\n            root = 'RNN_{}/RNN/MultiRNNCell/Cell{}'.format(i, j)\n            weight_name_map[root + '/rnn/lstm_cell/kernel'] = \\\n                root + '/LSTMCell/W_0'\n            weight_name_map[root + '/rnn/lstm_cell/bias'] = \\\n                root + '/LSTMCell/B'\n            weight_name_map[root + '/rnn/lstm_cell/projection/kernel'] = \\\n                root + '/LSTMCell/W_P_0'\n\n    # convert the graph name to that in the checkpoint\n    varname_in_file = varname[5:]\n    if varname_in_file.startswith('RNN'):\n        varname_in_file = weight_name_map[varname_in_file]\n\n    if varname_in_file == 'embedding':\n        with h5py.File(embedding_weight_file, 'r') as fin:\n            # Have added a special 0 index for padding not present\n            # in the original model.\n            embed_weights = fin[varname_in_file][...]\n            weights = np.zeros(\n                (embed_weights.shape[0] + 1, embed_weights.shape[1]),\n                dtype=DTYPE\n            )\n            weights[1:, :] = embed_weights\n    else:\n        with h5py.File(weight_file, 'r') as fin:\n            if varname_in_file == 'char_embed':\n                # Have added a special 0 index for padding not present\n                # in the original model.\n                char_embed_weights = fin[varname_in_file][...]\n                weights = np.zeros(\n                    (char_embed_weights.shape[0] + 1,\n                     char_embed_weights.shape[1]),\n                    dtype=DTYPE\n                )\n                weights[1:, :] = char_embed_weights\n            else:\n                weights = fin[varname_in_file][...]\n\n    # Tensorflow initializers are callables that accept a shape parameter\n    # and some optional kwargs\n    def ret(shape, **kwargs):\n        if list(shape) != list(weights.shape):\n            raise ValueError(\n                \"Invalid shape initializing {0}, got {1}, expected {2}\".format(\n                    varname_in_file, shape, weights.shape)\n            )\n        return weights\n\n    return ret", "response": "This function creates the pretrained initializers for the LM model."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef _build_word_char_embeddings(self):\n        projection_dim = self.options['lstm']['projection_dim']\n\n        cnn_options = self.options['char_cnn']\n        filters = cnn_options['filters']\n        n_filters = sum(f[1] for f in filters)\n        max_chars = cnn_options['max_characters_per_token']\n        char_embed_dim = cnn_options['embedding']['dim']\n        n_chars = cnn_options['n_characters']\n        if n_chars != 262:\n            raise Exception(\"Set n_characters=262 after training see a \\\n                            https://github.com/allenai/bilm-tf/blob/master/README.md\")\n\n        if cnn_options['activation'] == 'tanh':\n            activation = tf.nn.tanh\n        elif cnn_options['activation'] == 'relu':\n            activation = tf.nn.relu\n\n        # the character embeddings\n        with tf.device(\"/cpu:0\"):\n            self.embedding_weights = tf.get_variable(\"char_embed\", [n_chars, char_embed_dim],\n                                                     dtype=DTYPE,\n                                                     initializer=tf.random_uniform_initializer(-1.0, 1.0))\n            # shape (batch_size, unroll_steps, max_chars, embed_dim)\n            self.char_embedding = tf.nn.embedding_lookup(self.embedding_weights,\n                                                         self.ids_placeholder)\n\n        # the convolutions\n        def make_convolutions(inp):\n            with tf.variable_scope('CNN'):\n                convolutions = []\n                for i, (width, num) in enumerate(filters):\n                    if cnn_options['activation'] == 'relu':\n                        # He initialization for ReLU activation\n                        # with char embeddings init between -1 and 1\n                        # w_init = tf.random_normal_initializer(\n                        #    mean=0.0,\n                        #    stddev=np.sqrt(2.0 / (width * char_embed_dim))\n                        # )\n\n                        # Kim et al 2015, +/- 0.05\n                        w_init = tf.random_uniform_initializer(\n                            minval=-0.05, maxval=0.05)\n                    elif cnn_options['activation'] == 'tanh':\n                        # glorot init\n                        w_init = tf.random_normal_initializer(\n                            mean=0.0,\n                            stddev=np.sqrt(1.0 / (width * char_embed_dim))\n                        )\n                    w = tf.get_variable(\n                        \"W_cnn_%s\" % i,\n                        [1, width, char_embed_dim, num],\n                        initializer=w_init,\n                        dtype=DTYPE)\n                    b = tf.get_variable(\n                        \"b_cnn_%s\" % i, [num], dtype=DTYPE,\n                        initializer=tf.constant_initializer(0.0))\n\n                    conv = tf.nn.conv2d(inp, w,\n                                        strides=[1, 1, 1, 1],\n                                        padding=\"VALID\") + b\n                    # now max pool\n                    conv = tf.nn.max_pool(conv, [1, 1, max_chars - width + 1, 1],\n                                          [1, 1, 1, 1], 'VALID')\n\n                    # activation\n                    conv = activation(conv)\n                    conv = tf.squeeze(conv, squeeze_dims=[2])\n\n                    convolutions.append(conv)\n\n            return tf.concat(convolutions, 2)\n\n        embedding = make_convolutions(self.char_embedding)\n\n        # for highway and projection layers\n        n_highway = cnn_options.get('n_highway')\n        use_highway = n_highway is not None and n_highway > 0\n        use_proj = n_filters != projection_dim\n\n        if use_highway or use_proj:\n            #   reshape from (batch_size, n_tokens, dim) to (-1, dim)\n            batch_size_n_tokens = tf.shape(embedding)[0:2]\n            embedding = tf.reshape(embedding, [-1, n_filters])\n\n        # set up weights for projection\n        if use_proj:\n            assert n_filters > projection_dim\n            with tf.variable_scope('CNN_proj'):\n                    W_proj_cnn = tf.get_variable(\n                        \"W_proj\", [n_filters, projection_dim],\n                        initializer=tf.random_normal_initializer(\n                            mean=0.0, stddev=np.sqrt(1.0 / n_filters)),\n                        dtype=DTYPE)\n                    b_proj_cnn = tf.get_variable(\n                        \"b_proj\", [projection_dim],\n                        initializer=tf.constant_initializer(0.0),\n                        dtype=DTYPE)\n\n        # apply highways layers\n        def high(x, ww_carry, bb_carry, ww_tr, bb_tr):\n            carry_gate = tf.nn.sigmoid(tf.matmul(x, ww_carry) + bb_carry)\n            transform_gate = tf.nn.relu(tf.matmul(x, ww_tr) + bb_tr)\n            return carry_gate * transform_gate + (1.0 - carry_gate) * x\n\n        if use_highway:\n            highway_dim = n_filters\n\n            for i in range(n_highway):\n                with tf.variable_scope('CNN_high_%s' % i):\n                    W_carry = tf.get_variable(\n                        'W_carry', [highway_dim, highway_dim],\n                        # glorit init\n                        initializer=tf.random_normal_initializer(\n                            mean=0.0, stddev=np.sqrt(1.0 / highway_dim)),\n                        dtype=DTYPE)\n                    b_carry = tf.get_variable(\n                        'b_carry', [highway_dim],\n                        initializer=tf.constant_initializer(-2.0),\n                        dtype=DTYPE)\n                    W_transform = tf.get_variable(\n                        'W_transform', [highway_dim, highway_dim],\n                        initializer=tf.random_normal_initializer(\n                            mean=0.0, stddev=np.sqrt(1.0 / highway_dim)),\n                        dtype=DTYPE)\n                    b_transform = tf.get_variable(\n                        'b_transform', [highway_dim],\n                        initializer=tf.constant_initializer(0.0),\n                        dtype=DTYPE)\n\n                embedding = high(embedding, W_carry, b_carry,\n                                 W_transform, b_transform)\n\n        # finally project down if needed\n        if use_proj:\n            embedding = tf.matmul(embedding, W_proj_cnn) + b_proj_cnn\n\n        # reshape back to (batch_size, tokens, dim)\n        if use_highway or use_proj:\n            shp = tf.concat([batch_size_n_tokens, [projection_dim]], axis=0)\n            embedding = tf.reshape(embedding, shp)\n\n        # at last assign attributes for remainder of the model\n        self.embedding = embedding", "response": "Builds the word character embeddings for the current language."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef read(self, data_path: str, *args, **kwargs) -> Dict[str, List[Tuple[Any, Any]]]:\n        raise NotImplementedError", "response": "Reads a file and returns data as a list of tuples of inputs and correct outputs\n         for every data type in train valid and test."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nbuild an agent capable of handling several simple greetings.", "response": "def make_hello_bot_agent() -> DefaultAgent:\n    \"\"\"Builds agent based on PatternMatchingSkill and HighestConfidenceSelector.\n\n    This is agent building tutorial. You can use this .py file to check how hello-bot agent works.\n\n    Returns:\n        agent: Agent capable of handling several simple greetings.\n    \"\"\"\n    skill_hello = PatternMatchingSkill(['Hello world'], patterns=['hi', 'hello', 'good day'])\n    skill_bye = PatternMatchingSkill(['Goodbye world', 'See you around'], patterns=['bye', 'chao', 'see you'])\n    skill_fallback = PatternMatchingSkill(['I don\\'t understand, sorry', 'I can say \"Hello world\"'])\n\n    agent = DefaultAgent([skill_hello, skill_bye, skill_fallback], skills_processor=HighestConfidenceSelector())\n\n    return agent"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\ntakes an array of integers and transforms it to an array of one - hot encoded vectors", "response": "def to_one_hot(x, k):\n    \"\"\"\n    Takes an array of integers and transforms it\n    to an array of one-hot encoded vectors\n    \"\"\"\n    unit = np.eye(k, dtype=int)\n    return unit[x]"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nprettify the dictionary of metrics.", "response": "def prettify_metrics(metrics: List[Tuple[str, float]], precision: int = 4) -> OrderedDict:\n    \"\"\"Prettifies the dictionary of metrics.\"\"\"\n    prettified_metrics = OrderedDict()\n    for key, value in metrics:\n        value = round(value, precision)\n        prettified_metrics[key] = value\n    return prettified_metrics"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef populate_settings_dir(force: bool = False) -> bool:\n    res = False\n    if _default_settings_path == _settings_path:\n        return res\n\n    for src in list(_default_settings_path.glob('**/*.json')):\n        dest = _settings_path / src.relative_to(_default_settings_path)\n        if not force and dest.exists():\n            continue\n        res = True\n        dest.parent.mkdir(parents=True, exist_ok=True)\n        shutil.copy(src, dest)\n    return res", "response": "Populate the settings directory with default settings files."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nupdate dialogue state with new slots calculates features.", "response": "def update_state(self,\n                     slots: Union[List[Tuple[str, Any]], Dict[str, Any]]) -> 'Tracker':\n        \"\"\"\n        Updates dialogue state with new ``slots``, calculates features.\n        \n        Returns:\n            Tracker: .\"\"\"\n        pass"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef predict_with_model(config_path: [Path, str]) -> List[Optional[List[str]]]:\n    config = parse_config(config_path)\n\n    reader_config = config['dataset_reader']\n    reader = get_model(reader_config['class_name'])()\n    data_path = expand_path(reader_config.get('data_path', ''))\n    read_params = {k: v for k, v in reader_config.items() if k not in ['class_name', 'data_path']}\n    data: Dict = reader.read(data_path, **read_params)\n\n    iterator_config = config['dataset_iterator']\n    iterator: MorphoTaggerDatasetIterator = from_params(iterator_config, data=data)\n\n    model = build_model(config, load_trained=True)\n    answers = [None] * len(iterator.test)\n    batch_size = config['predict'].get(\"batch_size\", -1)\n    for indexes, (x, _) in iterator.gen_batches(\n            batch_size=batch_size, data_type=\"test\", shuffle=False, return_indexes=True):\n        y = model(x)\n        for i, elem in zip(indexes, y):\n            answers[i] = elem\n    outfile = config['predict'].get(\"outfile\")\n    if outfile is not None:\n        outfile = Path(outfile)\n        if not outfile.exists():\n            outfile.parent.mkdir(parents=True, exist_ok=True)\n        with open(outfile, \"w\", encoding=\"utf8\") as fout:\n            for elem in answers:\n                fout.write(elem + \"\\n\")\n    return answers", "response": "Returns predictions of morphotagging model given in config_path."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\ninitiating Flask web service with Alexa skill. Args: agent_generator: Callback Alexa agents factory. multi_instance: Multi instance mode flag. stateful: Stateful mode flag. port: Flask web service port. https: Flag for running Alexa skill service in https mode. ssl_key: SSL key file path. ssl_cert: SSL certificate file path.", "response": "def run_alexa_server(agent_generator: callable,\n                     multi_instance: bool = False,\n                     stateful: bool = False,\n                     port: Optional[int] = None,\n                     https: bool = False,\n                     ssl_key: str = None,\n                     ssl_cert: str = None) -> None:\n    \"\"\"Initiates Flask web service with Alexa skill.\n\n    Args:\n        agent_generator: Callback Alexa agents factory.\n        multi_instance: Multi instance mode flag.\n        stateful: Stateful mode flag.\n        port: Flask web service port.\n        https: Flag for running Alexa skill service in https mode.\n        ssl_key: SSL key file path.\n        ssl_cert: SSL certificate file path.\n    \"\"\"\n    server_config_path = Path(get_settings_path(), SERVER_CONFIG_FILENAME).resolve()\n    server_params = read_json(server_config_path)\n\n    host = server_params['common_defaults']['host']\n    port = port or server_params['common_defaults']['port']\n\n    alexa_server_params = server_params['alexa_defaults']\n\n    alexa_server_params['multi_instance'] = multi_instance or server_params['common_defaults']['multi_instance']\n    alexa_server_params['stateful'] = stateful or server_params['common_defaults']['stateful']\n    alexa_server_params['amazon_cert_lifetime'] = AMAZON_CERTIFICATE_LIFETIME\n\n    if https:\n        ssh_key_path = Path(ssl_key or server_params['https_key_path']).resolve()\n        if not ssh_key_path.is_file():\n            e = FileNotFoundError('Ssh key file not found: please provide correct path in --key param or '\n                                  'https_key_path param in server configuration file')\n            log.error(e)\n            raise e\n\n        ssh_cert_path = Path(ssl_cert or server_params['https_cert_path']).resolve()\n        if not ssh_cert_path.is_file():\n            e = FileNotFoundError('Ssh certificate file not found: please provide correct path in --cert param or '\n                                  'https_cert_path param in server configuration file')\n            log.error(e)\n            raise e\n\n        ssl_context = ssl.SSLContext(ssl.PROTOCOL_TLSv1_2)\n        ssl_context.load_cert_chain(ssh_cert_path, ssh_key_path)\n    else:\n        ssl_context = None\n\n    input_q = Queue()\n    output_q = Queue()\n\n    bot = Bot(agent_generator, alexa_server_params, input_q, output_q)\n    bot.start()\n\n    endpoint_description = {\n        'description': 'Amazon Alexa custom service endpoint',\n        'parameters': [\n            {\n                'name': 'Signature',\n                'in': 'header',\n                'required': 'true',\n                'type': 'string',\n                'example': 'Z5H5wqd06ExFVPNfJiqhKvAFjkf+cTVodOUirucHGcEVAMO1LfvgqWUkZ/X1ITDZbI0w+SMwVkEQZlkeThbVS/54M22StNDUtfz4Ua20xNDpIPwcWIACAmZ38XxbbTEFJI5WwqrbilNcfzqiGrIPfdO5rl+/xUjHFUdcJdUY/QzBxXsceytVYfEiR9MzOCN2m4C0XnpThUavAu159KrLj8AkuzN0JF87iXv+zOEeZRgEuwmsAnJrRUwkJ4yWokEPnSVdjF0D6f6CscfyvRe9nsWShq7/zRTa41meweh+n006zvf58MbzRdXPB22RI4AN0ksWW7hSC8/QLAKQE+lvaw==',\n            },\n            {\n                'name': 'Signaturecertchainurl',\n                'in': 'header',\n                'required': 'true',\n                'type': 'string',\n                'example': 'https://s3.amazonaws.com/echo.api/echo-api-cert-6-ats.pem',\n            },\n            {\n                'name': 'data',\n                'in': 'body',\n                'required': 'true',\n                'example': {\n                    'version': '1.0',\n                    'session': {\n                        'new': False,\n                        'sessionId': 'amzn1.echo-api.session.3c6ebffd-55b9-4e1a-bf3c-c921c1801b63',\n                        'application': {\n                            'applicationId': 'amzn1.ask.skill.8b17a5de-3749-4919-aa1f-e0bbaf8a46a6'\n                        },\n                        'attributes': {\n                            'sessionId': 'amzn1.echo-api.session.3c6ebffd-55b9-4e1a-bf3c-c921c1801b63'\n                        },\n                        'user': {\n                            'userId': 'amzn1.ask.account.AGR4R2LOVHMNMNOGROBVNLU7CL4C57X465XJF2T2F55OUXNTLCXDQP3I55UXZIALEKKZJ6Q2MA5MEFSMZVPEL5NVZS6FZLEU444BVOLPB5WVH5CHYTQAKGD7VFLGPRFZVHHH2NIB4HKNHHGX6HM6S6QDWCKXWOIZL7ONNQSBUCVPMZQKMCYXRG5BA2POYEXFDXRXCGEVDWVSMPQ'\n                        }\n                    },\n                    'context': {\n                        'System': {\n                            'application': {\n                                'applicationId': 'amzn1.ask.skill.8b17a5de-3749-4919-aa1f-e0bbaf8a46a6'\n                            },\n                            'user': {\n                                'userId': 'amzn1.ask.account.AGR4R2LOVHMNMNOGROBVNLU7CL4C57X465XJF2T2F55OUXNTLCXDQP3I55UXZIALEKKZJ6Q2MA5MEFSMZVPEL5NVZS6FZLEU444BVOLPB5WVH5CHYTQAKGD7VFLGPRFZVHHH2NIB4HKNHHGX6HM6S6QDWCKXWOIZL7ONNQSBUCVPMZQKMCYXRG5BA2POYEXFDXRXCGEVDWVSMPQ'\n                            },\n                            'device': {\n                                'deviceId': 'amzn1.ask.device.AFQAMLYOYQUUACSE7HFVYS4ZI2KUB35JPHQRUPKTDCAU3A47WESP5L57KSWT5L6RT3FVXWH4OA2DNPJRMZ2VGEIACF3PJEIDCOUWUBC4W5RPJNUB3ZVT22J4UJN5UL3T2UBP36RVHFJ5P4IPT2HUY3P2YOY33IOU4O33HUAG7R2BUNROEH4T2',\n                                'supportedInterfaces': {}\n                            },\n                            'apiEndpoint': 'https://api.amazonalexa.com',\n                            'apiAccessToken': 'eyJ0eXAiOiJKV1QiLCJhbGciOiJSUzI1NiIsImtpZCI6IjEifQ.eyJhdWQiOiJodHRwczovL2FwaS5hbWF6b25hbGV4YS5jb20iLCJpc3MiOiJBbGV4YVNraWxsS2l0Iiwic3ViIjoiYW16bjEuYXNrLnNraWxsLjhiMTdhNWRlLTM3NDktNDkxOS1hYTFmLWUwYmJhZjhhNDZhNiIsImV4cCI6MTU0NTIyMzY1OCwiaWF0IjoxNTQ1MjIwMDU4LCJuYmYiOjE1NDUyMjAwNTgsInByaXZhdGVDbGFpbXMiOnsiY29uc2VudFRva2VuIjpudWxsLCJkZXZpY2VJZCI6ImFtem4xLmFzay5kZXZpY2UuQUZRQU1MWU9ZUVVVQUNTRTdIRlZZUzRaSTJLVUIzNUpQSFFSVVBLVERDQVUzQTQ3V0VTUDVMNTdLU1dUNUw2UlQzRlZYV0g0T0EyRE5QSlJNWjJWR0VJQUNGM1BKRUlEQ09VV1VCQzRXNVJQSk5VQjNaVlQyMko0VUpONVVMM1QyVUJQMzZSVkhGSjVQNElQVDJIVVkzUDJZT1kzM0lPVTRPMzNIVUFHN1IyQlVOUk9FSDRUMiIsInVzZXJJZCI6ImFtem4xLmFzay5hY2NvdW50LkFHUjRSMkxPVkhNTk1OT0dST0JWTkxVN0NMNEM1N1g0NjVYSkYyVDJGNTVPVVhOVExDWERRUDNJNTVVWFpJQUxFS0taSjZRMk1BNU1FRlNNWlZQRUw1TlZaUzZGWkxFVTQ0NEJWT0xQQjVXVkg1Q0hZVFFBS0dEN1ZGTEdQUkZaVkhISDJOSUI0SEtOSEhHWDZITTZTNlFEV0NLWFdPSVpMN09OTlFTQlVDVlBNWlFLTUNZWFJHNUJBMlBPWUVYRkRYUlhDR0VWRFdWU01QUSJ9fQ.jcomYhBhU485T4uoe2NyhWnL-kZHoPQKpcycFqa-1sy_lSIitfFGup9DKrf2NkN-I9lZ3xwq9llqx9WRN78fVJjN6GLcDhBDH0irPwt3n9_V7_5bfB6KARv5ZG-JKOmZlLBqQbnln0DAJ10D8HNiytMARNEwduMBVDNK0A5z6YxtRcLYYFD2-Ieg_V8Qx90eE2pd2U5xOuIEL0pXfSoiJ8vpxb8BKwaMO47tdE4qhg_k7v8ClwyXg3EMEhZFjixYNqdW1tCrwDGj58IWMXDyzZhIlRMh6uudMOT6scSzcNVD0v42IOTZ3S_X6rG01B7xhUDlZXMqkrCuzOyqctGaPw'\n                        },\n                        'Viewport': {\n                            'experiences': [\n                                {\n                                    'arcMinuteWidth': 246,\n                                    'arcMinuteHeight': 144,\n                                    'canRotate': False,\n                                    'canResize': False\n                                }\n                            ],\n                            'shape': 'RECTANGLE',\n                            'pixelWidth': 1024,\n                            'pixelHeight': 600,\n                            'dpi': 160,\n                            'currentPixelWidth': 1024,\n                            'currentPixelHeight': 600,\n                            'touch': [\n                                'SINGLE'\n                            ]\n                        }\n                    },\n                    'request': {\n                        'type': 'IntentRequest',\n                        'requestId': 'amzn1.echo-api.request.388d0f6e-04b9-4450-a687-b9abaa73ac6a',\n                        'timestamp': '2018-12-19T11:47:38Z',\n                        'locale': 'en-US',\n                        'intent': {\n                            'name': 'AskDeepPavlov',\n                            'confirmationStatus': 'NONE',\n                            'slots': {\n                                'raw_input': {\n                                    'name': 'raw_input',\n                                    'value': 'my beautiful sandbox skill',\n                                    'resolutions': {\n                                        'resolutionsPerAuthority': [\n                                            {\n                                                'authority': 'amzn1.er-authority.echo-sdk.amzn1.ask.skill.8b17a5de-3749-4919-aa1f-e0bbaf8a46a6.GetInput',\n                                                'status': {\n                                                    'code': 'ER_SUCCESS_NO_MATCH'\n                                                }\n                                            }\n                                        ]\n                                    },\n                                    'confirmationStatus': 'NONE',\n                                    'source': 'USER'\n                                }\n                            }\n                        }\n                    }\n                }\n            }\n        ],\n        'responses': {\n            \"200\": {\n                \"description\": \"A model response\"\n            }\n        }\n    }\n\n    @app.route('/')\n    def index():\n        return redirect('/apidocs/')\n\n    @app.route('/interact', methods=['POST'])\n    @swag_from(endpoint_description)\n    def handle_request():\n        request_body: bytes = request.get_data()\n        signature_chain_url: str = request.headers.get('Signaturecertchainurl')\n        signature: str = request.headers.get('Signature')\n        alexa_request: dict = request.get_json()\n\n        request_dict = {\n            'request_body': request_body,\n            'signature_chain_url': signature_chain_url,\n            'signature': signature,\n            'alexa_request': alexa_request\n        }\n\n        bot.input_queue.put(request_dict)\n        response: dict = bot.output_queue.get()\n        response_code = 400 if 'error' in response.keys() else 200\n\n        return jsonify(response), response_code\n\n    app.run(host=host, port=port, threaded=True, ssl_context=ssl_context)"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nloads the model parameters from self. load_path.", "response": "def load(self, exclude_scopes: tuple = ('Optimizer',)) -> None:\n        \"\"\"Load model parameters from self.load_path\"\"\"\n        if not hasattr(self, 'sess'):\n            raise RuntimeError('Your TensorFlow model {} must'\n                               ' have sess attribute!'.format(self.__class__.__name__))\n        path = str(self.load_path.resolve())\n        # Check presence of the model files\n        if tf.train.checkpoint_exists(path):\n            log.info('[loading model from {}]'.format(path))\n            # Exclude optimizer variables from saved variables\n            var_list = self._get_saveable_variables(exclude_scopes)\n            saver = tf.train.Saver(var_list)\n            saver.restore(self.sess, path)"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef save(self, exclude_scopes: tuple = ('Optimizer',)) -> None:\n        if not hasattr(self, 'sess'):\n            raise RuntimeError('Your TensorFlow model {} must'\n                               ' have sess attribute!'.format(self.__class__.__name__))\n        path = str(self.save_path.resolve())\n        log.info('[saving model to {}]'.format(path))\n        var_list = self._get_saveable_variables(exclude_scopes)\n        saver = tf.train.Saver(var_list)\n        saver.save(self.sess, path)", "response": "Save model parameters to self. save_path"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\ngets train operation for given loss and learning rate.", "response": "def get_train_op(self,\n                     loss,\n                     learning_rate,\n                     optimizer=None,\n                     clip_norm=None,\n                     learnable_scopes=None,\n                     optimizer_scope_name=None,\n                     **kwargs):\n        \"\"\"\n        Get train operation for given loss\n\n        Args:\n            loss: loss, tf tensor or scalar\n            learning_rate: scalar or placeholder.\n            clip_norm: clip gradients norm by clip_norm.\n            learnable_scopes: which scopes are trainable (None for all).\n            optimizer: instance of tf.train.Optimizer, default Adam.\n            **kwargs: parameters passed to tf.train.Optimizer object\n               (scalars or placeholders).\n\n        Returns:\n            train_op\n        \"\"\"\n        if optimizer_scope_name is None:\n            opt_scope = tf.variable_scope('Optimizer')\n        else:\n            opt_scope = tf.variable_scope(optimizer_scope_name)\n        with opt_scope:\n            if learnable_scopes is None:\n                variables_to_train = tf.get_collection(tf.GraphKeys.TRAINABLE_VARIABLES)\n            else:\n                variables_to_train = []\n                for scope_name in learnable_scopes:\n                    variables_to_train.extend(tf.get_collection(tf.GraphKeys.TRAINABLE_VARIABLES, scope=scope_name))\n\n            if optimizer is None:\n                optimizer = tf.train.AdamOptimizer\n\n            # For batch norm it is necessary to update running averages\n            extra_update_ops = tf.get_collection(tf.GraphKeys.UPDATE_OPS)\n            with tf.control_dependencies(extra_update_ops):\n\n                def clip_if_not_none(grad):\n                    if grad is not None:\n                        return tf.clip_by_norm(grad, clip_norm)\n\n                opt = optimizer(learning_rate, **kwargs)\n                grads_and_vars = opt.compute_gradients(loss, var_list=variables_to_train)\n                if clip_norm is not None:\n                    grads_and_vars = [(clip_if_not_none(grad), var)\n                                      for grad, var in grads_and_vars]\n                train_op = opt.apply_gradients(grads_and_vars)\n        return train_op"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nprint number of training parameters in the network.", "response": "def print_number_of_parameters():\n        \"\"\"\n        Print number of *trainable* parameters in the network\n        \"\"\"\n        log.info('Number of parameters: ')\n        variables = tf.trainable_variables()\n        blocks = defaultdict(int)\n        for var in variables:\n            # Get the top level scope name of variable\n            block_name = var.name.split('/')[0]\n            number_of_parameters = np.prod(var.get_shape().as_list())\n            blocks[block_name] += number_of_parameters\n        for block_name, cnt in blocks.items():\n            log.info(\"{} - {}.\".format(block_name, cnt))\n        total_num_parameters = np.sum(list(blocks.values()))\n        log.info('Total number of parameters equal {}'.format(total_num_parameters))"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nsearching the dictionary for a word in the dictionary d - window.", "response": "def search(self, word, d, allow_spaces=True, return_cost=True):\n        \"\"\"\n        Finds all dictionary words in d-window from word\n        \"\"\"\n        if not all((c in self.alphabet\n                    or (c == \" \" and self.allow_spaces)) for c in word):\n            return []\n            # raise ValueError(\"{0} contains an incorrect symbol\".format(word))\n        return self._trie_search(\n            word, d, allow_spaces=allow_spaces, return_cost=return_cost)"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef _trie_search(self, word, d, transducer=None,\n                     allow_spaces=True, return_cost=True):\n        \"\"\"\n        \u041d\u0430\u0445\u043e\u0434\u0438\u0442 \u0432\u0441\u0435 \u0441\u043b\u043e\u0432\u0430 \u0432 \u043f\u0440\u0435\u0444\u0438\u043a\u0441\u043d\u043e\u043c \u0431\u043e\u0440\u0435, \u0440\u0430\u0441\u0441\u0442\u043e\u044f\u043d\u0438\u0435 \u0434\u043e \u043a\u043e\u0442\u043e\u0440\u044b\u0445\n        \u0432 \u0441\u043e\u043e\u0442\u0432\u0435\u0442\u0441\u0442\u0432\u0438\u0438 \u0441 \u0437\u0430\u0434\u0430\u043d\u043d\u044b\u043c \u043f\u0440\u0435\u043e\u0431\u0440\u0430\u0437\u043e\u0432\u0430\u0442\u0435\u043b\u0435\u043c \u043d\u0435 \u043f\u0440\u0435\u0432\u044b\u0448\u0430\u0435\u0442 d\n        \"\"\"\n        if transducer is None:\n            # \u0440\u0430\u0437\u043e\u0431\u0440\u0430\u0442\u044c\u0441\u044f \u0441 \u043f\u0440\u043e\u0431\u0435\u043b\u0430\u043c\u0438\n            transducer = self.transducer.inverse()\n        allow_spaces &= self.allow_spaces\n        trie = self.dictionary\n        #  \u0438\u043d\u0438\u0446\u0438\u0430\u043b\u0438\u0437\u0430\u0446\u0438\u044f \u043f\u0435\u0440\u0435\u043c\u0435\u043d\u043d\u044b\u0445\n        used_agenda_keys = set()\n        agenda = SortedListWithKey(key=(lambda x:x[1]))\n        h = self.h_func(word, trie.root)\n        # agenda[self.agenda_key(\"\", 0, trie.root)] = (0.0, 0.0, h)\n        key, value = (\"\", 0, trie.root), (0.0, 0.0, h)\n        agenda.add((key, value))\n        answer = dict()\n        k = 0\n        # \u043e\u0447\u0435\u0440\u0435\u0434\u044c \u0441 \u043f\u0440\u0438\u043e\u0440\u0438\u0442\u0435\u0442\u043e\u043c \u0441 \u043f\u0440\u043e\u043c\u0435\u0436\u0443\u0442\u043e\u0447\u043d\u044b\u043c\u0438 \u0440\u0435\u0437\u0443\u043b\u044c\u0442\u0430\u0442\u0430\u043c\u0438\n        while len(agenda) > 0:\n            key, value = agenda.pop(0)\n            if key in used_agenda_keys:\n                continue\n            used_agenda_keys.add(key)\n            low, pos, index = key\n            cost, g, h = value\n            # g --- \u0442\u0435\u043a\u0443\u0449\u0430\u044f \u0441\u0442\u043e\u0438\u043c\u043e\u0441\u0442\u044c, h --- \u043d\u0438\u0436\u043d\u044f\u044f \u043e\u0446\u0435\u043d\u043a\u0430 \u0431\u0443\u0434\u0443\u0449\u0435\u0439 \u0441\u0442\u043e\u0438\u043c\u043e\u0441\u0442\u0438\n            # cost = g + h --- \u043d\u0438\u0436\u043d\u044f\u044f \u043e\u0446\u0435\u043d\u043a\u0430 \u0441\u0443\u043c\u043c\u0430\u0440\u043d\u043e\u0439 \u0441\u0442\u043e\u0438\u043c\u043e\u0441\u0442\u0438\n            k += 1\n            max_upperside_length = min(len(word) - pos, transducer.max_up_length)\n            for upperside_length in range(max_upperside_length + 1):\n                new_pos = pos + upperside_length\n                curr_up = word[pos: new_pos]\n                if curr_up not in transducer.operation_costs:\n                    continue\n                for curr_low, curr_cost in transducer.operation_costs[curr_up].items():\n                    new_g = g + curr_cost\n                    if new_g > d:  #\u0435\u0441\u043b\u0438 g > d, \u0442\u043e h \u043c\u043e\u0436\u043d\u043e \u043d\u0435 \u0432\u044b\u0447\u0438\u0441\u043b\u044f\u0442\u044c\n                        continue\n                    if curr_low == \" \":\n                        if allow_spaces and trie.is_final(index):\n                            new_index = trie.root\n                        else:\n                            new_index = Trie.NO_NODE\n                    else:\n                        new_index = trie.descend(index, curr_low)\n                    if new_index is Trie.NO_NODE:\n                        continue\n                    new_low = low + curr_low\n                    new_h = self.h_func(word[new_pos: ], new_index)\n                    new_cost = new_g + new_h\n                    if new_cost > d:\n                        continue\n                    new_key = (new_low, new_pos, new_index)\n                    new_value = (new_cost, new_g, new_h)\n                    if new_pos == len(word) and trie.is_final(new_index):\n                        old_g = answer.get(new_low, None)\n                        if old_g is None or new_g < old_g:\n                            answer[new_low] = new_g\n                    agenda.add((new_key, new_value))\n        answer = sorted(answer.items(), key=(lambda x: x[1]))\n        if return_cost:\n            return answer\n        else:\n            return [elem[0] for elem in answer]", "response": "Search the word in the dictionary and return a dictionary of the words found."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef _euristic_h_function(self, suffix, index):\n        if self.euristics > 0:\n            suffix = suffix[:self.euristics]\n        # \u043a\u044d\u0448\u0438\u0440\u043e\u0432\u0430\u043d\u0438\u0435 \u0440\u0435\u0437\u0443\u043b\u044c\u0442\u0430\u0442\u043e\u0432\n        index_temporary_euristics = self._temporary_euristics[index]\n        cost = index_temporary_euristics.get(suffix, None)\n        if cost is not None:\n            return cost\n        # \u0438\u0437\u0432\u043b\u0435\u0447\u0435\u043d\u0438\u0435 \u043d\u0443\u0436\u043d\u044b\u0445 \u0434\u0430\u043d\u043d\u044b\u0445 \u0438\u0437 \u043c\u0430\u0441\u0441\u0438\u0432\u043e\u0432\n        absense_costs = self._absense_costs_by_node[index]\n        data = self.dictionary.data[index]\n        costs = np.zeros(dtype=np.float64, shape=(self.euristics,))\n        # costs[j] --- \u043e\u0446\u0435\u043d\u043a\u0430 \u0448\u0442\u0440\u0430\u0444\u0430 \u043f\u0440\u0438 \u043f\u0440\u0435\u0434\u043f\u0440\u043e\u0441\u043c\u043e\u0442\u0440\u0435 \u0432\u043f\u0435\u0440\u0451\u0434 \u043d\u0430 j \u0441\u0438\u043c\u0432\u043e\u043b\u043e\u0432\n        for i, a in enumerate(suffix):\n            costs[i:] += absense_costs[a][i:]\n        cost = max(costs)\n        index_temporary_euristics[suffix] = cost\n        return cost", "response": "Euristic function for h - \u044d\u0432\u0440\u0438\u0441\u0442\u0438\u043a\u0438."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nreturn the operation cost for the given up and low.", "response": "def get_operation_cost(self, up, low):\n        \"\"\"\n        \u0412\u043e\u0437\u0432\u0440\u0430\u0449\u0430\u0435\u0442 \u0441\u0442\u043e\u0438\u043c\u043e\u0441\u0442\u044c \u044d\u043b\u0435\u043c\u0435\u043d\u0442\u0430\u0440\u043d\u043e\u0439 \u0442\u0440\u0430\u043d\u0441\u0434\u0443\u043a\u0446\u0438\u0438 up->low\n        \u0438\u043b\u0438 np.inf, \u0435\u0441\u043b\u0438 \u0442\u0430\u043a\u043e\u0439 \u044d\u043b\u0435\u043c\u0435\u043d\u0442\u0430\u0440\u043d\u043e\u0439 \u0442\u0440\u0430\u043d\u0441\u0434\u0443\u043a\u0446\u0438\u0438 \u043d\u0435\u0442\n\n        \u0410\u0440\u0433\u0443\u043c\u0435\u043d\u0442\u044b:\n        ----------\n        up, low : string\n            \u044d\u043b\u0435\u043c\u0435\u043d\u0442\u044b \u044d\u043b\u0435\u043c\u0435\u043d\u0442\u0430\u0440\u043d\u043e\u0439 \u0442\u0440\u0430\u043d\u0441\u0434\u0443\u043a\u0446\u0438\u0438\n\n        \u0412\u043e\u0437\u0432\u0440\u0430\u0449\u0430\u0435\u0442:\n        -----------\n        cost : float\n            \u0441\u0442\u043e\u0438\u043c\u043e\u0441\u0442\u044c \u044d\u043b\u0435\u043c\u0435\u043d\u0442\u0430\u0440\u043d\u043e\u0439 \u0442\u0440\u0430\u043d\u0441\u0434\u0443\u043a\u0446\u0438\u0438 up->low\n            (np.inf, \u0435\u0441\u043b\u0438 \u0442\u0430\u043a\u0430\u044f \u0442\u0440\u0430\u043d\u0441\u0434\u0443\u043a\u0446\u0438\u044f \u043e\u0442\u0441\u0443\u0442\u0441\u0442\u0432\u0443\u0435\u0442)\n        \"\"\"\n        up_costs = self.operation_costs.get(up, None)\n        if up_costs is None:\n            return np.inf\n        cost = up_costs.get(low, np.inf)\n        return cost"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef inverse(self):\n        # \u0423\u041f\u0420\u041e\u0421\u0422\u0418\u0422\u042c \u041e\u0411\u0420\u0410\u0429\u0415\u041d\u0418\u0415!!!\n        inversed_transducer = SegmentTransducer(self.alphabet, operation_costs=dict())\n        inversed_transducer.operation_costs = self._reversed_operation_costs\n        inversed_transducer._reversed_operation_costs = self.operation_costs\n        inversed_transducer.max_low_length = self.max_up_length\n        inversed_transducer.max_up_length = self.max_low_length\n        inversed_transducer.max_low_lengths_by_up = self.max_up_lengths_by_low\n        inversed_transducer.max_up_lengths_by_low = self.max_low_lengths_by_up\n        return inversed_transducer", "response": "Returns a new transducer that is the inverse of the current state."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef distance(self, first, second, return_transduction = False):\n        if return_transduction:\n            add_pred = (lambda x, y: (y == np.inf or x < y))\n        else:\n            add_pred = (lambda x, y: (y == np.inf or x <= y))\n        clear_pred = (lambda x, y: (y < np.inf and x < y))\n        update_func = lambda x, y: min(x, y)\n        costs, backtraces = self._fill_levenshtein_table(first, second,\n                                                        update_func, add_pred, clear_pred)\n        final_cost = costs[-1][-1]\n        if final_cost == np.inf:\n            transductions = [None]\n        elif return_transduction:\n            transductions = self._backtraces_to_transductions(first, second, backtraces,\n                                                              final_cost, return_cost=False)\n        if return_transduction:\n            return final_cost, transductions\n        else:\n            return final_cost", "response": "Returns the distance between two strings."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef lower_transductions(self, word, max_cost, return_cost=True):\n        prefixes = [[] for i in range(len(word) + 1)]\n        prefixes[0].append(((), 0.0))\n        for pos in range(len(prefixes)):\n            # \u0432\u0441\u0442\u0430\u0432\u043a\u0438\n            prefixes[pos] = self._perform_insertions(prefixes[pos], max_cost)\n            max_upperside_length = min(len(word) - pos, self.max_up_length)\n            for upperside_length in range(1, max_upperside_length + 1):\n                up = word[pos: pos + upperside_length]\n                for low, low_cost in self.operation_costs.get(up, dict()).items():\n                    for transduction, cost in prefixes[pos]:\n                        new_cost = cost + low_cost\n                        if new_cost <= max_cost:\n                            new_transduction = transduction +(up, low)\n                            prefixes[pos + upperside_length].append((new_transduction, new_cost))\n        answer = sorted(prefixes[-1], key=(lambda x: x[0]))\n        if return_cost:\n            return answer\n        else:\n            return [elem[0] for elem in answer]", "response": "Lower transductions of a word."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef _fill_levenshtein_table(self, first, second, update_func, add_pred, clear_pred,\n                               threshold=None):\n        \"\"\"\n        \u0424\u0443\u043d\u043a\u0446\u0438\u044f, \u0434\u0438\u043d\u0430\u043c\u0438\u0447\u0435\u0441\u043a\u0438 \u0437\u0430\u043f\u043e\u043b\u043d\u044f\u044e\u0449\u0430\u044f \u0442\u0430\u0431\u043b\u0438\u0446\u0443 costs \u0441\u0442\u043e\u0438\u043c\u043e\u0441\u0442\u0438 \u0442\u0440\u0430\u043d\u0441\u0434\u0443\u043a\u0446\u0438\u0439,\n        costs[i][j] --- \u043c\u0438\u043d\u0438\u043c\u0430\u043b\u044c\u043d\u0430\u044f \u0441\u0442\u043e\u0438\u043c\u043e\u0441\u0442\u044c \u0442\u0440\u0430\u043d\u0441\u0434\u0443\u043a\u0446\u0438\u0438,\n        \u043f\u0435\u0440\u0435\u0432\u043e\u0434\u044f\u0449\u0435\u0439 first[:i] \u0432 second[:j]\n\n        \u0410\u0440\u0433\u0443\u043c\u0435\u043d\u0442\u044b:\n        ----------\n        first, second : string\n            \u0412\u0435\u0440\u0445\u043d\u0438\u0439 \u0438 \u043d\u0438\u0436\u043d\u0438\u0439 \u044d\u043b\u0435\u043c\u0435\u043d\u0442\u044b \u0442\u0440\u0430\u043d\u0441\u0434\u0443\u043a\u0446\u0438\u0438\n        update_func : callable, float*float -> bool\n            update_func(x, y) \u0432\u043e\u0437\u0432\u0440\u0430\u0449\u0430\u0435\u0442 \u043d\u043e\u0432\u043e\u0435 \u0437\u043d\u0430\u0447\u0435\u043d\u0438\u0435 \u0432 \u044f\u0447\u0435\u0439\u043a\u0435 \u0442\u0430\u0431\u043b\u0438\u0446\u044b costs,\n            \u0435\u0441\u043b\u0438 \u0441\u0442\u0430\u0440\u043e\u0435 \u0437\u043d\u0430\u0447\u0435\u043d\u0438\u0435 --- y, \u0430 \u043f\u043e\u0442\u0435\u043d\u0446\u0438\u0430\u043b\u044c\u043d\u043e \u043d\u043e\u0432\u043e\u0435 \u0437\u043d\u0430\u0447\u0435\u043d\u0438\u0435 --- x\n            \u0432\u0435\u0437\u0434\u0435 update_func = min\n        add_pred : callable : float*float -> bool\n            add_pred(x, y) \u0432\u043e\u0437\u0432\u0440\u0430\u0449\u0430\u0435\u0442, \u043f\u0440\u043e\u0438\u0437\u0432\u043e\u0434\u0438\u0442\u0441\u044f \u043b\u0438 \u0434\u043e\u0431\u0430\u0432\u043b\u0435\u043d\u0438\u0435\n            \u043d\u043e\u0432\u043e\u0433\u043e \u044d\u043b\u0435\u043c\u0435\u043d\u0442\u0430 p \u0441\u0442\u043e\u0438\u043c\u043e\u0441\u0442\u0438 x \u0432 \u044f\u0447\u0435\u0439\u043a\u0443 backtraces[i][j]\n            \u0432 \u0437\u0430\u0432\u0438\u0441\u0438\u043c\u043e\u0441\u0442\u0438 \u043e\u0442 \u0437\u043d\u0430\u0447\u0435\u043d\u0438\u044f costs[i][j]=y \u0438 \u0442\u0435\u043a\u0443\u0449\u0435\u0439 \u0441\u0442\u043e\u0438\u043c\u043e\u0441\u0442\u0438 x\n        clear_pred : callable : float*float -> bool\n            clear_pred(x, y) \u0432\u043e\u0437\u0432\u0440\u0430\u0449\u0430\u0435\u0442, \u043f\u0440\u043e\u0438\u0437\u0432\u043e\u0434\u0438\u0442\u0441\u044f \u043b\u0438 \u043e\u0447\u0438\u0441\u0442\u043a\u0430\n            \u044f\u0447\u0435\u0439\u043a\u0438 backtraces[i][j] \u0432 \u0437\u0430\u0432\u0438\u0441\u0438\u043c\u043e\u0441\u0442\u0438 \u043e\u0442 \u0437\u043d\u0430\u0447\u0435\u043d\u0438\u044f costs[i][j]=y\n            \u0438 \u0442\u0435\u043a\u0443\u0449\u0435\u0439 \u0441\u0442\u043e\u0438\u043c\u043e\u0441\u0442\u0438 x \u044d\u043b\u0435\u043c\u0435\u043d\u0442\u0430 p, \u0434\u043e\u0431\u0430\u0432\u043b\u044f\u0435\u043c\u043e\u0433\u043e \u0432 \u044d\u0442\u0443 \u044f\u0447\u0435\u0439\u043a\u0443\n\n        \u0412\u043e\u0437\u0432\u0440\u0430\u0449\u0430\u0435\u0442:\n        -----------\n        costs : array, dtype=float, shape=(len(first)+1, len(second)+1)\n            \u043c\u0430\u0441\u0441\u0438\u0432, \u0432 \u044f\u0447\u0435\u0439\u043a\u0435 \u0441 \u0438\u043d\u0434\u0435\u043a\u0441\u0430\u043c\u0438 i, j \u043a\u043e\u0442\u043e\u0440\u043e\u0433\u043e \u0445\u0440\u0430\u043d\u0438\u0442\u0441\u044f\n            \u043c\u0438\u043d\u0438\u043c\u0430\u043b\u044c\u043d\u0430\u044f \u0441\u0442\u043e\u0438\u043c\u043e\u0441\u0442\u044c \u0442\u0440\u0430\u043d\u0441\u0434\u0443\u043a\u0446\u0438\u0438, \u043f\u0435\u0440\u0435\u0432\u043e\u0434\u044f\u0449\u0435\u0439 first[:i] \u0432 second[:j]\n        backtraces : array, dtype=list, shape=(len(first)+1, len(second)+1)\n            \u043c\u0430\u0441\u0441\u0438\u0432, \u0432 \u044f\u0447\u0435\u0439\u043a\u0435 \u0441 \u0438\u043d\u0434\u0435\u043a\u0441\u0430\u043c\u0438 i, j \u043a\u043e\u0442\u043e\u0440\u043e\u0433\u043e \u0445\u0440\u0430\u043d\u044f\u0442\u0441\u044f\n            \u043e\u0431\u0440\u0430\u0442\u043d\u044b\u0435 \u0441\u0441\u044b\u043b\u043a\u0438 \u043d\u0430 \u043f\u0440\u0435\u0434\u044b\u0434\u0443\u0449\u0443\u044e \u044f\u0447\u0435\u0439\u043a\u0443 \u0432 \u043e\u043f\u0442\u0438\u043c\u0430\u043b\u044c\u043d\u043e\u0439 \u0442\u0440\u0430\u043d\u0441\u0434\u0443\u043a\u0446\u0438\u0438,\n            \u043f\u0440\u0438\u0432\u043e\u0434\u044f\u0449\u0435\u0439 \u0432 \u044f\u0447\u0435\u0439\u043a\u0443 backtraces[i][j]\n        \"\"\"\n        m, n = len(first), len(second)\n        # \u0435\u0441\u043b\u0438 threshold=None, \u0442\u043e \u0432 \u043a\u0430\u0447\u0435\u0441\u0442\u0432\u0435 \u043f\u043e\u0440\u043e\u0433\u0430 \u0431\u0435\u0440\u0451\u0442\u0441\u044f \u0443\u0434\u0432\u043e\u0435\u043d\u043d\u0430\u044f \u0441\u0442\u043e\u0438\u043c\u043e\u0441\u0442\u044c\n        # \u0442\u0440\u0430\u043d\u0441\u0434\u0443\u043a\u0446\u0438\u0438, \u043e\u0442\u043e\u0431\u0440\u0430\u0436\u0430\u044e\u0449\u0435\u0439 \u0441\u0438\u043c\u0432\u043e\u043b\u044b \u043d\u0430 \u043e\u0434\u0438\u043d\u0430\u043a\u043e\u0432\u044b\u0445 \u043f\u043e\u0437\u0438\u0446\u0438\u044f\u0445 \u0434\u0440\u0443\u0433 \u0432 \u0434\u0440\u0443\u0433\u0430\n        if threshold is None:\n            threshold = 0.0\n            for a, b in zip(first, second):\n                threshold += self.get_operation_cost(a, b)\n            if m > n:\n                for a in first[n: ]:\n                    threshold += self.get_operation_cost(a, '')\n            elif m < n:\n                for b in second[m: ]:\n                    threshold += self.get_operation_cost('', b)\n            threshold *= 2\n        # \u0438\u043d\u0438\u0446\u0438\u0430\u043b\u0438\u0437\u0430\u0446\u0438\u044f \u0432\u043e\u0437\u0432\u0440\u0430\u0449\u0430\u0435\u043c\u044b\u0445 \u043c\u0430\u0441\u0441\u0438\u0432\u043e\u0432\n        costs = np.zeros(shape=(m + 1, n + 1), dtype=np.float64)\n        costs[:] = np.inf\n        backtraces = [None] * (m + 1)\n        for i in range(m + 1):\n            backtraces[i] = [[] for j in range(n + 1)]\n        costs[0][0] = 0.0\n        for i in range(m + 1):\n            for i_right in range(i, min(i + self.max_up_length, m) + 1):\n                up = first[i: i_right]\n                max_low_length = self.max_low_lengths_by_up.get(up, -1)\n                if max_low_length == -1: # no up key in transduction\n                    continue\n                up_costs = self.operation_costs[up]\n                for j in range(n + 1):\n                    if costs[i][j] > threshold:\n                        continue\n                    if len(backtraces[i][j]) == 0 and i + j > 0:\n                        continue # \u043d\u0435 \u043d\u0430\u0448\u043b\u043e\u0441\u044c \u043e\u0431\u0440\u0430\u0442\u043d\u044b\u0445 \u0441\u0441\u044b\u043b\u043e\u043a\n                    for j_right in range((j if i_right > i else j + 1),\n                                         min(j + max_low_length, n) + 1):\n                        low = second[j: j_right]\n                        curr_cost = up_costs.get(low, np.inf)\n                        old_cost = costs[i_right][j_right]\n                        new_cost = costs[i][j] + curr_cost\n                        if new_cost > threshold:\n                            continue\n                        if add_pred(new_cost, old_cost):\n                            if clear_pred(new_cost, old_cost):\n                                backtraces[i_right][j_right] = []\n                            costs[i_right][j_right] = update_func(new_cost, old_cost)\n                            backtraces[i_right][j_right].append((i, j))\n        return costs, backtraces", "response": "Fill the Levenshtein distance table."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nmake the _reversed_operation_costs dictionary.", "response": "def _make_reversed_operation_costs(self):\n        \"\"\"\n        \u0417\u0430\u043f\u043e\u043b\u043d\u044f\u0435\u0442 \u043c\u0430\u0441\u0441\u0438\u0432 _reversed_operation_costs\n        \u043d\u0430 \u043e\u0441\u043d\u043e\u0432\u0435 \u0438\u043c\u0435\u044e\u0449\u0435\u0433\u043e\u0441\u044f \u043c\u0430\u0441\u0441\u0438\u0432\u0430 operation_costs\n        \"\"\"\n        _reversed_operation_costs = dict()\n        for up, costs in self.operation_costs.items():\n            for low, cost in costs.items():\n                if low not in _reversed_operation_costs:\n                    _reversed_operation_costs[low] = dict()\n                _reversed_operation_costs[low][up] = cost\n        self._reversed_operation_costs = _reversed_operation_costs"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef _make_maximal_key_lengths(self):\n        self.max_up_length =\\\n            (max(len(up) for up in self.operation_costs)\n             if len(self.operation_costs) > 0 else -1)\n        self.max_low_length =\\\n            (max(len(low) for low in self._reversed_operation_costs)\n             if len(self._reversed_operation_costs) > 0 else -1)\n        self.max_low_lengths_by_up, self.max_up_lengths_by_low = dict(), dict()\n        for up, costs in self.operation_costs.items():\n            self.max_low_lengths_by_up[up] =\\\n                max(len(low) for low in costs) if len(costs) > 0 else -1\n        for low, costs in self._reversed_operation_costs.items():\n            self.max_up_lengths_by_low[low] =\\\n                max(len(up) for up in costs) if len(costs) > 0 else -1", "response": "Makes the maximum length of the key - length pairs."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nperforming insertions on the current set of items.", "response": "def _perform_insertions(self, initial, max_cost):\n        \"\"\"\n        \u0432\u043e\u0437\u0432\u0440\u0430\u0449\u0430\u0435\u0442 \u0432\u0441\u0435 \u0442\u0440\u0430\u043d\u0441\u0434\u0443\u043a\u0446\u0438\u0438 \u0441\u0442\u043e\u0438\u043c\u043e\u0441\u0442\u0438 <= max_cost,\n        \u043a\u043e\u0442\u043e\u0440\u044b\u0435 \u043c\u043e\u0436\u043d\u043e \u043f\u043e\u043b\u0443\u0447\u0438\u0442\u044c \u0438\u0437 \u044d\u043b\u0435\u043c\u0435\u043d\u0442\u043e\u0432 initial\n\n        \u0410\u0440\u0433\u0443\u043c\u0435\u043d\u0442\u044b:\n        ----------\n        initial : list of tuples\n            \u0441\u043f\u0438\u0441\u043e\u043a \u0438\u0441\u0445\u043e\u0434\u043d\u044b\u0445 \u0442\u0440\u0430\u043d\u0441\u0434\u0443\u043a\u0446\u0438\u0439 \u0432\u0438\u0434\u0430 [(\u0442\u0440\u0430\u043d\u0441\u0434\u0443\u043a\u0446\u0438\u044f, \u0441\u0442\u043e\u0438\u043c\u043e\u0441\u0442\u044c)]\n        max_cost : float\n            \u043c\u0430\u043a\u0441\u0438\u043c\u0430\u043b\u044c\u043d\u0430\u044f \u0441\u0442\u043e\u0438\u043c\u043e\u0441\u0442\u044c \u0442\u0440\u0430\u043d\u0441\u0434\u0443\u043a\u0446\u0438\u0438\n\n        \u0412\u043e\u0437\u0432\u0440\u0430\u0449\u0430\u0435\u0442:\n        -----------\n        final : list of tuples\n            \u0444\u0438\u043d\u0430\u043b\u044c\u043d\u044b\u0439 \u0441\u043f\u0438\u0441\u043e\u043a \u0442\u0440\u0430\u043d\u0441\u0434\u0443\u043a\u0446\u0438\u0439 \u0432\u0438\u0434\u0430 [(\u0442\u0440\u0430\u043d\u0441\u0434\u0443\u043a\u0446\u0438\u044f, \u0441\u0442\u043e\u0438\u043c\u043e\u0441\u0442\u044c)]\n        \"\"\"\n        queue = list(initial)\n        final = initial\n        while len(queue) > 0:\n            transduction, cost = queue[0]\n            queue = queue[1:]\n            for string, string_cost in self.operation_costs[\"\"].items():\n                new_cost = cost + string_cost\n                if new_cost <= max_cost:\n                    new_transduction = transduction + (\"\", string)\n                    final.append((new_transduction, new_cost))\n                    queue.append((new_transduction, new_cost))\n        return final"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef _make_default_operation_costs(self, allow_spaces=False):\n        self.operation_costs = dict()\n        self.operation_costs[\"\"] = {c: 1.0 for c in list(self.alphabet) + [' ']}\n        for a in self.alphabet:\n            current_costs = {c: 1.0 for c in self.alphabet}\n            current_costs[a] = 0.0\n            current_costs[\"\"] = 1.0\n            if allow_spaces:\n                current_costs[\" \"] = 1.0\n            self.operation_costs[a] = current_costs\n        # \u0442\u0440\u0430\u043d\u0441\u043f\u043e\u0437\u0438\u0446\u0438\u0438\n        for a, b in itertools.permutations(self.alphabet, 2):\n            self.operation_costs[a + b] = {b + a: 1.0}\n        # \u043f\u0440\u043e\u0431\u0435\u043b\u044b\n        if allow_spaces:\n            self.operation_costs[\" \"] = {c: 1.0 for c in self.alphabet}\n            self.operation_costs[\" \"][\"\"] = 1.0", "response": "Make the default operation cost dictionary."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nstarts the self - destruct timer.", "response": "def _start_timer(self) -> None:\n        \"\"\"Initiates self-destruct timer.\"\"\"\n        self.timer = Timer(self.config['conversation_lifetime'], self.self_destruct_callback)\n        self.timer.start()"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nroute Alexa requests to appropriate handlers.", "response": "def handle_request(self, request: dict) -> dict:\n        \"\"\"Routes Alexa requests to appropriate handlers.\n\n        Args:\n            request: Alexa request.\n        Returns:\n            response: Response conforming Alexa response specification.\n        \"\"\"\n        request_type = request['request']['type']\n        request_id = request['request']['requestId']\n        log.debug(f'Received request. Type: {request_type}, id: {request_id}')\n\n        if request_type in self.handled_requests.keys():\n            response: dict = self.handled_requests[request_type](request)\n        else:\n            response: dict = self.handled_requests['_unsupported'](request)\n            log.warning(f'Unsupported request type: {request_type}, request id: {request_id}')\n\n        self._rearm_self_destruct()\n\n        return response"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\ninfer DeepPavlov agent with raw user input extracted from Alexa request.", "response": "def _act(self, utterance: str) -> list:\n        \"\"\"Infers DeepPavlov agent with raw user input extracted from Alexa request.\n\n        Args:\n            utterance: Raw user input extracted from Alexa request.\n        Returns:\n            response: DeepPavlov agent response.\n        \"\"\"\n        if self.stateful:\n            utterance = [[utterance], [self.key]]\n        else:\n            utterance = [[utterance]]\n\n        agent_response: list = self.agent(*utterance)\n\n        return agent_response"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef _generate_response(self, response: dict, request: dict) -> dict:\n        response_template = deepcopy(self.response_template)\n        response_template['sessionAttributes']['sessionId'] = request['session']['sessionId']\n\n        for key, value in response_template.items():\n            if key not in response.keys():\n                response[key] = value\n\n        return response", "response": "Populates generated response with additional data conforming Alexa response specification."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef _handle_intent(self, request: dict) -> dict:\n        intent_name = self.config['intent_name']\n        slot_name = self.config['slot_name']\n\n        request_id = request['request']['requestId']\n        request_intent: dict = request['request']['intent']\n\n        if intent_name != request_intent['name']:\n            log.error(f\"Wrong intent name received: {request_intent['name']} in request {request_id}\")\n            return {'error': 'wrong intent name'}\n\n        if slot_name not in request_intent['slots'].keys():\n            log.error(f'No slot named {slot_name} found in request {request_id}')\n            return {'error': 'no slot found'}\n\n        utterance = request_intent['slots'][slot_name]['value']\n        agent_response = self._act(utterance)\n\n        if not agent_response:\n            log.error(f'Some error during response generation for request {request_id}')\n            return {'error': 'error during response generation'}\n\n        prediction: RichMessage = agent_response[0]\n        prediction: list = prediction.alexa()\n\n        if not prediction:\n            log.error(f'Some error during response generation for request {request_id}')\n            return {'error': 'error during response generation'}\n\n        response = self._generate_response(prediction[0], request)\n\n        return response", "response": "Handles the intent request and returns the response dict conforming Alexa specification."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef _handle_launch(self, request: dict) -> dict:\n        response = {\n            'response': {\n                'shouldEndSession': False,\n                'outputSpeech': {\n                    'type': 'PlainText',\n                    'text': self.config['start_message']\n                },\n                'card': {\n                    'type': 'Simple',\n                    'content': self.config['start_message']\n                }\n            }\n        }\n\n        response = self._generate_response(response, request)\n\n        return response", "response": "Handles LaunchRequest Alexa request."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef _handle_unsupported(self, request: dict) -> dict:\n        response = {\n            'response': {\n                'shouldEndSession': False,\n                'outputSpeech': {\n                    'type': 'PlainText',\n                    'text': self.config['unsupported_message']\n                },\n                'card': {\n                    'type': 'Simple',\n                    'content': self.config['unsupported_message']\n                }\n            }\n        }\n\n        response = self._generate_response(response, request)\n\n        return response", "response": "Handles all unsupported types of Alexa requests. Returns standard message."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef _repr_pretty_(self, p, cycle):\n        if cycle:\n            p.text('Struct(...)')\n        else:\n            with p.group(7, 'Struct(', ')'):\n                p.pretty(self._asdict())", "response": "method that defines Struct s pretty printing rules for iPython\n           "}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\ncalculate the perplexity of the model loss by averaging the losses by averaging the mean loss and then multiplying the perplexity by the sum of the losses by the sum of the losses.", "response": "def elmo_loss2ppl(losses: List[np.ndarray]) -> float:\n    \"\"\" Calculates perplexity by loss\n\n    Args:\n        losses: list of numpy arrays of model losses\n\n    Returns:\n        perplexity : float\n    \"\"\"\n    avg_loss = np.mean(losses)\n    return float(np.exp(avg_loss))"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nbuilds the loss op for training a new object of class IA4GH language.", "response": "def _build_loss(self, lstm_outputs):\n        \"\"\"\n        Create:\n            self.total_loss: total loss op for training\n            self.softmax_W, softmax_b: the softmax variables\n            self.next_token_id / _reverse: placeholders for gold input\n\n        \"\"\"\n        batch_size = self.options['batch_size']\n        unroll_steps = self.options['unroll_steps']\n\n        n_tokens_vocab = self.options['n_tokens_vocab']\n\n        # DEFINE next_token_id and *_reverse placeholders for the gold input\n        def _get_next_token_placeholders(suffix):\n            name = 'next_token_id' + suffix\n            id_placeholder = tf.placeholder(DTYPE_INT,\n                                            shape=(batch_size, unroll_steps),\n                                            name=name)\n            return id_placeholder\n\n        # get the window and weight placeholders\n        self.next_token_id = _get_next_token_placeholders('')\n        if self.bidirectional:\n            self.next_token_id_reverse = _get_next_token_placeholders(\n                '_reverse')\n\n        # DEFINE THE SOFTMAX VARIABLES\n        # get the dimension of the softmax weights\n        # softmax dimension is the size of the output projection_dim\n        softmax_dim = self.options['lstm']['projection_dim']\n\n        # the output softmax variables -- they are shared if bidirectional\n        if self.share_embedding_softmax:\n            # softmax_W is just the embedding layer\n            self.softmax_W = self.embedding_weights\n\n        with tf.variable_scope('softmax'), tf.device('/cpu:0'):\n            # Glorit init (std=(1.0 / sqrt(fan_in))\n            softmax_init = tf.random_normal_initializer(0.0, 1.0 / np.sqrt(softmax_dim))\n            if not self.share_embedding_softmax:\n                self.softmax_W = tf.get_variable(\n                    'W', [n_tokens_vocab, softmax_dim],\n                    dtype=DTYPE,\n                    initializer=softmax_init\n                )\n            self.softmax_b = tf.get_variable(\n                'b', [n_tokens_vocab],\n                dtype=DTYPE,\n                initializer=tf.constant_initializer(0.0))\n\n        # now calculate losses\n        # loss for each direction of the LSTM\n        self.individual_train_losses = []\n        self.individual_eval_losses = []\n\n        if self.bidirectional:\n            next_ids = [self.next_token_id, self.next_token_id_reverse]\n        else:\n            next_ids = [self.next_token_id]\n\n        for id_placeholder, lstm_output_flat in zip(next_ids, lstm_outputs):\n            # flatten the LSTM output and next token id gold to shape:\n            # (batch_size * unroll_steps, softmax_dim)\n            # Flatten and reshape the token_id placeholders\n            next_token_id_flat = tf.reshape(id_placeholder, [-1, 1])\n\n            with tf.control_dependencies([lstm_output_flat]):\n                sampled_losses = tf.nn.sampled_softmax_loss(self.softmax_W, self.softmax_b,\n                                                            next_token_id_flat, lstm_output_flat,\n                                                            self.options['n_negative_samples_batch'],\n                                                            self.options['n_tokens_vocab'],\n                                                            num_true=1)\n\n                # get the full softmax loss\n                output_scores = tf.matmul(\n                    lstm_output_flat,\n                    tf.transpose(self.softmax_W)\n                ) + self.softmax_b\n                # NOTE: tf.nn.sparse_softmax_cross_entropy_with_logits\n                #   expects unnormalized output since it performs the\n                #   softmax internally\n                losses = tf.nn.sparse_softmax_cross_entropy_with_logits(\n                    logits=output_scores,\n                    labels=tf.squeeze(next_token_id_flat, squeeze_dims=[1])\n                )\n            sampled_losses = tf.reshape(sampled_losses, [self.options['batch_size'], -1])\n            losses = tf.reshape(losses, [self.options['batch_size'], -1])\n            self.individual_train_losses.append(tf.reduce_mean(sampled_losses, axis=1))\n            self.individual_eval_losses.append(tf.reduce_mean(losses, axis=1))\n\n        # now make the total loss -- it's the train of the individual losses\n        if self.bidirectional:\n            self.total_train_loss = 0.5 * (self.individual_train_losses[0] + self.individual_train_losses[1])\n            self.total_eval_loss = 0.5 * (self.individual_eval_losses[0] + self.individual_eval_losses[1])\n        else:\n            self.total_train_loss = self.individual_train_losses[0]\n            self.total_eval_loss = self.individual_eval_losses[0]"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nbuilding and return the model described in corresponding configuration file.", "response": "def build_model(config: Union[str, Path, dict], mode: str = 'infer',\n                load_trained: bool = False, download: bool = False,\n                serialized: Optional[bytes] = None) -> Chainer:\n    \"\"\"Build and return the model described in corresponding configuration file.\"\"\"\n    config = parse_config(config)\n\n    if serialized:\n        serialized: list = pickle.loads(serialized)\n\n    if download:\n        deep_download(config)\n\n    import_packages(config.get('metadata', {}).get('imports', []))\n\n    model_config = config['chainer']\n\n    model = Chainer(model_config['in'], model_config['out'], model_config.get('in_y'))\n\n    for component_config in model_config['pipe']:\n        if load_trained and ('fit_on' in component_config or 'in_y' in component_config):\n            try:\n                component_config['load_path'] = component_config['save_path']\n            except KeyError:\n                log.warning('No \"save_path\" parameter for the {} component, so \"load_path\" will not be renewed'\n                            .format(component_config.get('class_name', component_config.get('ref', 'UNKNOWN'))))\n\n        if serialized and 'in' in component_config:\n            component_serialized = serialized.pop(0)\n        else:\n            component_serialized = None\n\n        component = from_params(component_config, mode=mode, serialized=component_serialized)\n\n        if 'in' in component_config:\n            c_in = component_config['in']\n            c_out = component_config['out']\n            in_y = component_config.get('in_y', None)\n            main = component_config.get('main', False)\n            model.append(component, c_in, c_out, in_y, main)\n\n    return model"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef interact_model(config: Union[str, Path, dict]) -> None:\n    model = build_model(config)\n\n    while True:\n        args = []\n        for in_x in model.in_x:\n            args.append((input('{}::'.format(in_x)),))\n            # check for exit command\n            if args[-1][0] in {'exit', 'stop', 'quit', 'q'}:\n                return\n\n        pred = model(*args)\n        if len(model.out_params) > 1:\n            pred = zip(*pred)\n\n        print('>>', *pred)", "response": "Start interaction with the model described in corresponding configuration file."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef predict_on_stream(config: Union[str, Path, dict], batch_size: int = 1, file_path: Optional[str] = None) -> None:\n    if file_path is None or file_path == '-':\n        if sys.stdin.isatty():\n            raise RuntimeError('To process data from terminal please use interact mode')\n        f = sys.stdin\n    else:\n        f = open(file_path, encoding='utf8')\n\n    model: Chainer = build_model(config)\n\n    args_count = len(model.in_x)\n    while True:\n        batch = list((l.strip() for l in islice(f, batch_size * args_count)))\n\n        if not batch:\n            break\n\n        args = []\n        for i in range(args_count):\n            args.append(batch[i::args_count])\n\n        res = model(*args)\n        if len(model.out_params) == 1:\n            res = [res]\n        for res in zip(*res):\n            res = json.dumps(res, ensure_ascii=False)\n            print(res, flush=True)\n\n    if f is not sys.stdin:\n        f.close()", "response": "Make a prediction with the component described in corresponding configuration file."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef read_infile(infile: Union[Path, str], from_words=False,\n                word_column: int = WORD_COLUMN, pos_column: int = POS_COLUMN,\n                tag_column: int = TAG_COLUMN, max_sents: int = -1,\n                read_only_words: bool = False) -> List[Tuple[List, Union[List, None]]]:\n    \"\"\"Reads input file in CONLL-U format\n\n    Args:\n        infile: a path to a file\n        word_column: column containing words (default=1)\n        pos_column: column containing part-of-speech labels (default=3)\n        tag_column: column containing fine-grained tags (default=5)\n        max_sents: maximal number of sents to read\n        read_only_words: whether to read only words\n\n    Returns:\n        a list of sentences. Each item contains a word sequence and a tag sequence, which is ``None``\n        in case ``read_only_words = True``\n    \"\"\"\n    answer, curr_word_sent, curr_tag_sent = [], [], []\n    if from_words:\n        word_column, read_only_words = 0, True\n    with open(infile, \"r\", encoding=\"utf8\") as fin:\n        for line in fin:\n            line = line.strip()\n            if line.startswith(\"#\"):\n                continue\n            if line == \"\":\n                if len(curr_word_sent) > 0:\n                    if read_only_words:\n                        curr_tag_sent = None\n                    answer.append((curr_word_sent, curr_tag_sent))\n                curr_tag_sent, curr_word_sent = [], []\n                if len(answer) == max_sents:\n                    break\n                continue\n            splitted = line.split(\"\\t\")\n            index = splitted[0]\n            if not from_words and not index.isdigit():\n                continue\n            curr_word_sent.append(splitted[word_column])\n            if not read_only_words:\n                pos, tag = splitted[pos_column], splitted[tag_column]\n                tag = pos if tag == \"_\" else \"{},{}\".format(pos, tag)\n                curr_tag_sent.append(tag)\n        if len(curr_word_sent) > 0:\n            if read_only_words:\n                curr_tag_sent = None\n            answer.append((curr_word_sent, curr_tag_sent))\n    return answer", "response": "Reads a CONLL - U file into a list of sentences."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef preprocess_data(data: List[Tuple[List[str], List[str]]], to_lower: bool = True,\n                    append_case: str = \"first\") -> List[Tuple[List[Tuple[str]], List[str]]]:\n    \"\"\"Processes all words in data using\n    :func:`~deeppavlov.dataset_iterators.morphotagger_iterator.process_word`.\n\n    Args:\n        data: a list of pairs (words, tags), each pair corresponds to a single sentence\n        to_lower: whether to lowercase\n        append_case: whether to add case mark\n\n    Returns:\n        a list of preprocessed sentences\n    \"\"\"\n    new_data = []\n    for words, tags in data:\n        new_words = [process_word(word, to_lower=to_lower, append_case=append_case)\n                     for word in words]\n        # tags could also be processed in future\n        new_tags = tags\n        new_data.append((new_words, new_tags))\n    return new_data", "response": "Processes all words in data using the morphotagger iterator."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef fn_from_str(name: str) -> Callable[..., Any]:\n    try:\n        module_name, fn_name = name.split(':')\n    except ValueError:\n        raise ConfigError('Expected function description in a `module.submodules:function_name` form, but got `{}`'\n                          .format(name))\n\n    return getattr(importlib.import_module(module_name), fn_name)", "response": "Returns a function object with the given name given in string."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef register_metric(metric_name: str) -> Callable[..., Any]:\n    def decorate(fn):\n        fn_name = fn.__module__ + ':' + fn.__name__\n        if metric_name in _REGISTRY and _REGISTRY[metric_name] != fn_name:\n            log.warning('\"{}\" is already registered as a metric name, the old function will be ignored'\n                        .format(metric_name))\n        _REGISTRY[metric_name] = fn_name\n        return fn\n    return decorate", "response": "Decorator for metric registration."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef get_metric_by_name(name: str) -> Callable[..., Any]:\n    if name not in _REGISTRY:\n        raise ConfigError(f'\"{name}\" is not registered as a metric')\n    return fn_from_str(_REGISTRY[name])", "response": "Returns a metric callable with a corresponding name."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef from_str(cls, label: str) -> int:\n        label_norm = label.replace('1', 'one').upper()\n        if label_norm in cls.__members__:\n            return DecayType[label_norm]\n        else:\n            raise NotImplementedError", "response": "Convert given string label of decay type to special index."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef fit(self, *args):\n        data = list(zip(*args))\n        self.save()\n        if self._fit_batch_size is None:\n            raise ConfigError(\"in order to use fit() method\"\n                              \" set `fit_batch_size` parameter\")\n        bs = int(self._fit_batch_size)\n        data_len = len(data)\n        num_batches = self._fit_max_batches or ((data_len - 1) // bs + 1)\n\n        avg_loss = 0.\n        best_loss = float('inf')\n        lrs, losses = [], []\n        _lr_find_schedule = DecayScheduler(start_val=self._fit_learning_rate[0],\n                                           end_val=self._fit_learning_rate[1],\n                                           dec_type=\"exponential\",\n                                           num_it=num_batches)\n        self._lr = _lr_find_schedule.start_val\n        self._mom = 0.\n        self._update_graph_variables(learning_rate=self._lr, momentum=self._mom)\n        best_lr = _lr_find_schedule.start_val\n        for i in range(num_batches):\n            batch_start = (i * bs) % data_len\n            batch_end = batch_start + bs\n            report = self.train_on_batch(*zip(*data[batch_start:batch_end]))\n            if not isinstance(report, dict):\n                report = {'loss': report}\n            # Calculating smoothed loss\n            avg_loss = self._fit_beta*avg_loss + (1 - self._fit_beta)*report['loss']\n            smoothed_loss = avg_loss / (1 - self._fit_beta**(i + 1))\n            lrs.append(self._lr)\n            losses.append(smoothed_loss)\n            log.info(f\"Batch {i}/{num_batches}: smooth_loss = {smoothed_loss}\"\n                     f\", lr = {self._lr}, best_lr = {best_lr}\")\n            if math.isnan(smoothed_loss) or (smoothed_loss > 4 * best_loss):\n                break\n            if (smoothed_loss < best_loss) and (i >= self._fit_min_batches):\n                best_loss = smoothed_loss\n                best_lr = self._lr\n            self._lr = _lr_find_schedule.next_val()\n            self._update_graph_variables(learning_rate=self._lr)\n\n            if i >= num_batches:\n                break\n        # best_lr /= 10\n        end_val = self._get_best(lrs, losses)\n\n        start_val = end_val\n        if self._lr_schedule.dec_type in (DecayType.ONECYCLE, DecayType.TRAPEZOID):\n            start_val = end_val / self._fit_learning_rate_div\n        elif self._lr_schedule.dec_type in (DecayType.POLYNOMIAL, DecayType.EXPONENTIAL,\n                                            DecayType.LINEAR, DecayType.COSINE):\n            start_val = end_val\n            end_val = end_val / self._fit_learning_rate_div\n        self._lr_schedule = DecayScheduler(start_val=start_val,\n                                           end_val=end_val,\n                                           num_it=self._lr_schedule.nb,\n                                           dec_type=self._lr_schedule.dec_type,\n                                           extra=self._lr_schedule.extra)\n        log.info(f\"Found best learning rate value = {best_lr}\"\n                 f\", setting new learning rate schedule with {self._lr_schedule}.\")\n\n        self.load()\n        self._lr = self._lr_schedule.start_val\n        self._mom = self._mom_schedule.start_val\n        self._update_graph_variables(learning_rate=self._lr, momentum=self._mom)\n        return {'smoothed_loss': losses, 'learning_rate': lrs}", "response": "Fits the model on the given data."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef _get_best(values: List[float], losses: List[float],\n                  max_loss_div: float = 0.9, min_val_div: float = 10.0) -> float:\n        \"\"\"\n        Find the best value according to given losses\n\n        Args:\n            values: list of considered values\n            losses: list of obtained loss values corresponding to `values`\n            max_loss_div: maximal divergence of loss to be considered significant\n            min_val_div: minimum divergence of loss to be considered significant\n\n        Returns:\n            best value divided by `min_val_div`\n        \"\"\"\n        assert len(values) == len(losses), \"lengths of values and losses should be equal\"\n        min_ind = np.argmin(losses)\n        for i in range(min_ind - 1, 0, -1):\n            if (losses[i] * max_loss_div > losses[min_ind]) or\\\n                    (values[i] * min_val_div < values[min_ind]):\n                return values[i + 1]\n        return values[min_ind] / min_val_div", "response": "Find the best value according to given values and losses"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef process_event(self, event_name: str, data: dict) -> None:\n        if event_name == \"after_validation\":\n            if data['impatience'] > self._learning_rate_last_impatience:\n                self._learning_rate_cur_impatience += 1\n            else:\n                self._learning_rate_cur_impatience = 0\n\n            self._learning_rate_last_impatience = data['impatience']\n\n            if (self._learning_rate_drop_patience is not None) and\\\n                    (self._learning_rate_cur_impatience >=\n                     self._learning_rate_drop_patience):\n                self._learning_rate_cur_impatience = 0\n                self._learning_rate_cur_div *= self._learning_rate_drop_div\n                self._lr /= self._learning_rate_drop_div\n                self._update_graph_variables(learning_rate=self._lr)\n                log.info(f\"New learning rate dividor = {self._learning_rate_cur_div}\")\n        if event_name == 'after_batch':\n            if (self._lr is not None) and self._lr_update_on_batch:\n                self._lr = self._lr_schedule.next_val() / self._learning_rate_cur_div\n                self._update_graph_variables(learning_rate=self._lr)\n            if (self._mom is not None) and self._mom_update_on_batch:\n                self._mom = min(1., max(0., self._mom_schedule.next_val()))\n                self._update_graph_variables(momentum=self._mom)\n        if event_name == 'after_epoch':\n            if (self._lr is not None) and not self._lr_update_on_batch:\n                self._lr = self._lr_schedule.next_val() / self._learning_rate_cur_div\n                self._update_graph_variables(learning_rate=self._lr)\n            if (self._mom is not None) and not self._mom_update_on_batch:\n                self._mom = min(1., max(0., self._mom_schedule.next_val()))\n                self._update_graph_variables(momentum=self._mom)\n        if event_name == 'after_train_log':\n            if (self._lr is not None) and ('learning_rate' not in data):\n                data['learning_rate'] = self._lr\n            if (self._mom is not None) and ('momentum' not in data):\n                data['momentum'] = self._mom", "response": "Process the event and update the learning rate and momentum variables after the event."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nencoding one text sample into a list of embedded tokens or array of mean embedding of tokens per sample.", "response": "def _encode(self, tokens: List[str], mean: bool) -> Union[List[np.ndarray], np.ndarray]:\n        \"\"\"\n        Embed one text sample\n\n        Args:\n            tokens: tokenized text sample\n            mean: whether to return mean embedding of tokens per sample\n\n        Returns:\n            list of embedded tokens or array of mean values\n        \"\"\"\n        embedded_tokens = []\n        for t in tokens:\n            try:\n                emb = self.tok2emb[t]\n            except KeyError:\n                try:\n                    emb = self._get_word_vector(t)\n                except KeyError:\n                    emb = np.zeros(self.dim, dtype=np.float32)\n                self.tok2emb[t] = emb\n            embedded_tokens.append(emb)\n\n        if mean is None:\n            mean = self.mean\n\n        if mean:\n            filtered = [et for et in embedded_tokens if np.any(et)]\n            if filtered:\n                return np.mean(filtered, axis=0)\n            return np.zeros(self.dim, dtype=np.float32)\n\n        return embedded_tokens"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nparse requirements from requirements. txt", "response": "def read_requirements():\n    \"\"\"parses requirements from requirements.txt\"\"\"\n    reqs_path = os.path.join(__location__, 'requirements.txt')\n    with open(reqs_path, encoding='utf8') as f:\n        reqs = [line.strip() for line in f if not line.strip().startswith('#')]\n\n    names = []\n    links = []\n    for req in reqs:\n        if '://' in req:\n            links.append(req)\n        else:\n            names.append(req)\n    return {'install_requires': names, 'dependency_links': links}"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\ngenerate ngrams from a list of tokens lemmas or other strings.", "response": "def ngramize(items: List[str], ngram_range=(1, 1)) -> Generator[List[str], Any, None]:\n    \"\"\"\n    Make ngrams from a list of tokens/lemmas\n    :param items: list of tokens, lemmas or other strings to form ngrams\n    :param ngram_range: range for producing ngrams, ex. for unigrams + bigrams should be set to\n    (1, 2), for bigrams only should be set to (2, 2)\n    :return: ngrams (as strings) generator\n    \"\"\"\n\n    ngrams = []\n    ranges = [(0, i) for i in range(ngram_range[0], ngram_range[1] + 1)]\n    for r in ranges:\n        ngrams += list(zip(*[items[j:] for j in range(*r)]))\n\n    formatted_ngrams = [' '.join(item) for item in ngrams]\n\n    yield formatted_ngrams"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\ncalculating the log loss of the current state of the hierarchy.", "response": "def sk_log_loss(y_true: Union[List[List[float]], List[List[int]], np.ndarray],\n                y_predicted: Union[List[List[float]], List[List[int]], np.ndarray]) -> float:\n    \"\"\"\n    Calculates log loss.\n\n    Args:\n        y_true: list or array of true values\n        y_predicted: list or array of predicted values\n\n    Returns:\n        Log loss\n    \"\"\"\n    return log_loss(y_true, y_predicted)"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nmake a TF - Hub module spec.", "response": "def make_module_spec(options, weight_file):\n    \"\"\"Makes a module spec.\n\n    Args:\n      options: LM hyperparameters.\n      weight_file: location of the hdf5 file with LM weights.\n\n    Returns:\n      A module spec object used for constructing a TF-Hub module.\n    \"\"\"\n    def module_fn():\n        \"\"\"Spec function for a token embedding module.\"\"\"\n        # init\n        _bos_id = 256\n        _eos_id = 257\n        _bow_id = 258\n        _eow_id = 259\n        _pad_id = 260\n\n        _max_word_length = 50\n        _parallel_iterations = 10\n        _max_batch_size = 1024\n\n        id_dtype = tf.int32\n        id_nptype = np.int32\n        max_word_length = tf.constant(_max_word_length, dtype=id_dtype, name='max_word_length')\n\n        version = tf.constant('from_dp_1', dtype=tf.string, name='version')\n\n        # the charcter representation of the begin/end of sentence characters\n        def _make_bos_eos(c):\n            r = np.zeros([_max_word_length], dtype=id_nptype)\n            r[:] = _pad_id\n            r[0] = _bow_id\n            r[1] = c\n            r[2] = _eow_id\n            return tf.constant(r, dtype=id_dtype)\n\n        bos_ids = _make_bos_eos(_bos_id)\n        eos_ids = _make_bos_eos(_eos_id)\n\n        def token2ids(token):\n            with tf.name_scope(\"token2ids_preprocessor\"):\n                char_ids = tf.decode_raw(token, tf.uint8, name='decode_raw2get_char_ids')\n                char_ids = tf.cast(char_ids, tf.int32, name='cast2int_token')\n                char_ids = tf.strided_slice(char_ids, [0], [max_word_length - 2],\n                                            [1], name='slice2resized_token')\n                ids_num = tf.shape(char_ids)[0]\n                fill_ids_num = (_max_word_length - 2) - ids_num\n                pads = tf.fill([fill_ids_num], _pad_id)\n                bow_token_eow_pads = tf.concat([[_bow_id], char_ids, [_eow_id], pads],\n                                               0, name='concat2bow_token_eow_pads')\n                return bow_token_eow_pads\n\n        def sentence_tagging_and_padding(sen_dim):\n            with tf.name_scope(\"sentence_tagging_and_padding_preprocessor\"):\n                sen = sen_dim[0]\n                dim = sen_dim[1]\n                extra_dim = tf.shape(sen)[0] - dim\n                sen = tf.slice(sen, [0, 0], [dim, max_word_length], name='slice2sen')\n\n                bos_sen_eos = tf.concat([[bos_ids], sen, [eos_ids]], 0, name='concat2bos_sen_eos')\n                bos_sen_eos_plus_one = bos_sen_eos + 1\n                bos_sen_eos_pads = tf.pad(bos_sen_eos_plus_one, [[0, extra_dim], [0, 0]],\n                                          \"CONSTANT\", name='pad2bos_sen_eos_pads')\n                return bos_sen_eos_pads\n\n        # Input placeholders to the biLM.\n        tokens = tf.placeholder(shape=(None, None), dtype=tf.string, name='ph2tokens')\n        sequence_len = tf.placeholder(shape=(None, ), dtype=tf.int32, name='ph2sequence_len')\n\n        tok_shape = tf.shape(tokens)\n        line_tokens = tf.reshape(tokens, shape=[-1], name='reshape2line_tokens')\n\n        with tf.device('/cpu:0'):\n            tok_ids = tf.map_fn(\n                token2ids,\n                line_tokens,\n                dtype=tf.int32, back_prop=False, parallel_iterations=_parallel_iterations,\n                name='map_fn2get_tok_ids')\n\n        tok_ids = tf.reshape(tok_ids, [tok_shape[0], tok_shape[1], -1], name='reshape2tok_ids')\n        with tf.device('/cpu:0'):\n            sen_ids = tf.map_fn(\n                sentence_tagging_and_padding,\n                (tok_ids, sequence_len),\n                dtype=tf.int32, back_prop=False, parallel_iterations=_parallel_iterations,\n                name='map_fn2get_sen_ids')\n\n        # Build the biLM graph.\n        bilm = BidirectionalLanguageModel(options, str(weight_file),\n                                          max_batch_size=_max_batch_size)\n\n        embeddings_op = bilm(sen_ids)\n\n        # Get an op to compute ELMo (weighted average of the internal biLM layers)\n        elmo_output = weight_layers('elmo_output', embeddings_op, l2_coef=0.0)\n\n        weighted_op = elmo_output['weighted_op']\n        mean_op = elmo_output['mean_op']\n        word_emb = elmo_output['word_emb']\n        lstm_outputs1 = elmo_output['lstm_outputs1']\n        lstm_outputs2 = elmo_output['lstm_outputs2']\n\n        hub.add_signature(\"tokens\", {\"tokens\": tokens, \"sequence_len\": sequence_len},\n                          {\"elmo\": weighted_op,\n                           \"default\": mean_op,\n                           \"word_emb\": word_emb,\n                           \"lstm_outputs1\": lstm_outputs1,\n                           \"lstm_outputs2\": lstm_outputs2,\n                           \"version\": version})\n\n        # #########################Next signature############################# #\n\n        # Input placeholders to the biLM.\n        def_strings = tf.placeholder(shape=(None), dtype=tf.string)\n        def_tokens_sparse = tf.string_split(def_strings)\n        def_tokens_dense = tf.sparse_to_dense(sparse_indices=def_tokens_sparse.indices,\n                                              output_shape=def_tokens_sparse.dense_shape,\n                                              sparse_values=def_tokens_sparse.values,\n                                              default_value=''\n                                              )\n        def_mask = tf.not_equal(def_tokens_dense, '')\n        def_int_mask = tf.cast(def_mask, dtype=tf.int32)\n        def_sequence_len = tf.reduce_sum(def_int_mask, axis=-1)\n\n        def_tok_shape = tf.shape(def_tokens_dense)\n        def_line_tokens = tf.reshape(def_tokens_dense, shape=[-1], name='reshape2line_tokens')\n\n        with tf.device('/cpu:0'):\n            def_tok_ids = tf.map_fn(\n                token2ids,\n                def_line_tokens,\n                dtype=tf.int32, back_prop=False, parallel_iterations=_parallel_iterations,\n                name='map_fn2get_tok_ids')\n\n        def_tok_ids = tf.reshape(def_tok_ids, [def_tok_shape[0], def_tok_shape[1], -1], name='reshape2tok_ids')\n        with tf.device('/cpu:0'):\n            def_sen_ids = tf.map_fn(\n                sentence_tagging_and_padding,\n                (def_tok_ids, def_sequence_len),\n                dtype=tf.int32, back_prop=False, parallel_iterations=_parallel_iterations,\n                name='map_fn2get_sen_ids')\n\n        # Get ops to compute the LM embeddings.\n        def_embeddings_op = bilm(def_sen_ids)\n\n        # Get an op to compute ELMo (weighted average of the internal biLM layers)\n        def_elmo_output = weight_layers('elmo_output', def_embeddings_op, l2_coef=0.0, reuse=True)\n\n        def_weighted_op = def_elmo_output['weighted_op']\n        def_mean_op = def_elmo_output['mean_op']\n        def_word_emb = def_elmo_output['word_emb']\n        def_lstm_outputs1 = def_elmo_output['lstm_outputs1']\n        def_lstm_outputs2 = def_elmo_output['lstm_outputs2']\n\n        hub.add_signature(\"default\", {\"strings\": def_strings},\n                          {\"elmo\": def_weighted_op,\n                           \"default\": def_mean_op,\n                           \"word_emb\": def_word_emb,\n                           \"lstm_outputs1\": def_lstm_outputs1,\n                           \"lstm_outputs2\": def_lstm_outputs2,\n                           \"version\": version})\n\n    return hub.create_module_spec(module_fn)"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef export2hub(weight_file, hub_dir, options):\n\n    spec = make_module_spec(options, str(weight_file))\n\n    try:\n        with tf.Graph().as_default():\n            module = hub.Module(spec)\n\n            with tf.Session() as sess:\n                sess.run(tf.global_variables_initializer())\n                if hub_dir.exists():\n                    shutil.rmtree(hub_dir)\n                module.export(str(hub_dir), sess)\n    finally:\n        pass", "response": "Exports a TF - Hub module from a weight file to a hub directory."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nformat catalog item output", "response": "def show_details(item_data: Dict[Any, Any]) -> str:\n    \"\"\"Format catalog item output\n\n    Parameters:\n        item_data: item's attributes values\n\n    Returns:\n        [rich_message]: list of formatted rich message\n    \"\"\"\n\n    txt = \"\"\n\n    for key, value in item_data.items():\n        txt += \"**\" + str(key) + \"**\" + ': ' + str(value) + \"  \\n\"\n\n    return txt"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef make_agent() -> EcommerceAgent:\n\n    config_path = find_config('tfidf_retrieve')\n    skill = build_model(config_path)\n    agent = EcommerceAgent(skills=[skill])\n    return agent", "response": "Make an agent for the current language"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef main():\n\n    args = parser.parse_args()\n    run_ms_bot_framework_server(agent_generator=make_agent,\n                                app_id=args.ms_id,\n                                app_secret=args.ms_secret,\n                                stateful=True)", "response": "Parse parameters and run ms bot framework server"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nprocessing utterances and returns corresponding responses batch.", "response": "def _call(self, utterances_batch: List[str], utterances_ids: List[int] = None) -> List[RichMessage]:\n        \"\"\"Processes batch of utterances and returns corresponding responses batch.\n\n        Args:\n            utterances_batch: Batch of incoming utterances.\n            utterances_ids: Batch of dialog IDs corresponding to incoming utterances.\n\n        Returns:\n            responses: A batch of responses corresponding to the\n                utterance batch received by agent.\n        \"\"\"\n\n        rich_message = RichMessage()\n        for utt_id, utt in enumerate(utterances_batch):\n\n            if utterances_ids:\n                id_ = utterances_ids[utt_id]\n\n            log.debug(f'Utterance: {utt}')\n\n            if utt == \"/start\":\n                welcome = \"I am a new e-commerce bot. I will help you to find products that you are looking for. Please type your request in plain text.\"\n                rich_message.add_control(PlainText(welcome))\n                continue\n\n            if utt[0] == \"@\":\n                command, *parts = utt.split(\":\")\n                log.debug(f'Actions: {parts}')\n\n                if command == \"@details\":\n                    batch_index = int(parts[0])  # batch index in history list\n                    item_index = int(parts[1])  # index in batch\n                    rich_message.add_control(PlainText(show_details(\n                        self.history[id_][batch_index][item_index])))\n                    continue\n\n                if command == \"@entropy\":\n                    state = self.history[id_][int(parts[0])]\n                    state[parts[1]] = parts[2]\n                    state[\"start\"] = 0\n                    state[\"stop\"] = 5\n                    utt = state['query']\n                    self.states[id_] = state\n\n                if command == \"@next\":\n                    state = self.history[id_][int(parts[0])]\n                    state['start'] = state['stop']\n                    state['stop'] = state['stop'] + 5\n                    utt = state['query']\n                    self.states[id_] = state\n            else:\n                if id_ not in self.states:\n                    self.states[id_] = {}\n\n                self.states[id_][\"start\"] = 0\n                self.states[id_][\"stop\"] = 5\n\n            responses_batch, confidences_batch, state_batch = self.skills[0](\n                [utt], self.history[id_], [self.states[id_]])\n\n            # update `self.states` with retrieved results\n            self.states[id_] = state_batch[0]\n            self.states[id_][\"query\"] = utt\n\n            items_batch, entropy_batch = responses_batch\n\n            for batch_idx, items in enumerate(items_batch):\n\n                self.history[id_].append(items)\n                self.history[id_].append(self.states[id_])\n\n                for idx, item in enumerate(items):\n                    rich_message.add_control(_draw_item(item, idx, self.history[id_]))\n\n                if len(items) == self.states[id_]['stop'] - self.states[id_]['start']:\n                    buttons_frame = _draw_tail(entropy_batch[batch_idx], self.history[id_])\n                    rich_message.add_control(buttons_frame)\n\n        return [rich_message]"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef TemporalDropout(inputs, dropout=0.0):\n    # TO DO: adapt for >3D tensors\n    if dropout == 0.0:\n        return inputs\n    inputs_func = lambda x: kb.ones_like(inputs[:, :, 0:1])\n    inputs_mask = kl.Lambda(inputs_func)(inputs)\n    inputs_mask = kl.Dropout(dropout)(inputs_mask)\n    tiling_shape = [1, 1, kb.shape(inputs)[2]] + [1] * (kb.ndim(inputs) - 3)\n    inputs_mask = kl.Lambda(kb.tile, arguments={\"n\": tiling_shape},\n                            output_shape=inputs._keras_shape[1:])(inputs_mask)\n    answer = kl.Multiply()([inputs, inputs_mask])\n    return answer", "response": "Drops with a probability temporal steps of input 3D tensor."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef positions_func(inputs, pad=0):\n    position_inputs = kb.cumsum(kb.ones_like(inputs, dtype=\"float32\"), axis=1)\n    position_inputs *= kb.cast(kb.not_equal(inputs, pad), \"float32\")\n    return kb.log(1.0 + position_inputs)", "response": "Compute the positions of the input tensor."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef download(dest_file_path: [List[Union[str, Path]]], source_url: str, force_download=True):\n\n    if isinstance(dest_file_path, list):\n        dest_file_paths = [Path(path) for path in dest_file_path]\n    else:\n        dest_file_paths = [Path(dest_file_path).absolute()]\n\n    if not force_download:\n        to_check = list(dest_file_paths)\n        dest_file_paths = []\n        for p in to_check:\n            if p.exists():\n                log.info(f'File already exists in {p}')\n            else:\n                dest_file_paths.append(p)\n\n    if dest_file_paths:\n        cache_dir = os.getenv('DP_CACHE_DIR')\n        cached_exists = False\n        if cache_dir:\n            first_dest_path = Path(cache_dir) / md5(source_url.encode('utf8')).hexdigest()[:15]\n            cached_exists = first_dest_path.exists()\n        else:\n            first_dest_path = dest_file_paths.pop()\n\n        if not cached_exists:\n            first_dest_path.parent.mkdir(parents=True, exist_ok=True)\n\n            simple_download(source_url, first_dest_path)\n        else:\n            log.info(f'Found cached {source_url} in {first_dest_path}')\n\n        for dest_path in dest_file_paths:\n            dest_path.parent.mkdir(parents=True, exist_ok=True)\n            shutil.copy(str(first_dest_path), str(dest_path))", "response": "Download a file from one or more locations to another."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\ndownloads and extract a. tar. gz or. gz file to one or several target locations.", "response": "def download_decompress(url: str, download_path: [Path, str], extract_paths=None):\n    \"\"\"Download and extract .tar.gz or .gz file to one or several target locations.\n    The archive is deleted if extraction was successful.\n\n    Args:\n        url: URL for file downloading\n        download_path: path to the directory where downloaded file will be stored\n        until the end of extraction\n        extract_paths: path or list of paths where contents of archive will be extracted\n    \"\"\"\n    file_name = Path(urlparse(url).path).name\n    download_path = Path(download_path)\n\n    if extract_paths is None:\n        extract_paths = [download_path]\n    elif isinstance(extract_paths, list):\n        extract_paths = [Path(path) for path in extract_paths]\n    else:\n        extract_paths = [Path(extract_paths)]\n\n    cache_dir = os.getenv('DP_CACHE_DIR')\n    extracted = False\n    if cache_dir:\n        cache_dir = Path(cache_dir)\n        url_hash = md5(url.encode('utf8')).hexdigest()[:15]\n        arch_file_path = cache_dir / url_hash\n        extracted_path = cache_dir / (url_hash + '_extracted')\n        extracted = extracted_path.exists()\n        if not extracted and not arch_file_path.exists():\n            simple_download(url, arch_file_path)\n    else:\n        arch_file_path = download_path / file_name\n        simple_download(url, arch_file_path)\n        extracted_path = extract_paths.pop()\n\n    if not extracted:\n        log.info('Extracting {} archive into {}'.format(arch_file_path, extracted_path))\n        extracted_path.mkdir(parents=True, exist_ok=True)\n\n        if file_name.endswith('.tar.gz'):\n            untar(arch_file_path, extracted_path)\n        elif file_name.endswith('.gz'):\n            ungzip(arch_file_path, extracted_path / Path(file_name).with_suffix('').name)\n        elif file_name.endswith('.zip'):\n            with zipfile.ZipFile(arch_file_path, 'r') as zip_ref:\n                zip_ref.extractall(extracted_path)\n        else:\n            raise RuntimeError(f'Trying to extract an unknown type of archive {file_name}')\n\n        if not cache_dir:\n            arch_file_path.unlink()\n\n    for extract_path in extract_paths:\n        for src in extracted_path.iterdir():\n            dest = extract_path / src.name\n            if src.is_dir():\n                copytree(src, dest)\n            else:\n                extract_path.mkdir(parents=True, exist_ok=True)\n                shutil.copy(str(src), str(dest))"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nupdates the given dict recursively with the given dict", "response": "def update_dict_recursive(editable_dict: dict, editing_dict: dict) -> None:\n    \"\"\"Updates dict recursively\n\n    You need to use this function to update dictionary if depth of editing_dict is more then 1\n\n    Args:\n        editable_dict: dictionary, that will be edited\n        editing_dict: dictionary, that contains edits\n    Returns:\n        None\n    \"\"\"\n    for k, v in editing_dict.items():\n        if isinstance(v, collections.Mapping):\n            update_dict_recursive(editable_dict.get(k, {}), v)\n        else:\n            editable_dict[k] = v"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\ngive a file URL return a md5 query of the file", "response": "def path_set_md5(url):\n    \"\"\"Given a file URL, return a md5 query of the file\n\n    Args:\n        url: a given URL\n    Returns:\n        URL of the md5 file\n    \"\"\"\n    scheme, netloc, path, query_string, fragment = urlsplit(url)\n    path += '.md5'\n\n    return urlunsplit((scheme, netloc, path, query_string, fragment))"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\ngives a URL set or replace a query parameter and return the modified URL.", "response": "def set_query_parameter(url, param_name, param_value):\n    \"\"\"Given a URL, set or replace a query parameter and return the modified URL.\n\n    Args:\n        url: a given  URL\n        param_name: the parameter name to add\n        param_value: the parameter value\n    Returns:\n        URL with the added parameter\n\n    \"\"\"\n    scheme, netloc, path, query_string, fragment = urlsplit(url)\n    query_params = parse_qs(query_string)\n\n    query_params[param_name] = [param_value]\n    new_query_string = urlencode(query_params, doseq=True)\n\n    return urlunsplit((scheme, netloc, path, new_query_string, fragment))"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef alexa(self) -> dict:\n        response = {\n            'response': {\n                'shouldEndSession': False,\n                'outputSpeech': {\n                    'type': 'PlainText',\n                    'text': self.content},\n                'card': {\n                    'type': 'Simple',\n                    'content': self.content\n                }\n            }\n        }\n\n        return response", "response": "Returns the Amazon Alexa compatible state of the PlainText instance."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef json(self) -> dict:\n        content = {}\n        content['name'] = self.name\n        content['callback'] = self.callback\n        self.control_json['content'] = content\n        return self.control_json", "response": "Returns json compatible state of the Button instance."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nreturn the MS Bot Framework compatible state of the Button instance.", "response": "def ms_bot_framework(self) -> dict:\n        \"\"\"Returns MS Bot Framework compatible state of the Button instance.\n\n        Creates MS Bot Framework CardAction (button) with postBack value return.\n\n        Returns:\n            control_json: MS Bot Framework representation of Button state.\n        \"\"\"\n        card_action = {}\n        card_action['type'] = 'postBack'\n        card_action['title'] = self.name\n        card_action['value'] = self.callback = self.callback\n        return card_action"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nreturning json representation of the ButtonsFrame instance including all nested buttons.", "response": "def json(self) -> dict:\n        \"\"\"Returns json compatible state of the ButtonsFrame instance.\n\n        Returns json compatible state of the ButtonsFrame instance including\n        all nested buttons.\n\n        Returns:\n            control_json: Json representation of ButtonsFrame state.\n        \"\"\"\n        content = {}\n\n        if self.text:\n            content['text'] = self.text\n\n        content['controls'] = [control.json() for control in self.content]\n\n        self.control_json['content'] = content\n\n        return self.control_json"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nreturns the MS Bot Framework compatible state of the ButtonsFrame instance.", "response": "def ms_bot_framework(self) -> dict:\n        \"\"\"Returns MS Bot Framework compatible state of the ButtonsFrame instance.\n\n        Creating MS Bot Framework activity blank with RichCard in \"attachments\". RichCard\n        is populated with CardActions corresponding buttons embedded in ButtonsFrame.\n\n        Returns:\n            control_json: MS Bot Framework representation of ButtonsFrame state.\n        \"\"\"\n        rich_card = {}\n\n        buttons = [button.ms_bot_framework() for button in self.content]\n        rich_card['buttons'] = buttons\n\n        if self.text:\n            rich_card['title'] = self.text\n\n        attachments = [\n            {\n                \"contentType\": \"application/vnd.microsoft.card.thumbnail\",\n                \"content\": rich_card\n            }\n        ]\n\n        out_activity = {}\n        out_activity['type'] = 'message'\n        out_activity['attachments'] = attachments\n\n        return out_activity"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\ncalculating the exact match score between y_true and y_predicted.", "response": "def squad_v2_exact_match(y_true: List[List[str]], y_predicted: List[str]) -> float:\n    \"\"\" Calculates Exact Match score between y_true and y_predicted\n        EM score uses the best matching y_true answer:\n            if y_pred equal at least to one answer in y_true then EM = 1, else EM = 0\n\n    The same as in SQuAD-v2.0\n\n    Args:\n        y_true: list of correct answers (correct answers are represented by list of strings)\n        y_predicted: list of predicted answers\n\n    Returns:\n        exact match score : float\n    \"\"\"\n    EM_total = sum(normalize_answer(prediction) in map(normalize_answer, ground_truth)\n                   for ground_truth, prediction in zip(y_true, y_predicted))\n    return 100 * EM_total / len(y_true) if len(y_true) > 0 else 0"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef squad_v1_exact_match(y_true: List[List[str]], y_predicted: List[str]) -> float:\n    EM_total = 0\n    count = 0\n    for ground_truth, prediction in zip(y_true, y_predicted):\n        if len(ground_truth[0]) == 0:\n            # skip empty answers\n            continue\n        count += 1\n        EMs = [int(normalize_answer(gt) == normalize_answer(prediction)) for gt in ground_truth]\n        EM_total += max(EMs)\n    return 100 * EM_total / count if count > 0 else 0", "response": "Calculates the exact match score between y_true and y_predicted."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef squad_v2_f1(y_true: List[List[str]], y_predicted: List[str]) -> float:\n    f1_total = 0.0\n    for ground_truth, prediction in zip(y_true, y_predicted):\n        prediction_tokens = normalize_answer(prediction).split()\n        f1s = []\n        for gt in ground_truth:\n            gt_tokens = normalize_answer(gt).split()\n            if len(gt_tokens) == 0 or len(prediction_tokens) == 0:\n                f1s.append(float(gt_tokens == prediction_tokens))\n                continue\n            common = Counter(prediction_tokens) & Counter(gt_tokens)\n            num_same = sum(common.values())\n            if num_same == 0:\n                f1s.append(0.0)\n                continue\n            precision = 1.0 * num_same / len(prediction_tokens)\n            recall = 1.0 * num_same / len(gt_tokens)\n            f1 = (2 * precision * recall) / (precision + recall)\n            f1s.append(f1)\n        f1_total += max(f1s)\n    return 100 * f1_total / len(y_true) if len(y_true) > 0 else 0", "response": "Calculates the F - 1 score between y_true and y_predicted."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\ncalculating recall at k ranking metric.", "response": "def recall_at_k(y_true: List[int], y_pred: List[List[np.ndarray]], k: int):\n    \"\"\"\n    Calculates recall at k ranking metric.\n\n    Args:\n        y_true: Labels. Not used in the calculation of the metric.\n        y_predicted: Predictions.\n            Each prediction contains ranking score of all ranking candidates for the particular data sample.\n            It is supposed that the ranking score for the true candidate goes first in the prediction.\n\n    Returns:\n        Recall at k\n    \"\"\"\n    num_examples = float(len(y_pred))\n    predictions = np.array(y_pred)\n    predictions = np.flip(np.argsort(predictions, -1), -1)[:, :k]\n    num_correct = 0\n    for el in predictions:\n        if 0 in el:\n            num_correct += 1\n    return float(num_correct) / num_examples"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef check_gpu_existence():\n    global _gpu_available\n    if _gpu_available is None:\n        sess_config = tf.ConfigProto()\n        sess_config.gpu_options.allow_growth = True\n        try:\n            with tf.Session(config=sess_config):\n                device_list = device_lib.list_local_devices()\n                _gpu_available = any(device.device_type == 'GPU' for device in device_list)\n        except AttributeError as e:\n            log.warning(f'Got an AttributeError `{e}`, assuming documentation building')\n            _gpu_available = False\n    return _gpu_available", "response": "rReturn True if at least one GPU is available"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nread config s variables and apply their values to all its properties", "response": "def parse_config(config: Union[str, Path, dict]) -> dict:\n    \"\"\"Read config's variables and apply their values to all its properties\"\"\"\n    if isinstance(config, (str, Path)):\n        config = read_json(find_config(config))\n\n    variables = {\n        'DEEPPAVLOV_PATH': os.getenv(f'DP_DEEPPAVLOV_PATH', Path(__file__).parent.parent.parent)\n    }\n    for name, value in config.get('metadata', {}).get('variables', {}).items():\n        env_name = f'DP_{name}'\n        if env_name in os.environ:\n            value = os.getenv(env_name)\n        variables[name] = value.format(**variables)\n\n    return _parse_config_property(config, variables)"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef expand_path(path: Union[str, Path]) -> Path:\n    return Path(path).expanduser().resolve()", "response": "Convert relative paths to absolute with resolving user directory."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef from_params(params: Dict, mode: str = 'infer', serialized: Any = None, **kwargs) -> Component:\n    # what is passed in json:\n    config_params = {k: _resolve(v) for k, v in params.items()}\n\n    # get component by reference (if any)\n    if 'ref' in config_params:\n        try:\n            component = _refs[config_params['ref']]\n            if serialized is not None:\n                component.deserialize(serialized)\n            return component\n        except KeyError:\n            e = ConfigError('Component with id \"{id}\" was referenced but not initialized'\n                            .format(id=config_params['ref']))\n            log.exception(e)\n            raise e\n\n    elif 'config_path' in config_params:\n        from deeppavlov.core.commands.infer import build_model\n        refs = _refs.copy()\n        _refs.clear()\n        config = parse_config(expand_path(config_params['config_path']))\n        model = build_model(config, serialized=serialized)\n        _refs.clear()\n        _refs.update(refs)\n        try:\n            _refs[config_params['id']] = model \n        except KeyError:\n            pass\n        return model\n\n    cls_name = config_params.pop('class_name', None)\n    if not cls_name:\n        e = ConfigError('Component config has no `class_name` nor `ref` fields')\n        log.exception(e)\n        raise e\n    cls = get_model(cls_name)\n\n    # find the submodels params recursively\n    config_params = {k: _init_param(v, mode) for k, v in config_params.items()}\n\n    try:\n        spec = inspect.getfullargspec(cls)\n        if 'mode' in spec.args+spec.kwonlyargs or spec.varkw is not None:\n            kwargs['mode'] = mode\n\n        component = cls(**dict(config_params, **kwargs))\n        try:\n            _refs[config_params['id']] = component\n        except KeyError:\n            pass\n    except Exception:\n        log.exception(\"Exception in {}\".format(cls))\n        raise\n\n    if serialized is not None:\n        component.deserialize(serialized)\n    return component", "response": "Builds and returns the Component from corresponding dictionary of parameters."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef run(self) -> None:\n        while True:\n            request = self.input_queue.get()\n            response = self._handle_request(request)\n            self.output_queue.put(response)", "response": "Thread run method implementation."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef _del_conversation(self, conversation_key: str) -> None:\n        if conversation_key in self.conversations.keys():\n            del self.conversations[conversation_key]\n            log.info(f'Deleted conversation, key: {conversation_key}')", "response": "Deletes a conversation instance."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef _refresh_valid_certs(self) -> None:\n        self.timer = Timer(REFRESH_VALID_CERTS_PERIOD_SECS, self._refresh_valid_certs)\n        self.timer.start()\n\n        expired_certificates = []\n\n        for valid_cert_url, valid_cert in self.valid_certificates.items():\n            valid_cert: ValidatedCert = valid_cert\n            cert_expiration_time: datetime = valid_cert.expiration_timestamp\n            if datetime.utcnow() > cert_expiration_time:\n                expired_certificates.append(valid_cert_url)\n\n        for expired_cert_url in expired_certificates:\n            del self.valid_certificates[expired_cert_url]\n            log.info(f'Validation period of {expired_cert_url} certificate expired')", "response": "Refreshes the list of expired certificates."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef _verify_request(self, signature_chain_url: str, signature: str, request_body: bytes) -> bool:\n        if signature_chain_url not in self.valid_certificates.keys():\n            amazon_cert: X509 = verify_cert(signature_chain_url)\n            if amazon_cert:\n                amazon_cert_lifetime: timedelta = self.config['amazon_cert_lifetime']\n                expiration_timestamp = datetime.utcnow() + amazon_cert_lifetime\n                validated_cert = ValidatedCert(cert=amazon_cert, expiration_timestamp=expiration_timestamp)\n                self.valid_certificates[signature_chain_url] = validated_cert\n                log.info(f'Certificate {signature_chain_url} validated')\n            else:\n                log.error(f'Certificate {signature_chain_url} validation failed')\n                return False\n        else:\n            validated_cert: ValidatedCert = self.valid_certificates[signature_chain_url]\n            amazon_cert: X509 = validated_cert.cert\n\n        if verify_signature(amazon_cert, signature, request_body):\n            result = True\n        else:\n            log.error(f'Failed signature verification for request: {request_body.decode(\"utf-8\", \"replace\")}')\n            result = False\n\n        return result", "response": "Verify a request against the Amazon Alexa certificate chain."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nprocessing a single Alexa request and returns a dict with the response.", "response": "def _handle_request(self, request: dict) -> dict:\n        \"\"\"Processes Alexa requests from skill server and returns responses to Alexa.\n\n        Args:\n            request: Dict with Alexa request payload and metadata.\n        Returns:\n            result: Alexa formatted or error response.\n        \"\"\"\n        request_body: bytes = request['request_body']\n        signature_chain_url: str = request['signature_chain_url']\n        signature: str = request['signature']\n        alexa_request: dict = request['alexa_request']\n\n        if not self._verify_request(signature_chain_url, signature, request_body):\n            return {'error': 'failed certificate/signature check'}\n\n        timestamp_str = alexa_request['request']['timestamp']\n        timestamp_datetime = datetime.strptime(timestamp_str, '%Y-%m-%dT%H:%M:%SZ')\n        now = datetime.utcnow()\n\n        delta = now - timestamp_datetime if now >= timestamp_datetime else timestamp_datetime - now\n\n        if abs(delta.seconds) > REQUEST_TIMESTAMP_TOLERANCE_SECS:\n            log.error(f'Failed timestamp check for request: {request_body.decode(\"utf-8\", \"replace\")}')\n            return {'error': 'failed request timestamp check'}\n\n        conversation_key = alexa_request['session']['user']['userId']\n\n        if conversation_key not in self.conversations.keys():\n            if self.config['multi_instance']:\n                conv_agent = self._init_agent()\n                log.info('New conversation instance level agent initiated')\n            else:\n                conv_agent = self.agent\n\n            self.conversations[conversation_key] = \\\n                Conversation(config=self.config,\n                             agent=conv_agent,\n                             conversation_key=conversation_key,\n                             self_destruct_callback=lambda: self._del_conversation(conversation_key))\n\n            log.info(f'Created new conversation, key: {conversation_key}')\n\n        conversation = self.conversations[conversation_key]\n        response = conversation.handle_request(alexa_request)\n\n        return response"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nreturn a class object with the given name given as a string.", "response": "def cls_from_str(name: str) -> type:\n    \"\"\"Returns a class object with the name given as a string.\"\"\"\n    try:\n        module_name, cls_name = name.split(':')\n    except ValueError:\n        raise ConfigError('Expected class description in a `module.submodules:ClassName` form, but got `{}`'\n                          .format(name))\n\n    return getattr(importlib.import_module(module_name), cls_name)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef register(name: str = None) -> type:\n    def decorate(model_cls: type, reg_name: str = None) -> type:\n        model_name = reg_name or short_name(model_cls)\n        global _REGISTRY\n        cls_name = model_cls.__module__ + ':' + model_cls.__name__\n        if model_name in _REGISTRY and _REGISTRY[model_name] != cls_name:\n            logger.warning('Registry name \"{}\" has been already registered and will be overwritten.'.format(model_name))\n        _REGISTRY[model_name] = cls_name\n        return model_cls\n\n    return lambda model_cls_name: decorate(model_cls_name, name)", "response": "Register classes that could be initialized from JSON configuration file."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef get_model(name: str) -> type:\n    if name not in _REGISTRY:\n        if ':' not in name:\n            raise ConfigError(\"Model {} is not registered.\".format(name))\n        return cls_from_str(name)\n    return cls_from_str(_REGISTRY[name])", "response": "Returns a registered class object with the given name in the string."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef general_attention(key, context, hidden_size, projected_align=False):\n\n    if hidden_size % 2 != 0:\n        raise ValueError(\"hidden size must be dividable by two\")\n    batch_size = tf.shape(context)[0]\n    max_num_tokens, token_size = context.get_shape().as_list()[-2:]\n    r_context = tf.reshape(context, shape=[-1, max_num_tokens, token_size])\n\n    # projected_key: [None, None, hidden_size]\n    projected_key = \\\n        tf.layers.dense(key, hidden_size, kernel_initializer=xav())\n    r_projected_key = tf.reshape(projected_key, shape=[-1, hidden_size, 1])\n\n    lstm_fw_cell = tf.nn.rnn_cell.LSTMCell(hidden_size//2)\n    lstm_bw_cell = tf.nn.rnn_cell.LSTMCell(hidden_size//2)\n    (output_fw, output_bw), states = \\\n        tf.nn.bidirectional_dynamic_rnn(cell_fw=lstm_fw_cell,\n                                        cell_bw=lstm_bw_cell,\n                                        inputs=r_context,\n                                        dtype=tf.float32)\n    # bilstm_output: [-1, max_num_tokens, hidden_size]\n    bilstm_output = tf.concat([output_fw, output_bw], -1)\n\n    attn = tf.nn.softmax(tf.matmul(bilstm_output, r_projected_key), dim=1)\n\n    if projected_align:\n        log.info(\"Using projected attention alignment\")\n        t_context = tf.transpose(bilstm_output, [0, 2, 1])\n        output = tf.reshape(tf.matmul(t_context, attn),\n                            shape=[batch_size, -1, hidden_size])\n    else:\n        log.info(\"Using without projected attention alignment\")\n        t_context = tf.transpose(r_context, [0, 2, 1])\n        output = tf.reshape(tf.matmul(t_context, attn),\n                            shape=[batch_size, -1, token_size])\n    return output", "response": "General attention mechanism for the given key and context."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef light_general_attention(key, context, hidden_size, projected_align=False):\n    batch_size = tf.shape(context)[0]\n    max_num_tokens, token_size = context.get_shape().as_list()[-2:]\n    r_context = tf.reshape(context, shape=[-1, max_num_tokens, token_size])\n\n    # projected_key: [None, None, hidden_size]\n    projected_key = tf.layers.dense(key, hidden_size, kernel_initializer=xav())\n    r_projected_key = tf.reshape(projected_key, shape=[-1, hidden_size, 1])\n\n    # projected context: [None, None, hidden_size]\n    projected_context = \\\n        tf.layers.dense(r_context, hidden_size, kernel_initializer=xav())\n\n    attn = tf.nn.softmax(tf.matmul(projected_context, r_projected_key), dim=1)\n\n    if projected_align:\n        log.info(\"Using projected attention alignment\")\n        t_context = tf.transpose(projected_context, [0, 2, 1])\n        output = tf.reshape(tf.matmul(t_context, attn),\n                            shape=[batch_size, -1, hidden_size])\n    else:\n        log.info(\"Using without projected attention alignment\")\n        t_context = tf.transpose(r_context, [0, 2, 1])\n        output = tf.reshape(tf.matmul(t_context, attn),\n                            shape=[batch_size, -1, token_size])\n    return output", "response": "This function is a implementation of the Luong et al. attention mechanism with general score."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\ncreate a new generic estimator from an embedded file.", "response": "def from_file(file=str):\n        \"\"\"\n        Creates new Generic model by loading existing embedded model into library, e.g. from H2O MOJO.\n        The imported model must be supported by H2O.\n        :param file: A string containing path to the file to create the model from\n        :return: H2OGenericEstimator instance representing the generic model\n        \"\"\"\n        from h2o import lazy_import, get_frame\n        model_key = lazy_import(file)\n        model_bytes_frame = get_frame(model_key[0])\n        model = H2OGenericEstimator(model_key = model_bytes_frame)\n        model.train()\n\n        return model"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef getGLMRegularizationPath(model):\n        x = h2o.api(\"GET /3/GetGLMRegPath\", data={\"model\": model._model_json[\"model_id\"][\"name\"]})\n        ns = x.pop(\"coefficient_names\")\n        res = {\n            \"lambdas\": x[\"lambdas\"],\n            \"explained_deviance_train\": x[\"explained_deviance_train\"],\n            \"explained_deviance_valid\": x[\"explained_deviance_valid\"],\n            \"coefficients\": [dict(zip(ns, y)) for y in x[\"coefficients\"]],\n        }\n        if \"coefficients_std\" in x:\n            res[\"coefficients_std\"] = [dict(zip(ns, y)) for y in x[\"coefficients_std\"]]\n        return res", "response": "Extract full regularization path explored during lambda search from glm model."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\ncreate a custom GLM model using the given coefficients.", "response": "def makeGLMModel(model, coefs, threshold=.5):\n        \"\"\"\n        Create a custom GLM model using the given coefficients.\n\n        Needs to be passed source model trained on the dataset to extract the dataset information from.\n\n        :param model: source model, used for extracting dataset information\n        :param coefs: dictionary containing model coefficients\n        :param threshold: (optional, only for binomial) decision threshold used for classification\n        \"\"\"\n        model_json = h2o.api(\n            \"POST /3/MakeGLMModel\",\n            data={\"model\": model._model_json[\"model_id\"][\"name\"],\n                  \"names\": list(coefs.keys()),\n                  \"beta\": list(coefs.values()),\n                  \"threshold\": threshold}\n        )\n        m = H2OGeneralizedLinearEstimator()\n        m._resolve_model(model_json[\"model_id\"][\"name\"], model_json)\n        return m"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\ncreates H2OCluster object from a list of key - value pairs.", "response": "def from_kvs(keyvals):\n        \"\"\"\n        Create H2OCluster object from a list of key-value pairs.\n\n        TODO: This method should be moved into the base H2OResponse class.\n        \"\"\"\n        obj = H2OCluster()\n        obj._retrieved_at = time.time()\n        for k, v in keyvals:\n            if k in {\"__meta\", \"_exclude_fields\", \"__schema\"}: continue\n            if k in _cloud_v3_valid_keys:\n                obj._props[k] = v\n            else:\n                raise AttributeError(\"Attribute %s cannot be set on H2OCluster (= %r)\" % (k, v))\n        return obj"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nshutting down the H2O server.", "response": "def shutdown(self, prompt=False):\n        \"\"\"\n        Shut down the server.\n\n        This method checks if the H2O cluster is still running, and if it does shuts it down (via a REST API call).\n\n        :param prompt: A logical value indicating whether to prompt the user before shutting down the H2O server.\n        \"\"\"\n        if not self.is_running(): return\n        assert_is_type(prompt, bool)\n        if prompt:\n            question = \"Are you sure you want to shutdown the H2O instance running at %s (Y/N)? \" \\\n                       % h2o.connection().base_url\n            response = input(question)  # works in Py2 & Py3 because redefined in h2o.utils.compatibility module\n        else:\n            response = \"Y\"\n        if response.lower() in {\"y\", \"yes\"}:\n            h2o.api(\"POST /3/Shutdown\")\n            h2o.connection().close()"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\ndetermines if the H2O cluster is running or not.", "response": "def is_running(self):\n        \"\"\"\n        Determine if the H2O cluster is running or not.\n\n        :returns: True if the cluster is up; False otherwise\n        \"\"\"\n        try:\n            if h2o.connection().local_server and not h2o.connection().local_server.is_running(): return False\n            h2o.api(\"GET /\")\n            return True\n        except (H2OConnectionError, H2OServerError):\n            return False"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef show_status(self, detailed=False):\n        if self._retrieved_at + self.REFRESH_INTERVAL < time.time():\n            # Info is stale, need to refresh\n            new_info = h2o.api(\"GET /3/Cloud\")\n            self._fill_from_h2ocluster(new_info)\n        ncpus = sum(node[\"num_cpus\"] for node in self.nodes)\n        allowed_cpus = sum(node[\"cpus_allowed\"] for node in self.nodes)\n        free_mem = sum(node[\"free_mem\"] for node in self.nodes)\n        unhealthy_nodes = sum(not node[\"healthy\"] for node in self.nodes)\n        status = \"locked\" if self.locked else \"accepting new members\"\n        if unhealthy_nodes == 0:\n            status += \", healthy\"\n        else:\n            status += \", %d nodes are not healthy\" % unhealthy_nodes\n        api_extensions = self.list_api_extensions()\n        H2ODisplay([\n            [\"H2O cluster uptime:\",        get_human_readable_time(self.cloud_uptime_millis)],\n            [\"H2O cluster timezone:\",      self.cloud_internal_timezone],\n            [\"H2O data parsing timezone:\", self.datafile_parser_timezone],\n            [\"H2O cluster version:\",       self.version],\n            [\"H2O cluster version age:\",   \"{} {}\".format(self.build_age, (\"!!!\" if self.build_too_old else \"\"))],\n            [\"H2O cluster name:\",          self.cloud_name],\n            [\"H2O cluster total nodes:\",   self.cloud_size],\n            [\"H2O cluster free memory:\",   get_human_readable_bytes(free_mem)],\n            [\"H2O cluster total cores:\",   str(ncpus)],\n            [\"H2O cluster allowed cores:\", str(allowed_cpus)],\n            [\"H2O cluster status:\",        status],\n            [\"H2O connection url:\",        h2o.connection().base_url],\n            [\"H2O connection proxy:\",      h2o.connection().proxy],\n            [\"H2O internal security:\",     self.internal_security_enabled],\n            [\"H2O API Extensions:\",        ', '.join(api_extensions)],\n            [\"Python version:\",            \"%d.%d.%d %s\" % tuple(sys.version_info[:4])],\n        ])\n\n        if detailed:\n            keys = [\"h2o\", \"healthy\", \"last_ping\", \"num_cpus\", \"sys_load\", \"mem_value_size\", \"free_mem\", \"pojo_mem\",\n                    \"swap_mem\", \"free_disk\", \"max_disk\", \"pid\", \"num_keys\", \"tcps_active\", \"open_fds\", \"rpcs_active\"]\n            header = [\"Nodes info:\"] + [\"Node %d\" % (i + 1) for i in range(len(self.nodes))]\n            table = [[k] for k in keys]\n            for node in self.nodes:\n                for i, k in enumerate(keys):\n                    table[i].append(node[k])\n            H2ODisplay(table=table, header=header)", "response": "Print current cluster status information."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nlist all jobs performed by the cluster.", "response": "def list_jobs(self):\n        \"\"\"List all jobs performed by the cluster.\"\"\"\n        res = h2o.api(\"GET /3/Jobs\")\n        table = [[\"type\"], [\"dest\"], [\"description\"], [\"status\"]]\n        for job in res[\"jobs\"]:\n            job_dest = job[\"dest\"]\n            table[0].append(self._translate_job_type(job_dest[\"type\"]))\n            table[1].append(job_dest[\"name\"])\n            table[2].append(job[\"description\"])\n            table[3].append(job[\"status\"])\n        return table"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef list_timezones(self):\n        from h2o.expr import ExprNode\n        return h2o.H2OFrame._expr(expr=ExprNode(\"listTimeZones\"))._frame()", "response": "Return the list of all known timezones."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nfills in this object with information from another H2OCluster instance.", "response": "def _fill_from_h2ocluster(self, other):\n        \"\"\"\n        Update information in this object from another H2OCluster instance.\n\n        :param H2OCluster other: source of the new information for this object.\n        \"\"\"\n        self._props = other._props\n        self._retrieved_at = other._retrieved_at\n        other._props = {}\n        other._retrieved_at = None"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef metalearner_params(self):\n        if self._parms.get(\"metalearner_params\") != None:\n            metalearner_params_dict =  ast.literal_eval(self._parms.get(\"metalearner_params\"))\n            for k in metalearner_params_dict:\n                if len(metalearner_params_dict[k]) == 1: #single parameter\n                    metalearner_params_dict[k] = metalearner_params_dict[k][0]\n            return metalearner_params_dict\n        else:\n            return self._parms.get(\"metalearner_params\")", "response": "Returns a dict of parameters for metalearner algorithm\nMimeType"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef check_obj_has_good_numbers(obj, hierarchy=\"\", curr_depth=0, max_depth=4, allowNaN=False):\n    def serialize(obj, hierarchy=\"\", curr_depth=0):\n        \"\"\"Recursively walk object's hierarchy. Limit to max_depth\"\"\"\n        if curr_depth>max_depth:\n            return\n\n        if isinstance(obj, (bool, int, long, float, basestring)):\n            try:\n                number = float(obj)\n                print \"Yay!\", hierarchy, number\n            except:\n                if obj is None:\n                    print \"Not Yay! how come you're giving me None for a coefficient? %s %s\" % (hierarchy, obj)\n                elif str(obj)==\"\":\n                    print \"Not Yay! how come you're giving me an empty string for a coefficient? %s %s\" % (hierarchy, obj)\n                else:\n                    raise Exception(\"%s %s %s is not a valid float\" % (hierarchy, obj, type(obj)))\n                # hack for now\n                number = 0.0\n            if not allowNaN and math.isnan(number):\n                raise Exception(\"%s %s is a NaN\" % (hierarchy, obj))\n            if not allowNaN and math.isinf(number):\n                raise Exception(\"%s %s is a Inf\" % (hierarchy, obj))\n            return number\n\n        elif isinstance(obj, dict):\n            obj = obj.copy()\n            for key in obj:\n                obj[key] = serialize(obj[key], hierarchy + \".%\" % key, curr_depth+1)\n            return obj\n\n        elif isinstance(obj, (list, tuple)):\n            return [serialize(item, hierarchy + \"[%s]\" % i, curr_depth+1) for (i, item) in enumerate(obj)]\n\n        elif hasattr(obj, '__dict__'):\n            return serialize(obj.__dict__, hierarchy, curr_depth+1)\n\n        else:\n            return repr(obj) # Don't know how to handle, convert to string\n\n    return (serialize(obj, hierarchy, curr_depth+1))", "response": "Checks if an object has good numbers."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nimplements transformation of CALL_FUNCTION bc inst to Rapids expression. The implementation follows definition of behavior defined in https://docs.python.org/3/library/dis.html :param nargs: number of arguments including keyword and positional arguments :param idx: index of current instruction on the stack :param ops: stack of instructions :param keys: names of instructions :return: ExprNode representing method call", "response": "def _call_func_bc(nargs, idx, ops, keys):\n    \"\"\"\n    Implements transformation of CALL_FUNCTION bc inst to Rapids expression.\n    The implementation follows definition of behavior defined in\n    https://docs.python.org/3/library/dis.html\n    \n    :param nargs: number of arguments including keyword and positional arguments\n    :param idx: index of current instruction on the stack\n    :param ops: stack of instructions\n    :param keys:  names of instructions\n    :return: ExprNode representing method call\n    \"\"\"\n    named_args = {}\n    unnamed_args = []\n    args = []\n    # Extract arguments based on calling convention for CALL_FUNCTION_KW\n    while nargs > 0:\n        if nargs >= 256:  # named args ( foo(50,True,x=10) ) read first  ( right -> left )\n            arg, idx = _opcode_read_arg(idx, ops, keys)\n            named_args[ops[idx][1][0]] = arg\n            idx -= 1  # skip the LOAD_CONST for the named args\n            nargs -= 256  # drop 256\n        else:\n            arg, idx = _opcode_read_arg(idx, ops, keys)\n            unnamed_args.insert(0, arg)\n            nargs -= 1\n    # LOAD_ATTR <method_name>: Map call arguments to a call of method on H2OFrame class\n    op = ops[idx][1][0]\n    args = _get_h2o_frame_method_args(op, named_args, unnamed_args) if is_attr(ops[idx][0]) else []\n    # Map function name to proper rapids name\n    op = _get_func_name(op, args)\n    # Go to next instruction\n    idx -= 1\n    if is_bytecode_instruction(ops[idx][0]):\n        arg, idx = _opcode_read_arg(idx, ops, keys)\n        args.insert(0, arg)\n    elif is_load_fast(ops[idx][0]):\n        args.insert(0, _load_fast(ops[idx][1][0]))\n        idx -= 1\n    return [ExprNode(op, *args), idx]"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef jobs(self, job_key=None, timeoutSecs=10, **kwargs):\n    '''\n    Fetch all the jobs or a single job from the /Jobs endpoint.\n    '''\n    params_dict = {\n        # 'job_key': job_key\n    }\n    h2o_methods.check_params_update_kwargs(params_dict, kwargs, 'jobs', True)\n    result = self.do_json_request('3/Jobs.json', timeout=timeoutSecs, params=params_dict)\n    return result", "response": "Fetch all the jobs or a single job from the Jobs endpoint."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\npolls a single job from the Jobs endpoint until it is done or we time out.", "response": "def poll_job(self, job_key, timeoutSecs=10, retryDelaySecs=0.5, key=None, **kwargs):\n    '''\n    Poll a single job from the /Jobs endpoint until it is \"status\": \"DONE\" or \"CANCELLED\" or \"FAILED\" or we time out.\n    '''\n    params_dict = {}\n    # merge kwargs into params_dict\n    h2o_methods.check_params_update_kwargs(params_dict, kwargs, 'poll_job', False)\n\n    start_time = time.time()\n    pollCount = 0\n    while True:\n        result = self.do_json_request('3/Jobs.json/' + job_key, timeout=timeoutSecs, params=params_dict)\n        # print 'Job: ', dump_json(result)\n\n        if key:\n            frames_result = self.frames(key=key)\n            print 'frames_result for key:', key, dump_json(result)\n\n        jobs = result['jobs'][0]\n        description = jobs['description']\n        dest = jobs['dest']\n        dest_name = dest['name']\n        msec = jobs['msec']\n        status = jobs['status']\n        progress = jobs['progress']\n        print description, \\\n            \"dest_name:\", dest_name, \\\n            \"\\tprogress:\", \"%-10s\" % progress, \\\n            \"\\tstatus:\", \"%-12s\" % status, \\\n            \"\\tmsec:\", msec\n        \n        if status=='DONE' or status=='CANCELLED' or status=='FAILED':\n            h2o_sandbox.check_sandbox_for_errors()\n            return result\n\n        # what about 'CREATED'\n        # FIX! what are the other legal polling statuses that we should check for?\n\n        if not h2o_args.no_timeout and (time.time() - start_time > timeoutSecs):\n            h2o_sandbox.check_sandbox_for_errors()\n            emsg = \"Job:\", job_key, \"timed out in:\", timeoutSecs\n\n            # for debug\n            a = h2o.nodes[0].get_cloud()\n            print \"cloud.json:\", dump_json(a)\n            raise Exception(emsg)\n            print emsg\n            return None\n\n        # check every other poll, for now\n        if (pollCount % 2) == 0:\n            h2o_sandbox.check_sandbox_for_errors()\n\n        time.sleep(retryDelaySecs)\n        pollCount += 1"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nimport a file or files into h2o. The 'file' parameter accepts a directory or a single file. 192.168.0.37:54323/ImportFiles.html?file=%2Fhome%2F0xdiag%2Fdatasets", "response": "def import_files(self, path, timeoutSecs=180):\n    ''' \n    Import a file or files into h2o.  The 'file' parameter accepts a directory or a single file.\n    192.168.0.37:54323/ImportFiles.html?file=%2Fhome%2F0xdiag%2Fdatasets\n    '''\n    a = self.do_json_request('3/ImportFiles.json',\n        timeout=timeoutSecs,\n        params={\"path\": path}\n    )\n    verboseprint(\"\\nimport_files result:\", dump_json(a))\n    h2o_sandbox.check_sandbox_for_errors()\n    return a"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nparse an imported raw file or files into a Frame.", "response": "def parse(self, key, hex_key=None, columnTypeDict=None,\n          timeoutSecs=300, retryDelaySecs=0.2, initialDelaySecs=None, pollTimeoutSecs=180,\n          noise=None, benchmarkLogging=None, noPoll=False, intermediateResults=False, **kwargs):\n    '''\n    Parse an imported raw file or files into a Frame.\n    '''\n    # these should override what parse setup gets below\n    params_dict = {\n        'source_frames': None,\n        'destination_frame': hex_key, \n        'parse_type': None, # file type \n        'separator': None,\n        'single_quotes': None,\n        'check_header': None, # forces first line to be seen as column names \n        'number_columns': None,\n        'column_names': None, # a list\n        'column_types': None, # a list. or can use columnTypeDict param (see below)\n        'na_strings' : None, # a list\n        'chunk_size': None,\n        # are these two no longer supported?\n        'delete_on_done': None,\n        'blocking': None,\n    }\n        \n    # if key is a list, create a comma separated string\n    # list or tuple but not string\n    if not isinstance(key, basestring):\n        # it's a list of some kind (tuple ok?)\n        # if len(key) > 1:\n        #     print \"I noticed you're giving me a list of > 1 keys %s to parse:\" % len(key), key\n\n        # len 1 is ok here. 0 not. what if None or [None] here\n        if not key:\n            raise Exception(\"key seems to be bad in parse. Should be list or string. %s\" % key)\n        # have to put double quotes around the individual list items (single not legal)\n        source_frames = \"[\" + \",\".join(map((lambda x: '\"' + x + '\"'), key)) + \"]\"\n\n    else:\n        # what if None here\n        source_frames = '[\"' + key + '\"]' # quotes required on key\n\n    params_dict['source_frames'] = source_frames\n\n    # merge kwargs into params_dict\n    # =None overwrites params_dict\n\n    # columnTypeDict not used here\n    h2o_methods.check_params_update_kwargs(params_dict, kwargs, 'parse before setup merge', print_params=False)\n    # Call ParseSetup?source_frames=[keys] . . .\n\n    # if benchmarkLogging:\n    #     cloudPerfH2O.get_log_save(initOnly=True)\n\n    # h2o_methods.check_params_update_kwargs(params_dict, kwargs, 'parse_setup', print_params=True)\n    params_setup = {'source_frames': source_frames}\n    setup_result = self.do_json_request(jsonRequest=\"3/ParseSetup.json\", cmd='post', timeout=timeoutSecs, postData=params_setup)\n    h2o_sandbox.check_sandbox_for_errors()\n    verboseprint(\"ParseSetup result:\", dump_json(setup_result))\n\n    # this should match what we gave as input?\n    if setup_result['source_frames']:\n        # should these be quoted?\n        source_framesStr = \"[\" + \",\".join([('\"%s\"' % src['name']) for src in setup_result['source_frames'] ]) + \"]\"\n    else:\n        source_framesStr = None\n\n    # I suppose we need a way for parameters to parse() to override these\n    # should it be an array or a dict?\n    if setup_result['column_names']:\n        # single quotes not legal..need double quotes\n        columnNamesStr = \"[\" + \",\".join(map((lambda x: '\"' + x + '\"'), setup_result['column_names'])) + \"]\"\n    else:\n        columnNamesStr = None\n\n    columnTypes = setup_result['column_types']\n    assert columnTypes is not None, \"%s %s\" % (\"column_types:\", columnTypes)\n\n    if setup_result['na_strings']:\n        # single quotes not legal..need double quotes\n        naStrings = \"[\" + \",\".join(map((lambda x: '\"' + x + '\"' if x != None else '\"\"'), setup_result['na_strings'])) + \"]\"\n    else:\n        naStrings = None\n\n    # dict parameter to update columnTypeDict?\n    # but we don't pass columnNames like this?\n    ct = setup_result['column_types']\n    columnNames = setup_result['column_names']\n    if columnTypeDict: \n        for k,v in columnTypeDict.iteritems():\n            if isinstance(k, int):\n                # if a column index\n                if k>=0 and k<len(ct):\n                    ct[k] = v\n                else:\n                    raise Exception(\"bad col index %s in columnTypeDict param %s\" % (k, columnTypeDict))\n            # if a column name\n            elif isinstance(k, basestring):\n                # find the index\n                if k not in columnNames:\n                    raise Exception(\"bad col name %s in columnTypeDict param %s. columnNames: %s\" % (k, columnTypeDict, columnNames))\n                ci = columnNames.index(k)\n                ct[ci] = v\n            else:\n                raise Exception(\"%s %s should be int or string\" % (k, type(k)))\n\n    columnTypesStr = \"[\" + \",\".join(map((lambda x: '\"' + x + '\"'), ct)) + \"]\"\n\n\n    parse_params = {\n        'source_frames': source_framesStr,\n        'destination_frame': setup_result['destination_frame'],\n        'parse_type': setup_result['parse_type'],\n        'separator': setup_result['separator'],\n        'single_quotes': setup_result['single_quotes'],\n        'check_header': setup_result['check_header'],\n        'number_columns': setup_result['number_columns'],\n        'column_names': columnNamesStr,\n        'column_types': columnTypesStr,\n        'na_strings': naStrings, \n        'chunk_size': setup_result['chunk_size'],\n        # No longer supported? how come these aren't in setup_result?\n        'delete_on_done': params_dict['delete_on_done'],\n        'blocking': params_dict['blocking'],\n    }\n    # HACK: if there are too many column names..don't print! it is crazy output\n    # just check the output of parse setup. Don't worry about columnNames passed as params here. \n    tooManyColNamesToPrint = setup_result['column_names'] and len(setup_result['column_names']) > 2000\n    if tooManyColNamesToPrint:\n        h2p.yellow_print(\"Not printing the parameters to Parse because the columnNames are too lengthy.\") \n        h2p.yellow_print(\"See sandbox/commands.log\")\n\n    # merge params_dict into parse_params\n    # don't want =None to overwrite parse_params\n    h2o_methods.check_params_update_kwargs(parse_params, params_dict, 'parse after merge into parse setup', \n        print_params=not tooManyColNamesToPrint, ignoreNone=True)\n\n    print \"parse source_frames is length:\", len(parse_params['source_frames'])\n    # This can be null now? parseSetup doesn't return default colnames?\n    # print \"parse column_names is length:\", len(parse_params['column_names'])\n\n    # none of the kwargs passed to here!\n    parse_result = self.do_json_request( jsonRequest=\"3/Parse.json\", cmd='post', postData=parse_params, timeout=timeoutSecs)\n    verboseprint(\"Parse result:\", dump_json(parse_result))\n\n    job_key = parse_result['job']['key']['name']\n    hex_key = parse_params['destination_frame']\n\n    # TODO: dislike having different shapes for noPoll and poll\n    if noPoll:\n        # ??\n        h2o_sandbox.check_sandbox_for_errors()\n        # return self.jobs(job_key)\n        return parse_result\n\n    # does Frame also, while polling\n    if intermediateResults:\n        key = hex_key\n    else:\n        key = None\n\n    job_result = self.poll_job(job_key, timeoutSecs=timeoutSecs, key=key)\n\n    if job_result:\n        jobs = job_result['jobs'][0]\n        description = jobs['description']\n        dest = jobs['dest']\n        msec = jobs['msec']\n        status = jobs['status']\n        progress = jobs['progress']\n        dest_key = dest['name']\n\n        # can condition this with a parameter if some FAILED are expected by tests.\n        if status=='FAILED':\n            print dump_json(job_result)\n            raise Exception(\"Taking exception on parse job status: %s %s %s %s %s\" % \\\n                (status, progress, msec, dest_key, description))\n\n        return self.frames(dest_key)\n    else:\n        # ? we should always get a job_json result\n        raise Exception(\"parse didn't get a job_result when it expected one\")"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef frames(self, key=None, timeoutSecs=60, **kwargs):\n    if not (key is None or isinstance(key, (basestring, Key))):\n        raise Exception(\"frames: key should be string or Key type %s %s\" % (type(key), key))\n\n    params_dict = {\n        'find_compatible_models': 0,\n        'row_offset': 0, # is offset working yet?\n        'row_count': 5,\n    }\n    '''\n    Return a single Frame or all of the Frames in the h2o cluster.  The\n    frames are contained in a list called \"frames\" at the top level of the\n    result.  Currently the list is unordered.\n    TODO:\n    When find_compatible_models is implemented then the top level \n    dict will also contain a \"models\" list.\n    '''\n    h2o_methods.check_params_update_kwargs(params_dict, kwargs, 'frames', False)\n    \n    # key can be type Key? (from h2o_xl) str(key) should return\n    if key:\n        if isinstance(key, Key):\n            keyStr = key.frame\n        else:\n            keyStr = key\n        result = self.do_json_request('3/Frames.json/' + keyStr, timeout=timeoutSecs, params=params_dict)\n    else:\n        result = self.do_json_request('3/Frames.json', timeout=timeoutSecs, params=params_dict)\n    return result", "response": "Get a list of all the Frames in the H2O cluster."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nreturns the summary for a single column in a single Frame in the h2o cluster.", "response": "def summary(self, key, column=\"C1\", timeoutSecs=10, **kwargs):\n    '''\n    Return the summary for a single column for a single Frame in the h2o cluster.  \n    '''\n    params_dict = { \n        # 'offset': 0,\n        # 'len': 100\n    }\n    h2o_methods.check_params_update_kwargs(params_dict, kwargs, 'summary', True)\n    \n    result = self.do_json_request('3/Frames.json/%s/columns/%s/summary' % (key, column), timeout=timeoutSecs, params=params_dict)\n    h2o_sandbox.check_sandbox_for_errors()\n    return result"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\ndeletes a frame on the H2O cluster given its key.", "response": "def delete_frame(self, key, ignoreMissingKey=True, timeoutSecs=60, **kwargs):\n    '''\n    Delete a frame on the h2o cluster, given its key.\n    '''\n    assert key is not None, '\"key\" parameter is null'\n\n    result = self.do_json_request('/3/Frames.json/' + key, cmd='delete', timeout=timeoutSecs)\n\n    # TODO: look for what?\n    if not ignoreMissingKey and 'f00b4r' in result:\n        raise ValueError('Frame key not found: ' + key)\n    return result"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef model_builders(self, algo=None, timeoutSecs=10, **kwargs):\n    '''\n    Return a model builder or all of the model builders known to the\n    h2o cluster.  The model builders are contained in a dictionary\n    called \"model_builders\" at the top level of the result.  The\n    dictionary maps algorithm names to parameters lists.  Each of the\n    parameters contains all the metdata required by a client to\n    present a model building interface to the user.\n\n    if parameters = True, return the parameters?\n    '''\n    params_dict = {}\n    h2o_methods.check_params_update_kwargs(params_dict, kwargs, 'model_builders', False)\n\n    request = '3/ModelBuilders.json' \n    if algo:\n        request += \"/\" + algo\n\n    result = self.do_json_request(request, timeout=timeoutSecs, params=params_dict)\n    # verboseprint(request, \"result:\", dump_json(result))\n    h2o_sandbox.check_sandbox_for_errors()\n    return result", "response": "Get a list of model builders known to the H2O cluster."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef validate_model_parameters(self, algo, training_frame, parameters, timeoutSecs=60, **kwargs):\n    '''\n    Check a dictionary of model builder parameters on the h2o cluster \n    using the given algorithm and model parameters.\n    '''\n    assert algo is not None, '\"algo\" parameter is null'\n    # Allow this now: assert training_frame is not None, '\"training_frame\" parameter is null'\n    assert parameters is not None, '\"parameters\" parameter is null'\n\n    model_builders = self.model_builders(timeoutSecs=timeoutSecs)\n    assert model_builders is not None, \"/ModelBuilders REST call failed\"\n    assert algo in model_builders['model_builders']\n    builder = model_builders['model_builders'][algo]\n    \n    # TODO: test this assert, I don't think this is working. . .\n    if training_frame is not None:\n        frames = self.frames(key=training_frame)\n        assert frames is not None, \"/Frames/{0} REST call failed\".format(training_frame)\n\n        key_name = frames['frames'][0]['key']['name']\n        assert key_name==training_frame, \\\n            \"/Frames/{0} returned Frame {1} rather than Frame {2}\".format(training_frame, key_name, training_frame)\n\n        parameters['training_frame'] = training_frame\n\n    # TODO: add parameter existence checks\n    # TODO: add parameter value validation\n\n    # FIX! why ignoreH2oError here?\n    result = self.do_json_request('/3/ModelBuilders.json/' + algo + \"/parameters\", cmd='post', \n        timeout=timeoutSecs, postData=parameters, ignoreH2oError=True, noExtraErrorCheck=True)\n\n    verboseprint(\"model parameters validation: \" + repr(result))\n    return result", "response": "Validate model parameters on the H2O cluster \n   ."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nbuilding a model on the H2O cluster using the given algorithm training_frame and parameters.", "response": "def build_model(self, algo, training_frame, parameters, destination_frame=None, model_id=None,\n    timeoutSecs=60, noPoll=False, **kwargs):\n\n    if 'destination_key' in kwargs:\n        raise Exception('Change destination_key in build_model() to model_id')\n\n    '''\n    Build a model on the h2o cluster using the given algorithm, training \n    Frame and model parameters.\n    '''\n    assert algo is not None, '\"algo\" parameter is null'\n    assert training_frame is not None, '\"training_frame\" parameter is null'\n    assert parameters is not None, '\"parameters\" parameter is null'\n\n    # why always check that the algo is in here?\n    model_builders = self.model_builders(timeoutSecs=timeoutSecs)\n    assert model_builders is not None, \"/ModelBuilders REST call failed\"\n    assert algo in model_builders['model_builders'], \"%s %s\" % (algo, [k for k in model_builders['model_builders']])\n    builder = model_builders['model_builders'][algo]\n    \n    # TODO: test this assert, I don't think this is working. . .\n    frames = self.frames(key=training_frame)\n    assert frames is not None, \"/Frames/{0} REST call failed\".format(training_frame)\n\n    key_name = frames['frames'][0]['frame_id']['name'] \n    assert key_name==training_frame, \\\n        \"/Frames/{0} returned Frame {1} rather than Frame {2}\".format(training_frame, key_name, training_frame)\n    parameters['training_frame'] = training_frame\n\n    if destination_frame is not None:\n        print \"destination_frame should be replaced by model_id now\"\n        parameters['model_id'] = destination_frame\n\n    if model_id is not None:\n        parameters['model_id'] = model_id\n\n    print \"build_model parameters\", parameters\n    start = time.time()\n    result1 = self.do_json_request('/3/ModelBuilders.json/' + algo, cmd='post', \n        timeout=timeoutSecs, postData=parameters)\n    # make get overwritten after polling\n    elapsed = time.time() - start\n    verboseprint(\"build_model result\", dump_json(result1))\n      \n    if noPoll:\n        result = result1\n    elif ('validation_error_count' in result1) and (result1['validation_error_count']>0):\n        h2p.yellow_print(\"parameter error in model_builders: %s\" % result1)\n        # parameters validation failure\n        # TODO: add schema_type and schema_version into all the schemas to make this clean to check\n        result = result1\n        # don't bother printing a time message\n    elif 'exception_msg' in result1:\n        h2p.yellow_print(\"exception msg in model_builders: %s\" % result1['exception_msg'])\n        result = result1\n    else:\n        job_result = result1['job']\n        job_key = job_result['key']['name']\n        verboseprint(\"build_model job_key: \" + repr(job_key))\n\n        job_result = self.poll_job(job_key, timeoutSecs=timeoutSecs)\n        verboseprint(job_result)\n\n        elapsed = time.time() - start\n        print \"ModelBuilders\", algo,  \"end on\", training_frame, 'took', time.time() - start, 'seconds'\n        print \"%d pct. of timeout\" % ((elapsed/timeoutSecs) * 100)\n\n        if job_result:\n            jobs = job_result['jobs'][0]\n            description = jobs['description']\n            dest = jobs['dest']\n            msec = jobs['msec']\n            status = jobs['status']\n            progress = jobs['progress']\n\n            # can condition this with a parameter if some FAILED are expected by tests.\n            if status=='FAILED':\n                print dump_json(job_result)\n                raise Exception(\"Taking exception on build_model job status: %s %s %s %s\" % \\\n                    (status, progress, msec, description))\n\n            result = job_result\n        else:\n            # ? we should always get a job_json result\n            raise Exception(\"build_model didn't get a job_result when it expected one\")\n            # return None\n\n    verboseprint(\"result:\", result)\n    h2o_sandbox.check_sandbox_for_errors()\n    result['python_elapsed'] = elapsed\n    return result"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\ncomputing the model metrics for a given model and frame.", "response": "def compute_model_metrics(self, model, frame, timeoutSecs=60, **kwargs):\n    '''\n    Score a model on the h2o cluster on the given Frame and return only the model metrics. \n    '''\n    assert model is not None, '\"model\" parameter is null'\n    assert frame is not None, '\"frame\" parameter is null'\n\n    models = self.models(key=model, timeoutSecs=timeoutSecs)\n    assert models is not None, \"/Models REST call failed\"\n    assert models['models'][0]['model_id']['name'] == model, \"/Models/{0} returned Model {1} rather than Model {2}\".format(model, models['models'][0]['key']['name'], model)\n\n    # TODO: test this assert, I don't think this is working. . .\n    frames = self.frames(key=frame)\n    assert frames is not None, \"/Frames/{0} REST call failed\".format(frame)\n    \n    print \"frames:\", dump_json(frames)\n    # is the name not there?\n    # assert frames['frames'][0]['model_id']['name'] == frame, \"/Frames/{0} returned Frame {1} rather than Frame {2}\".format(frame, models['models'][0]['key']['name'], frame)\n\n    result = self.do_json_request('/3/ModelMetrics.json/models/' + model + '/frames/' + frame, cmd='post', timeout=timeoutSecs)\n\n    mm = result['model_metrics'][0]\n    verboseprint(\"model metrics: \" + repr(mm))\n    h2o_sandbox.check_sandbox_for_errors()\n    return mm"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\ngetting the list of model metrics for a single page.", "response": "def model_metrics(self, timeoutSecs=60, **kwargs):\n    '''\n    ModelMetrics list. \n    '''\n    result = self.do_json_request('/3/ModelMetrics.json', cmd='get', timeout=timeoutSecs)\n    h2o_sandbox.check_sandbox_for_errors()\n    return result"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef models(self, key=None, timeoutSecs=10, **kwargs):\n    '''\n    Return all of the models in the h2o cluster, or a single model given its key.  \n    The models are contained in a list called \"models\" at the top level of the\n    result.  Currently the list is unordered.\n    TODO:\n    When find_compatible_frames is implemented then the top level \n    dict will also contain a \"frames\" list.\n    '''\n    params_dict = {\n        'find_compatible_frames': False\n    }\n    h2o_methods.check_params_update_kwargs(params_dict, kwargs, 'models', True)\n\n    if key:\n        # result = self.do_json_request('3/Models.json', timeout=timeoutSecs, params=params_dict)\n        # print \"for ray:\", dump_json(result)\n        result = self.do_json_request('3/Models.json/' + key, timeout=timeoutSecs, params=params_dict)\n    else:\n        result = self.do_json_request('3/Models.json', timeout=timeoutSecs, params=params_dict)\n    \n    verboseprint(\"models result:\", dump_json(result))\n    h2o_sandbox.check_sandbox_for_errors()\n    return result", "response": "Return all of the models in the h2o cluster or a single model given its key."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef delete_model(self, key, ignoreMissingKey=True, timeoutSecs=60, **kwargs):\n    '''\n    Delete a model on the h2o cluster, given its key.\n    '''\n    assert key is not None, '\"key\" parameter is null'\n\n    result = self.do_json_request('/3/Models.json/' + key, cmd='delete', timeout=timeoutSecs)\n\n    # TODO: look for what?\n    if not ignoreMissingKey and 'f00b4r' in result:\n        raise ValueError('Model key not found: ' + key)\n\n    verboseprint(\"delete_model result:\", dump_json(result))\n    return result", "response": "Delete a model on the h2o cluster given its key."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef run_instances(count, ec2_config, region, waitForSSH=True, tags=None):\n    '''Create a new reservation for count instances'''\n\n    ec2params = inheritparams(ec2_config, EC2_API_RUN_INSTANCE)\n    ec2params.setdefault('min_count', count)\n    ec2params.setdefault('max_count', count)\n\n    reservation = None\n    conn = ec2_connect(region)\n    try:\n        reservation = conn.run_instances(**ec2params)\n        log('Reservation: {0}'.format(reservation.id))\n        log('Waiting for {0} EC2 instances {1} to come up, this can take 1-2 minutes.'.format(len(reservation.instances), reservation.instances))\n        start = time.time()\n        time.sleep(1)\n        for instance in reservation.instances:\n            while instance.update() == 'pending':\n               time.sleep(1)\n               h2o_cmd.dot()\n\n            if not instance.state == 'running':\n                raise Exception('\\033[91m[ec2] Error waiting for running state. Instance is in state {0}.\\033[0m'.format(instance.state))\n\n        log('Instances started in {0} seconds'.format(time.time() - start))\n        log('Instances: ')\n        for inst in reservation.instances: log(\"   {0} ({1}) : public ip: {2}, private ip: {3}\".format(inst.public_dns_name, inst.id, inst.ip_address, inst.private_ip_address))\n        \n        if waitForSSH:\n            # kbn: changing to private address, so it should fail if not in right domain\n            # used to have the public ip address\n            wait_for_ssh([ i.private_ip_address for i in reservation.instances ])\n\n        # Tag instances\n        try:\n            if tags:\n                conn.create_tags([i.id for i in reservation.instances], tags)                        \n        except:\n            warn('Something wrong during tagging instances. Exceptions IGNORED!')\n            print sys.exc_info()\n            pass\n\n        return reservation\n    except:\n        print \"\\033[91mUnexpected error\\033[0m :\", sys.exc_info()\n        if reservation:\n            terminate_reservation(reservation, region)\n        raise", "response": "Create a new reservation for count instances"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef terminate_instances(instances, region):\n    '''terminate all the instances given by its ids'''\n    if not instances: return\n    conn = ec2_connect(region)\n    log(\"Terminating instances {0}.\".format(instances))\n    conn.terminate_instances(instances)\n    log(\"Done\")", "response": "terminate all the instances given by its ids"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef stop_instances(instances, region):\n    '''stop all the instances given by its ids'''\n    if not instances: return\n    conn = ec2_connect(region)\n    log(\"Stopping instances {0}.\".format(instances))\n    conn.stop_instances(instances)\n    log(\"Done\")", "response": "stop all the instances given by its ids"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nstarts all the instances given by its ids", "response": "def start_instances(instances, region):\n    '''Start all the instances given by its ids'''\n    if not instances: return\n    conn = ec2_connect(region)\n    log(\"Starting instances {0}.\".format(instances))\n    conn.start_instances(instances)\n    log(\"Done\")"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef reboot_instances(instances, region):\n    '''Reboot all the instances given by its ids'''\n    if not instances: return\n    conn = ec2_connect(region)\n    log(\"Rebooting instances {0}.\".format(instances))\n    conn.reboot_instances(instances)\n    log(\"Done\")", "response": "Reboot all the instances given by its ids"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nwaits for SSH service to appear on given hosts.", "response": "def wait_for_ssh(ips, port=22, skipAlive=True, requiredsuccess=3):\n    ''' Wait for ssh service to appear on given hosts'''\n    log('Waiting for SSH on following hosts: {0}'.format(ips))\n    for ip in ips:\n        if not skipAlive or not ssh_live(ip, port): \n            log('Waiting for SSH on instance {0}...'.format(ip))\n            count = 0\n            while count < requiredsuccess:\n                if ssh_live(ip, port):\n                    count += 1\n                else:\n                    count = 0\n                time.sleep(1)\n                h2o_cmd.dot()"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nreturns fully qualified function name. This method will attempt to find \"full name\" of the given function object. This full name is either of the form \"<class name>.<method name>\" if the function is a class method, or \"<module name>.<func name>\" if it's a regular function. Thus, this is an attempt to back-port func.__qualname__ to Python 2. :param func: a function object. :returns: string with the function's full name as explained above.", "response": "def _get_method_full_name(func):\n    \"\"\"\n    Return fully qualified function name.\n\n    This method will attempt to find \"full name\" of the given function object. This full name is either of\n    the form \"<class name>.<method name>\" if the function is a class method, or \"<module name>.<func name>\"\n    if it's a regular function. Thus, this is an attempt to back-port func.__qualname__ to Python 2.\n\n    :param func: a function object.\n\n    :returns: string with the function's full name as explained above.\n    \"\"\"\n    # Python 3.3 already has this information available...\n    if hasattr(func, \"__qualname__\"): return func.__qualname__\n\n    module = inspect.getmodule(func)\n    if module is None:\n        return \"?.%s\" % getattr(func, \"__name__\", \"?\")\n    for cls_name in dir(module):\n        cls = getattr(module, cls_name)\n        if not inspect.isclass(cls): continue\n        for method_name in dir(cls):\n            cls_method = getattr(cls, method_name)\n            if cls_method == func:\n                return \"%s.%s\" % (cls_name, method_name)\n    if hasattr(func, \"__name__\"):\n        return \"%s.%s\" % (module.__name__, func.__name__)\n    return \"<unknown>\""}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\ngive a frame and a compiled function code find the corresponding function object within the frame.", "response": "def _find_function_from_code(frame, code):\n    \"\"\"\n    Given a frame and a compiled function code, find the corresponding function object within the frame.\n\n    This function addresses the following problem: when handling a stacktrace, we receive information about\n    which piece of code was being executed in the form of a CodeType object. That objects contains function name,\n    file name, line number, and the compiled bytecode. What it *doesn't* contain is the function object itself.\n\n    So this utility function aims at locating this function object, and it does so by searching through objects\n    in the preceding local frame (i.e. the frame where the function was called from). We expect that the function\n    should usually exist there -- either by itself, or as a method on one of the objects.\n\n    :param types.FrameType frame: local frame where the function ought to be found somewhere.\n    :param types.CodeType code: the compiled code of the function to look for.\n\n    :returns: the function object, or None if not found.\n    \"\"\"\n    def find_code(iterable, depth=0):\n        if depth > 3: return  # Avoid potential infinite loops, or generally objects that are too deep.\n        for item in iterable:\n            if item is None: continue\n            found = None\n            if hasattr(item, \"__code__\") and item.__code__ == code:\n                found = item\n            elif isinstance(item, type) or isinstance(item, ModuleType):  # class / module\n                try:\n                    found = find_code((getattr(item, n, None) for n in dir(item)), depth + 1)\n                except Exception:\n                    # Sometimes merely getting module's attributes may cause an exception. For example :mod:`six.moves`\n                    # is such an offender...\n                    continue\n            elif isinstance(item, (list, tuple, set)):\n                found = find_code(item, depth + 1)\n            elif isinstance(item, dict):\n                found = find_code(item.values(), depth + 1)\n            if found: return found\n    return find_code(frame.f_locals.values()) or find_code(frame.f_globals.values())"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nreturning function s declared arguments as a string.", "response": "def _get_args_str(func, highlight=None):\n    \"\"\"\n    Return function's declared arguments as a string.\n\n    For example for this function it returns \"func, highlight=None\"; for the ``_wrap`` function it returns\n    \"text, wrap_at=120, indent=4\". This should usually coincide with the function's declaration (the part\n    which is inside the parentheses).\n    \"\"\"\n    if not func: return \"\"\n    s = str(inspect.signature(func))[1:-1]\n    if highlight:\n        s = re.sub(r\"\\b%s\\b\" % highlight, Style.BRIGHT + Fore.WHITE + highlight + Fore.LIGHTBLACK_EX + Style.NORMAL, s)\n    return s"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nwrap a string of text around if needed.", "response": "def _wrap(text, wrap_at=120, indent=4):\n    \"\"\"\n    Return piece of text, wrapped around if needed.\n\n    :param text: text that may be too long and then needs to be wrapped.\n    :param wrap_at: the maximum line length.\n    :param indent: number of spaces to prepend to all subsequent lines after the first.\n    \"\"\"\n    out = \"\"\n    curr_line_length = indent\n    space_needed = False\n    for word in text.split():\n        if curr_line_length + len(word) > wrap_at:\n            out += \"\\n\" + \" \" * indent\n            curr_line_length = indent\n            space_needed = False\n        if space_needed:\n            out += \" \"\n            curr_line_length += 1\n        out += word\n        curr_line_length += len(word)\n        space_needed = True\n    return out"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nallocates a new base port for the Jenkins process", "response": "def jenkins_h2o_port_allocate():\n    \"\"\"\n    input: jenkins environment variable EXECUTOR_NUMBER\n    output: creates ./BASE_PORT.sh, that you should 'source ./PORT.sh'\n        (can't see the env. variables directly from python?)\n\n    which will create os environment variables H2O_PORT and H2O_PORT_OFFSET (legacy)\n\n        internal state for this script that can be updated: \n            USED_HOSTNAMES (list of machine names), \n            PORTS_PER_SLOT (max per any job), \n            DEFAULT_BASE_PORT\n        If you modify any of the internal state, you may introduce contention between \n        new jenkins jobs and running jenkins jobs. (might not!)\n        You should stop/start all jobs (or ignore failures) if you modify internal state here. \n        Hence, no parameters to avoid living dangerously!\n    \"\"\"\n    if os.environ.has_key(\"EXECUTOR_NUMBER\"):\n        # this will fail if it's not an integer\n        executor = int(os.environ[\"EXECUTOR_NUMBER\"])\n    else:\n        executor = 1 # jenkins starts with 1\n\n    print \"jenkins EXECUTOR_NUMBER:\", executor\n    if executor<0 or executor>=EXECUTOR_NUM:\n        raise Exception(\"executor: %s wrong? Expecting 1-8 jenkins executors on a machine (0-7 exp.)\" % executor)\n\n    h2oPort = DEFAULT_BASE_PORT\n    h2oPortOffset = 0\n    hostname = socket.gethostname()\n    if hostname not in USED_HOSTNAMES:\n        print \"WARNING: this hostname: %s isn't in my list. You should add it?\" % hostname\n        print \"Will use default base port\"\n    else:\n        hostnameIndex = USED_HOSTNAMES.index(hostname)\n        h2oPortOffset = PORTS_PER_SLOT * (executor + hostnameIndex)\n        h2oPort += h2oPortOffset\n\n    print \"Possible h2o base_port range is %s to %s\" % \\\n        (DEFAULT_BASE_PORT, DEFAULT_BASE_PORT + (PORTS_PER_SLOT * EXECUTOR_NUM * len(USED_HOSTNAMES)) - 2)\n    print \"Possible h2o ports used ranged is %s to %s\" % \\\n        (DEFAULT_BASE_PORT, DEFAULT_BASE_PORT + (PORTS_PER_SLOT * EXECUTOR_NUM * len(USED_HOSTNAMES)) - 1)\n    print \"want to 'export H2O_PORT=%s'\" % h2oPort\n    print \"want to 'export H2O_PORT_OFFSET=%s # legacy'\" % h2oPortOffset\n\n    f = open('H2O_BASE_PORT.sh','w')\n    f.write('export H2O_PORT=%s\\n' % h2oPort)\n\n    f.write('export H2O_PORT_OFFSET=%s # legacy\\n' % h2oPortOffset)\n    f.close()\n    print \"\\nNow please:\\nsource ./H2O_BASE_PORT.sh\""}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\ntraining the model asynchronously.", "response": "def start(self, x, y=None, training_frame=None, offset_column=None, fold_column=None,\n              weights_column=None, validation_frame=None, **params):\n        \"\"\"\n        Train the model asynchronously (to block for results call :meth:`join`).\n\n        :param x: A list of column names or indices indicating the predictor columns.\n        :param y: An index or a column name indicating the response column.\n        :param H2OFrame training_frame: The H2OFrame having the columns indicated by x and y (as well as any\n            additional columns specified by fold, offset, and weights).\n        :param offset_column: The name or index of the column in training_frame that holds the offsets.\n        :param fold_column: The name or index of the column in training_frame that holds the per-row fold\n            assignments.\n        :param weights_column: The name or index of the column in training_frame that holds the per-row weights.\n        :param validation_frame: H2OFrame with validation data to be scored on while training.\n        \"\"\"\n        self._future = True\n        self.train(x=x,\n                   y=y,\n                   training_frame=training_frame,\n                   offset_column=offset_column,\n                   fold_column=fold_column,\n                   weights_column=weights_column,\n                   validation_frame=validation_frame,\n                   **params)"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nwaiting until job s completion.", "response": "def join(self):\n        \"\"\"Wait until job's completion.\"\"\"\n        self._future = False\n        self._job.poll()\n        model_key = self._job.dest_key\n        self._job = None\n        model_json = h2o.api(\"GET /%d/Models/%s\" % (self._rest_version, model_key))[\"models\"][0]\n        self._resolve_model(model_key, model_json)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\ntrain the H2O model on the given data.", "response": "def train(self, x=None, y=None, training_frame=None, offset_column=None, fold_column=None,\n              weights_column=None, validation_frame=None, max_runtime_secs=None, ignored_columns=None,\n              model_id=None, verbose=False):\n        \"\"\"\n        Train the H2O model.\n\n        :param x: A list of column names or indices indicating the predictor columns.\n        :param y: An index or a column name indicating the response column.\n        :param H2OFrame training_frame: The H2OFrame having the columns indicated by x and y (as well as any\n            additional columns specified by fold, offset, and weights).\n        :param offset_column: The name or index of the column in training_frame that holds the offsets.\n        :param fold_column: The name or index of the column in training_frame that holds the per-row fold\n            assignments.\n        :param weights_column: The name or index of the column in training_frame that holds the per-row weights.\n        :param validation_frame: H2OFrame with validation data to be scored on while training.\n        :param float max_runtime_secs: Maximum allowed runtime in seconds for model training. Use 0 to disable.\n        :param bool verbose: Print scoring history to stdout. Defaults to False.\n        \"\"\"\n        self._train(x=x, y=y, training_frame=training_frame, offset_column=offset_column, fold_column=fold_column,\n                    weights_column=weights_column, validation_frame=validation_frame, max_runtime_secs=max_runtime_secs, \n                    ignored_columns=ignored_columns, model_id=model_id, verbose=verbose)"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nfits an H2O estimator to a set of training variables.", "response": "def fit(self, X, y=None, **params):\n        \"\"\"\n        Fit an H2O model as part of a scikit-learn pipeline or grid search.\n\n        A warning will be issued if a caller other than sklearn attempts to use this method.\n\n        :param H2OFrame X: An H2OFrame consisting of the predictor variables.\n        :param H2OFrame y: An H2OFrame consisting of the response variable.\n        :param params: Extra arguments.\n        :returns: The current instance of H2OEstimator for method chaining.\n        \"\"\"\n        stk = inspect.stack()[1:]\n        warn = True\n        for s in stk:\n            mod = inspect.getmodule(s[0])\n            if mod:\n                warn = \"sklearn\" not in mod.__name__\n                if not warn: break\n        if warn:\n            warnings.warn(\"\\n\\n\\t`fit` is not recommended outside of the sklearn framework. Use `train` instead.\",\n                          UserWarning, stacklevel=2)\n        training_frame = X.cbind(y) if y is not None else X\n        x = X.names\n        y = y.names[0] if y is not None else None\n        self.train(x, y, training_frame, **params)\n        return self"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef get_params(self, deep=True):\n        out = dict()\n        for key, value in self.parms.items():\n            if deep and isinstance(value, H2OEstimator):\n                deep_items = list(value.get_params().items())\n                out.update((key + \"__\" + k, val) for k, val in deep_items)\n            out[key] = value\n        return out", "response": "Returns a dictionary of parameters for this estimator."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\ncheck if a parameter is already set and save it if not throw UserWarning.", "response": "def _check_and_save_parm(self, parms, parameter_name, parameter_value):\n        \"\"\"\n        If a parameter is not stored in parms dict save it there (even though the value is None).\n        Else check if the parameter has been already set during initialization of estimator. If yes, check the new value is the same or not. If the values are different, set the last passed value to params dict and throw UserWarning.\n        \"\"\"\n        if parameter_name not in parms:\n            parms[parameter_name] = parameter_value\n        elif parameter_value is not None and parms[parameter_name] != parameter_value:\n            parms[parameter_name] = parameter_value\n            warnings.warn(\"\\n\\n\\t`%s` parameter has been already set and had a different value in `train` method. The last passed value \\\"%s\\\" is used.\" % (parameter_name, parameter_value), UserWarning, stacklevel=2)"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef is_rdemo(file_name):\n    packaged_demos = [\"h2o.anomaly.R\", \"h2o.deeplearning.R\", \"h2o.gbm.R\", \"h2o.glm.R\", \"h2o.glrm.R\", \"h2o.kmeans.R\",\n                      \"h2o.naiveBayes.R\", \"h2o.prcomp.R\", \"h2o.randomForest.R\"]\n    if file_name in packaged_demos: return True\n    if re.match(\"^rdemo.*\\.(r|R|ipynb)$\", file_name): return True\n    return False", "response": "Test if a file_name matches a regexp for an R demo."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef is_ipython_notebook(file_name):\n    if (not re.match(\"^.*checkpoint\\.ipynb$\", file_name)) and re.match(\"^.*\\.ipynb$\", file_name): return True\n    return False", "response": "Test if file_name matches a regexp for an ipython notebook."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef grab_java_message(node_list, curr_testname):\n\n    global g_java_start_text  # contains text that describe the start of a unit test.\n\n    java_messages = \"\"\n    start_test = False  # denote when the current test was found in the java_*_0.out.txt file\n\n    # grab each java file and try to grab the java messages associated with curr_testname\n    for each_node in node_list:\n        java_filename = each_node.output_file_name  # find the java_*_0.out.txt file\n        if os.path.isfile(java_filename):\n            java_file = open(java_filename, 'r')\n            for each_line in java_file:\n                if g_java_start_text in each_line:\n                    start_str, found, end_str = each_line.partition(g_java_start_text)\n\n                    if len(found) > 0:  # a new test is being started.\n                        current_testname = end_str.strip()  # grab the test name and check if it is curr_testname\n                        if current_testname == curr_testname:\n                            # found the line starting with current test.  Grab everything now\n                            start_test = True  # found text in java_*_0.out.txt that describe curr_testname\n\n                            # add header to make JAVA messages visible.\n                            java_messages += \"\\n\\n**********************************************************\\n\"\n                            java_messages += \"**********************************************************\\n\"\n                            java_messages += \"JAVA Messages\\n\"\n                            java_messages += \"**********************************************************\\n\"\n                            java_messages += \"**********************************************************\\n\\n\"\n\n\n                        else:\n                            # found a differnt test than our curr_testname.  We are done!\n                            if start_test:\n                                # in the middle of curr_testname but found a new test starting, can quit now.\n                                break\n\n                # store java message associated with curr_testname into java_messages\n                if start_test:\n                    java_messages += each_line\n\n            java_file.close()  # finished finding java messages\n\n        if start_test:\n            # found java message associate with our test already. No need to continue the loop.\n            break\n\n    return java_messages", "response": "This function will grab the java messages associated with the current test in the node_list and store the result in the node_list."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef usage():\n    print(\"\")\n    print(\"Usage:  \" + g_script_name + \" [...options...]\")\n    print(\"\")\n    print(\"    (Output dir is: \" + str(g_output_dir) + \")\")\n    print(\"    (Default number of clouds is: \" + str(g_num_clouds) + \")\")\n    print(\"\")\n    print(\"    --wipeall        Remove all prior test state before starting, particularly\")\n    print(\"                     random seeds.\")\n    print(\"                     (Removes master_seed file and all Rsandbox directories.\")\n    print(\"                     Also wipes the output dir before starting.)\")\n    print(\"\")\n    print(\"    --wipe           Wipes the output dir before starting.  Keeps old random seeds.\")\n    print(\"\")\n    print(\"    --baseport       The first port at which H2O starts searching for free ports.\")\n    print(\"\")\n    print(\"    --numclouds      The number of clusters to start.\")\n    print(\"                     Each test is randomly assigned to a cluster.\")\n    print(\"\")\n    print(\"    --numnodes       The number of nodes in the cluster.\")\n    print(\"                     When this is specified, numclouds must be 1.\")\n    print(\"\")\n    print(\"    --test           If you only want to run one test, specify it like this.\")\n    print(\"\")\n    print(\"    --testlist       A file containing a list of tests to run (for example the\")\n    print(\"                     'failed.txt' file from the output directory).\")\n    print(\"    --excludelist    A file containing a list of tests to NOT run.\")\n    print(\"\")\n    print(\"    --testgroup      Test a group of tests by function:\")\n    print(\"                     pca, glm, kmeans, gbm, rf, deeplearning, algos, golden, munging, parser\")\n    print(\"\")\n    print(\"    --testsize       Sizes (and by extension length) of tests to run:\")\n    print(\"                     s=small (seconds), m=medium (a minute or two), l=large (longer), x=xlarge (very big)\")\n    print(\"                     (Default is to run all tests.)\")\n    print(\"\")\n    print(\"    --usecloud       ip:port of cluster to send tests to instead of starting clusters.\")\n    print(\"                     (When this is specified, numclouds is ignored.)\")\n    print(\"\")\n    print(\"    --usecloud2      cloud.cfg: Use a set clusters defined in cloud.config to run tests on.\")\n    print(\"                     (When this is specified, numclouds, numnodes, and usecloud are ignored.)\")\n    print(\"\")\n    print(\"    --client         Send REST API commands through client mode.\")\n    print(\"\")\n    print(\"    --norun          Perform side effects like wipe, but don't actually run tests.\")\n    print(\"\")\n    print(\"    --jvm.xmx        Configure size of launched JVM running H2O. E.g. '--jvm.xmx 3g'\")\n    print(\"\")\n    print(\"    --jvm.cp         Classpath argument, in addition to h2o.jar path. E.g. \"\n          \"'--jvm.cp /Users/h2o/mysql-connector-java-5.1.38-bin.jar'\")\n    print(\"\")\n    print(\"    --nopass         Run the NOPASS and NOFEATURE tests only and do not ignore any failures.\")\n    print(\"\")\n    print(\"    --nointernal     Don't run the INTERNAL tests.\")\n    print(\"\")\n    print(\"    --c              Start the JVMs in a convenient location.\")\n    print(\"\")\n    print(\"    --h2ojar         Supply a path to the H2O jar file.\")\n    print(\"\")\n    print(\"    --tar            Supply a path to the R TAR.\")\n    print(\"\")\n    print(\"\")\n    print(\"    --pto            The phantomjs timeout in seconds. Default is 3600 (1hr).\")\n    print(\"\")\n    print(\"    --noxunit        Do not produce xUnit reports.\")\n    print(\"\")\n    print(\"    --rPkgVerChk     Check that Jenkins-approved R packages/versions are present\")\n    print(\"\")\n    print(\"    --onHadoop       Indication that tests will be run on h2o multinode hadoop clusters.\")\n    print(\"                     `locate` and `sandbox` runit/pyunit test utilities use this indication in order to\")\n    print(\"                     behave properly. --hadoopNamenode must be specified if --onHadoop option is used.\")\n    print(\"    --hadoopNamenode Specifies that the runit/pyunit tests have access to this hadoop namenode.\")\n    print(\"                     runit/pyunit test utilities have ability to retrieve this value.\")\n    print(\"\")\n    print(\"    --perf           Save Jenkins build id, date, machine ip, git hash, name, start time, finish time,\")\n    print(\"                     pass, ncpus, os, and job name of each test to perf.csv in the results directory.\")\n    print(\"                     Takes three parameters: git hash, git branch, and build id, job name in that order.\")\n    print(\"\")\n    print(\"    --jacoco         Generate code coverage data using JaCoCo. Class includes and excludes may optionally\")\n    print(\"                     follow in the format of [includes]:[excludes] where [...] denotes a list of\")\n    print(\"                     classes, each separated by a comma (,). Wildcard characters (* and ?) may be used.\")\n    print(\"\")\n    print(\"    --geterrs        Generate xml file that contains the actual unit test errors and the actual Java error.\")\n    print(\"\")\n    print(\"    --test.ssl       Runs all the nodes with SSL enabled.\")\n    print(\"\")\n    print(\"    --ldap.username  Username for LDAP.\")\n    print(\"\")\n    print(\"    --ldap.password  Password for LDAP.\")\n    print(\"\")\n    print(\"    --ldap.config    Path to LDAP config. If set, all nodes will be started with LDAP support.\")\n    print(\"\")\n    print(\"    --kerb.principal  Kerberos service principal.\")\n    print(\"\")\n    print(\"    --jvm.opts       Additional JVM options.\")\n    print(\"\")\n    print(\"    --restLog        If set, enable REST API logging. Logs will be available at <resultsDir>/rest.log.\")\n    print(\"                     Please note, that enablig REST API logging will increase the execution time and that\")\n    print(\"                     the log file might be large (> 2GB).\")\n    print(\"    If neither --test nor --testlist is specified, then the list of tests is\")\n    print(\"    discovered automatically as files matching '*runit*.R'.\")\n    print(\"\")\n    print(\"\")\n    print(\"Examples:\")\n    print(\"\")\n    print(\"    Just accept the defaults and go (note: output dir must not exist):\")\n    print(\"        \" + g_script_name)\n    print(\"\")\n    print(\"    Remove all random seeds (i.e. make new ones) but don't run any tests:\")\n    print(\"        \" + g_script_name + \" --wipeall --norun\")\n    print(\"\")\n    print(\"    For a powerful laptop with 8 cores (keep default numclouds):\")\n    print(\"        \" + g_script_name + \" --wipeall\")\n    print(\"\")\n    print(\"    For a big server with 32 cores:\")\n    print(\"        \" + g_script_name + \" --wipeall --numclouds 16\")\n    print(\"\")\n    print(\"    Just run the tests that finish quickly\")\n    print(\"        \" + g_script_name + \" --wipeall --testsize s\")\n    print(\"\")\n    print(\"    Run one specific test, keeping old random seeds:\")\n    print(\"        \" + g_script_name + \" --wipe --test path/to/test.R\")\n    print(\"\")\n    print(\"    Rerunning failures from a previous run, keeping old random seeds:\")\n    print(\"        # Copy failures.txt, otherwise --wipe removes the directory with the list!\")\n    print(\"        cp \" + os.path.join(g_output_dir, \"failures.txt\") + \" .\")\n    print(\"        \" + g_script_name + \" --wipe --numclouds 16 --testlist failed.txt\")\n    print(\"\")\n    print(\"    Run tests on a pre-existing cloud (e.g. in a debugger), keeping old random seeds:\")\n    print(\"        \" + g_script_name + \" --wipe --usecloud ip:port\")\n    print(\"\")\n    print(\"    Run tests with JaCoCo enabled, excluding org.example1 and org.example2\")\n    print(\"        \" + g_script_name + \" --jacoco :org.example1,org.example2\")\n    sys.exit(1)", "response": "Print USAGE for the current script."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nparsing the arguments into a list of modules.", "response": "def parse_args(argv):\n    \"\"\"\n    Parse the arguments into globals (ain't this an ugly duckling?).\n\n    TODO: replace this machinery with argparse module.\n    \"\"\"\n    global g_base_port\n    global g_num_clouds\n    global g_nodes_per_cloud\n    global g_wipe_test_state\n    global g_wipe_output_dir\n    global g_test_to_run\n    global g_test_list_file\n    global g_exclude_list_file\n    global g_test_group\n    global g_run_small\n    global g_run_medium\n    global g_run_large\n    global g_run_xlarge\n    global g_use_cloud\n    global g_use_cloud2\n    global g_use_client\n    global g_config\n    global g_use_proto\n    global g_use_ip\n    global g_use_port\n    global g_no_run\n    global g_jvm_xmx\n    global g_jvm_cp\n    global g_nopass\n    global g_nointernal\n    global g_convenient\n    global g_path_to_h2o_jar\n    global g_path_to_tar\n    global g_path_to_whl\n    global g_jacoco_include\n    global g_jacoco_options\n    global g_produce_unit_reports\n    global g_phantomjs_to\n    global g_phantomjs_packs\n    global g_r_pkg_ver_chk\n    global g_on_hadoop\n    global g_hadoop_namenode\n    global g_r_test_setup\n    global g_py_test_setup\n    global g_perf\n    global g_git_hash\n    global g_git_branch\n    global g_machine_ip\n    global g_date\n    global g_build_id\n    global g_ncpu\n    global g_os\n    global g_job_name\n    global g_py3\n    global g_pycoverage\n    global g_use_xml2\n    global g_test_ssl\n    global g_ldap_username\n    global g_ldap_password\n    global g_kerb_principal\n    global g_ldap_config\n    global g_rest_log\n    global g_jvm_opts\n\n    i = 1\n    while i < len(argv):\n        s = argv[i]\n\n        if s == \"--baseport\":\n            i += 1\n            if i >= len(argv):\n                usage()\n            g_base_port = int(argv[i])\n        elif s == \"--py3\":\n            g_py3 = True\n        elif s == \"--coverage\":\n            g_pycoverage = True\n        elif s == \"--numclouds\":\n            i += 1\n            if i >= len(argv):\n                usage()\n            g_num_clouds = int(argv[i])\n        elif s == \"--numnodes\":\n            i += 1\n            if i >= len(argv):\n                usage()\n            g_nodes_per_cloud = int(argv[i])\n        elif s == \"--wipeall\":\n            g_wipe_test_state = True\n            g_wipe_output_dir = True\n        elif s == \"--wipe\":\n            g_wipe_output_dir = True\n        elif s == \"--test\":\n            i += 1\n            if i >= len(argv):\n                usage()\n            g_test_to_run = TestRunner.find_test(argv[i])\n        elif s == \"--testlist\":\n            i += 1\n            if i >= len(argv):\n                usage()\n            g_test_list_file = argv[i]\n        elif s == \"--excludelist\":\n            i += 1\n            if i >= len(argv):\n                usage()\n            g_exclude_list_file = argv[i]\n        elif s == \"--testgroup\":\n            i += 1\n            if i >= len(argv):\n                usage()\n            g_test_group = argv[i]\n        elif s == \"--testsize\":\n            i += 1\n            if i >= len(argv):\n                usage()\n            v = argv[i]\n            if re.match(r'(s)?(m)?(l)?', v):\n                if 's' not in v:\n                    g_run_small = False\n                if 'm' not in v:\n                    g_run_medium = False\n                if 'l' not in v:\n                    g_run_large = False\n                if 'x' not in v:\n                    g_run_xlarge = False\n            else:\n                bad_arg(s)\n        elif s == \"--usecloud\":\n            i += 1\n            if i >= len(argv):\n                usage()\n            s = argv[i]\n            proto = \"\"\n            if s.lower().startswith(\"https://\"):\n                proto = \"https://\"\n                s = s[8:]\n            m = re.match(r'(\\S+):([1-9][0-9]*)', s)\n            if m is None:\n                unknown_arg(s)\n            g_use_cloud = True\n            g_use_proto = proto\n            g_use_ip = m.group(1)\n            port_string = m.group(2)\n            g_use_port = int(port_string)\n        elif s == \"--usecloud2\":\n            i += 1\n            if i >= len(argv):\n                usage()\n            s = argv[i]\n            if s is None:\n                unknown_arg(s)\n            g_use_cloud2 = True\n            g_config = s\n        elif s == \"--client\":\n            g_use_client = True\n        elif s == \"--nopass\":\n            g_nopass = True\n        elif s == \"--nointernal\":\n            g_nointernal = True\n        elif s == \"--c\":\n            g_convenient = True\n        elif s == \"--h2ojar\":\n            i += 1\n            g_path_to_h2o_jar = os.path.abspath(argv[i])\n        elif s == \"--pto\":\n            i += 1\n            g_phantomjs_to = int(argv[i])\n        elif s == \"--ptt\":\n            i += 1\n            g_phantomjs_packs = argv[i]\n        elif s == \"--tar\":\n            i += 1\n            g_path_to_tar = os.path.abspath(argv[i])\n        elif s == \"--whl\":\n            i += 1\n            g_path_to_whl = os.path.abspath(argv[i])\n        elif s == \"--jvm.xmx\":\n            i += 1\n            if i >= len(argv):\n                usage()\n            g_jvm_xmx = argv[i]\n        elif s == \"--jvm.cp\":\n            i += 1\n            if i > len(argv):\n                usage()\n            g_jvm_cp = argv[i]\n        elif s == \"--norun\":\n            g_no_run = True\n        elif s == \"--noxunit\":\n            g_produce_unit_reports = False\n        elif s == \"--jacoco\":\n            g_jacoco_include = True\n            if i + 1 < len(argv):\n                s = argv[i + 1]\n                m = re.match(r'(?P<includes>([^:,]+(,[^:,]+)*)?):(?P<excludes>([^:,]+(,[^:,]+)*)?)$', s)\n                if m is not None:\n                    g_jacoco_options[0] = m.group(\"includes\")\n                    g_jacoco_options[1] = m.group(\"excludes\")\n        elif s == \"-h\" or s == \"--h\" or s == \"-help\" or s == \"--help\":\n            usage()\n        elif s == \"--rPkgVerChk\":\n            g_r_pkg_ver_chk = True\n        elif s == \"--onHadoop\":\n            g_on_hadoop = True\n        elif s == \"--hadoopNamenode\":\n            i += 1\n            if i >= len(argv):\n                usage()\n            g_hadoop_namenode = argv[i]\n        elif s == \"--perf\":\n            g_perf = True\n\n            i += 1\n            if i >= len(argv):\n                usage()\n            g_git_hash = argv[i]\n\n            i += 1\n            if i >= len(argv):\n                usage()\n            g_git_branch = argv[i]\n\n            i += 1\n            if i >= len(argv):\n                usage()\n            g_build_id = argv[i]\n\n            i += 1\n            if i >= len(argv):\n                usage()\n            g_job_name = argv[i]\n        elif s == \"--geterrs\":\n            g_use_xml2 = True\n        elif s == \"--test_ssl\":\n            g_test_ssl = True\n        elif s == '--ldap.config':\n            i += 1\n            if i >= len(argv):\n                usage()\n            g_ldap_config = argv[i]\n        elif s == '--ldap.username':\n            i += 1\n            if i >= len(argv):\n                usage()\n            g_ldap_username = argv[i]\n        elif s == '--ldap.password':\n            i += 1\n            if i >= len(argv):\n                usage()\n            g_ldap_password = argv[i]\n        elif s == '--kerb.principal':\n            i += 1\n            if i >= len(argv):\n                usage()\n            g_kerb_principal = argv[i]\n        elif s == '--jvm.opts':\n            i += 1\n            if i >= len(argv):\n                usage()\n            g_jvm_opts = argv[i]\n        elif s == '--restLog':\n            g_rest_log = True\n        else:\n            unknown_arg(s)\n\n        i += 1\n\n    if int(g_use_client) + int(g_use_cloud) + int(g_use_cloud2) > 1:\n        print(\"\")\n        print(\"ERROR: --client, --usecloud and --usecloud2 are mutually exclusive.\")\n        print(\"\")\n        sys.exit(1)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nclearing the output directory.", "response": "def wipe_output_dir():\n    \"\"\"Clear the output directory.\"\"\"\n    print(\"Wiping output directory.\")\n    try:\n        if os.path.exists(g_output_dir):\n            shutil.rmtree(str(g_output_dir))\n    except OSError as e:\n        print(\"ERROR: Removing output directory %s failed: \" % g_output_dir)\n        print(\"       (errno {0}): {1}\".format(e.errno, e.strerror))\n        print(\"\")\n        sys.exit(1)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef remove_sandbox(parent_dir, dir_name):\n    if \"Rsandbox\" in dir_name:\n        rsandbox_dir = os.path.join(parent_dir, dir_name)\n        try:\n            if sys.platform == \"win32\":\n                os.system(r'C:/cygwin64/bin/rm.exe -r -f \"{0}\"'.format(rsandbox_dir))\n            else:\n                shutil.rmtree(rsandbox_dir)\n        except OSError as e:\n            print(\"\")\n            print(\"ERROR: Removing RSandbox directory failed: \" + rsandbox_dir)\n            print(\"       (errno {0}): {1}\".format(e.errno, e.strerror))\n            print(\"\")\n            sys.exit(1)", "response": "This function is written to remove sandbox directories if they exist under the parent_dir."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nstarting one node of H2O.", "response": "def start(self):\n        \"\"\"\n        Start one node of H2O.\n        (Stash away the self.child and self.pid internally here.)\n\n        :return none\n        \"\"\"\n\n        # there is no hdfs currently in ec2, except s3n/hdfs\n        # the core-site.xml provides s3n info\n        # it's possible that we can just always hardware the hdfs version\n        # to match the cdh3 cluster we're hard-wiring tests to\n        # i.e. it won't make s3n/s3 break on ec2\n\n        if self.is_client:\n            main_class = \"water.H2OClientApp\"\n        else:\n            main_class = \"water.H2OApp\"\n\n        if \"JAVA_HOME\" in os.environ:\n            java = os.environ[\"JAVA_HOME\"] + \"/bin/java\"\n        else:\n            java = \"java\"\n        classpath_sep = \";\" if sys.platform == \"win32\" else \":\"\n        classpath = self.h2o_jar if self.cp == \"\" else self.h2o_jar + classpath_sep + self.cp\n\n        cmd = [java,\n               # \"-agentlib:jdwp=transport=dt_socket,server=y,suspend=n,address=5005\",\n               \"-Xmx\" + self.xmx,\n               \"-ea\"]\n        if self.jvm_opts is not None:\n            cmd += [self.jvm_opts]\n        cmd += [\"-cp\", classpath,\n               main_class,\n               \"-name\", self.cloud_name,\n               \"-port\", str(self.port),\n               \"-ip\", self.ip]\n        if self.flatfile is not None:\n            cmd += [\"-flatfile\", self.flatfile]\n\n        if self.ldap_config_path is not None:\n            cmd.append('-login_conf')\n            cmd.append(self.ldap_config_path)\n            cmd.append('-ldap_login')\n\n        # If the jacoco flag was included, then modify cmd to generate coverage\n        # data using the jacoco agent\n        if g_jacoco_include:\n            root_dir = os.path.abspath(os.path.join(os.path.dirname(__file__), \"..\"))\n            agent_dir = os.path.join(root_dir, \"jacoco\", \"jacocoagent.jar\")\n            jresults_dir = os.path.join(self.output_dir, \"jacoco\")\n            if not os.path.exists(jresults_dir):\n                os.mkdir(jresults_dir)\n            jresults_dir = os.path.join(jresults_dir, \"{cloud}_{node}\".format(cloud=self.cloud_num, node=self.node_num))\n            jacoco = \"-javaagent:\" + agent_dir + \"=destfile=\" + \\\n                     os.path.join(jresults_dir, \"{cloud}_{node}.exec\".format(cloud=self.cloud_num, node=self.node_num))\n            opt0, opt1 = g_jacoco_options\n            if opt0 is not None:\n                jacoco += \",includes={inc}\".format(inc=opt0.replace(',', ':'))\n            if opt1 is not None:\n                jacoco += \",excludes={ex}\".format(ex=opt1.replace(',', ':'))\n\n            cmd = cmd[:1] + [jacoco] + cmd[1:]\n\n        if self.test_ssl:\n            cmd.append(\"-internal_security_conf\")\n            if g_convenient:\n                cmd.append(\"../h2o-algos/src/test/resources/ssl.properties\")\n            else:\n                cmd.append(\"../../../h2o-algos/src/test/resources/ssl3.properties\")\n\n        # Add S3N credentials to cmd if they exist.\n        # ec2_hdfs_config_file_name = os.path.expanduser(\"~/.ec2/core-site.xml\")\n        # if (os.path.exists(ec2_hdfs_config_file_name)):\n        #     cmd.append(\"-hdfs_config\")\n        #     cmd.append(ec2_hdfs_config_file_name)\n\n        self.output_file_name = \\\n            os.path.join(self.output_dir, \"java_\" + str(self.cloud_num) + \"_\" + str(self.node_num) + \".out.txt\")\n        f = open(self.output_file_name, \"w\")\n\n        if g_convenient:\n            cwd = os.getcwd()\n            here = os.path.abspath(os.path.dirname(__file__))\n            there = os.path.abspath(os.path.join(here, \"..\"))\n            os.chdir(there)\n            self.child = subprocess.Popen(args=cmd,\n                                          stdout=f,\n                                          stderr=subprocess.STDOUT,\n                                          cwd=there)\n            os.chdir(cwd)\n        else:\n            try:\n              self.child = subprocess.Popen(args=cmd,\n                                            stdout=f,\n                                            stderr=subprocess.STDOUT,\n                                            cwd=self.output_dir)\n              self.pid = self.child.pid\n              print(\"+ CMD: \" + ' '.join(cmd))\n\n            except OSError:\n                raise \"Failed to spawn %s in %s\" % (cmd, self.output_dir)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef scrape_port_from_stdout(self):\n        regex = re.compile(r\"Open H2O Flow in your web browser: https?://([^:]+):(\\d+)\")\n        retries_left = 30\n        while retries_left and not self.terminated:\n            with open(self.output_file_name, \"r\") as f:\n                for line in f:\n                    mm = re.search(regex, line)\n                    if mm is not None:\n                        self.port = mm.group(2)\n                        print(\"H2O cloud %d node %d listening on port %s\\n    with output file %s\" %\n                              (self.cloud_num, self.node_num, self.port, self.output_file_name))\n                        return\n            if self.terminated: break\n            retries_left -= 1\n            time.sleep(1)\n\n        if self.terminated: return\n        print(\"\\nERROR: Too many retries starting cloud %d.\\nCheck the output log %s.\\n\" %\n              (self.cloud_num, self.output_file_name))\n        sys.exit(1)", "response": "Scrape the port number from the stdout log and store it in self. port."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef scrape_cloudsize_from_stdout(self, nodes_per_cloud):\n        retries = 60\n        while retries > 0:\n            if self.terminated: return\n            f = open(self.output_file_name, \"r\")\n            s = f.readline()\n            while len(s) > 0:\n                if self.terminated: return\n                match_groups = re.search(r\"Cloud of size (\\d+) formed\", s)\n                if match_groups is not None:\n                    size = match_groups.group(1)\n                    if size is not None:\n                        size = int(size)\n                        if size == nodes_per_cloud:\n                            f.close()\n                            return\n\n                s = f.readline()\n\n            f.close()\n            retries -= 1\n            if self.terminated: return\n            time.sleep(1)\n\n        print(\"\")\n        print(\"ERROR: Too many retries starting cloud.\")\n        print(\"\")\n        sys.exit(1)", "response": "Scrape cloud size from the stdout log and wait until the cluster of proper size is formed."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef stop(self):\n        if self.pid > 0:\n            print(\"Killing JVM with PID {}\".format(self.pid))\n            try:\n                self.child.terminate()\n                self.child.wait()\n            except OSError:\n                pass\n            self.pid = -1", "response": "Normal node shutdown.\n        Ignore failures for now.\n\n        :return none"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nstart the H2O cluster.", "response": "def start(self):\n        \"\"\"\n        Start H2O cluster.\n        The cluster is not up until wait_for_cloud_to_be_up() is called and returns.\n\n        :return none\n        \"\"\"\n        for node in self.nodes:\n            node.start()\n\n        for node in self.client_nodes:\n            node.start()"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nstopping all nodes and client nodes.", "response": "def stop(self):\n        \"\"\"\n        Normal cluster shutdown.\n\n        :return none\n        \"\"\"\n        for node in self.nodes:\n            node.stop()\n\n        for node in self.client_nodes:\n            node.stop()"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nterminate a running cluster.", "response": "def terminate(self):\n        \"\"\"\n        Terminate a running cluster.  (Due to a signal.)\n\n        :return none\n        \"\"\"\n        for node in self.client_nodes:\n            node.terminate()\n\n        for node in self.nodes:\n            node.terminate()"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef get_ip(self):\n        if len(self.client_nodes) > 0:\n            node = self.client_nodes[0]\n        else:\n            node = self.nodes[0]\n        return node.get_ip()", "response": "Return an ip to use to talk to this cluster."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef get_port(self):\n        if len(self.client_nodes) > 0:\n            node = self.client_nodes[0]\n        else:\n            node = self.nodes[0]\n        return node.get_port()", "response": "Return a port to use to talk to this cluster."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef main(argv):\n    global g_log_base_dir\n    global g_airline_java\n    global g_milsongs_java\n    global g_airline_python\n    global g_milsongs_python\n    global g_jenkins_url\n    global g_airline_py_tail\n    global g_milsongs_py_tail\n    global g_airline_java_tail\n    global g_milsongs_java_tail\n\n    if len(argv) < 9:\n        print \"python grabGLRMrunLogs logsBaseDirectory airlineJavaFileNameWithPath milsongJavaFileNameWithPath \" \\\n              \"airlinePyunitWithPath airlinePyunitWithPath jenkinsJobURL startBuild# endBuild#.\\n\"\n        sys.exit(1)\n    else:   # we may be in business\n#        g_script_name = os.path.basename(argv[0])   # get name of script being run.\n            # base directory where all logs will be collected according to build #\n        g_log_base_dir = argv[1]\n        g_jenkins_url = argv[2]\n        g_airline_java = argv[3]\n        g_milsongs_java = argv[4]\n        g_airline_python = argv[5]\n        g_milsongs_python = argv[6]\n        start_number = int(argv[7])\n        end_number = int(argv[8])\n\n        if (start_number > end_number):\n            print \"startBuild# must be <= end_number\"\n            sys.exit(1)\n        else:\n            for build_index in range(start_number, end_number+1):   # grab log info for all builds\n                # copy the java jobs\n                get_file_out(build_index, g_airline_java, g_airline_java_tail)\n                get_file_out(build_index, g_milsongs_java, g_milsongs_java_tail)\n\n                # copy the pyunit jobs\n                get_file_out(build_index, g_airline_python, g_airline_py_tail)\n                get_file_out(build_index, g_milsongs_python, g_milsongs_py_tail)", "response": "Main function for the airline script."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nplot the training set and validation set for an H2OBinomialModel.", "response": "def plot(self, timestep=\"AUTO\", metric=\"AUTO\", server=False, **kwargs):\n        \"\"\"\n        Plot training set (and validation set if available) scoring history for an H2OBinomialModel.\n\n        The timestep and metric arguments are restricted to what is available in its scoring history.\n\n        :param str timestep: A unit of measurement for the x-axis.\n        :param str metric: A unit of measurement for the y-axis.\n        :param bool server: if True, then generate the image inline (using matplotlib's \"Agg\" backend)\n        \"\"\"\n        assert_is_type(metric, \"AUTO\", \"logloss\", \"auc\", \"classification_error\", \"rmse\")\n        if self._model_json[\"algo\"] in (\"deeplearning\", \"deepwater\", \"xgboost\", \"drf\", \"gbm\"):\n            if metric == \"AUTO\":\n                metric = \"logloss\"\n        self._plot(timestep=timestep, metric=metric, server=server)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nreturning the coordinates of the ROC curve for a given set of data.", "response": "def roc(self, train=False, valid=False, xval=False):\n        \"\"\"\n        Return the coordinates of the ROC curve for a given set of data.\n\n        The coordinates are two-tuples containing the false positive rates as a list and true positive rates as a list.\n        If all are False (default), then return is the training data. If more than one ROC\n        curve is requested, the data is returned as a dictionary of two-tuples.\n\n        :param bool train: If True, return the ROC value for the training data.\n        :param bool valid: If True, return the ROC value for the validation data.\n        :param bool xval: If True, return the ROC value for each of the cross-validated splits.\n\n        :returns: The ROC values for the specified key(s).\n        \"\"\"\n        tm = ModelBase._get_metrics(self, train, valid, xval)\n        m = {}\n        for k, v in viewitems(tm):\n\n            if v is not None:\n                m[k] = (v.fprs, v.tprs)\n        return list(m.values())[0] if len(m) == 1 else m"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nretrieving the index in this metric s threshold list at which the given threshold is located.", "response": "def find_idx_by_threshold(self, threshold, train=False, valid=False, xval=False):\n        \"\"\"\n        Retrieve the index in this metric's threshold list at which the given threshold is located.\n\n        If all are False (default), then return the training metric value.\n        If more than one options is set to True, then return a dictionary of metrics where the keys are \"train\",\n        \"valid\", and \"xval\".\n\n        :param float threshold: Threshold value to search for in the threshold list.\n        :param bool train: If True, return the find idx by threshold value for the training data.\n        :param bool valid: If True, return the find idx by threshold value for the validation data.\n        :param bool xval: If True, return the find idx by threshold value for each of the cross-validated splits.\n\n        :returns: The find idx by threshold values for the specified key(s).\n        \"\"\"\n        tm = ModelBase._get_metrics(self, train, valid, xval)\n        m = {}\n        for k, v in viewitems(tm):\n            m[k] = None if v is None else v.find_idx_by_threshold(threshold)\n        return list(m.values())[0] if len(m) == 1 else m"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\ncreate a new H2OWord2vecEstimator instance from an external model.", "response": "def from_external(external=H2OFrame):\n        \"\"\"\n        Creates new H2OWord2vecEstimator based on an external model.\n        :param external: H2OFrame with an external model\n        :return: H2OWord2vecEstimator instance representing the external model\n        \"\"\"\n        w2v_model = H2OWord2vecEstimator(pre_trained=external)\n        w2v_model.train()\n        return w2v_model"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef _determine_vec_size(self):\n        first_column = self.pre_trained.types[self.pre_trained.columns[0]]\n\n        if first_column != 'string':\n            raise H2OValueError(\"First column of given pre_trained model %s is required to be a String\",\n                                self.pre_trained.frame_id)\n\n        if list(self.pre_trained.types.values()).count('string') > 1:\n            raise H2OValueError(\"There are multiple columns in given pre_trained model %s with a String type.\",\n                                self.pre_trained.frame_id)\n\n        self.vec_size = self.pre_trained.dim[1] - 1;", "response": "Determines the size of the vector for a pre - trained model after basic model verification."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nmean absolute error regression loss.", "response": "def h2o_mean_absolute_error(y_actual, y_predicted, weights=None):\n    \"\"\"\n    Mean absolute error regression loss.\n\n    :param y_actual: H2OFrame of actual response.\n    :param y_predicted: H2OFrame of predicted response.\n    :param weights: (Optional) sample weights\n    :returns: mean absolute error loss (best is 0.0).\n    \"\"\"\n    ModelBase._check_targets(y_actual, y_predicted)\n    return _colmean((y_predicted - y_actual).abs())"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nmean squared error regression loss.", "response": "def h2o_mean_squared_error(y_actual, y_predicted, weights=None):\n    \"\"\"\n    Mean squared error regression loss\n\n    :param y_actual: H2OFrame of actual response.\n    :param y_predicted: H2OFrame of predicted response.\n    :param weights: (Optional) sample weights\n    :returns: mean squared error loss (best is 0.0).\n    \"\"\"\n    ModelBase._check_targets(y_actual, y_predicted)\n    return _colmean((y_predicted - y_actual) ** 2)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef h2o_explained_variance_score(y_actual, y_predicted, weights=None):\n    ModelBase._check_targets(y_actual, y_predicted)\n\n    _, numerator = _mean_var(y_actual - y_predicted, weights)\n    _, denominator = _mean_var(y_actual, weights)\n    if denominator == 0.0:\n        return 1. if numerator == 0 else 0.  # 0/0 => 1, otherwise, 0\n    return 1 - numerator / denominator", "response": "Explained variance regression score function."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef h2o_r2_score(y_actual, y_predicted, weights=1.):\n    ModelBase._check_targets(y_actual, y_predicted)\n    numerator = (weights * (y_actual - y_predicted) ** 2).sum()\n    denominator = (weights * (y_actual - _colmean(y_actual)) ** 2).sum()\n\n    if denominator == 0.0:\n        return 1. if numerator == 0. else 0.  # 0/0 => 1, else 0\n    return 1 - numerator / denominator", "response": "H2O R - squared regression score function"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nplot the training set and validation set for an H2ORegressionModel.", "response": "def plot(self, timestep=\"AUTO\", metric=\"AUTO\", **kwargs):\n        \"\"\"\n        Plots training set (and validation set if available) scoring history for an H2ORegressionModel. The timestep\n        and metric arguments are restricted to what is available in its scoring history.\n\n        :param timestep: A unit of measurement for the x-axis.\n        :param metric: A unit of measurement for the y-axis.\n        :returns: A scoring history plot.\n        \"\"\"\n\n        if self._model_json[\"algo\"] in (\"deeplearning\", \"deepwater\", \"xgboost\", \"drf\", \"gbm\"):\n            if metric == \"AUTO\":\n                metric = \"rmse\"\n            elif metric not in (\"rmse\", \"deviance\", \"mae\"):\n                raise ValueError(\"metric for H2ORegressionModel must be one of: AUTO, rmse, deviance, or mae\")\n\n        self._plot(timestep=timestep, metric=metric, **kwargs)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef assert_is_type(var, *types, **kwargs):\n    assert types, \"The list of expected types was not provided\"\n    expected_type = types[0] if len(types) == 1 else U(*types)\n    if _check_type(var, expected_type): return\n\n    # Type check failed => Create a nice error message\n    assert set(kwargs).issubset({\"message\", \"skip_frames\"}), \"Unexpected keyword arguments: %r\" % kwargs\n    message = kwargs.get(\"message\", None)\n    skip_frames = kwargs.get(\"skip_frames\", 1)\n    args = _retrieve_assert_arguments()\n    vname = args[0]\n    etn = _get_type_name(expected_type, dump=\", \".join(args[1:]))\n    vtn = _get_type_name(type(var))\n    raise H2OTypeError(var_name=vname, var_value=var, var_type_name=vtn, exp_type_name=etn, message=message,\n                       skip_frames=skip_frames)", "response": "Checks that the variable is of the specified type."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef assert_matches(v, regex):\n    m = re.match(regex, v)\n    if m is None:\n        vn = _retrieve_assert_arguments()[0]\n        message = \"Argument `{var}` (= {val!r}) did not match /{regex}/\".format(var=vn, regex=regex, val=v)\n        raise H2OValueError(message, var_name=vn, skip_frames=1)\n    return m", "response": "Assert that string variable matches the provided regular expression."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef assert_satisfies(v, cond, message=None):\n    if not cond:\n        vname, vexpr = _retrieve_assert_arguments()\n        if not message:\n            message = \"Argument `{var}` (= {val!r}) does not satisfy the condition {expr}\" \\\n                      .format(var=vname, val=v, expr=vexpr)\n        raise H2OValueError(message=message, var_name=vname, skip_frames=1)", "response": "Assert that the variable v satisfies the condition cond."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nchecks if the variable is of the specified type and returns True if the variable is of the specified type and False otherwise.", "response": "def _check_type(var, vtype):\n    \"\"\"\n    Return True if the variable is of the specified type, and False otherwise.\n\n    :param var: variable to check\n    :param vtype: expected variable's type\n    \"\"\"\n    if vtype is None:\n        return var is None\n    if isinstance(vtype, _primitive_type):\n        return var == vtype\n    if vtype is str:\n        return isinstance(var, _str_type)\n    if vtype is int:\n        return isinstance(var, _int_type)\n    if vtype is numeric:\n        return isinstance(var, _num_type)\n    if isinstance(vtype, MagicType):\n        return vtype.check(var)\n    if isinstance(vtype, type):\n        # ``vtype`` is a name of the class, or a built-in type such as \"list\", \"tuple\", etc\n        return isinstance(var, vtype)\n    if isinstance(vtype, list):\n        # ``vtype`` is a list literal\n        elem_type = U(*vtype)\n        return isinstance(var, list) and all(_check_type(item, elem_type) for item in var)\n    if isinstance(vtype, set):\n        # ``vtype`` is a set literal\n        elem_type = U(*vtype)\n        return isinstance(var, set) and all(_check_type(item, elem_type) for item in var)\n    if isinstance(vtype, tuple):\n        # ``vtype`` is a tuple literal\n        return (isinstance(var, tuple) and len(vtype) == len(var) and\n                all(_check_type(var[i], vtype[i]) for i in range(len(vtype))))\n    if isinstance(vtype, dict):\n        # ``vtype`` is a dict literal\n        ttkv = U(*viewitems(vtype))\n        return isinstance(var, dict) and all(_check_type(kv, ttkv) for kv in viewitems(var))\n    if isinstance(vtype, (FunctionType, BuiltinFunctionType)):\n        return vtype(var)\n    raise RuntimeError(\"Ivalid type %r in _check_type()\" % vtype)"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _get_type_name(vtype, dump=None):\n    if vtype is None:\n        return \"None\"\n    if vtype is str:\n        return \"string\"\n    if vtype is int:\n        return \"integer\"\n    if vtype is numeric:\n        return \"numeric\"\n    if is_type(vtype, str):\n        return '\"%s\"' % repr(vtype)[1:-1]\n    if is_type(vtype, int):\n        return str(vtype)\n    if isinstance(vtype, MagicType):\n        return vtype.name(dump)\n    if isinstance(vtype, type):\n        return vtype.__name__\n    if isinstance(vtype, list):\n        return \"list(%s)\" % _get_type_name(U(*vtype), dump)\n    if isinstance(vtype, set):\n        return \"set(%s)\" % _get_type_name(U(*vtype), dump)\n    if isinstance(vtype, tuple):\n        return \"(%s)\" % \", \".join(_get_type_name(item, dump) for item in vtype)\n    if isinstance(vtype, dict):\n        return \"dict(%s)\" % \", \".join(\"%s: %s\" % (_get_type_name(tk, dump), _get_type_name(tv, dump))\n                                      for tk, tv in viewitems(vtype))\n    if isinstance(vtype, (FunctionType, BuiltinFunctionType)):\n        if vtype.__name__ == \"<lambda>\":\n            return _get_lambda_source_code(vtype, dump)\n        else:\n            return vtype.__name__\n    raise RuntimeError(\"Unexpected `vtype`: %r\" % vtype)", "response": "Return the name of the provided type."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nattempting to find the source code of the lambda_fn within the string src.", "response": "def _get_lambda_source_code(lambda_fn, src):\n    \"\"\"Attempt to find the source code of the ``lambda_fn`` within the string ``src``.\"\"\"\n    def gen_lambdas():\n        def gen():\n            yield src + \"\\n\"\n\n        g = gen()\n        step = 0\n        tokens = []\n        for tok in tokenize.generate_tokens(getattr(g, \"next\", getattr(g, \"__next__\", None))):\n            if step == 0:\n                if tok[0] == tokenize.NAME and tok[1] == \"lambda\":\n                    step = 1\n                    tokens = [tok]\n                    level = 0\n            elif step == 1:\n                if tok[0] == tokenize.NAME:\n                    tokens.append(tok)\n                    step = 2\n                else:\n                    step = 0\n            elif step == 2:\n                if tok[0] == tokenize.OP and tok[1] == \":\":\n                    tokens.append(tok)\n                    step = 3\n                else:\n                    step = 0\n            elif step == 3:\n                if level == 0 and (tok[0] == tokenize.OP and tok[1] in \",)\" or tok[0] == tokenize.ENDMARKER):\n                    yield tokenize.untokenize(tokens).strip()\n                    step = 0\n                else:\n                    tokens.append(tok)\n                    if tok[0] == tokenize.OP:\n                        if tok[1] in \"[({\": level += 1\n                        if tok[1] in \"])}\": level -= 1\n        assert not tokens\n\n    actual_code = lambda_fn.__code__.co_code\n    for lambda_src in gen_lambdas():\n        try:\n            fn = eval(lambda_src, globals(), locals())\n            if fn.__code__.co_code == actual_code:\n                return lambda_src.split(\":\", 1)[1].strip()\n        except Exception:\n            pass\n    return \"<lambda>\""}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef name(self, src=None):\n        res = [_get_type_name(tt, src) for tt in self._types]\n        if len(res) == 2 and \"None\" in res:\n            res.remove(\"None\")\n            return \"?\" + res[0]\n        else:\n            return \" | \".join(res)", "response": "Return string representing the name of this type."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef check(self, var):\n        return all(_check_type(var, tt) for tt in self._types)", "response": "Return True if the variable matches this type and False otherwise."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef name(self, src=None):\n        return \" & \".join(_get_type_name(tt, src) for tt in self._types)", "response": "Return string representing the name of this type."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef check(self, var):\n        return not any(_check_type(var, tt) for tt in self._types)", "response": "Return True if the variable matches any of the types and False otherwise."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef name(self, src=None):\n        if len(self._types) > 1:\n            return \"!(%s)\" % str(\"|\".join(_get_type_name(tt, src) for tt in self._types))\n        else:\n            return \"!\" + _get_type_name(self._types[0], src)", "response": "Return string representing the name of this type."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef check(self, var):\n        return isinstance(var, tuple) and all(_check_type(t, self._element_type) for t in var)", "response": "Return True if the variable matches this type and False otherwise."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nreturning True if the variable matches this type and False otherwise.", "response": "def check(self, var):\n        \"\"\"Return True if the variable matches this type, and False otherwise.\"\"\"\n        if not isinstance(var, dict): return False\n        if any(key not in self._types for key in var): return False\n        for key, ktype in viewitems(self._types):\n            val = var.get(key, None)\n            if not _check_type(val, ktype):\n                return False\n        return True"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nreturns string representing the name of this type.", "response": "def name(self, src=None):\n        \"\"\"Return string representing the name of this type.\"\"\"\n        return \"{%s}\" % \", \".join(\"%s: %s\" % (key, _get_type_name(ktype, src))\n                                  for key, ktype in viewitems(self._types))"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef check(self, var):\n        return (isinstance(var, _int_type) and\n                (self._lower_bound is None or var >= self._lower_bound) and\n                (self._upper_bound is None or var <= self._upper_bound))", "response": "Return True if the variable matches the specified type."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nreturn string representing the name of this type.", "response": "def name(self, src=None):\n        \"\"\"Return string representing the name of this type.\"\"\"\n        if self._upper_bound is None and self._lower_bound is None: return \"int\"\n        if self._upper_bound is None:\n            if self._lower_bound == 1: return \"int>0\"\n            return \"int\u2265%d\" % self._lower_bound\n        if self._lower_bound is None:\n            return \"int\u2264%d\" % self._upper_bound\n        return \"int[%d\u2026%d]\" % (self._lower_bound, self._upper_bound)"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef check(self, var):\n        return (isinstance(var, _num_type) and\n                (self._lower_bound is None or var >= self._lower_bound) and\n                (self._upper_bound is None or var <= self._upper_bound))", "response": "Return True if the variable matches the specified type."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nreturns True if the variable matches this type and False otherwise.", "response": "def check(self, var):\n        \"\"\"Return True if the variable matches this type, and False otherwise.\"\"\"\n        if self._class is None: self._init()\n        return self._class and self._checker(var, self._class)"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef check(self, var):\n        if not isinstance(var, _str_type): return False\n        return _enum_mangle(var) in self._consts", "response": "Check whether the provided value is a valid enum constant."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nretrieves the config as a dictionary of key - value pairs.", "response": "def get_config():\n        \"\"\"Retrieve the config as a dictionary of key-value pairs.\"\"\"\n        self = H2OConfigReader._get_instance()\n        if not self._config_loaded:\n            self._read_config()\n        return self._config"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef _read_config(self):\n        self._config_loaded = True\n        conf = []\n        for f in self._candidate_log_files():\n            if os.path.isfile(f):\n                self._logger.info(\"Reading config file %s\" % f)\n                section_rx = re.compile(r\"^\\[(\\w+)\\]$\")\n                keyvalue_rx = re.compile(r\"^(\\w+:)?([\\w.]+)\\s*=(.*)$\")\n                with io.open(f, \"rt\", encoding=\"utf-8\") as config_file:\n                    section_name = None\n                    for lineno, line in enumerate(config_file):\n                        line = line.strip()\n                        if line == \"\" or line.startswith(\"#\"): continue\n                        m1 = section_rx.match(line)\n                        if m1:\n                            section_name = m1.group(1)\n                            continue\n                        m2 = keyvalue_rx.match(line)\n                        if m2:\n                            lng = m2.group(1)\n                            key = m2.group(2)\n                            val = m2.group(3).strip()\n                            if lng and lng.lower() != \"py:\": continue\n                            if section_name:\n                                key = section_name + \".\" + key\n                            if key in H2OConfigReader._allowed_config_keys:\n                                conf.append((key, val))\n                            else:\n                                self._logger.error(\"Key %s is not a valid config key\" % key)\n                            continue\n                        self._logger.error(\"Syntax error in config file line %d: %s\" % (lineno, line))\n                self._config = dict(conf)\n                return", "response": "Find and parse config file and store all variables in self. _config."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef _candidate_log_files():\n        # Search for .h2oconfig in the current directory and all parent directories\n        relpath = \".h2oconfig\"\n        prevpath = None\n        while True:\n            abspath = os.path.abspath(relpath)\n            if abspath == prevpath: break\n            prevpath = abspath\n            relpath = \"../\" + relpath\n            yield abspath\n        # Also check if .h2oconfig exists in the user's directory\n        yield os.path.expanduser(\"~/.h2oconfig\")", "response": "Return a generator that yields the absolute locations of the. h2oconfig file."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef execute(self, progress_fn, print_verbose_info=None):\n        assert_is_type(progress_fn, FunctionType, GeneratorType, MethodType)\n        if isinstance(progress_fn, GeneratorType):\n            # Convert generator to a regular function\n            progress_fn = (lambda g: lambda: next(g))(progress_fn)\n\n        # Initialize the execution context\n        self._next_poll_time = 0\n        self._t0 = time.time()\n        self._x0 = 0\n        self._v0 = 0.01  # corresponds to 100s completion time\n        self._ve = 0.01\n\n        progress = 0\n        status = None  # Status message in case the job gets interrupted.\n        try:\n            while True:\n                # We attempt to synchronize all helper functions, ensuring that each of them has the same idea\n                # for what the current time moment is. Otherwise we could have some corner cases when one method\n                # says that something must happen right now, while the other already sees that moment in the past.\n                now = time.time()\n\n                # Query the progress level, but only if it's time already\n                if self._next_poll_time <= now:\n                    res = progress_fn()  # may raise StopIteration\n                    assert_is_type(res, (numeric, numeric), numeric)\n                    if not isinstance(res, tuple):\n                        res = (res, -1)\n                    # Progress querying could have taken some time, so update the current time moment\n                    now = time.time()\n                    self._store_model_progress(res, now)\n                    self._recalculate_model_parameters(now)\n\n                # Render the widget regardless of whether it's too early or not\n                progress = min(self._compute_progress_at_time(now)[0], 1)\n                if progress == 1 and self._get_real_progress() >= 1:\n                    # Do not exit until both the model and the actual progress reach 100% mark.\n                    break\n                result = self._widget.render(progress)\n                assert_is_type(result, RenderResult)\n                time0 = result.next_time\n                time1 = self._get_time_at_progress(result.next_progress)\n                next_render_time = min(time0, time1)\n                self._draw(result.rendered)\n\n                # Wait until the next rendering/querying cycle\n                wait_time = min(next_render_time, self._next_poll_time) - now\n                if wait_time > 0:\n                    time.sleep(wait_time)\n                    if print_verbose_info is not None:\n                        print_verbose_info(progress)\n        except KeyboardInterrupt:\n            # If the user presses Ctrl+C, we interrupt the progress bar.\n            status = \"cancelled\"\n        except StopIteration as e:\n            # If the generator raises StopIteration before reaching 100%, then the progress display will\n            # reamin incomplete.\n            status = str(e)\n\n        # Do one final rendering before we exit\n        result = self._widget.render(progress=progress, status=status)\n        self._draw(result.rendered, final=True)\n\n        if status == \"cancelled\":\n            # Re-raise the exception, to inform the upstream caller that something unexpected happened.\n            raise StopIteration(status)", "response": "Execute a function in the progress bar."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nstores the current model progress into the internal list of items.", "response": "def _store_model_progress(self, res, now):\n        \"\"\"\n        Save the current model progress into ``self._progress_data``, and update ``self._next_poll_time``.\n\n        :param res: tuple (progress level, poll delay).\n        :param now: current timestamp.\n        \"\"\"\n        raw_progress, delay = res\n        raw_progress = clamp(raw_progress, 0, self._maxval)\n        self._progress_data.append((now, raw_progress))\n\n        if delay < 0:\n            # calculation of ``_guess_next_poll_interval()`` should be done only *after* we pushed the fresh data to\n            # ``self._progress_data``.\n            delay = self._guess_next_poll_interval()\n        self._next_poll_time = now + clamp(delay, self.MIN_PROGRESS_CHECK_INTERVAL, self.MAX_PROGRESS_CHECK_INTERVAL)"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\ncomputes t0 x0 v0 ve.", "response": "def _recalculate_model_parameters(self, now):\n        \"\"\"Compute t0, x0, v0, ve.\"\"\"\n        time_until_end = self._estimate_progress_completion_time(now) - now\n        assert time_until_end >= 0, \"Estimated progress completion cannot be in the past.\"\n        x_real = self._get_real_progress()\n        if x_real == 1:\n            t0, x0, v0, ve = now, 1, 0, 0\n        else:\n            x0, v0 = self._compute_progress_at_time(now)\n            t0 = now\n            if x0 >= 1:\n                # On rare occasion, the model's progress may have reached 100% by ``now``. This can happen if\n                # (1) the progress is close to 100% initially and has high speed, (2) on the previous call we\n                # estimated that the process completion time will be right after the next poll time, and (3)\n                # the polling itself took so much time that the process effectively \"overshoot\".\n                # If this happens, then we adjust x0, v0 to the previous valid data checkpoint.\n                t0, x0, v0 = self._t0, self._x0, self._v0\n                time_until_end += now - t0\n            z = self.BETA * time_until_end\n            max_speed = (1 - x_real**2) / self.FINISH_DELAY\n            ve = v0 + (self.BETA * (1 - x0) - v0 * z) / (z - 1 + math.exp(-z))\n            if ve < 0:\n                # Current speed is too high -- reduce v0 (violate non-smoothness of speed)\n                v0 = self.BETA * (1 - x0) / (1 - math.exp(-z))\n                ve = 0\n            if ve > max_speed:\n                # Current speed is too low: finish later, but do not allow ``ve`` to be higher than ``max_speed``\n                ve = max_speed\n        self._t0, self._x0, self._v0, self._ve = t0, x0, v0, ve"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nestimates the time when the underlying process is expected to reach completion.", "response": "def _estimate_progress_completion_time(self, now):\n        \"\"\"\n        Estimate the moment when the underlying process is expected to reach completion.\n\n        This function should only return future times. Also this function is not allowed to return time moments less\n        than self._next_poll_time if the actual progress is below 100% (this is because we won't know that the\n        process have finished until we poll the external progress function).\n        \"\"\"\n        assert self._next_poll_time >= now\n        tlast, wlast = self._progress_data[-1]\n        # If reached 100%, make sure that we finish as soon as possible, but maybe not immediately\n        if wlast == self._maxval:\n            current_completion_time = (1 - self._x0) / self._v0 + self._t0\n            return clamp(current_completion_time, now, now + self.FINISH_DELAY)\n\n        # Calculate the approximate speed of the raw progress based on recent data\n        tacc, wacc = 0, 0\n        factor = self.GAMMA\n        for t, x in self._progress_data[-2::-1]:\n            tacc += factor * (tlast - t)\n            wacc += factor * (wlast - x)\n            factor *= self.GAMMA\n            if factor < 1e-2: break\n\n        # If there was no progress at all, then just assume it's 5 minutes from now\n        if wacc == 0: return now + 300\n\n        # Estimate the completion time assuming linear progress\n        t_estimate = tlast + tacc * (self._maxval - wlast) / wacc\n\n        # Adjust the estimate if it looks like it may happen too soon\n        if t_estimate <= self._next_poll_time:\n            t_estimate = self._next_poll_time + self.FINISH_DELAY\n\n        return t_estimate"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nguessing the next poll interval for the next poll.", "response": "def _guess_next_poll_interval(self):\n        \"\"\"\n        Determine when to query the progress status next.\n\n        This function is used if the external progress function did not return time interval for when it should be\n        queried next.\n        \"\"\"\n        time_elapsed = self._progress_data[-1][0] - self._progress_data[0][0]\n        real_progress = self._get_real_progress()\n        return min(0.2 * time_elapsed, 0.5 + (1 - real_progress)**0.5)"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\ncomputing the modelled progress state at the given time moment.", "response": "def _compute_progress_at_time(self, t):\n        \"\"\"\n        Calculate the modelled progress state for the given time moment.\n\n        :returns: tuple (x, v) of the progress level and progress speed.\n        \"\"\"\n        t0, x0, v0, ve = self._t0, self._x0, self._v0, self._ve\n        z = (v0 - ve) * math.exp(-self.BETA * (t - t0))\n        vt = ve + z\n        xt = clamp(x0 + ve * (t - t0) + (v0 - ve - z) / self.BETA, 0, 1)\n        return xt, vt"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef _get_time_at_progress(self, x_target):\n        t, x, v = self._t0, self._x0, self._v0\n        # The convergence should be achieved in just few iterations, however in unlikely situation that it doesn't\n        # we don't want to loop forever...\n        for _ in range(20):\n            if v == 0: return 1e20\n            # make time prediction assuming the progress will continue at a linear speed ``v``\n            t += (x_target - x) / v\n            # calculate the actual progress at that time\n            x, v = self._compute_progress_at_time(t)\n            # iterate until convergence\n            if abs(x - x_target) < 1e-3: return t\n        return time.time() + 100", "response": "Return the projected time at which the progress level x_target will be reached."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nprints the rendered string to the stdout.", "response": "def _draw(self, txt, final=False):\n        \"\"\"Print the rendered string to the stdout.\"\"\"\n        if not self._file_mode:\n            # If the user presses Ctrl+C this ensures we still start writing from the beginning of the line\n            sys.stdout.write(\"\\r\")\n        sys.stdout.write(txt)\n        if final and not isinstance(self._widget, _HiddenWidget):\n            sys.stdout.write(\"\\n\")\n        else:\n            if not self._file_mode:\n                sys.stdout.write(\"\\r\")\n            sys.stdout.flush()"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef _compute_widget_sizes(self):\n        wl = [0] * len(self._widgets)\n        flex_count = 0\n\n        # First render all non-flexible widgets\n        for i, widget in enumerate(self._widgets):\n            if isinstance(widget, ProgressBarFlexibleWidget):\n                flex_count += 1\n            else:\n                wl[i] = widget.render(1).length\n\n        remaining_width = self._width - sum(wl)\n        remaining_width -= len(self._widgets) - 1  # account for 1-space interval between widgets\n        if remaining_width < 10 * flex_count:\n            if self._file_mode:\n                remaining_width = 10 * flex_count\n            else:\n                # The window is too small to accomodate the widget: try to split it into several lines, otherwise\n                # switch to \"file mode\". If we don't do this, then rendering the widget will cause it to wrap, and\n                # then when we use \\r to go to the beginning of the line, only part of the widget will be overwritten,\n                # which means we'll have many (possibly hundreds) of progress bar lines in the end.\n                widget0 = self._widgets[0]\n                if isinstance(widget0, PBWString) and remaining_width + widget0.render(0).length >= 10 * flex_count:\n                    remaining_width += widget0.render(0).length + 1\n                    self._to_render = widget0.render(0).rendered + \"\\n\"\n                    self._widgets = self._widgets[1:]\n                if remaining_width < 10 * flex_count:\n                    self._file_mode = True\n                    remaining_width = 10 * flex_count\n\n        remaining_width = max(remaining_width, 10 * flex_count)  # Ensure at least 10 chars per flexible widget\n\n        for i, widget in enumerate(self._widgets):\n            if isinstance(widget, ProgressBarFlexibleWidget):\n                target_length = int(remaining_width / flex_count)\n                result = widget.render(1, target_length)\n                wl[i] = result.length\n                remaining_width -= result.length\n                flex_count -= 1\n\n        return wl", "response": "Compute the widths of all widgets in the current window."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _get_terminal_size():\n        # If output is not terminal but a regular file, assume 100 chars width\n        if not sys.stdout.isatty():\n            return 80\n\n        # Otherwise, first try getting the dimensions from shell command `stty`:\n        try:\n            import subprocess\n            ret = subprocess.check_output([\"stty\", \"size\"]).strip().split(\" \")\n            if len(ret) == 2:\n                return int(ret[1])\n        except:\n            pass\n\n        # Otherwise try using ioctl\n        try:\n            from termios import TIOCGWINSZ\n            from fcntl import ioctl\n            from struct import unpack\n            res = unpack(\"hh\", ioctl(sys.stdout, TIOCGWINSZ, b\"1234\"))\n            return int(res[1])\n        except:\n            pass\n\n        # Finally check the COLUMNS environment variable\n        return int(os.environ.get(\"COLUMNS\", 80))", "response": "Find current STDOUT s width in characters."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef set_encoding(self, encoding):\n        self._bar_ends = \"[]\"\n        self._bar_symbols = \"#\"\n        if not encoding: return\n        s1 = \"\\u258F\\u258E\\u258D\\u258C\\u258B\\u258A\\u2589\\u2588\"\n        s2 = \"\\u258C\\u2588\"\n        s3 = \"\\u2588\"\n        if self._file_mode:\n            s1 = s2 = None\n        assert len(s3) == 1\n        for s in (s1, s2, s3):\n            if s is None: continue\n            try:\n                s.encode(encoding)\n                self._bar_ends = \"||\"\n                self._bar_symbols = s\n                return\n            except UnicodeEncodeError:\n                pass\n            except LookupError:\n                print(\"Warning: unknown encoding %s\" % encoding)", "response": "Inform the widget about the encoding of the underlying character stream."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef fit(self, frame = None):\n        self._teColumns = list(map(lambda i: frame.names[i], self._teColumns)) if all(isinstance(n, int) for n in self._teColumns) else self._teColumns\n        self._responseColumnName = frame.names[self._responseColumnName] if isinstance(self._responseColumnName, int) else self._responseColumnName\n        self._foldColumnName = frame.names[self._foldColumnName] if isinstance(self._foldColumnName, int) else self._foldColumnName\n        \n        self._encodingMap = ExprNode(\"target.encoder.fit\", frame, self._teColumns, self._responseColumnName,\n                                     self._foldColumnName)._eager_map_frame()\n\n        return self._encodingMap", "response": "Returns an encoding map as an object that maps column_name -> frame_with_encoding_map_for_this_column_name"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\napply transformation to the target encoding maps generated during training.", "response": "def transform(self, frame=None, holdout_type=None, noise=-1, seed=-1):\n        \"\"\"\n        Apply transformation to `te_columns` based on the encoding maps generated during `TargetEncoder.fit()` call.\n        You must not pass encodings manually from `.fit()` method because they are being stored internally\n        after `.fit()' had been called.\n\n        :param frame frame: to which frame we are applying target encoding transformations.\n        :param str holdout_type: Supported options:\n\n                1) \"kfold\" - encodings for a fold are generated based on out-of-fold data.\n                2) \"loo\" - leave one out. Current row's response value is subtracted from the pre-calculated per-level frequencies.\n                3) \"none\" - we do not holdout anything. Using whole frame for training\n                \n        :param float noise: the amount of random noise added to the target encoding.  This helps prevent overfitting. Defaults to 0.01 * range of y.\n        :param int seed: a random seed used to generate draws from the uniform distribution for random noise. Defaults to -1.\n        \"\"\"\n        assert_is_type(holdout_type, \"kfold\", \"loo\", \"none\")\n\n        # We need to make sure that frames are being sent in the same order\n        assert self._encodingMap.map_keys['string'] == self._teColumns\n        encodingMapKeys = self._encodingMap.map_keys['string']\n        encodingMapFramesKeys = list(map(lambda x: x['key']['name'], self._encodingMap.frames))\n        return H2OFrame._expr(expr=ExprNode(\"target.encoder.transform\", encodingMapKeys, encodingMapFramesKeys, frame, self._teColumns, holdout_type,\n                                            self._responseColumnName, self._foldColumnName,\n                                            self._blending, self._inflectionPoint, self._smoothing,\n                                            noise, seed))"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef generatePandaEnumCols(pandaFtrain, cname, nrows, domainL):\n    import numpy as np\n    import pandas as pd\n    \n    cmissingNames=[cname+\".missing(NA)\"]\n    tempnp = np.zeros((nrows,1), dtype=np.int)\n    # check for nan and assign it correct value\n    colVals = pandaFtrain[cname]\n    for ind in range(nrows):\n        try:\n            if not(colVals[ind] in domainL):\n                tempnp[ind]=1\n        except ValueError:\n            pass\n    zeroFrame = pd.DataFrame(tempnp)\n    zeroFrame.columns=cmissingNames\n    temp = pd.get_dummies(pandaFtrain[cname], prefix=cname, drop_first=False)\n    tempNames = list(temp)  # get column names\n    colLength = len(tempNames)\n    newNames = ['a']*colLength\n\n    for ind in range(0,colLength):\n        newNames[ind]=cname+\"_\"+domainL[ind]\n    ftemp = temp[newNames]\n    ctemp = pd.concat([ftemp, zeroFrame], axis=1)\n    return ctemp", "response": "This function generates a panda frame with the necessary encoding for an H2O Enum column."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef get_frame(frame_id, rows=10, rows_offset=0, cols=-1, full_cols=-1, cols_offset=0, light=False):\n        fr = H2OFrame()\n        fr._ex._cache._id = frame_id\n        try:\n            fr._ex._cache.fill(rows=rows, rows_offset=rows_offset, cols=cols, full_cols=full_cols, cols_offset=cols_offset, light=light)\n        except EnvironmentError:\n            return None\n        return fr", "response": "Retrieve an existing H2OFrame from the H2O cluster using the frame s id."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nreloading the frame information from the backend H2O server.", "response": "def refresh(self):\n        \"\"\"Reload frame information from the backend H2O server.\"\"\"\n        self._ex._cache.flush()\n        self._frame(fill_cache=True)"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef names(self):\n        if not self._ex._cache.names_valid():\n            self._ex._cache.flush()\n            self._frame(fill_cache=True)\n        return list(self._ex._cache.names)", "response": "The list of column names ( List [ str ] )."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef nrows(self):\n        if not self._ex._cache.nrows_valid():\n            self._ex._cache.flush()\n            self._frame(fill_cache=True)\n        return self._ex._cache.nrows", "response": "Number of rows in the dataframe ( int )."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nnumbers of columns in the dataframe ( int ).", "response": "def ncols(self):\n        \"\"\"Number of columns in the dataframe (int).\"\"\"\n        if not self._ex._cache.ncols_valid():\n            self._ex._cache.flush()\n            self._frame(fill_cache=True)\n        return self._ex._cache.ncols"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef type(self, col):\n        assert_is_type(col, int, str)\n        if not self._ex._cache.types_valid() or not self._ex._cache.names_valid():\n            self._ex._cache.flush()\n            self._frame(fill_cache=True)\n        types = self._ex._cache.types\n        if is_type(col, str):\n            if col in types:\n                return types[col]\n        else:\n            names = self._ex._cache.names\n            if -len(names) <= col < len(names):\n                return types[names[col]]\n        raise H2OValueError(\"Column '%r' does not exist in the frame\" % col)", "response": "Returns the type of the given column."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nreturn a list of indices of columns of the specified type from the H2OFrame.", "response": "def columns_by_type(self, coltype=\"numeric\"):\n        \"\"\"\n        Extract columns of the specified type from the frame.\n\n        :param str coltype: A character string indicating which column type to filter by. This must be\n            one of the following:\n\n            - ``\"numeric\"``      - Numeric, but not categorical or time\n            - ``\"categorical\"``  - Integer, with a categorical/factor String mapping\n            - ``\"string\"``       - String column\n            - ``\"time\"``         - Long msec since the Unix Epoch - with a variety of display/parse options\n            - ``\"uuid\"``         - UUID\n            - ``\"bad\"``          - No none-NA rows (triple negative! all NAs or zero rows)\n\n        :returns: list of indices of columns that have the requested type\n        \"\"\"\n        assert_is_type(coltype, \"numeric\", \"categorical\", \"string\", \"time\", \"uuid\", \"bad\")\n        assert_is_type(self, H2OFrame)\n        return ExprNode(\"columnsByType\", self, coltype)._eager_scalar()"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef show(self, use_pandas=False, rows=10, cols=200):\n        if self._ex is None:\n            print(\"This H2OFrame has been removed.\")\n            return\n        if not self._has_content():\n            print(\"This H2OFrame is empty and not initialized.\")\n            return\n        if self.nrows == 0:\n            print(\"This H2OFrame is empty.\")\n            return\n        if not self._ex._cache.is_valid(): self._frame()._ex._cache.fill()\n        if H2ODisplay._in_ipy():\n            import IPython.display\n            if use_pandas and can_use_pandas():\n                IPython.display.display(self.head(rows=rows, cols=cols).as_data_frame(fill_cache=True))\n            else:\n                IPython.display.display_html(self._ex._cache._tabulate(\"html\", False), raw=True)\n        else:\n            if use_pandas and can_use_pandas():\n                print(self.head(rows=rows, cols=cols).as_data_frame(True))  # no keyword fill_cache\n            else:\n                s = self.__unicode__()\n                stk = traceback.extract_stack()\n                if \"IPython\" in stk[-3][0]:\n                    s = \"\\n%s\" % s\n                try:\n                    print(s)\n                except UnicodeEncodeError:\n                    print(s.encode(\"ascii\", \"replace\"))", "response": "Prints a string representation of the data frame."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\ndisplay summary information about the frame.", "response": "def summary(self, return_data=False):\n        \"\"\"\n        Display summary information about the frame.\n\n        Summary includes min/mean/max/sigma and other rollup data.\n\n        :param bool return_data: Return a dictionary of the summary output\n        \"\"\"\n        if not self._has_content():\n            print(\"This H2OFrame is empty and not initialized.\")\n            return self._ex._cache._data;\n        if not self._ex._cache.is_valid(): self._frame()._ex._cache.fill()\n        if not return_data:\n            if self.nrows == 0:\n                print(\"This H2OFrame is empty.\")\n            elif H2ODisplay._in_ipy():\n                import IPython.display\n                IPython.display.display_html(self._ex._cache._tabulate(\"html\", True), raw=True)\n            else:\n                print(self._ex._cache._tabulate(\"simple\", True))\n        else:\n            return self._ex._cache._data"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef describe(self, chunk_summary=False):\n        if self._has_content():\n            res = h2o.api(\"GET /3/Frames/%s\" % self.frame_id, data={\"row_count\": 10})[\"frames\"][0]\n            self._ex._cache._fill_data(res)\n\n            print(\"Rows:{}\".format(self.nrow))\n            print(\"Cols:{}\".format(self.ncol))\n\n            #The chunk & distribution summaries are not cached, so must be pulled if chunk_summary=True.\n            if chunk_summary:\n                res[\"chunk_summary\"].show()\n                res[\"distribution_summary\"].show()\n            print(\"\\n\")\n        self.summary()", "response": "Generate an in - depth description of this H2OFrame."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef head(self, rows=10, cols=200):\n        assert_is_type(rows, int)\n        assert_is_type(cols, int)\n        nrows = min(self.nrows, rows)\n        ncols = min(self.ncols, cols)\n        newdt = self[:nrows, :ncols]\n        return newdt._frame(rows=nrows, cols=cols, fill_cache=True)", "response": "Return the first rows and cols of the current H2OFrame."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nmultiplies this H2OFrame by another matrix.", "response": "def mult(self, matrix):\n        \"\"\"\n        Multiply this frame, viewed as a matrix, by another matrix.\n\n        :param matrix: another frame that you want to multiply the current frame by; must be compatible with the\n            current frame (i.e. its number of rows must be the same as number of columns in the current frame).\n        :returns: new H2OFrame, which is the result of multiplying the current frame by ``matrix``.\n        \"\"\"\n        if self.ncols != matrix.nrows:\n            raise H2OValueError(\"Matrix is not compatible for multiplication with the current frame\")\n        return H2OFrame._expr(expr=ExprNode(\"x\", self, matrix))"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\ncreate a time column from individual components of a time column.", "response": "def moment(year=None, month=None, day=None, hour=None, minute=None, second=None, msec=None, date=None, time=None):\n        \"\"\"\n        Create a time column from individual components.\n\n        Each parameter should be either an integer, or a single-column H2OFrame\n        containing the corresponding time parts for each row.\n\n        The \"date\" part of the timestamp can be specified using either the tuple ``(year, month, day)``, or an\n        explicit ``date`` parameter. The \"time\" part of the timestamp is optional, but can be specified either via\n        the ``time`` parameter, or via the ``(hour, minute, second, msec)`` tuple.\n\n        :param year: the year part of the constructed date\n        :param month: the month part of the constructed date\n        :param day: the day-of-the-month part of the constructed date\n        :param hour: the hours part of the constructed date\n        :param minute: the minutes part of the constructed date\n        :param second: the seconds part of the constructed date\n        :param msec: the milliseconds part of the constructed date\n        :param date date: construct the timestamp from the Python's native ``datetime.date`` (or ``datetime.datetime``)\n            object. If the object passed is of type ``date``, then you can specify the time part using either the\n            ``time`` argument, or ``hour`` ... ``msec`` arguments (but not both). If the object passed is of type\n            ``datetime``, then no other arguments can be provided.\n        :param time time: construct the timestamp from this Python's native ``datetime.time`` object. This argument\n            cannot be used alone, it should be supplemented with either ``date`` argument, or ``year`` ... ``day``\n            tuple.\n\n        :returns: H2OFrame with one column containing the date constructed from the provided arguments.\n        \"\"\"\n        assert_is_type(date, None, datetime.date, numpy_datetime, pandas_timestamp)\n        assert_is_type(time, None, datetime.time)\n        assert_is_type(year, None, int, H2OFrame)\n        assert_is_type(month, None, int, H2OFrame)\n        assert_is_type(day, None, int, H2OFrame)\n        assert_is_type(hour, None, int, H2OFrame)\n        assert_is_type(minute, None, int, H2OFrame)\n        assert_is_type(second, None, int, H2OFrame)\n        assert_is_type(msec, None, int, H2OFrame)\n        if time is not None:\n            if hour is not None or minute is not None or second is not None or msec is not None:\n                raise H2OValueError(\"Arguments hour, minute, second, msec cannot be used together with `time`.\")\n            hour = time.hour\n            minute = time.minute\n            second = time.second\n            msec = time.microsecond // 1000\n        if date is not None:\n            if is_type(date, pandas_timestamp):\n                date = date.to_pydatetime()\n            if is_type(date, numpy_datetime):\n                date = date.astype(\"M8[ms]\").astype(\"O\")\n            if year is not None or month is not None or day is not None:\n                raise H2OValueError(\"Arguments year, month and day cannot be used together with `date`.\")\n            year = date.year\n            month = date.month\n            day = date.day\n            if isinstance(date, datetime.datetime):\n                if time is not None:\n                    raise H2OValueError(\"Argument `time` cannot be used together with `date` of datetime type.\")\n                if hour is not None or minute is not None or second is not None or msec is not None:\n                    raise H2OValueError(\"Arguments hour, minute, second, msec cannot be used together with `date` \"\n                                        \"of datetime type.\")\n                hour = date.hour\n                minute = date.minute\n                second = date.second\n                msec = date.microsecond // 1000\n        if year is None or month is None or day is None:\n            raise H2OValueError(\"Either arguments (`year`, `month` and `day`) or the `date` are required.\")\n        if hour is None: hour = 0\n        if minute is None: minute = 0\n        if second is None: second = 0\n        if msec is None: msec = 0\n\n        local_vars = locals()\n        res_nrows = None\n        for n in [\"year\", \"month\", \"day\", \"hour\", \"minute\", \"second\", \"msec\"]:\n            x = local_vars[n]\n            if isinstance(x, H2OFrame):\n                if x.ncols != 1:\n                    raise H2OValueError(\"Argument `%s` is a frame with more than 1 column\" % n)\n                if x.type(0) not in {\"int\", \"real\"}:\n                    raise H2OValueError(\"Column `%s` is not numeric (type = %s)\" % (n, x.type(0)))\n                if res_nrows is None:\n                    res_nrows = x.nrows\n                if x.nrows == 0 or x.nrows != res_nrows:\n                    raise H2OValueError(\"Incompatible column `%s` having %d rows\" % (n, x.nrows))\n        if res_nrows is None:\n            res_nrows = 1\n        res = H2OFrame._expr(ExprNode(\"moment\", year, month, day, hour, minute, second, msec))\n        res._ex._cache._names = [\"name\"]\n        res._ex._cache._types = {\"name\": \"time\"}\n        res._ex._cache._nrows = res_nrows\n        res._ex._cache._ncols = 1\n        return res"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\ngets the factor levels.", "response": "def levels(self):\n        \"\"\"\n        Get the factor levels.\n\n        :returns: A list of lists, one list per column, of levels.\n        \"\"\"\n        lol = H2OFrame._expr(expr=ExprNode(\"levels\", self)).as_data_frame(False)\n        lol.pop(0)  # Remove column headers\n        lol = list(zip(*lol))\n        return [[ll for ll in l if ll != ''] for l in lol]"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nreturn the number of factor levels per categorical column.", "response": "def nlevels(self):\n        \"\"\"\n        Get the number of factor levels for each categorical column.\n\n        :returns: A list of the number of levels per column.\n        \"\"\"\n        levels = self.levels()\n        return [len(l) for l in levels] if levels else 0"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nreturn a new H2OFrame with the desired levels replaced.", "response": "def set_levels(self, levels):\n        \"\"\"\n        Replace the levels of a categorical column.\n\n        New levels must be aligned with the old domain. This call has copy-on-write semantics.\n\n        :param List[str] levels: A list of strings specifying the new levels. The number of new\n            levels must match the number of old levels.\n        :returns: A single-column H2OFrame with the desired levels.\n        \"\"\"\n        assert_is_type(levels, [str])\n        return H2OFrame._expr(expr=ExprNode(\"setDomain\", self, False, levels), cache=self._ex._cache)"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nchange names of columns in the frame.", "response": "def rename(self, columns=None):\n        \"\"\"\n        Change names of columns in the frame.\n\n        Dict key is an index or name of the column whose name is to be set.\n        Dict value is the new name of the column.\n\n        :param columns: dict-like transformations to apply to the column names\n        \"\"\"\n        assert_is_type(columns, None, dict)\n        new_names = self.names\n        ncols = self.ncols\n\n        for col, name in columns.items():\n            col_index = None\n            if is_type(col, int) and (-ncols <= col < ncols):\n                col_index = (col + ncols) % ncols  # handle negative indices\n            elif is_type(col, str) and col in self.names:\n                col_index = self.names.index(col)  # lookup the name\n\n            if col_index is not None:\n                new_names[col_index] = name\n\n        return self.set_names(new_names)"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nsets the names of all columns in the current record set.", "response": "def set_names(self, names):\n        \"\"\"\n        Change names of all columns in the frame.\n\n        :param List[str] names: The list of new names for every column in the frame.\n        \"\"\"\n        assert_is_type(names, [str])\n        assert_satisfies(names, len(names) == self.ncol)\n        self._ex = ExprNode(\"colnames=\", self, range(self.ncol), names)  # Update-in-place, but still lazy\n        return self"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nset a new name for a column.", "response": "def set_name(self, col=None, name=None):\n        \"\"\"\n        Set a new name for a column.\n\n        :param col: index or name of the column whose name is to be set; may be skipped for 1-column frames\n        :param name: the new name of the column\n        \"\"\"\n        assert_is_type(col, None, int, str)\n        assert_is_type(name, str)\n        ncols = self.ncols\n\n        col_index = None\n        if is_type(col, int):\n            if not(-ncols <= col < ncols):\n                raise H2OValueError(\"Index %d is out of bounds for a frame with %d columns\" % (col, ncols))\n            col_index = (col + ncols) % ncols  # handle negative indices\n        elif is_type(col, str):\n            if col not in self.names:\n                raise H2OValueError(\"Column %s doesn't exist in the frame.\" % col)\n            col_index = self.names.index(col)  # lookup the name\n        else:\n            assert col is None\n            if ncols != 1:\n                raise H2OValueError(\"The frame has %d columns; please specify which one to rename\" % ncols)\n            col_index = 0\n        if name != self.names[col_index] and name in self.types:\n            raise H2OValueError(\"Column '%s' already exists in the frame\" % name)\n\n        oldname = self.names[col_index]\n        old_cache = self._ex._cache\n        self._ex = ExprNode(\"colnames=\", self, col_index, name)  # Update-in-place, but still lazy\n        self._ex._cache.fill_from(old_cache)\n        if self.names is None:\n            self._frame()._ex._cache.fill()\n        else:\n            self._ex._cache._names = self.names[:col_index] + [name] + self.names[col_index + 1:]\n            self._ex._cache._types[name] = self._ex._cache._types.pop(oldname)\n        return"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\ncomputes cumulative sum over rows / columns of the H2OFrame.", "response": "def cumsum(self,  axis=0):\n        \"\"\"\n        Compute cumulative sum over rows / columns of the frame.\n\n        :param int axis: 0 for column-wise, 1 for row-wise\n        :returns: new H2OFrame with cumulative sums of the original frame.\n        \"\"\"\n        return H2OFrame._expr(expr=ExprNode(\"cumsum\", self, axis), cache=self._ex._cache)"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\ntesting whether the elements of an H2OFrame are contained in the item.", "response": "def isin(self, item):\n        \"\"\"\n        Test whether elements of an H2OFrame are contained in the ``item``.\n\n        :param items: An item or a list of items to compare the H2OFrame against.\n\n        :returns: An H2OFrame of 0s and 1s showing whether each element in the original H2OFrame is contained in item.\n        \"\"\"\n        if is_type(item, list, tuple, set):\n            if self.ncols == 1 and (self.type(0) == 'str' or self.type(0) == 'enum'):\n                return self.match(item)\n            else:\n                return functools.reduce(H2OFrame.__or__, (self == i for i in item))\n        else:\n            return self == item"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nbuild a fold assignments column for cross - validation.", "response": "def modulo_kfold_column(self, n_folds=3):\n        \"\"\"\n        Build a fold assignments column for cross-validation.\n\n        Rows are assigned a fold according to the current row number modulo ``n_folds``.\n\n        :param int n_folds: An integer specifying the number of validation sets to split the training data into.\n        :returns: A single-column H2OFrame with the fold assignments.\n        \"\"\"\n        return H2OFrame._expr(expr=ExprNode(\"modulo_kfold_column\", self, n_folds))._frame()"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef stratified_kfold_column(self, n_folds=3, seed=-1):\n        return H2OFrame._expr(\n            expr=ExprNode(\"stratified_kfold_column\", self, n_folds, seed))._frame()", "response": "Build a fold assignment column with the constraint that each fold has the same class\n        distribution as the fold column."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef structure(self):\n        df = self.as_data_frame(use_pandas=False)\n        cn = df.pop(0)\n        nr = self.nrow\n        nc = self.ncol\n        width = max([len(c) for c in cn])\n        isfactor = self.isfactor()\n        numlevels = self.nlevels()\n        lvls = self.levels()\n        print(\"H2OFrame: '{}' \\nDimensions: {} obs. of {} variables\".format(self.frame_id, nr, nc))\n        for i in range(nc):\n            print(\"$ {} {}: \".format(cn[i], ' ' * (width - max(0, len(cn[i])))), end=' ')\n            if isfactor[i]:\n                nl = numlevels[i]\n                print(\"Factor w/ {} level(s) {} \".format(nl, '\"' + '\",\"'.join(lvls[i]) + '\"'), end='\\n')\n            else:\n                print(\"num {}\".format(\" \".join(it[0] if it else \"nan\" for it in h2o.as_list(self[:10, i], False)[1:])))", "response": "Compactly display the internal structure of an H2OFrame."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nreturn the dataset as a python - local object.", "response": "def as_data_frame(self, use_pandas=True, header=True):\n        \"\"\"\n        Obtain the dataset as a python-local object.\n\n        :param bool use_pandas: If True (default) then return the H2OFrame as a pandas DataFrame (requires that the\n            ``pandas`` library was installed). If False, then return the contents of the H2OFrame as plain nested\n            list, in a row-wise order.\n        :param bool header: If True (default), then column names will be appended as the first row in list\n\n        :returns: A python object (a list of lists of strings, each list is a row, if use_pandas=False, otherwise\n            a pandas DataFrame) containing this H2OFrame instance's data.\n        \"\"\" \n        if can_use_pandas() and use_pandas:\n            import pandas\n            return pandas.read_csv(StringIO(self.get_frame_data()), low_memory=False, skip_blank_lines=False)\n        from h2o.utils.csv.readers import reader\n        frame = [row for row in reader(StringIO(self.get_frame_data()))]\n        if not header:\n            frame.pop(0)\n        return frame"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\ndropping a single column or row from a H2OFrame.", "response": "def drop(self, index, axis=1):\n        \"\"\"\n        Drop a single column or row or a set of columns or rows from a H2OFrame.\n\n        Dropping a column or row is not in-place.\n        Indices of rows and columns are zero-based.\n\n        :param index: A list of column indices, column names, or row indices to drop; or\n            a string to drop a single column by name; or an int to drop a single column by index.\n\n        :param int axis: If 1 (default), then drop columns; if 0 then drop rows.\n\n        :returns: a new H2OFrame with the respective dropped columns or rows. The original H2OFrame remains\n            unchanged.\n        \"\"\"\n        if axis == 1:\n            if not isinstance(index, list):\n                #If input is a string, i.e., \"C1\":\n                if is_type(index, str):\n                    #Check if index is an actual column(s) in the frame\n                    if index not in self.names:\n                        raise H2OValueError(\"Column(s) selected to drop are not in original frame: %r\" % index)\n                    index = self.names.index(index)\n                #If input is an int indicating a column index, i.e., 3:\n                elif is_type(index, int):\n                    #Check if index is an actual column index in the frame\n                    if index > self.ncol:\n                        raise H2OValueError(\"Column index selected to drop is not part of the frame: %r\" % index)\n                    if index < 0:\n                        raise H2OValueError(\"Column index selected to drop is not positive: %r\" % index)\n\n                fr = H2OFrame._expr(expr=ExprNode(\"cols\", self, -(index + 1)), cache=self._ex._cache)\n                fr._ex._cache.ncols -= 1\n                fr._ex._cache.names = self.names[:index] + self.names[index + 1:]\n                fr._ex._cache.types = {name: self.types[name] for name in fr._ex._cache.names}\n                return fr\n\n            elif isinstance(index, list):\n                #If input is an int array indicating a column index, i.e., [3] or [1,2,3]:\n                if is_type(index, [int]):\n                    if max(index) > self.ncol:\n                        raise H2OValueError(\"Column index selected to drop is not part of the frame: %r\" % index)\n                    if min(index) < 0:\n                        raise H2OValueError(\"Column index selected to drop is not positive: %r\" % index)\n                    for i in range(len(index)):\n                        index[i] = -(index[i] + 1)\n                #If index is a string array, i.e., [\"C1\", \"C2\"]\n                elif is_type(index, [str]):\n                    #Check if index is an actual column(s) in the frame\n                    if not set(index).issubset(self.names):\n                        raise H2OValueError(\"Column(s) selected to drop are not in original frame: %r\" % index)\n                    for i in range(len(index)):\n                        index[i] = -(self.names.index(index[i]) + 1)\n                fr = H2OFrame._expr(expr=ExprNode(\"cols\", self, index), cache=self._ex._cache)\n                fr._ex._cache.ncols -= len(index)\n                fr._ex._cache.names = [i for i in self.names\n                                       if self.names.index(i) not in list(map(lambda x: abs(x) - 1, index))]\n                fr._ex._cache.types = {name: fr.types[name] for name in fr._ex._cache.names}\n\n            else:\n                raise ValueError(\"Invalid column index types. Must either be a list of all int indexes, \"\n                                 \"a string list of all column names, a single int index, or\"\n                                 \"a single string for dropping columns.\")\n            return fr\n        elif axis == 0:\n            if is_type(index, [int]):\n                #Check if index is an actual column index in the frame\n                if max(index) > self.nrow:\n                    raise H2OValueError(\"Row index selected to drop is not part of the frame: %r\" % index)\n                if min(index) < 0:\n                    raise H2OValueError(\"Row index selected to drop is not positive: %r\" % index)\n                index = [-(x + 1) for x in index]\n                fr = H2OFrame._expr(expr=ExprNode(\"rows\", self, index), cache=self._ex._cache)\n                fr._ex._cache.nrows -= len(index)\n            else:\n                raise ValueError(\"Invalid row indexes. Must be a list of int row indexes to drop from the H2OFrame.\")\n        return fr"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\npop a column from the current H2OFrame at index i.", "response": "def pop(self, i):\n        \"\"\"\n        Pop a column from the H2OFrame at index i.\n\n        :param i: The index (int) or name (str) of the column to pop.\n        :returns: an H2OFrame containing the column dropped from the current frame; the current frame is modified\n            in-place and loses the column.\n        \"\"\"\n        if is_type(i, str): i = self.names.index(i)\n        col = H2OFrame._expr(expr=ExprNode(\"cols\", self, i))\n        old_cache = self._ex._cache\n        self._ex = ExprNode(\"cols\", self, -(i + 1))\n        self._ex._cache.ncols -= 1\n        self._ex._cache.names = old_cache.names[:i] + old_cache.names[i + 1:]\n        self._ex._cache.types = {name: old_cache.types[name] for name in self._ex._cache.names}\n        self._ex._cache._data = None\n        col._ex._cache.ncols = 1\n        col._ex._cache.names = [old_cache.names[i]]\n        return col"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef quantile(self, prob=None, combine_method=\"interpolate\", weights_column=None):\n        if len(self) == 0: return self\n        if prob is None: prob = [0.01, 0.1, 0.25, 0.333, 0.5, 0.667, 0.75, 0.9, 0.99]\n        if weights_column is None:\n            weights_column = \"_\"\n        else:\n            assert_is_type(weights_column, str, I(H2OFrame, lambda wc: wc.ncol == 1 and wc.nrow == self.nrow))\n            if isinstance(weights_column, H2OFrame):\n                merged = self.cbind(weights_column)\n                weights_column = merged.names[-1]\n                return H2OFrame._expr(expr=ExprNode(\"quantile\", merged, prob, combine_method, weights_column))\n        return H2OFrame._expr(expr=ExprNode(\"quantile\", self, prob, combine_method, weights_column))", "response": "Compute quantiles for the current object."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef concat(self, frames, axis=1):\n        if len(frames) == 0:\n            raise ValueError(\"Input list of frames is empty! Nothing to concat.\")\n\n        if axis == 1:\n            df = self.cbind(frames)\n        else:\n            df = self.rbind(frames)\n        return df", "response": "Concatenate multiple H2OFrames to this frame column - wise or row - wise."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef cbind(self, data):\n        assert_is_type(data, H2OFrame, numeric, [H2OFrame, numeric])\n        frames = [data] if not isinstance(data, list) else data\n        new_cols = list(self.columns)\n        new_types = dict(self.types)\n        for frame in frames:\n            if isinstance(frame, H2OFrame):\n                if frame.nrow != self.nrow:\n                    raise H2OValueError(\"Cannot bind a dataframe with %d rows to a data frame with %d rows: \"\n                                        \"the number of rows should match\" % (frame.nrow, self.nrow))\n                new_cols += frame.columns\n                new_types.update(frame.types)\n            else:\n                new_cols += [None]\n        unique_cols = set(new_cols)\n        fr = H2OFrame._expr(expr=ExprNode(\"cbind\", self, *frames), cache=self._ex._cache)\n        fr._ex._cache.ncols = len(new_cols)\n        if len(new_cols) == len(unique_cols) and None not in unique_cols:\n            fr._ex._cache.names = new_cols\n            fr._ex._cache.types = new_types\n        else:\n            # Invalidate names and types since they contain duplicate / unknown names, and the server will choose those.\n            fr._ex._cache.names = None\n            fr._ex._cache.types = None\n        return fr", "response": "Return a new H2OFrame with all frames in data appended column - wise."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef rbind(self, data):\n        assert_is_type(data, H2OFrame, [H2OFrame])\n        frames = [data] if not isinstance(data, list) else data\n        for frame in frames:\n            if frame.ncol != self.ncol:\n                raise H2OValueError(\"Cannot row-bind a dataframe with %d columns to a data frame with %d columns: \"\n                                    \"the columns must match\" % (frame.ncol, self.ncol))\n            if frame.columns != self.columns or frame.types != self.types:\n                raise H2OValueError(\"Column names and types must match for rbind() to work\")\n        fr = H2OFrame._expr(expr=ExprNode(\"rbind\", self, *frames), cache=self._ex._cache)\n        fr._ex._cache.nrows = self.nrow + sum(frame.nrow for frame in frames)\n        return fr", "response": "Append data to this H2OFrame with all frames in data appended row - wise."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef split_frame(self, ratios=None, destination_frames=None, seed=None):\n        assert_is_type(ratios, [numeric], None)\n        assert_is_type(destination_frames, [str], None)\n        assert_is_type(seed, int, None)\n\n        if ratios is None:\n            ratios = [0.75]\n        if not ratios:\n            raise ValueError(\"Ratios array may not be empty\")\n\n        if destination_frames is not None:\n            if len(ratios) + 1 != len(destination_frames):\n                raise ValueError(\"The number of provided destination_frames must be one more \"\n                                 \"than the number of provided ratios\")\n\n        num_slices = len(ratios) + 1\n        boundaries = []\n\n        last_boundary = 0\n        i = 0\n        while i < num_slices - 1:\n            ratio = ratios[i]\n            if ratio < 0:\n                raise ValueError(\"Ratio must be greater than 0\")\n            boundary = last_boundary + ratio\n            if boundary >= 1.0:\n                raise ValueError(\"Ratios must add up to less than 1.0\")\n            boundaries.append(boundary)\n            last_boundary = boundary\n            i += 1\n\n        splits = []\n        tmp_runif = self.runif(seed)\n        tmp_runif.frame_id = \"%s_splitter\" % _py_tmp_key(h2o.connection().session_id)\n\n        i = 0\n        while i < num_slices:\n            if i == 0:\n                # lower_boundary is 0.0\n                upper_boundary = boundaries[i]\n                tmp_slice = self[(tmp_runif <= upper_boundary), :]\n            elif i == num_slices - 1:\n                lower_boundary = boundaries[i - 1]\n                # upper_boundary is 1.0\n                tmp_slice = self[(tmp_runif > lower_boundary), :]\n            else:\n                lower_boundary = boundaries[i - 1]\n                upper_boundary = boundaries[i]\n                tmp_slice = self[((tmp_runif > lower_boundary) & (tmp_runif <= upper_boundary)), :]\n\n            if destination_frames is None:\n                splits.append(tmp_slice)\n            else:\n                destination_frame_id = destination_frames[i]\n                tmp_slice.frame_id = destination_frame_id\n                splits.append(tmp_slice)\n\n            i += 1\n\n        del tmp_runif\n        return splits", "response": "Splits a single frame into distinct subsets of size determined by the given ratios."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef group_by(self, by):\n        assert_is_type(by, str, int, [str, int])\n        return GroupBy(self, by)", "response": "Return a new GroupBy object using this frame and the desired grouping columns."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nsorting the Frame by a list of column names and indices.", "response": "def sort(self, by, ascending=[]):\n        \"\"\"\n        Return a new Frame that is sorted by column(s) in ascending order. A fully distributed and parallel sort.\n        However, the original frame can contain String columns but sorting cannot be done on String columns.\n        Default sorting direction is ascending.\n\n        :param by: The column to sort by (either a single column name, or a list of column names, or\n            a list of column indices)\n        :param ascending: Boolean array to denote sorting direction for each sorting column.  True for ascending\n            sort and False for descending sort.\n\n        :return:  a new sorted Frame\n        \"\"\"\n        assert_is_type(by, str, int, [str, int])\n        if type(by) != list: by = [by]\n        if type(ascending) != list: ascending = [ascending]   # convert to list\n        ascendingI=[1]*len(by)  # intitalize sorting direction to ascending by default\n        for c in by:\n            if self.type(c) not in [\"enum\",\"time\",\"int\",\"real\",\"string\"]:\n                raise H2OValueError(\"Sort by column: \" + str(c) + \" not of enum, time, int, real, or string type\")\n        if len(ascending)>0:  # user did not specify sort direction, assume all columns ascending\n            assert len(ascending)==len(by), \"Sorting direction must be specified for each sorted column.\"\n            for index in range(len(by)):\n                ascendingI[index]=1 if ascending[index] else -1\n        return H2OFrame._expr(expr=ExprNode(\"sort\",self,by,ascendingI))"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nreturns a new Frame that fills NA along a given axis with a maximum fill length.", "response": "def fillna(self,method=\"forward\",axis=0,maxlen=1):\n        \"\"\"\n        Return a new Frame that fills NA along a given axis and along a given direction with a maximum fill length\n\n        :param method: ``\"forward\"`` or ``\"backward\"``\n        :param axis:  0 for columnar-wise or 1 for row-wise fill\n        :param maxlen: Max number of consecutive NA's to fill\n        \n        :return: \n        \"\"\"\n        assert_is_type(axis, 0, 1)\n        assert_is_type(method,str)\n        assert_is_type(maxlen, int)\n        return H2OFrame._expr(expr=ExprNode(\"h2o.fillna\",self,method,axis,maxlen))"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nimputing missing values into the frame.", "response": "def impute(self, column=-1, method=\"mean\", combine_method=\"interpolate\", by=None, group_by_frame=None, values=None):\n        \"\"\"\n        Impute missing values into the frame, modifying it in-place.\n\n        :param int column: Index of the column to impute, or -1 to impute the entire frame.\n        :param str method: The method of imputation: ``\"mean\"``, ``\"median\"``, or ``\"mode\"``.\n        :param str combine_method: When the method is ``\"median\"``, this setting dictates how to combine quantiles\n            for even samples. One of ``\"interpolate\"``, ``\"average\"``, ``\"low\"``, ``\"high\"``.\n        :param by: The list of columns to group on.\n        :param H2OFrame group_by_frame: Impute the values with this pre-computed grouped frame.\n        :param List values: The list of impute values, one per column. None indicates to skip the column.\n\n        :returns: A list of values used in the imputation or the group-by result used in imputation.\n        \"\"\"\n        if is_type(column, str): column = self.names.index(column)\n        if is_type(by, str):     by = self.names.index(by)\n\n        if values is None:\n            values = \"_\"\n        else:\n            assert len(values) == len(self.columns), \"Length of values does not match length of columns\"\n            # convert string values to categorical num values\n            values2 = []\n            for i in range(0,len(values)):\n                if self.type(i) == \"enum\":\n                    try:\n                        values2.append(self.levels()[i].index(values[i]))\n                    except:\n                        raise H2OValueError(\"Impute value of: \" + values[i] + \" not found in existing levels of\"\n                                            \" column: \" + self.col_names[i])\n                else:\n                    values2.append(values[i])\n            values = values2\n        if group_by_frame is None: group_by_frame = \"_\"\n\n\n        # This code below is needed to ensure the frame (self) exists on the server. Without it, self._ex._cache.fill()\n        # fails with an assertion that ._id is None.\n        # This code should be removed / reworked once we have a more consistent strategy of dealing with frames.\n        self._ex._eager_frame()\n\n        if by is not None or group_by_frame is not \"_\":\n            res = H2OFrame._expr(\n                expr=ExprNode(\"h2o.impute\", self, column, method, combine_method, by, group_by_frame, values))._frame()\n        else:\n            res = ExprNode(\"h2o.impute\", self, column, method, combine_method, by, group_by_frame,\n                           values)._eager_scalar()\n\n        self._ex._cache.flush()\n        self._ex._cache.fill(10)\n        return res"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef merge(self, other, all_x=False, all_y=False, by_x=None, by_y=None, method=\"auto\"):\n\n        if by_x is None and by_y is None:\n            common_names = list(set(self.names) & set(other.names))\n            if not common_names:\n                raise H2OValueError(\"No columns in common to merge on!\")\n\n        if by_x is None:\n            by_x = [self.names.index(c) for c in common_names]\n        else:\n            by_x = _getValidCols(by_x,self)\n\n        if by_y is None:\n            by_y = [other.names.index(c) for c in common_names]\n        else:\n            by_y = _getValidCols(by_y,other)\n\n\n        return H2OFrame._expr(expr=ExprNode(\"merge\", self, other, all_x, all_y, by_x, by_y, method))", "response": "Merge two datasets based on common column names.  We do not support all_x=True and all_y=True.\n        Only one can be True or none is True.  The default merge method is auto and it will default to the\n        radix method.  The radix method will return the correct merge result regardless of duplicated rows\n         in the right frame.  In addition, the radix method can perform merge even if you have string columns\n         in your frames.  If there are duplicated rows in your rite frame, they will not be included if you use\n        the hash method.  The hash method cannot perform merge if you have string columns in your left frame.\n        Hence, we consider the radix method superior to the hash method and is the default method to use.\n\n        :param H2OFrame other: The frame to merge to the current one. By default, must have at least one column in common with\n            this frame, and all columns in common are used as the merge key.  If you want to use only a subset of the\n            columns in common, rename the other columns so the columns are unique in the merged result.\n        :param bool all_x: If True, include all rows from the left/self frame\n        :param bool all_y: If True, include all rows from the right/other frame\n        :param by_x: list of columns in the current frame to use as a merge key.\n        :param by_y: list of columns in the ``other`` frame to use as a merge key. Should have the same number of\n            columns as in the ``by_x`` list.\n        :param method: string representing the merge method, one of auto(default), radix or hash.\n\n        :returns: New H2OFrame with the result of merging the current frame with the ``other`` frame."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nreorder the levels of a single column of a H2O factor for one single column of a H2O frame.", "response": "def relevel(self, y):\n        \"\"\"\n        Reorder levels of an H2O factor for one single column of a H2O frame\n\n        The levels of a factor are reordered such that the reference level is at level 0, all remaining levels are\n        moved down as needed.\n\n        :param str y: The reference level\n        :returns: New reordered factor column\n        \"\"\"\n        return H2OFrame._expr(expr=ExprNode(\"relevel\", self, quote(y)))"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\ninserting missing values into the current frame.", "response": "def insert_missing_values(self, fraction=0.1, seed=None):\n        \"\"\"\n        Insert missing values into the current frame, modifying it in-place.\n\n        Randomly replaces a user-specified fraction of entries in a H2O dataset with missing\n        values.\n\n        :param float fraction: A number between 0 and 1 indicating the fraction of entries to replace with missing.\n        :param int seed: The seed for the random number generator used to determine which values to make missing.\n\n        :returns: the original H2OFrame with missing values inserted.\n        \"\"\"\n        kwargs = {}\n        kwargs['dataset'] = self.frame_id  # Eager; forces eval now for following REST call\n        kwargs['fraction'] = fraction\n        if seed is not None: kwargs['seed'] = seed\n        job = {}\n        job['job'] = h2o.api(\"POST /3/MissingInserter\", data=kwargs)\n        H2OJob(job, job_type=(\"Insert Missing Values\")).poll()\n        self._ex._cache.flush()\n        return self"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef sum(self, skipna=True, axis=0, **kwargs):\n        assert_is_type(skipna, bool)\n        assert_is_type(axis, 0, 1)\n        # Deprecated since 2016-10-14,\n        if \"na_rm\" in kwargs:\n            warnings.warn(\"Parameter na_rm is deprecated; use skipna instead\", category=DeprecationWarning)\n            na_rm = kwargs.pop(\"na_rm\")\n            assert_is_type(na_rm, bool)\n            skipna = na_rm  # don't assign to skipna directly, to help with error reporting\n        # Determine whether to return a frame or a list\n        return_frame = get_config_value(\"general.allow_breaking_changes\", False)\n        if \"return_frame\" in kwargs:\n            return_frame = kwargs.pop(\"return_frame\")\n            assert_is_type(return_frame, bool)\n        if kwargs:\n            raise H2OValueError(\"Unknown parameters %r\" % list(kwargs))\n\n        if return_frame:\n            return H2OFrame._expr(ExprNode(\"sumaxis\", self, skipna, axis))\n        else:\n            return ExprNode(\"sumNA\" if skipna else \"sum\", self)._eager_scalar()", "response": "Compute the sum of the presence of the entries in the current H2O frame."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef var(self, y=None, na_rm=False, use=None):\n        symmetric = False\n        if y is None:\n            y = self\n            symmetric = True\n        if use is None: use = \"complete.obs\" if na_rm else \"everything\"\n        if self.nrow == 1 or (self.ncol == 1 and y.ncol == 1):\n            return ExprNode(\"var\", self, y, use, symmetric)._eager_scalar()\n        return H2OFrame._expr(expr=ExprNode(\"var\", self, y, use, symmetric))._frame()", "response": "Compute the variance - covariance matrix of one or two H2OFrames."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef cor(self, y=None, na_rm=False, use=None):\n        assert_is_type(y, H2OFrame, None)\n        assert_is_type(na_rm, bool)\n        assert_is_type(use, None, \"everything\", \"all.obs\", \"complete.obs\")\n        if y is None:\n            y = self\n        if use is None: use = \"complete.obs\" if na_rm else \"everything\"\n        if self.nrow == 1 or (self.ncol == 1 and y.ncol == 1): return ExprNode(\"cor\", self, y, use)._eager_scalar()\n        return H2OFrame._expr(expr=ExprNode(\"cor\", self, y, use))._frame()", "response": "Compute the correlation matrix between two columns of the current frame and the columns of y."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\ncomputing a pairwise distance measure between all rows of this H2OFrame and y.", "response": "def distance(self, y, measure=None):\n        \"\"\"\n        Compute a pairwise distance measure between all rows of two numeric H2OFrames.\n\n        :param H2OFrame y: Frame containing queries (small)\n        :param str use: A string indicating what distance measure to use. Must be one of:\n\n            - ``\"l1\"``:        Absolute distance (L1-norm, >=0)\n            - ``\"l2\"``:        Euclidean distance (L2-norm, >=0)\n            - ``\"cosine\"``:    Cosine similarity (-1...1)\n            - ``\"cosine_sq\"``: Squared Cosine similarity (0...1)\n\n        :examples:\n          >>>\n          >>> iris_h2o = h2o.import_file(path=pyunit_utils.locate(\"smalldata/iris/iris.csv\"))\n          >>> references = iris_h2o[10:150,0:4\n          >>> queries    = iris_h2o[0:10,0:4]\n          >>> A = references.distance(queries, \"l1\")\n          >>> B = references.distance(queries, \"l2\")\n          >>> C = references.distance(queries, \"cosine\")\n          >>> D = references.distance(queries, \"cosine_sq\")\n          >>> E = queries.distance(references, \"l1\")\n          >>> (E.transpose() == A).all()\n\n        :returns: An H2OFrame of the matrix containing pairwise distance / similarity between the \n            rows of this frame (N x p) and ``y`` (M x p), with dimensions (N x M).\n        \"\"\"\n        assert_is_type(y, H2OFrame)\n        if measure is None: measure = \"l2\"\n        return H2OFrame._expr(expr=ExprNode(\"distance\", self, y, measure))._frame()"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef strdistance(self, y, measure=None, compare_empty=True):\n        assert_is_type(y, H2OFrame)\n        assert_is_type(measure, Enum('lv', 'lcs', 'qgram', 'jaccard', 'jw', 'soundex'))\n        assert_is_type(compare_empty, bool)\n        return H2OFrame._expr(expr=ExprNode(\"strDistance\", self, y, measure, compare_empty))._frame()", "response": "Compute element - wise string distances between two H2OFrames."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef asfactor(self):\n        for colname in self.names:\n            t = self.types[colname]\n            if t not in {\"bool\", \"int\", \"string\", \"enum\"}:\n                raise H2OValueError(\"Only 'int' or 'string' are allowed for \"\n                                    \"asfactor(), got %s:%s \" % (colname, t))\n        fr = H2OFrame._expr(expr=ExprNode(\"as.factor\", self), cache=self._ex._cache)\n        if fr._ex._cache.types_valid():\n            fr._ex._cache.types = {name: \"enum\" for name in self.types}\n        else:\n            raise H2OTypeError(\"Types are not available in result\")\n        \n        return fr", "response": "Convert columns in the current frame to categoricals."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef categories(self):\n        if self.ncols != 1:\n            raise H2OValueError(\"This operation only applies to a single factor column\")\n        if self.types[self.names[0]] != \"enum\":\n            raise H2OValueError(\"Input is not a factor. This operation only applies to a single factor column\")\n        return self.levels()[0]", "response": "Return the list of categories for an enum categorical column. This function can only be applied to single - column categorical frame."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef strsplit(self, pattern):\n        fr = H2OFrame._expr(expr=ExprNode(\"strsplit\", self, pattern))\n        fr._ex._cache.nrows = self.nrow\n        return fr", "response": "Splits the strings in the target column on the given regular expression pattern."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef countmatches(self, pattern):\n        assert_is_type(pattern, str, [str])\n        fr = H2OFrame._expr(expr=ExprNode(\"countmatches\", self, pattern))\n        fr._ex._cache.nrows = self.nrow\n        fr._ex._cache.ncols = self.ncol\n        return fr", "response": "Count the number of occurrences of the pattern in each string in the frame."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef substring(self, start_index, end_index=None):\n        fr = H2OFrame._expr(expr=ExprNode(\"substring\", self, start_index, end_index))\n        fr._ex._cache.nrows = self.nrow\n        fr._ex._cache.ncol = self.ncol\n        return fr", "response": "Returns a new string that is a substring of the original string."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nreturns a new H2OFrame with all leading characters removed from the left of the column.", "response": "def lstrip(self, set=\" \"):\n        \"\"\"\n        Return a copy of the column with leading characters removed.\n\n        The set argument is a string specifying the set of characters to be removed.\n        If omitted, the set argument defaults to removing whitespace.\n\n        :param character set: The set of characters to lstrip from strings in column.\n        :returns: a new H2OFrame with the same shape as the original frame and having all its values\n            trimmed from the left (equivalent of Python's ``str.lstrip()``).\n        \"\"\"\n        # work w/ None; parity with python lstrip\n        if set is None: set = \" \"\n\n        fr = H2OFrame._expr(expr=ExprNode(\"lstrip\", self, set))\n        fr._ex._cache.nrows = self.nrow\n        fr._ex._cache.ncol = self.ncol\n        return fr"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef entropy(self):\n        fr = H2OFrame._expr(expr=ExprNode(\"entropy\", self))\n        fr._ex._cache.nrows = self.nrow\n        fr._ex._cache.ncol = self.ncol\n        return fr", "response": "Compute the Shannon entropy for each string."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef num_valid_substrings(self, path_to_words):\n        assert_is_type(path_to_words, str)\n        fr = H2OFrame._expr(expr=ExprNode(\"num_valid_substrings\", self, path_to_words))\n        fr._ex._cache.nrows = self.nrow\n        fr._ex._cache.ncol = self.ncol\n        return fr", "response": "Returns the number of substrings that are contained in the given line - separated text file."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef table(self, data2=None, dense=True):\n        return H2OFrame._expr(expr=ExprNode(\"table\", self, data2, dense)) if data2 is not None else H2OFrame._expr(\n            expr=ExprNode(\"table\", self, dense))", "response": "Compute the counts of values appearing in a column or co - occupurence counts between two columns."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\ncomputes a histogram over a numeric column.", "response": "def hist(self, breaks=\"sturges\", plot=True, **kwargs):\n        \"\"\"\n        Compute a histogram over a numeric column.\n\n        :param breaks: Can be one of ``\"sturges\"``, ``\"rice\"``, ``\"sqrt\"``, ``\"doane\"``, ``\"fd\"``, ``\"scott\"``;\n            or a single number for the number of breaks; or a list containing the split points, e.g:\n            ``[-50, 213.2123, 9324834]``. If breaks is \"fd\", the MAD is used over the IQR in computing bin width.\n        :param bool plot: If True (default), then a plot will be generated using ``matplotlib``.\n\n        :returns: If ``plot`` is False, return H2OFrame with these columns: breaks, counts, mids_true,\n            mids, and density; otherwise this method draws a plot and returns nothing.\n        \"\"\"\n        server = kwargs.pop(\"server\") if \"server\" in kwargs else False\n        assert_is_type(breaks, int, [numeric], Enum(\"sturges\", \"rice\", \"sqrt\", \"doane\", \"fd\", \"scott\"))\n        assert_is_type(plot, bool)\n        assert_is_type(server, bool)\n        if kwargs:\n            raise H2OValueError(\"Unknown parameters to hist(): %r\" % kwargs)\n        hist = H2OFrame._expr(expr=ExprNode(\"hist\", self, breaks))._frame()\n\n        if plot:\n            try:\n                import matplotlib\n                if server:\n                    matplotlib.use(\"Agg\", warn=False)\n                import matplotlib.pyplot as plt\n            except ImportError:\n                print(\"ERROR: matplotlib is required to make the histogram plot. \"\n                      \"Set `plot` to False, if a plot is not desired.\")\n                return\n\n            hist[\"widths\"] = hist[\"breaks\"].difflag1()\n            # [2:] because we're removing the title and the first row (which consists of NaNs)\n            lefts = [float(c[0]) for c in h2o.as_list(hist[\"breaks\"], use_pandas=False)[2:]]\n            widths = [float(c[0]) for c in h2o.as_list(hist[\"widths\"], use_pandas=False)[2:]]\n            counts = [float(c[0]) for c in h2o.as_list(hist[\"counts\"], use_pandas=False)[2:]]\n\n            plt.xlabel(self.names[0])\n            plt.ylabel(\"Frequency\")\n            plt.title(\"Histogram of %s\" % self.names[0])\n            plt.bar(left=lefts, width=widths, height=counts, bottom=0)\n            if not server:\n                plt.show()\n        else:\n            hist[\"density\"] = hist[\"counts\"] / (hist[\"breaks\"].difflag1() * hist[\"counts\"].sum())\n            return hist"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\ncomputing the iSAX index for the time series data.", "response": "def isax(self, num_words, max_cardinality, optimize_card=False, **kwargs):\n        \"\"\"\n        Compute the iSAX index for DataFrame which is assumed to be numeric time series data.\n\n        References:\n\n            - http://www.cs.ucr.edu/~eamonn/SAX.pdf\n            - http://www.cs.ucr.edu/~eamonn/iSAX_2.0.pdf\n\n        :param int num_words: Number of iSAX words for the timeseries, i.e. granularity along the time series\n        :param int max_cardinality: Maximum cardinality of the iSAX word. Each word can have less than the max\n        :param bool optimized_card: An optimization flag that will find the max cardinality regardless of what is\n            passed in for ``max_cardinality``.\n\n        :returns: An H2OFrame with the name of time series, string representation of iSAX word, followed by\n            binary representation.\n        \"\"\"\n        if num_words <= 0: raise H2OValueError(\"num_words must be greater than 0\")\n        if max_cardinality <= 0: raise H2OValueError(\"max_cardinality must be greater than 0\")\n        return H2OFrame._expr(expr=ExprNode(\"isax\", self, num_words, max_cardinality, optimize_card))"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nconverts H2OFrame to DMatrix", "response": "def convert_H2OFrame_2_DMatrix(self, predictors, yresp, h2oXGBoostModel):\n        '''\n        This method requires that you import the following toolboxes: xgboost, pandas, numpy and scipy.sparse.\n\n        This method will convert an H2OFrame to a DMatrix that can be used by native XGBoost.  The H2OFrame contains\n        numerical and enum columns alone.  Note that H2O one-hot-encoding introduces a missing(NA)\n        column. There can be NAs in any columns.\n\n        Follow the steps below to compare H2OXGBoost and native XGBoost:\n\n        1. Train the H2OXGBoost model with H2OFrame trainFile and generate a prediction:\n        h2oModelD = H2OXGBoostEstimator(**h2oParamsD) # parameters specified as a dict()\n        h2oModelD.train(x=myX, y=y, training_frame=trainFile) # train with H2OFrame trainFile\n        h2oPredict = h2oPredictD = h2oModelD.predict(trainFile)\n\n        2. Derive the DMatrix from H2OFrame:\n        nativeDMatrix = trainFile.convert_H2OFrame_2_DMatrix(myX, y, h2oModelD)\n\n        3. Derive the parameters for native XGBoost:\n        nativeParams = h2oModelD.convert_H2OXGBoostParams_2_XGBoostParams()\n\n        4. Train your native XGBoost model and generate a prediction:\n        nativeModel = xgb.train(params=nativeParams[0], dtrain=nativeDMatrix, num_boost_round=nativeParams[1])\n        nativePredict = nativeModel.predict(data=nativeDMatrix, ntree_limit=nativeParams[1].\n\n        5. Compare the predictions h2oPredict from H2OXGBoost, nativePredict from native XGBoost.\n\n        :param h2oFrame: H2OFrame to be converted to DMatrix for native XGBoost\n        :param predictors: List of predictor columns, can be column names or indices\n        :param yresp: response column, can be column index or name\n        :param h2oXGBoostModel: H2OXGboost model that are built with the same H2OFrame as input earlier\n        :return: DMatrix that can be an input to a native XGBoost model\n        '''\n        import xgboost as xgb\n        import pandas as pd\n        import numpy as np\n        from scipy.sparse import csr_matrix\n\n        assert isinstance(predictors, list) or isinstance(predictors, tuple)\n        assert h2oXGBoostModel._model_json['algo'] == 'xgboost', \\\n            \"convert_H2OFrame_2_DMatrix is used for H2OXGBoost model only.\"\n\n        tempFrame = self[predictors].cbind(self[yresp])\n        colnames = tempFrame.names\n        if type(predictors[0])==type(1): # convert integer indices to column names\n            temp = []\n            for colInd in predictors:\n                temp.append(colnames[colInd])\n            predictors = temp\n\n        if (type(yresp) == type(1)):\n            tempy = colnames[yresp]\n            yresp = tempy # column name of response column\n\n        enumCols = [] # extract enum columns out to process them\n        enumColsIndices = []     # store enum column indices\n        typeDict = self.types\n        for predName in predictors:\n            if str(typeDict[predName])=='enum':\n                enumCols.append(predName)\n                enumColsIndices.append(colnames.index(predName))\n\n        pandaFtrain = tempFrame.as_data_frame(use_pandas=True, header=True)\n        nrows = tempFrame.nrow\n\n        # convert H2OFrame to DMatrix starts here\n        if len(enumCols) > 0:   # enumCols contain all enum column names\n            allDomain = tempFrame.levels() # list all domain levels with column indices\n            domainLen = []\n            for enumIndex in enumColsIndices:\n                if len(allDomain[enumIndex])>0:\n                    domainLen.append(len(allDomain[enumIndex])*-1)\n            incLevel = np.argsort(domainLen) # indices of enum column indices with decreasing domain length\n\n            # need to move enum columns to the front, highest level first\n            c2 = tempFrame[enumCols[incLevel[0]]]\n            tempFrame = tempFrame.drop(enumCols[incLevel[0]])\n            for index in range(1, len(incLevel)):\n                c2 = c2.cbind(tempFrame[enumCols[incLevel[index]]])\n                tempFrame = tempFrame.drop(enumCols[incLevel[index]])\n               \n            enumCols = c2.names\n            tempFrame = c2.cbind(tempFrame)\n            pandaFtrain = tempFrame.as_data_frame(use_pandas=True, header=True) # redo translation from H2O to panda\n        \n            pandaTrainPart = generatePandaEnumCols(pandaFtrain, enumCols[0], nrows, tempFrame[enumCols[0]].categories())\n            pandaFtrain.drop([enumCols[0]], axis=1, inplace=True)\n\n            for colInd in range(1, len(enumCols)):\n                cname=enumCols[colInd]\n                ctemp = generatePandaEnumCols(pandaFtrain, cname,  nrows, tempFrame[enumCols[colInd]].categories())\n                pandaTrainPart=pd.concat([pandaTrainPart, ctemp], axis=1)\n                pandaFtrain.drop([cname], axis=1, inplace=True)\n\n            pandaFtrain = pd.concat([pandaTrainPart, pandaFtrain], axis=1)\n\n        c0= tempFrame[yresp].asnumeric().as_data_frame(use_pandas=True, header=True)\n        pandaFtrain.drop([yresp], axis=1, inplace=True)\n        pandaF = pd.concat([c0, pandaFtrain], axis=1)\n        pandaF.rename(columns={c0.columns[0]:yresp}, inplace=True)\n        newX = list(pandaFtrain.columns.values)\n        data = pandaF.as_matrix(newX)\n        label = pandaF.as_matrix([yresp])\n\n        return xgb.DMatrix(data=csr_matrix(data), label=label) \\\n            if h2oXGBoostModel._model_json['output']['sparse'] else xgb.DMatrix(data=data, label=label)"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\npivots the frame designated by the three columns index column and value.", "response": "def pivot(self, index, column, value):\n        \"\"\"\n        Pivot the frame designated by the three columns: index, column, and value. Index and column should be\n        of type enum, int, or time.\n        For cases of multiple indexes for a column label, the aggregation method is to pick the first occurrence in the data frame\n\n        :param index: Index is a column that will be the row label\n        :param column: The labels for the columns in the pivoted Frame\n        :param value: The column of values for the given index and column label\n        :returns:\n        \"\"\"\n        assert_is_type(index, str)\n        assert_is_type(column, str)\n        assert_is_type(value, str)\n        col_names = self.names\n        if index not in col_names:\n            raise H2OValueError(\"Index not in H2OFrame\")\n        if column not in col_names:\n            raise H2OValueError(\"Column not in H2OFrame\")\n        if value not in col_names:\n            raise H2OValueError(\"Value column not in H2OFrame\")\n        if self.type(column) not in [\"enum\",\"time\",\"int\"]:\n            raise H2OValueError(\"'column' argument is not type enum, time or int\")\n        if self.type(index) not in [\"enum\",\"time\",\"int\"]:\n            raise H2OValueError(\"'index' argument is not type enum, time or int\")\n        return H2OFrame._expr(expr=ExprNode(\"pivot\",self,index,column,value))"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef topNBottomN(self, column=0, nPercent=10, grabTopN=-1):\n        assert (nPercent >= 0) and (nPercent<=100.0), \"nPercent must be between 0.0 and 100.0\"\n        assert round(nPercent*0.01*self.nrows)>0, \"Increase nPercent.  Current value will result in top 0 row.\"\n\n        if isinstance(column, int):\n            if (column < 0) or (column>=self.ncols):\n                raise H2OValueError(\"Invalid column index H2OFrame\")\n            else:\n                colIndex = column\n        else:       # column is a column name\n            col_names = self.names\n            if column not in col_names:\n                raise H2OValueError(\"Column name not found H2OFrame\")\n            else:\n                colIndex = col_names.index(column)\n\n        if not(self[colIndex].isnumeric()):\n            raise H2OValueError(\"Wrong column type!  Selected column must be numeric.\")\n\n        return H2OFrame._expr(expr=ExprNode(\"topn\", self, colIndex, nPercent, grabTopN))", "response": "This function returns the top or bottom N% of the column of a frame."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef sub(self, pattern, replacement, ignore_case=False):\n        return H2OFrame._expr(expr=ExprNode(\"replacefirst\", self, pattern, replacement, ignore_case))", "response": "Substitute the first occurrence of pattern in a string with replacement."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef interaction(self, factors, pairwise, max_factors, min_occurrence, destination_frame=None):\n        return h2o.interaction(data=self, factors=factors, pairwise=pairwise, max_factors=max_factors,\n                               min_occurrence=min_occurrence, destination_frame=destination_frame)", "response": "Categorical Interaction Feature Creation in H2O."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef toupper(self):\n        return H2OFrame._expr(expr=ExprNode(\"toupper\", self), cache=self._ex._cache)", "response": "Translate characters from lower to upper case for a particular column."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef grep(self,pattern, ignore_case = False, invert = False, output_logical = False):\n        return H2OFrame._expr(expr=ExprNode(\"grep\", self, pattern, ignore_case, invert, output_logical))", "response": "Returns a new H2OFrame containing the elements that match the pattern."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nreturning a new H2OFrame with scaled values in the current frame.", "response": "def scale(self, center=True, scale=True):\n        \"\"\"\n        Center and/or scale the columns of the current frame.\n\n        :param center: If True, then demean the data. If False, no shifting is done. If ``center`` is a list of\n            numbers then shift each column by the corresponding amount.\n        :param scale: If True, then scale the data by each column's standard deviation. If False, no scaling\n            is done. If ``scale`` is a list of numbers, then scale each column by the requested amount.\n        :returns: an H2OFrame with scaled values from the current frame.\n        \"\"\"\n        return H2OFrame._expr(expr=ExprNode(\"scale\", self, center, scale), cache=self._ex._cache)"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef signif(self, digits=6):\n        return H2OFrame._expr(expr=ExprNode(\"signif\", self, digits), cache=self._ex._cache)", "response": "Returns a new H2OFrame with rounded doubles or floats."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef na_omit(self):\n        fr = H2OFrame._expr(expr=ExprNode(\"na.omit\", self), cache=self._ex._cache)\n        fr._ex._cache.nrows = -1\n        return fr", "response": "Returns a new H2OFrame with all rows with NAs removed from the original H2OFrame."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nconducts a diff - 1 transform on a numeric column.", "response": "def difflag1(self):\n        \"\"\"\n        Conduct a diff-1 transform on a numeric frame column.\n\n        :returns: an H2OFrame where each element is equal to the corresponding element in the source\n            frame minus the previous-row element in the same frame.\n        \"\"\"\n        if self.ncols > 1:\n            raise H2OValueError(\"Only single-column frames supported\")\n        if self.types[self.columns[0]] not in {\"real\", \"int\", \"bool\"}:\n            raise H2OValueError(\"Numeric column expected\")\n        fr = H2OFrame._expr(expr=ExprNode(\"difflag1\", self), cache=self._ex._cache)\n        return fr"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nreturning a new H2OFrame with the values of the NA elements.", "response": "def isna(self):\n        \"\"\"\n        For each element in an H2OFrame, determine if it is NA or not.\n\n        :returns: an H2OFrame of 1s and 0s, where 1s mean the values were NAs.\n        \"\"\"\n        fr = H2OFrame._expr(expr=ExprNode(\"is.na\", self))\n        fr._ex._cache.nrows = self._ex._cache.nrows\n        fr._ex._cache.ncols = self._ex._cache.ncols\n        if self._ex._cache.names:\n            fr._ex._cache.names = [\"isNA(%s)\" % n for n in self._ex._cache.names]\n            fr._ex._cache.types = {\"isNA(%s)\" % n: \"int\" for n in self._ex._cache.names}\n        return fr"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nreturning a single - column H2OFrame containing the minute part from a date column.", "response": "def minute(self):\n        \"\"\"\n        Extract the \"minute\" part from a date column.\n\n        :returns: a single-column H2OFrame containing the \"minute\" part from the source frame.\n        \"\"\"\n        fr = H2OFrame._expr(expr=ExprNode(\"minute\", self), cache=self._ex._cache)\n        if fr._ex._cache.types_valid():\n            fr._ex._cache.types = {k: \"int\" for k in self._ex._cache.types.keys()}\n        return fr"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\ngenerate a column of random numbers drawn uniformly from a uniform distribution from 0 1 and return a new H2OFrame filled with doubles sampled uniformly from 0 1.", "response": "def runif(self, seed=None):\n        \"\"\"\n        Generate a column of random numbers drawn from a uniform distribution [0,1) and\n        having the same data layout as the source frame.\n\n        :param int seed: seed for the random number generator.\n\n        :returns: Single-column H2OFrame filled with doubles sampled uniformly from [0,1).\n        \"\"\"\n        fr = H2OFrame._expr(expr=ExprNode(\"h2o.runif\", self, -1 if seed is None else seed))\n        fr._ex._cache.ncols = 1\n        fr._ex._cache.nrows = self.nrow\n        return fr"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nreturns a new H2OFrame with a single categorical column with two levels stratified split.", "response": "def stratified_split(self, test_frac=0.2, seed=-1):\n        \"\"\"\n        Construct a column that can be used to perform a random stratified split.\n\n        :param float test_frac: The fraction of rows that will belong to the \"test\".\n        :param int seed: The seed for the random number generator.\n\n        :returns: an H2OFrame having single categorical column with two levels: ``\"train\"`` and ``\"test\"``.\n\n        :examples:\n          >>> stratsplit = df[\"y\"].stratified_split(test_frac=0.3, seed=12349453)\n          >>> train = df[stratsplit==\"train\"]\n          >>> test = df[stratsplit==\"test\"]\n          >>>\n          >>> # check that the distributions among the initial frame, and the\n          >>> # train/test frames match\n          >>> df[\"y\"].table()[\"Count\"] / df[\"y\"].table()[\"Count\"].sum()\n          >>> train[\"y\"].table()[\"Count\"] / train[\"y\"].table()[\"Count\"].sum()\n          >>> test[\"y\"].table()[\"Count\"] / test[\"y\"].table()[\"Count\"].sum()\n        \"\"\"\n        return H2OFrame._expr(expr=ExprNode('h2o.random_stratified_split', self, test_frac, seed))"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef match(self, table, nomatch=0):\n        return H2OFrame._expr(expr=ExprNode(\"match\", self, table, nomatch, None))", "response": "Returns a new H2OFrame containing the index where the pattern table first occurs within that item."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nreturn a new H2OFrame containing the cuts of a numeric vector into categorical buckets.", "response": "def cut(self, breaks, labels=None, include_lowest=False, right=True, dig_lab=3):\n        \"\"\"\n        Cut a numeric vector into categorical \"buckets\".\n\n        This method is only applicable to a single-column numeric frame.\n\n        :param List[float] breaks: The cut points in the numeric vector.\n        :param List[str] labels: Labels for categorical levels produced. Defaults to set notation of\n            intervals defined by the breaks.\n        :param bool include_lowest: By default, cuts are defined as intervals ``(lo, hi]``. If this parameter\n            is True, then the interval becomes ``[lo, hi]``.\n        :param bool right: Include the high value: ``(lo, hi]``. If False, get ``(lo, hi)``.\n        :param int dig_lab: Number of digits following the decimal point to consider.\n\n        :returns: Single-column H2OFrame of categorical data.\n        \"\"\"\n        assert_is_type(breaks, [numeric])\n        if self.ncols != 1: raise H2OValueError(\"Single-column frame is expected\")\n        if self.types[self.names[0]] not in {\"int\", \"real\"}: raise H2OValueError(\"A numeric column is expected\")\n        fr = H2OFrame._expr(expr=ExprNode(\"cut\", self, breaks, labels, include_lowest, right, dig_lab),\n                            cache=self._ex._cache)\n        fr._ex._cache.types = {k: \"enum\" for k in self.names}\n        return fr"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef idxmax(self,skipna=True, axis=0):\n        return H2OFrame._expr(expr=ExprNode(\"which.max\", self, skipna, axis))", "response": "Returns the index of the maximum value in a column or row or column."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nreturns a H2OFrame with the values of yes and no set to True.", "response": "def ifelse(self, yes, no):\n        \"\"\"\n        Equivalent to ``[y if t else n for t,y,n in zip(self,yes,no)]``.\n\n        Based on the booleans in the test vector, the output has the values of the\n        yes and no vectors interleaved (or merged together).  All Frames must have\n        the same row count.  Single column frames are broadened to match wider\n        Frames.  Scalars are allowed, and are also broadened to match wider frames.\n\n        :param yes: Frame to use if ``test`` is true; may be a scalar or single column\n        :param no: Frame to use if ``test`` is false; may be a scalar or single column\n\n        :returns: an H2OFrame of the merged yes/no frames/scalars according to the test input frame.\n        \"\"\"\n        return H2OFrame._expr(expr=ExprNode(\"ifelse\", self, yes, no))"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef apply(self, fun=None, axis=0):\n        from .astfun import lambda_to_expr\n        assert_is_type(axis, 0, 1)\n        assert_is_type(fun, FunctionType)\n        assert_satisfies(fun, fun.__name__ == \"<lambda>\")\n        res = lambda_to_expr(fun)\n        return H2OFrame._expr(expr=ExprNode(\"apply\", self, 1 + (axis == 0), *res))", "response": "Apply a lambda expression to the current H2OFrame."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef mktime(year=1970, month=0, day=0, hour=0, minute=0, second=0, msec=0):\n        return H2OFrame._expr(ExprNode(\"mktime\", year, month, day, hour, minute, second, msec))", "response": "Create a mktime expression that can be used to create a new time."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\ncreate a new H2OFrame from a Python object.", "response": "def from_python(python_obj, destination_frame=None, header=0, separator=\",\", column_names=None,\n                    column_types=None, na_strings=None):\n        \"\"\"[DEPRECATED] Use constructor ``H2OFrame()`` instead.\"\"\"\n        return H2OFrame(python_obj, destination_frame, header, separator, column_names, column_types,\n                        na_strings)"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nparses a string of text into a Code object.", "response": "def parse_text(text):\n    \"\"\"Parse code from a string of text.\"\"\"\n    assert isinstance(text, _str_type), \"`text` parameter should be a string, got %r\" % type(text)\n    gen = iter(text.splitlines(True))  # True = keep newlines\n    readline = gen.next if hasattr(gen, \"next\") else gen.__next__\n    return Code(_tokenize(readline))"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef parse_file(filename):\n    assert isinstance(filename, _str_type), \"`filename` parameter should be a string, got %r\" % type(filename)\n    with open(filename, \"rt\", encoding=\"utf-8\") as f:\n        return Code(_tokenize(f.readline))", "response": "Parse the provided file and return Code object."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _tokenize(readline):\n    assert callable(readline), \"`readline` should be a function\"\n\n    # Generate the initial list of tokens.\n    tokens = [Token(tok) for tok in tokenize.generate_tokens(readline)]\n\n    # Determine the levels of all indents / dedents.\n    indents_stack = [0]  # Stack of all indent levels up to the current parsing point\n    for tok in tokens:\n        if tok.op == INDENT:\n            tok.pre_indent = indents_stack[-1]\n            indents_stack.append(tok.end_col)\n            tok.post_indent = tok.end_col\n        elif tok.op == DEDENT:\n            tok.pre_indent = indents_stack.pop()\n            tok.post_indent = indents_stack[-1]\n        elif tok.op == COMMENT:\n            tok.pre_indent = tok.post_indent = indents_stack[-1]\n\n    # Iterate through tokens backwards and see whether it's necessary to swap any of them.\n    i = len(tokens) - 1\n    while i >= 2:\n        pptok, ptok, tok = tokens[i - 2:i + 1]\n        if tok.op == INDENT:\n            if ptok.op == NL and pptok.op == COMMENT:\n                # Comment preceding an INDENT token\n                indent, nl, comment = tok, ptok, pptok\n                assert nl.start_col == comment.end_col\n                underindent = indent.post_indent - comment.start_col\n                if underindent > 0:\n                    _warn(\"Comment '%s' is under-indented. Fixing...\" % comment.str)\n                    comment.move(0, underindent)\n                    nl.move(0, underindent)\n                indent.move(-1, 0)\n                tokens[i - 2:i + 1] = indent, comment, nl\n                comment.pre_indent = comment.post_indent = indent.post_indent\n                assert indent.end_row == comment.start_row and indent.end_col <= comment.start_col\n            elif ptok.op == NL and ptok.start_col == 0:\n                # Empty line before an INDENT\n                indent, nl = tok, ptok\n                indent.move(-1, 0)\n                tokens[i - 1:i + 1] = indent, nl\n        elif tok.op == DEDENT and ptok.op == NL:\n            if pptok.op == COMMENT:\n                # Comment preceding a DEDENT. Switch only if comment is not at the level of the previous block!\n                dedent, nl, comment = tok, ptok, pptok\n                if comment.start_col <= dedent.post_indent:\n                    rel_indent = comment.start_col - dedent.start_col\n                    if rel_indent < 0:\n                        _warn(\"Comment '%s' has wrong indentation\" % comment.str)\n                        ptok.move(0, -rel_indent)\n                        comment.move(0, -rel_indent)\n                    dedent.move(-1)\n                    tokens[i - 2:i + 1] = dedent, comment, nl\n                    comment.pre_indent = comment.post_indent = dedent.post_indent\n                    i += 1\n                    continue\n            elif ptok.start_col == 0:\n                # Empty line before a DEDENT\n                dedent, nl = tok, ptok\n                dedent.move(-1, -dedent.start_col)\n                tokens[i - 1:i + 1] = dedent, nl\n                i += 1\n                continue\n            else:\n                assert False, \"Unexpected sequence of tokens: %r %r %r\" % (pptok, ptok, tok)\n        elif tok.op == COMMENT:\n            if tok.start_col < tok.pre_indent:\n                _warn(\"Comment '%s' is under-indented relative to the surrounding block\" % tok.str)\n        i -= 1\n\n    return tokens", "response": "Parse any object accessible through a readline interface into a list of Token objects."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nmoving the token by drow and dcol columns.", "response": "def move(self, drow, dcol=0):\n        \"\"\"Move the token by `drow` rows and `dcol` columns.\"\"\"\n        self._start_row += drow\n        self._start_col += dcol\n        self._end_row += drow\n        self._end_col += dcol"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nconverts the parsed representation back into the source code.", "response": "def unparse(self):\n        \"\"\"Convert the parsed representation back into the source code.\"\"\"\n        ut = Untokenizer(start_row=self._tokens[0].start_row)\n        self._unparse(ut)\n        return ut.result()"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef _parse1(self):\n        fragments = []\n        tokens = self._tokens\n\n        def advance_after_newline(i0):\n            \"\"\"Return the index of the first token after the end of the current (logical) line.\"\"\"\n            for i in range(i0, len(tokens)):\n                if tokens[i].op == NEWLINE:\n                    break\n            return i + 1\n\n        i = 0\n        while i < len(tokens):\n            # Assume that we always start at the beginning of a new block\n            i0 = i\n            tok = tokens[i]\n            fragment_type = \"???\"  # to be determined in the switch clause below\n\n            if tok.op == ENDMARKER:\n                fragment_type = \"end\"\n                i += 1\n                assert i == len(tokens), \"ENDMARKER token encountered before the end of the stream\"\n\n            elif tok.op == NL:\n                fragment_type = \"whitespace\"\n                # If there are multiple whitespaces, gobble them all\n                while tokens[i].op == NL:\n                    i += 1\n\n            elif tok.op == COMMENT:\n                fragment_type = \"comment\"\n                # Collapse multiple comment lines into a single comment fragment; but only if they are at the same\n                # level of indentation.\n                is_banner = False\n                while i < len(tokens) and tokens[i].op == COMMENT and tokens[i].start_col == tok.start_col:\n                    assert tokens[i + 1].op == NL, \"Unexpected token after a comment: %r\" % tokens[i + 1]\n                    s = tokens[i].str\n                    if re.match(r\"^#\\s?[#*=-]{10,}$\", s) or re.match(r\"^#\\s?[#*=-]{4,}.*?[#*=-]{4,}$\", s):\n                        is_banner = True\n                    i += 2\n                if is_banner:\n                    fragment_type = \"banner-comment\"\n\n            elif (tok.op == STRING and tokens[i + 1].op == NEWLINE and\n                  all(frag[0] == \"whitespace\" or frag[0] == \"comment\" for frag in fragments)):\n                i += 2\n                fragment_type = \"docstring\"\n\n            elif tok.op == OP and tok.str == \"@\" and tokens[i + 1].op == NAME:\n                while tokens[i].op == OP and tokens[i].str == \"@\" and tokens[i + 1].op == NAME:\n                    i = advance_after_newline(i)\n                fragment_type = \"decorator\"\n\n            elif tok.op == NAME and tok.str in {\"from\", \"import\"}:\n                while tokens[i].op == NAME and tokens[i].str in {\"from\", \"import\"}:\n                    i = advance_after_newline(i)\n                fragment_type = \"import\"\n\n            elif tok.op in {INDENT, DEDENT, NEWLINE}:\n                assert False, \"Unexpected token %d: %r\" % (i, tok)\n\n            else:\n                i = advance_after_newline(i)\n                if i < len(tokens) and tokens[i].op == INDENT:\n                    level = 1\n                    while level > 0:\n                        i += 1\n                        level += tokens[i].indent()\n                    assert tokens[i].op == DEDENT\n                    i += 1  # consume the last DEDENT\n                while i < len(tokens) and tokens[i].op == COMMENT and tokens[i].start_col > tok.start_col:\n                    assert tokens[i + 1].op == NL\n                    i += 2\n                if tok.op == NAME and tok.str in {\"def\", \"class\"}:\n                    fragment_type = tok.str\n                else:\n                    fragment_type = \"code\"\n            assert i > i0, \"Stuck at i = %d\" % i\n            fragments.append((fragment_type, i0, i))\n        return fragments", "response": "Parse the code and return a list of tuples."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef _parse2(self, fragments):\n        out = []\n        tokens = self._tokens\n        i = 0\n        saved_start = None\n        while i < len(fragments):\n            ftype, start, end = fragments[i]\n            assert start == (0 if i == 0 else fragments[i - 1][2]), \"Discontinuity in `fragments` at i = %d\" % i\n            if ftype == \"whitespace\" or ftype == \"end\":\n                assert saved_start is None\n                obj = Whitespace(tokens[start:end])\n            elif ftype == \"docstring\":\n                assert saved_start is None\n                obj = Docstring(tokens[start:end])\n            elif ftype == \"comment\":\n                assert saved_start is None\n                next_frag = fragments[i + 1][0] if i + 1 < len(fragments) else \"end\"\n                if next_frag in {\"docstring\", \"end\", \"whitespace\", \"comment\", \"banner-comment\"}:\n                    # Possibly merge with the previous Comment instance\n                    # if (len(out) >= 2 and isinstance(out[-1], Whitespace) and isinstance(out[-2], Comment) and\n                    #         out[-2].type != \"banner\"):\n                    #     obj = Comment(out[-2].tokens + out[-1].tokens + tokens[start:end])\n                    #     del out[-2:]\n                    # else:\n                    obj = Comment(tokens[start:end])\n                elif next_frag in {\"decorator\", \"import\", \"def\", \"class\", \"code\"}:\n                    # save this comment for later\n                    saved_start = start\n                    i += 1\n                    continue\n                else:\n                    raise RuntimeError(\"Unknown token type %s\" % next_frag)\n            elif ftype == \"banner-comment\":\n                assert saved_start is None\n                obj = Comment(tokens[start:end])\n                obj.type = \"banner\"\n            elif ftype == \"decorator\":\n                if saved_start is None:\n                    saved_start = start\n                i += 1\n                continue\n            elif ftype == \"import\":\n                real_start = start if saved_start is None else saved_start\n                saved_start = None\n                obj = ImportBlock(tokens[real_start:end])\n            elif ftype in {\"class\", \"def\"}:\n                real_start = start if saved_start is None else saved_start\n                saved_start = None\n                obj = Callable(tokens[real_start:end])\n                obj.type = ftype\n            elif ftype == \"code\":\n                real_start = start if saved_start is None else saved_start\n                saved_start = None\n                obj = Expression(tokens[real_start:end])\n            else:\n                assert False, \"Unknown fragment type %s\" % ftype\n            out.append(obj)\n            i += 1\n        return out", "response": "This method is the second stage of parsing. It tries to parse the given list of fragments into a list of code objects."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef size(self, train=False, valid=False, xval=False):\n        tm = ModelBase._get_metrics(self, train, valid, xval)\n        m = {}\n        for k, v in tm.items():\n            m[k] = None if v is None else [v[2] for v in v._metric_json[\"centroid_stats\"].cell_values]\n        return list(m.values())[0] if len(m) == 1 else m", "response": "Get the sizes of each cluster."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef centers_std(self):\n        o = self._model_json[\"output\"]\n        cvals = o[\"centers_std\"].cell_values\n        centers_std = [list(cval[1:]) for cval in cvals]\n        centers_std = [list(x) for x in zip(*centers_std)]\n        return centers_std", "response": "The standardized centers for the kmeans model."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nconnect to an existing H2O server or local server.", "response": "def connect(server=None, url=None, ip=None, port=None, https=None, verify_ssl_certificates=None, auth=None,\n            proxy=None, cookies=None, verbose=True, config=None):\n    \"\"\"\n    Connect to an existing H2O server, remote or local.\n\n    There are two ways to connect to a server: either pass a `server` parameter containing an instance of\n    an H2OLocalServer, or specify `ip` and `port` of the server that you want to connect to.\n\n    :param server: An H2OLocalServer instance to connect to (optional).\n    :param url: Full URL of the server to connect to (can be used instead of `ip` + `port` + `https`).\n    :param ip: The ip address (or host name) of the server where H2O is running.\n    :param port: Port number that H2O service is listening to.\n    :param https: Set to True to connect via https:// instead of http://.\n    :param verify_ssl_certificates: When using https, setting this to False will disable SSL certificates verification.\n    :param auth: Either a (username, password) pair for basic authentication, an instance of h2o.auth.SpnegoAuth\n                 or one of the requests.auth authenticator objects.\n    :param proxy: Proxy server address.\n    :param cookies: Cookie (or list of) to add to request\n    :param verbose: Set to False to disable printing connection status messages.\n    :param connection_conf: Connection configuration object encapsulating connection parameters.\n    :returns: the new :class:`H2OConnection` object.\n    \"\"\"\n    global h2oconn\n    if config:\n        if \"connect_params\" in config:\n            h2oconn = _connect_with_conf(config[\"connect_params\"])\n        else:\n            h2oconn = _connect_with_conf(config)\n    else:\n        h2oconn = H2OConnection.open(server=server, url=url, ip=ip, port=port, https=https,\n                                     auth=auth, verify_ssl_certificates=verify_ssl_certificates,\n                                     proxy=proxy, cookies=cookies,\n                                     verbose=verbose)\n        if verbose:\n            h2oconn.cluster.show_status()\n    return h2oconn"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef api(endpoint, data=None, json=None, filename=None, save_to=None):\n    # type checks are performed in H2OConnection class\n    _check_connection()\n    return h2oconn.request(endpoint, data=data, json=json, filename=filename, save_to=save_to)", "response": "Perform a REST API request to a previously connected server."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nuse to verify that the H2O - Python module and the H2O server are compatible with each other.", "response": "def version_check():\n    \"\"\"Used to verify that h2o-python module and the H2O server are compatible with each other.\"\"\"\n    from .__init__ import __version__ as ver_pkg\n    ci = h2oconn.cluster\n    if not ci:\n        raise H2OConnectionError(\"Connection not initialized. Did you run h2o.connect()?\")\n    ver_h2o = ci.version\n    if ver_pkg == \"SUBST_PROJECT_VERSION\": ver_pkg = \"UNKNOWN\"\n    if str(ver_h2o) != str(ver_pkg):\n        branch_name_h2o = ci.branch_name\n        build_number_h2o = ci.build_number\n        if build_number_h2o is None or build_number_h2o == \"unknown\":\n            raise H2OConnectionError(\n                \"Version mismatch. H2O is version {0}, but the h2o-python package is version {1}. \"\n                \"Upgrade H2O and h2o-Python to latest stable version - \"\n                \"http://h2o-release.s3.amazonaws.com/h2o/latest_stable.html\"\n                \"\".format(ver_h2o, ver_pkg))\n        elif build_number_h2o == \"99999\":\n            raise H2OConnectionError(\n                \"Version mismatch. H2O is version {0}, but the h2o-python package is version {1}. \"\n                \"This is a developer build, please contact your developer.\"\n                \"\".format(ver_h2o, ver_pkg))\n        else:\n            raise H2OConnectionError(\n                \"Version mismatch. H2O is version {0}, but the h2o-python package is version {1}. \"\n                \"Install the matching h2o-Python version from - \"\n                \"http://h2o-release.s3.amazonaws.com/h2o/{2}/{3}/index.html.\"\n                \"\".format(ver_h2o, ver_pkg, branch_name_h2o, build_number_h2o))\n    # Check age of the install\n    if ci.build_too_old:\n        print(\"Warning: Your H2O cluster version is too old ({})! Please download and install the latest \"\n              \"version from http://h2o.ai/download/\".format(ci.build_age))"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef init(url=None, ip=None, port=None, name=None, https=None, insecure=None, username=None, password=None,\n         cookies=None, proxy=None, start_h2o=True, nthreads=-1, ice_root=None, log_dir=None, log_level=None,\n         enable_assertions=True, max_mem_size=None, min_mem_size=None, strict_version_check=None, ignore_config=False,\n         extra_classpath=None, jvm_custom_args=None, bind_to_localhost=True, **kwargs):\n    \"\"\"\n    Attempt to connect to a local server, or if not successful start a new server and connect to it.\n\n    :param url: Full URL of the server to connect to (can be used instead of `ip` + `port` + `https`).\n    :param ip: The ip address (or host name) of the server where H2O is running.\n    :param port: Port number that H2O service is listening to.\n    :param name: Cluster name. If None while connecting to an existing cluster it will not check the cluster name.\n                 If set then will connect only if the target cluster name matches. If no instance is found and decides to start a local\n                 one then this will be used as the cluster name or a random one will be generated if set to None.\n    :param https: Set to True to connect via https:// instead of http://.\n    :param insecure: When using https, setting this to True will disable SSL certificates verification.\n    :param username: Username and\n    :param password: Password for basic authentication.\n    :param cookies: Cookie (or list of) to add to each request.\n    :param proxy: Proxy server address.\n    :param start_h2o: If False, do not attempt to start an h2o server when connection to an existing one failed.\n    :param nthreads: \"Number of threads\" option when launching a new h2o server.\n    :param ice_root: Directory for temporary files for the new h2o server.\n    :param log_dir: Directory for H2O logs to be stored if a new instance is started. Ignored if connecting to an existing node.\n    :param log_level: The logger level for H2O if a new instance is started. One of TRACE,DEBUG,INFO,WARN,ERRR,FATA. Default is INFO. Ignored if connecting to an existing node.\n    :param enable_assertions: Enable assertions in Java for the new h2o server.\n    :param max_mem_size: Maximum memory to use for the new h2o server. Integer input will be evaluated as gigabytes.  Other units can be specified by passing in a string (e.g. \"160M\" for 160 megabytes).\n    :param min_mem_size: Minimum memory to use for the new h2o server. Integer input will be evaluated as gigabytes.  Other units can be specified by passing in a string (e.g. \"160M\" for 160 megabytes).\n    :param strict_version_check: If True, an error will be raised if the client and server versions don't match.\n    :param ignore_config: Indicates whether a processing of a .h2oconfig file should be conducted or not. Default value is False.\n    :param extra_classpath: List of paths to libraries that should be included on the Java classpath when starting H2O from Python.\n    :param kwargs: (all other deprecated attributes)\n    :param jvm_custom_args: Customer, user-defined argument's for the JVM H2O is instantiated in. Ignored if there is an instance of H2O already running and the client connects to it.\n    \"\"\"\n    global h2oconn\n    assert_is_type(url, str, None)\n    assert_is_type(ip, str, None)\n    assert_is_type(port, int, str, None)\n    assert_is_type(name, str, None)\n    assert_is_type(https, bool, None)\n    assert_is_type(insecure, bool, None)\n    assert_is_type(username, str, None)\n    assert_is_type(password, str, None)\n    assert_is_type(cookies, str, [str], None)\n    assert_is_type(proxy, {str: str}, None)\n    assert_is_type(start_h2o, bool, None)\n    assert_is_type(nthreads, int)\n    assert_is_type(ice_root, str, None)\n    assert_is_type(log_dir, str, None)\n    assert_is_type(log_level, str, None)\n    assert_satisfies(log_level, log_level in [None, \"TRACE\", \"DEBUG\", \"INFO\", \"WARN\", \"ERRR\", \"FATA\"])\n    assert_is_type(enable_assertions, bool)\n    assert_is_type(max_mem_size, int, str, None)\n    assert_is_type(min_mem_size, int, str, None)\n    assert_is_type(strict_version_check, bool, None)\n    assert_is_type(extra_classpath, [str], None)\n    assert_is_type(jvm_custom_args, [str], None)\n    assert_is_type(bind_to_localhost, bool)\n    assert_is_type(kwargs, {\"proxies\": {str: str}, \"max_mem_size_GB\": int, \"min_mem_size_GB\": int,\n                            \"force_connect\": bool, \"as_port\": bool})\n\n    def get_mem_size(mmint, mmgb):\n        if not mmint:  # treat 0 and \"\" as if they were None\n            if mmgb is None: return None\n            return mmgb << 30\n        if is_type(mmint, int):\n            # If the user gives some small number just assume it's in Gigabytes...\n            if mmint < 1000: return mmint << 30\n            return mmint\n        if is_type(mmint, str):\n            last = mmint[-1].upper()\n            num = mmint[:-1]\n            if not (num.isdigit() and last in \"MGT\"):\n                raise H2OValueError(\"Wrong format for a *_memory_size argument: %s (should be a number followed by \"\n                                    \"a suffix 'M', 'G' or 'T')\" % mmint)\n            if last == \"T\": return int(num) << 40\n            if last == \"G\": return int(num) << 30\n            if last == \"M\": return int(num) << 20\n\n    scheme = \"https\" if https else \"http\"\n    proxy = proxy[scheme] if proxy is not None and scheme in proxy else \\\n        kwargs[\"proxies\"][scheme] if \"proxies\" in kwargs and scheme in kwargs[\"proxies\"] else None\n    mmax = get_mem_size(max_mem_size, kwargs.get(\"max_mem_size_GB\"))\n    mmin = get_mem_size(min_mem_size, kwargs.get(\"min_mem_size_GB\"))\n    auth = (username, password) if username and password else None\n    check_version = True\n    verify_ssl_certificates = True\n\n    # Apply the config file if ignore_config=False\n    if not ignore_config:\n        config = H2OConfigReader.get_config()\n        if url is None and ip is None and port is None and https is None and \"init.url\" in config:\n            url = config[\"init.url\"]\n        if proxy is None and \"init.proxy\" in config:\n            proxy = config[\"init.proxy\"]\n        if cookies is None and \"init.cookies\" in config:\n            cookies = config[\"init.cookies\"].split(\";\")\n        if auth is None and \"init.username\" in config and \"init.password\" in config:\n            auth = (config[\"init.username\"], config[\"init.password\"])\n        if strict_version_check is None:\n            if \"init.check_version\" in config:\n                check_version = config[\"init.check_version\"].lower() != \"false\"\n            elif os.environ.get(\"H2O_DISABLE_STRICT_VERSION_CHECK\"):\n                check_version = False\n        else:\n            check_version = strict_version_check\n        if insecure is None:\n            if \"init.verify_ssl_certificates\" in config:\n                verify_ssl_certificates = config[\"init.verify_ssl_certificates\"].lower() != \"false\"\n            else:\n                verify_ssl_certificates = not insecure\n\n    if not start_h2o:\n        print(\"Warning: if you don't want to start local H2O server, then use of `h2o.connect()` is preferred.\")\n    try:\n        h2oconn = H2OConnection.open(url=url, ip=ip, port=port, name=name, https=https,\n                                     verify_ssl_certificates=verify_ssl_certificates,\n                                     auth=auth, proxy=proxy,cookies=cookies, verbose=True,\n                                     _msgs=(\"Checking whether there is an H2O instance running at {url} \",\n                                            \"connected.\", \"not found.\"))\n    except H2OConnectionError:\n        # Backward compatibility: in init() port parameter really meant \"baseport\" when starting a local server...\n        if port and not str(port).endswith(\"+\") and not kwargs.get(\"as_port\", False):\n            port = str(port) + \"+\"\n        if not start_h2o: raise\n        if ip and not (ip == \"localhost\" or ip == \"127.0.0.1\"):\n            raise H2OConnectionError('Can only start H2O launcher if IP address is localhost.')\n        hs = H2OLocalServer.start(nthreads=nthreads, enable_assertions=enable_assertions, max_mem_size=mmax,\n                                  min_mem_size=mmin, ice_root=ice_root, log_dir=log_dir, log_level=log_level,\n                                  port=port, name=name,\n                                  extra_classpath=extra_classpath, jvm_custom_args=jvm_custom_args,\n                                  bind_to_localhost=bind_to_localhost)\n        h2oconn = H2OConnection.open(server=hs, https=https, verify_ssl_certificates=not insecure,\n                                     auth=auth, proxy=proxy,cookies=cookies, verbose=True)\n    if check_version:\n        version_check()\n    h2oconn.cluster.timezone = \"UTC\"\n    h2oconn.cluster.show_status()", "response": "Initialize a new H2O cluster."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef lazy_import(path, pattern=None):\n    assert_is_type(path, str, [str])\n    assert_is_type(pattern, str, None)\n    paths = [path] if is_type(path, str) else path\n    return _import_multi(paths, pattern)", "response": "Imports a single file or a collection of files."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nupload a file to the H2O cluster.", "response": "def upload_file(path, destination_frame=None, header=0, sep=None, col_names=None, col_types=None,\n                na_strings=None, skipped_columns=None):\n    \"\"\"\n    Upload a dataset from the provided local path to the H2O cluster.\n\n    Does a single-threaded push to H2O. Also see :meth:`import_file`.\n\n    :param path: A path specifying the location of the data to upload.\n    :param destination_frame:  The unique hex key assigned to the imported file. If none is given, a key will\n        be automatically generated.\n    :param header: -1 means the first line is data, 0 means guess, 1 means first line is header.\n    :param sep: The field separator character. Values on each line of the file are separated by\n        this character. If not provided, the parser will automatically detect the separator.\n    :param col_names: A list of column names for the file.\n    :param col_types: A list of types or a dictionary of column names to types to specify whether columns\n        should be forced to a certain type upon import parsing. If a list, the types for elements that are\n        one will be guessed. The possible types a column may have are:\n\n        - \"unknown\" - this will force the column to be parsed as all NA\n        - \"uuid\"    - the values in the column must be true UUID or will be parsed as NA\n        - \"string\"  - force the column to be parsed as a string\n        - \"numeric\" - force the column to be parsed as numeric. H2O will handle the compression of the numeric\n          data in the optimal manner.\n        - \"enum\"    - force the column to be parsed as a categorical column.\n        - \"time\"    - force the column to be parsed as a time column. H2O will attempt to parse the following\n          list of date time formats: (date) \"yyyy-MM-dd\", \"yyyy MM dd\", \"dd-MMM-yy\", \"dd MMM yy\", (time)\n          \"HH:mm:ss\", \"HH:mm:ss:SSS\", \"HH:mm:ss:SSSnnnnnn\", \"HH.mm.ss\" \"HH.mm.ss.SSS\", \"HH.mm.ss.SSSnnnnnn\".\n          Times can also contain \"AM\" or \"PM\".\n    :param na_strings: A list of strings, or a list of lists of strings (one list per column), or a dictionary\n        of column names to strings which are to be interpreted as missing values.\n    :param skipped_columns: an integer lists of column indices to skip and not parsed into the final frame from the import file.\n\n    :returns: a new :class:`H2OFrame` instance.\n\n    :examples:\n        >>> frame = h2o.upload_file(\"/path/to/local/data\")\n    \"\"\"\n    coltype = U(None, \"unknown\", \"uuid\", \"string\", \"float\", \"real\", \"double\", \"int\", \"numeric\",\n                \"categorical\", \"factor\", \"enum\", \"time\")\n    natype = U(str, [str])\n    assert_is_type(path, str)\n    assert_is_type(destination_frame, str, None)\n    assert_is_type(header, -1, 0, 1)\n    assert_is_type(sep, None, I(str, lambda s: len(s) == 1))\n    assert_is_type(col_names, [str], None)\n    assert_is_type(col_types, [coltype], {str: coltype}, None)\n    assert_is_type(na_strings, [natype], {str: natype}, None)\n    assert (skipped_columns==None) or isinstance(skipped_columns, list), \\\n        \"The skipped_columns should be an list of column names!\"\n\n    check_frame_id(destination_frame)\n    if path.startswith(\"~\"):\n        path = os.path.expanduser(path)\n    return H2OFrame()._upload_parse(path, destination_frame, header, sep, col_names, col_types, na_strings, skipped_columns)"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef import_file(path=None, destination_frame=None, parse=True, header=0, sep=None, col_names=None, col_types=None,\n                na_strings=None, pattern=None, skipped_columns=None, custom_non_data_line_markers = None):\n    \"\"\"\n    Import a dataset that is already on the cluster.\n\n    The path to the data must be a valid path for each node in the H2O cluster. If some node in the H2O cluster\n    cannot see the file, then an exception will be thrown by the H2O cluster. Does a parallel/distributed\n    multi-threaded pull of the data. The main difference between this method and :func:`upload_file` is that\n    the latter works with local files, whereas this method imports remote files (i.e. files local to the server).\n    If you running H2O server on your own maching, then both methods behave the same.\n\n    :param path: path(s) specifying the location of the data to import or a path to a directory of files to import\n    :param destination_frame: The unique hex key assigned to the imported file. If none is given, a key will be\n        automatically generated.\n    :param parse: If True, the file should be parsed after import. If False, then a list is returned containing the file path.\n    :param header: -1 means the first line is data, 0 means guess, 1 means first line is header.\n    :param sep: The field separator character. Values on each line of the file are separated by\n        this character. If not provided, the parser will automatically detect the separator.\n    :param col_names: A list of column names for the file.\n    :param col_types: A list of types or a dictionary of column names to types to specify whether columns\n        should be forced to a certain type upon import parsing. If a list, the types for elements that are\n        one will be guessed. The possible types a column may have are:\n\n        - \"unknown\" - this will force the column to be parsed as all NA\n        - \"uuid\"    - the values in the column must be true UUID or will be parsed as NA\n        - \"string\"  - force the column to be parsed as a string\n        - \"numeric\" - force the column to be parsed as numeric. H2O will handle the compression of the numeric\n          data in the optimal manner.\n        - \"enum\"    - force the column to be parsed as a categorical column.\n        - \"time\"    - force the column to be parsed as a time column. H2O will attempt to parse the following\n          list of date time formats: (date) \"yyyy-MM-dd\", \"yyyy MM dd\", \"dd-MMM-yy\", \"dd MMM yy\", (time)\n          \"HH:mm:ss\", \"HH:mm:ss:SSS\", \"HH:mm:ss:SSSnnnnnn\", \"HH.mm.ss\" \"HH.mm.ss.SSS\", \"HH.mm.ss.SSSnnnnnn\".\n          Times can also contain \"AM\" or \"PM\".\n    :param na_strings: A list of strings, or a list of lists of strings (one list per column), or a dictionary\n        of column names to strings which are to be interpreted as missing values.\n    :param pattern: Character string containing a regular expression to match file(s) in the folder if `path` is a\n        directory.\n    :param skipped_columns: an integer list of column indices to skip and not parsed into the final frame from the import file.\n    :param custom_non_data_line_markers: If a line in imported file starts with any character in given string it will NOT be imported. Empty string means all lines are imported, None means that default behaviour for given format will be used\n\n    :returns: a new :class:`H2OFrame` instance.\n\n    :examples:\n        >>> # Single file import\n        >>> iris = import_file(\"h2o-3/smalldata/iris.csv\")\n        >>> # Return all files in the folder iris/ matching the regex r\"iris_.*\\.csv\"\n        >>> iris_pattern = h2o.import_file(path = \"h2o-3/smalldata/iris\",\n        ...                                pattern = \"iris_.*\\.csv\")\n    \"\"\"\n    coltype = U(None, \"unknown\", \"uuid\", \"string\", \"float\", \"real\", \"double\", \"int\", \"numeric\",\n                \"categorical\", \"factor\", \"enum\", \"time\")\n    natype = U(str, [str])\n    assert_is_type(path, str, [str])\n    assert_is_type(pattern, str, None)\n    assert_is_type(destination_frame, str, None)\n    assert_is_type(parse, bool)\n    assert_is_type(header, -1, 0, 1)\n    assert_is_type(sep, None, I(str, lambda s: len(s) == 1))\n    assert_is_type(col_names, [str], None)\n    assert_is_type(col_types, [coltype], {str: coltype}, None)\n    assert_is_type(na_strings, [natype], {str: natype}, None)\n    assert isinstance(skipped_columns, (type(None), list)), \"The skipped_columns should be an list of column names!\"\n    check_frame_id(destination_frame)\n    patharr = path if isinstance(path, list) else [path]\n    if any(os.path.split(p)[0] == \"~\" for p in patharr):\n        raise H2OValueError(\"Paths relative to a current user (~) are not valid in the server environment. \"\n                            \"Please use absolute paths if possible.\")\n    if not parse:\n        return lazy_import(path, pattern)\n    else:\n        return H2OFrame()._import_parse(path, pattern, destination_frame, header, sep, col_names, col_types, na_strings,\n                                        skipped_columns, custom_non_data_line_markers)", "response": "This method imports a file from a local file to a H2O cluster."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef import_hive_table(database=None, table=None, partitions=None, allow_multi_format=False):\n    assert_is_type(database, str, None)\n    assert_is_type(table, str)\n    assert_is_type(partitions, [[str]], None)\n    p = { \"database\": database, \"table\": table, \"partitions\": partitions, \"allow_multi_format\": allow_multi_format }\n    j = H2OJob(api(\"POST /3/ImportHiveTable\", data=p), \"Import Hive Table\").poll()\n    return get_frame(j.dest_key)", "response": "Import Hive table to H2OFrame in memory."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef import_sql_table(connection_url, table, username, password, columns=None, optimize=True, fetch_mode=None):\n    assert_is_type(connection_url, str)\n    assert_is_type(table, str)\n    assert_is_type(username, str)\n    assert_is_type(password, str)\n    assert_is_type(columns, [str], None)\n    assert_is_type(optimize, bool)\n    assert_is_type(fetch_mode, str, None)\n    p = {\"connection_url\": connection_url, \"table\": table, \"username\": username, \"password\": password,\n         \"fetch_mode\": fetch_mode}\n    if columns:\n        p[\"columns\"] = \", \".join(columns)\n    j = H2OJob(api(\"POST /99/ImportSQLTable\", data=p), \"Import SQL Table\").poll()\n    return get_frame(j.dest_key)", "response": "Imports a SQL table into H2OFrame in memory."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nimport the SQL query from a database into a temporary table.", "response": "def import_sql_select(connection_url, select_query, username, password, optimize=True, \n                      use_temp_table=None, temp_table_name=None, fetch_mode=None):\n    \"\"\"\n    Import the SQL table that is the result of the specified SQL query to H2OFrame in memory.\n\n    Creates a temporary SQL table from the specified sql_query.\n    Runs multiple SELECT SQL queries on the temporary table concurrently for parallel ingestion, then drops the table.\n    Be sure to start the h2o.jar in the terminal with your downloaded JDBC driver in the classpath::\n\n      java -cp <path_to_h2o_jar>:<path_to_jdbc_driver_jar> water.H2OApp\n\n    Also see h2o.import_sql_table. Currently supported SQL databases are MySQL, PostgreSQL, MariaDB, Hive, Oracle \n    and Microsoft SQL Server.\n\n    :param connection_url: URL of the SQL database connection as specified by the Java Database Connectivity (JDBC)\n        Driver. For example, \"jdbc:mysql://localhost:3306/menagerie?&useSSL=false\"\n    :param select_query: SQL query starting with `SELECT` that returns rows from one or more database tables.\n    :param username: username for SQL server\n    :param password: password for SQL server\n    :param optimize: DEPRECATED. Ignored - use fetch_mode instead. Optimize import of SQL table for faster imports.\n    :param use_temp_table: whether a temporary table should be created from select_query\n    :param temp_table_name: name of temporary table to be created from select_query\n    :param fetch_mode: Set to DISTRIBUTED to enable distributed import. Set to SINGLE to force a sequential read by a single node\n        from the database.\n\n    :returns: an :class:`H2OFrame` containing data of the specified SQL query.\n\n    :examples:\n        >>> conn_url = \"jdbc:mysql://172.16.2.178:3306/ingestSQL?&useSSL=false\"\n        >>> select_query = \"SELECT bikeid from citibike20k\"\n        >>> username = \"root\"\n        >>> password = \"abc123\"\n        >>> my_citibike_data = h2o.import_sql_select(conn_url, select_query,\n        ...                                          username, password, fetch_mode)\n    \"\"\"\n    assert_is_type(connection_url, str)\n    assert_is_type(select_query, str)\n    assert_is_type(username, str)\n    assert_is_type(password, str)\n    assert_is_type(optimize, bool)\n    assert_is_type(use_temp_table, bool, None)\n    assert_is_type(temp_table_name, str, None)\n    assert_is_type(fetch_mode, str, None)\n    p = {\"connection_url\": connection_url, \"select_query\": select_query, \"username\": username, \"password\": password,\n         \"use_temp_table\": use_temp_table, \"temp_table_name\": temp_table_name, \"fetch_mode\": fetch_mode}\n    j = H2OJob(api(\"POST /99/ImportSQLTable\", data=p), \"Import SQL Table\").poll()\n    return get_frame(j.dest_key)"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nparse the H2O cluster setup.", "response": "def parse_setup(raw_frames, destination_frame=None, header=0, separator=None, column_names=None,\n                column_types=None, na_strings=None, skipped_columns=None, custom_non_data_line_markers=None):\n    \"\"\"\n    Retrieve H2O's best guess as to what the structure of the data file is.\n\n    During parse setup, the H2O cluster will make several guesses about the attributes of\n    the data. This method allows a user to perform corrective measures by updating the\n    returning dictionary from this method. This dictionary is then fed into `parse_raw` to\n    produce the H2OFrame instance.\n\n    :param raw_frames: a collection of imported file frames\n    :param destination_frame: The unique hex key assigned to the imported file. If none is given, a key will\n        automatically be generated.\n    :param header: -1 means the first line is data, 0 means guess, 1 means first line is header.\n    :param separator: The field separator character. Values on each line of the file are separated by\n        this character. If not provided, the parser will automatically detect the separator.\n    :param column_names: A list of column names for the file. If skipped_columns are specified, only list column names\n         of columns that are not skipped.\n    :param column_types: A list of types or a dictionary of column names to types to specify whether columns\n        should be forced to a certain type upon import parsing. If a list, the types for elements that are\n        one will be guessed. If skipped_columns are specified, only list column types of columns that are not skipped.\n        The possible types a column may have are:\n\n        - \"unknown\" - this will force the column to be parsed as all NA\n        - \"uuid\"    - the values in the column must be true UUID or will be parsed as NA\n        - \"string\"  - force the column to be parsed as a string\n        - \"numeric\" - force the column to be parsed as numeric. H2O will handle the compression of the numeric\n          data in the optimal manner.\n        - \"enum\"    - force the column to be parsed as a categorical column.\n        - \"time\"    - force the column to be parsed as a time column. H2O will attempt to parse the following\n          list of date time formats: (date) \"yyyy-MM-dd\", \"yyyy MM dd\", \"dd-MMM-yy\", \"dd MMM yy\", (time)\n          \"HH:mm:ss\", \"HH:mm:ss:SSS\", \"HH:mm:ss:SSSnnnnnn\", \"HH.mm.ss\" \"HH.mm.ss.SSS\", \"HH.mm.ss.SSSnnnnnn\".\n          Times can also contain \"AM\" or \"PM\".\n\n    :param na_strings: A list of strings, or a list of lists of strings (one list per column), or a dictionary\n        of column names to strings which are to be interpreted as missing values.\n    :param skipped_columns: an integer lists of column indices to skip and not parsed into the final frame from the import file.\n    :param custom_non_data_line_markers: If a line in imported file starts with any character in given string it will NOT be imported. Empty string means all lines are imported, None means that default behaviour for given format will be used\n\n    :returns: a dictionary containing parse parameters guessed by the H2O backend.\n    \"\"\"\n    coltype = U(None, \"unknown\", \"uuid\", \"string\", \"float\", \"real\", \"double\", \"int\", \"numeric\",\n                \"categorical\", \"factor\", \"enum\", \"time\")\n    natype = U(str, [str])\n    assert_is_type(raw_frames, str, [str])\n    assert_is_type(destination_frame, None, str)\n    assert_is_type(header, -1, 0, 1)\n    assert_is_type(separator, None, I(str, lambda s: len(s) == 1))\n    assert_is_type(column_names, [str], None)\n    assert_is_type(column_types, [coltype], {str: coltype}, None)\n    assert_is_type(na_strings, [natype], {str: natype}, None)\n    check_frame_id(destination_frame)\n\n    # The H2O backend only accepts things that are quoted\n    if is_type(raw_frames, str): raw_frames = [raw_frames]\n\n    # temporary dictionary just to pass the following information to the parser: header, separator\n    kwargs = {\"check_header\": header, \"source_frames\": [quoted(frame_id) for frame_id in raw_frames]}\n    if separator:\n        kwargs[\"separator\"] = ord(separator)\n    \n    if custom_non_data_line_markers is not None:\n        kwargs[\"custom_non_data_line_markers\"] = custom_non_data_line_markers;\n\n    j = api(\"POST /3/ParseSetup\", data=kwargs)\n    if \"warnings\" in j and j[\"warnings\"]:\n        for w in j[\"warnings\"]:\n            warnings.warn(w)\n    # TODO: really should be url encoding...\n    if destination_frame:\n        j[\"destination_frame\"] = destination_frame\n\n    parse_column_len = len(j[\"column_types\"]) if skipped_columns is None else (len(j[\"column_types\"])-len(skipped_columns))\n    tempColumnNames = j[\"column_names\"] if j[\"column_names\"] is not None else gen_header(j[\"number_columns\"])\n    useType = [True]*len(tempColumnNames)\n    if skipped_columns is not None:\n        useType = [True]*len(tempColumnNames)\n\n        for ind in range(len(tempColumnNames)):\n            if ind in skipped_columns:\n                useType[ind]=False\n\n    if column_names is not None:\n        if not isinstance(column_names, list): raise ValueError(\"col_names should be a list\")\n        if (skipped_columns is not None) and len(skipped_columns)>0:\n            if (len(column_names)) != parse_column_len:\n                raise ValueError(\n                    \"length of col_names should be equal to the number of columns parsed: %d vs %d\"\n                    % (len(column_names), parse_column_len))\n        else:\n            if len(column_names) != len(j[\"column_types\"]): raise ValueError(\n                \"length of col_names should be equal to the number of columns: %d vs %d\"\n                % (len(column_names), len(j[\"column_types\"])))\n        j[\"column_names\"] = column_names\n        counter = 0\n        for ind in range(len(tempColumnNames)):\n            if useType[ind]:\n                tempColumnNames[ind]=column_names[counter]\n                counter=counter+1\n\n    if (column_types is not None): # keep the column types to include all columns\n        if isinstance(column_types, dict):\n            # overwrite dictionary to ordered list of column types. if user didn't specify column type for all names,\n            # use type provided by backend\n            if j[\"column_names\"] is None:  # no colnames discovered! (C1, C2, ...)\n                j[\"column_names\"] = gen_header(j[\"number_columns\"])\n            if not set(column_types.keys()).issubset(set(j[\"column_names\"])): raise ValueError(\n                \"names specified in col_types is not a subset of the column names\")\n            idx = 0\n            column_types_list = []\n\n            for name in tempColumnNames: # column_names may have already been changed\n                if name in column_types:\n                    column_types_list.append(column_types[name])\n                else:\n                    column_types_list.append(j[\"column_types\"][idx])\n                idx += 1\n\n            column_types = column_types_list\n\n        elif isinstance(column_types, list):\n            if len(column_types) != parse_column_len: raise ValueError(\n                \"length of col_types should be equal to the number of parsed columns\")\n            # need to expand it out to all columns, not just the parsed ones\n            column_types_list = j[\"column_types\"]\n            counter = 0\n            for ind in range(len(j[\"column_types\"])):\n                if useType[ind] and (column_types[counter]!=None):\n                    column_types_list[ind]=column_types[counter]\n                    counter=counter+1\n\n            column_types = column_types_list\n        else:  # not dictionary or list\n            raise ValueError(\"col_types should be a list of types or a dictionary of column names to types\")\n        j[\"column_types\"] = column_types\n    if na_strings is not None:\n        if isinstance(na_strings, dict):\n            # overwrite dictionary to ordered list of lists of na_strings\n            if not j[\"column_names\"]: raise ValueError(\"column names should be specified\")\n            if not set(na_strings.keys()).issubset(set(j[\"column_names\"])): raise ValueError(\n                \"names specified in na_strings is not a subset of the column names\")\n            j[\"na_strings\"] = [[] for _ in range(len(j[\"column_names\"]))]\n            for name, na in na_strings.items():\n                idx = j[\"column_names\"].index(name)\n                if is_type(na, str): na = [na]\n                for n in na: j[\"na_strings\"][idx].append(quoted(n))\n        elif is_type(na_strings, [[str]]):\n            if len(na_strings) != len(j[\"column_types\"]):\n                raise ValueError(\"length of na_strings should be equal to the number of columns\")\n            j[\"na_strings\"] = [[quoted(na) for na in col] if col is not None else [] for col in na_strings]\n        elif isinstance(na_strings, list):\n            j[\"na_strings\"] = [[quoted(na) for na in na_strings]] * len(j[\"column_types\"])\n        else:  # not a dictionary or list\n            raise ValueError(\n                \"na_strings should be a list, a list of lists (one list per column), or a dictionary of column \"\n                \"names to strings which are to be interpreted as missing values\")\n\n    if skipped_columns is not None:\n        if isinstance(skipped_columns, list):\n            j[\"skipped_columns\"] = []\n            for colidx in skipped_columns:\n                if (colidx < 0): raise ValueError(\"skipped column index cannot be negative\")\n                j[\"skipped_columns\"].append(colidx)\n    \n\n    # quote column names and column types also when not specified by user\n    if j[\"column_names\"]: j[\"column_names\"] = list(map(quoted, j[\"column_names\"]))\n    j[\"column_types\"] = list(map(quoted, j[\"column_types\"]))\n    return j"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nparse a raw H2O frame.", "response": "def parse_raw(setup, id=None, first_line_is_header=0):\n    \"\"\"\n    Parse dataset using the parse setup structure.\n\n    :param setup: Result of ``h2o.parse_setup()``\n    :param id: an id for the frame.\n    :param first_line_is_header: -1, 0, 1 if the first line is to be used as the header\n\n    :returns: an :class:`H2OFrame` object.\n    \"\"\"\n    assert_is_type(setup, dict)\n    assert_is_type(id, str, None)\n    assert_is_type(first_line_is_header, -1, 0, 1)\n    check_frame_id(id)\n    if id:\n        setup[\"destination_frame\"] = id\n    if first_line_is_header != (-1, 0, 1):\n        if first_line_is_header not in (-1, 0, 1): raise ValueError(\"first_line_is_header should be -1, 0, or 1\")\n        setup[\"check_header\"] = first_line_is_header\n    fr = H2OFrame()\n    fr._parse_raw(setup)\n    return fr"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nassign new id to the frame.", "response": "def assign(data, xid):\n    \"\"\"\n    (internal) Assign new id to the frame.\n\n    :param data: an H2OFrame whose id should be changed\n    :param xid: new id for the frame.\n    :returns: the passed frame.\n    \"\"\"\n    assert_is_type(data, H2OFrame)\n    assert_is_type(xid, str)\n    assert_satisfies(xid, xid != data.frame_id)\n    check_frame_id(xid)\n    data._ex = ExprNode(\"assign\", xid, data)._eval_driver(False)\n    data._ex._cache._id = xid\n    data._ex._children = None\n    return data"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef deep_copy(data, xid):\n    assert_is_type(data, H2OFrame)\n    assert_is_type(xid, str)\n    assert_satisfies(xid, xid != data.frame_id)\n    check_frame_id(xid)\n    duplicate = data.apply(lambda x: x)\n    duplicate._ex = ExprNode(\"assign\", xid, duplicate)._eval_driver(False)\n    duplicate._ex._cache._id = xid\n    duplicate._ex._children = None\n    return duplicate", "response": "Create a deep copy of the frame data."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nloading a single model from the server.", "response": "def get_model(model_id):\n    \"\"\"\n    Load a model from the server.\n\n    :param model_id: The model identification in H2O\n\n    :returns: Model object, a subclass of H2OEstimator\n    \"\"\"\n    assert_is_type(model_id, str)\n    model_json = api(\"GET /3/Models/%s\" % model_id)[\"models\"][0]\n    algo = model_json[\"algo\"]\n    if algo == \"svd\":            m = H2OSVD()\n    elif algo == \"pca\":          m = H2OPrincipalComponentAnalysisEstimator()\n    elif algo == \"drf\":          m = H2ORandomForestEstimator()\n    elif algo == \"naivebayes\":   m = H2ONaiveBayesEstimator()\n    elif algo == \"kmeans\":       m = H2OKMeansEstimator()\n    elif algo == \"glrm\":         m = H2OGeneralizedLowRankEstimator()\n    elif algo == \"glm\":          m = H2OGeneralizedLinearEstimator()\n    elif algo == \"gbm\":          m = H2OGradientBoostingEstimator()\n    elif algo == \"deepwater\":    m = H2ODeepWaterEstimator()\n    elif algo == \"xgboost\":      m = H2OXGBoostEstimator()\n    elif algo == \"word2vec\":     m = H2OWord2vecEstimator()\n    elif algo == \"generic\": m = H2OGenericEstimator()\n    elif algo == \"deeplearning\":\n        if model_json[\"output\"][\"model_category\"] == \"AutoEncoder\":\n            m = H2OAutoEncoderEstimator()\n        else:\n            m = H2ODeepLearningEstimator()\n    elif algo == \"stackedensemble\": m = H2OStackedEnsembleEstimator()\n    elif algo == \"isolationforest\": m = H2OIsolationForestEstimator()\n    else:\n        raise ValueError(\"Unknown algo type: \" + algo)\n    m._resolve_model(model_id, model_json)\n    return m"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef get_grid(grid_id):\n    assert_is_type(grid_id, str)\n    grid_json = api(\"GET /99/Grids/%s\" % grid_id)\n    models = [get_model(key[\"name\"]) for key in grid_json[\"model_ids\"]]\n    # get first model returned in list of models from grid search to get model class (binomial, multinomial, etc)\n    first_model_json = api(\"GET /3/Models/%s\" % grid_json[\"model_ids\"][0][\"name\"])[\"models\"][0]\n    gs = H2OGridSearch(None, {}, grid_id)\n    gs._resolve_grid(grid_id, grid_json, first_model_json)\n    gs.models = models\n    hyper_params = {param: set() for param in gs.hyper_names}\n    for param in gs.hyper_names:\n        for model in models:\n            if isinstance(model.full_parameters[param][\"actual_value\"], list):\n                hyper_params[param].add(model.full_parameters[param][\"actual_value\"][0])\n            else:\n                hyper_params[param].add(model.full_parameters[param][\"actual_value\"])\n\n    hyper_params = {str(param): list(vals) for param, vals in hyper_params.items()}\n    gs.hyper_params = hyper_params\n    gs.model = model.__class__()\n    return gs", "response": "Returns the specified grid."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef get_frame(frame_id, **kwargs):\n    assert_is_type(frame_id, str)\n    return H2OFrame.get_frame(frame_id, **kwargs)", "response": "Obtain a handle to the frame in H2O with the frame_id key."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nremoving objects from the H2O cluster.", "response": "def remove(x):\n    \"\"\"\n    Remove object(s) from H2O.\n\n    :param x: H2OFrame, H2OEstimator, or string, or a list of those things: the object(s) or unique id(s)\n        pointing to the object(s) to be removed.\n    \"\"\"\n    item_type = U(str, H2OFrame, H2OEstimator)\n    assert_is_type(x, item_type, [item_type])\n    if not isinstance(x, list): x = [x]\n    for xi in x:\n        if isinstance(xi, H2OFrame):\n            xi_id = xi._ex._cache._id  # String or None\n            if xi_id is None: return  # Lazy frame, never evaluated, nothing in cluster\n            rapids(\"(rm {})\".format(xi_id))\n            xi._ex = None\n        elif isinstance(xi, H2OEstimator):\n            api(\"DELETE /3/DKV/%s\" % xi.model_id)\n            xi._id = None\n        else:\n            # string may be a Frame key name part of a rapids session... need to call rm thru rapids here\n            try:\n                rapids(\"(rm {})\".format(xi))\n            except:\n                api(\"DELETE /3/DKV/%s\" % xi)"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\ndownloads the POJO for this model.", "response": "def download_pojo(model, path=\"\", get_jar=True, jar_name=\"\"):\n    \"\"\"\n    Download the POJO for this model to the directory specified by path; if path is \"\", then dump to screen.\n\n    :param model: the model whose scoring POJO should be retrieved.\n    :param path: an absolute path to the directory where POJO should be saved.\n    :param get_jar: retrieve the h2o-genmodel.jar also (will be saved to the same folder ``path``).\n    :param jar_name: Custom name of genmodel jar.\n    :returns: location of the downloaded POJO file.\n    \"\"\"\n    assert_is_type(model, ModelBase)\n    assert_is_type(path, str)\n    assert_is_type(get_jar, bool)\n\n    if not model.have_pojo:\n        raise H2OValueError(\"Export to POJO not supported\")\n\n    if path == \"\":\n        java_code = api(\"GET /3/Models.java/%s\" % model.model_id)\n        print(java_code)\n        return None\n    else:\n        filename = api(\"GET /3/Models.java/%s\" % model.model_id, save_to=path)\n        if get_jar:\n            if jar_name == \"\":\n                api(\"GET /3/h2o-genmodel.jar\", save_to=os.path.join(path, \"h2o-genmodel.jar\"))\n            else:\n                api(\"GET /3/h2o-genmodel.jar\", save_to=os.path.join(path, jar_name))\n        return filename"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\ndownloads an H2O data set to a CSV file on the local disk.", "response": "def download_csv(data, filename):\n    \"\"\"\n    Download an H2O data set to a CSV file on the local disk.\n\n    Warning: Files located on the H2O server may be very large! Make sure you have enough\n    hard drive space to accommodate the entire file.\n\n    :param data: an H2OFrame object to be downloaded.\n    :param filename: name for the CSV file where the data should be saved to.\n    \"\"\"\n    assert_is_type(data, H2OFrame)\n    assert_is_type(filename, str)\n    url = h2oconn.make_url(\"DownloadDataset\", 3) + \"?frame_id={}&hex_string=false\".format(data.frame_id)\n    with open(filename, \"wb\") as f:\n        f.write(urlopen()(url).read())"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\ndownloading all H2O log files to disk.", "response": "def download_all_logs(dirname=\".\", filename=None):\n    \"\"\"\n    Download H2O log files to disk.\n\n    :param dirname: a character string indicating the directory that the log file should be saved in.\n    :param filename: a string indicating the name that the CSV file should be. Note that the saved format is .zip, so the file name must include the .zip extension.\n\n    :returns: path of logs written in a zip file.\n\n    :examples: The following code will save the zip file `'autoh2o_log.zip'` in a directory that is one down from where you are currently working into a directory called `your_directory_name`. (Please note that `your_directory_name` should be replaced with the name of the directory that you've created and that already exists.)\n\n        >>> h2o.download_all_logs(dirname='./your_directory_name/', filename = 'autoh2o_log.zip')\n\n    \"\"\"\n    assert_is_type(dirname, str)\n    assert_is_type(filename, str, None)\n    url = \"%s/3/Logs/download\" % h2oconn.base_url\n    opener = urlopen()\n    response = opener(url)\n\n    if not os.path.exists(dirname): os.mkdir(dirname)\n    if filename is None:\n        if PY3:\n            headers = [h[1] for h in response.headers._headers]\n        else:\n            headers = response.headers.headers\n        for h in headers:\n            if \"filename=\" in h:\n                filename = h.split(\"filename=\")[1].strip()\n                break\n    path = os.path.join(dirname, filename)\n    response = opener(url).read()\n\n    print(\"Writing H2O logs to \" + path)\n    with open(path, \"wb\") as f:\n        f.write(response)\n    return path"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nsave an ensemble binary model to disk.", "response": "def save_model(model, path=\"\", force=False):\n    \"\"\"\n    Save an H2O Model object to disk. (Note that ensemble binary models can now be saved using this method.)\n\n    :param model: The model object to save.\n    :param path: a path to save the model at (hdfs, s3, local)\n    :param force: if True overwrite destination directory in case it exists, or throw exception if set to False.\n\n    :returns: the path of the saved model\n\n    :examples:\n        >>> path = h2o.save_model(my_model, dir=my_path)\n    \"\"\"\n    assert_is_type(model, ModelBase)\n    assert_is_type(path, str)\n    assert_is_type(force, bool)\n    path = os.path.join(os.getcwd() if path == \"\" else path, model.model_id)\n    return api(\"GET /99/Models.bin/%s\" % model.model_id, data={\"dir\": path, \"force\": force})[\"dir\"]"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nloads a saved H2O model from disk.", "response": "def load_model(path):\n    \"\"\"\n    Load a saved H2O model from disk. (Note that ensemble binary models can now be loaded using this method.)\n\n    :param path: the full path of the H2O Model to be imported.\n\n    :returns: an :class:`H2OEstimator` object\n\n    :examples:\n        >>> path = h2o.save_model(my_model, dir=my_path)\n        >>> h2o.load_model(path)\n    \"\"\"\n    assert_is_type(path, str)\n    res = api(\"POST /99/Models.bin/%s\" % \"\", data={\"dir\": path})\n    return get_model(res[\"models\"][0][\"model_id\"][\"name\"])"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nexporting a given H2OFrame to a file on the machine.", "response": "def export_file(frame, path, force=False, parts=1):\n    \"\"\"\n    Export a given H2OFrame to a path on the machine this python session is currently connected to.\n\n    :param frame: the Frame to save to disk.\n    :param path: the path to the save point on disk.\n    :param force: if True, overwrite any preexisting file with the same path\n    :param parts: enables export to multiple 'part' files instead of just a single file.\n        Convenient for large datasets that take too long to store in a single file.\n        Use parts=-1 to instruct H2O to determine the optimal number of part files or\n        specify your desired maximum number of part files. Path needs to be a directory\n        when exporting to multiple files, also that directory must be empty.\n        Default is ``parts = 1``, which is to export to a single file.\n    \"\"\"\n    assert_is_type(frame, H2OFrame)\n    assert_is_type(path, str)\n    assert_is_type(force, bool)\n    assert_is_type(parts, int)\n    H2OJob(api(\"POST /3/Frames/%s/export\" % (frame.frame_id), data={\"path\": path, \"num_parts\": parts, \"force\": force}),\n           \"Export File\").poll()"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\ncreates a new H2O data frame in H2O with random data.", "response": "def create_frame(frame_id=None, rows=10000, cols=10, randomize=True,\n                 real_fraction=None, categorical_fraction=None, integer_fraction=None,\n                 binary_fraction=None, time_fraction=None, string_fraction=None,\n                 value=0, real_range=100, factors=100, integer_range=100,\n                 binary_ones_fraction=0.02, missing_fraction=0.01,\n                 has_response=False, response_factors=2, positive_response=False,\n                 seed=None, seed_for_column_types=None):\n    \"\"\"\n    Create a new frame with random data.\n\n    Creates a data frame in H2O with real-valued, categorical, integer, and binary columns specified by the user.\n\n    :param frame_id: the destination key. If empty, this will be auto-generated.\n    :param rows: the number of rows of data to generate.\n    :param cols: the number of columns of data to generate. Excludes the response column if has_response is True.\n    :param randomize: If True, data values will be randomly generated. This must be True if either\n        categorical_fraction or integer_fraction is non-zero.\n    :param value: if randomize is False, then all real-valued entries will be set to this value.\n    :param real_range: the range of randomly generated real values.\n    :param real_fraction: the fraction of columns that are real-valued.\n    :param categorical_fraction: the fraction of total columns that are categorical.\n    :param factors: the number of (unique) factor levels in each categorical column.\n    :param integer_fraction: the fraction of total columns that are integer-valued.\n    :param integer_range: the range of randomly generated integer values.\n    :param binary_fraction: the fraction of total columns that are binary-valued.\n    :param binary_ones_fraction: the fraction of values in a binary column that are set to 1.\n    :param time_fraction: the fraction of randomly created date/time columns.\n    :param string_fraction: the fraction of randomly created string columns.\n    :param missing_fraction: the fraction of total entries in the data frame that are set to NA.\n    :param has_response: A logical value indicating whether an additional response column should be prepended to the\n        final H2O data frame. If set to True, the total number of columns will be ``cols + 1``.\n    :param response_factors: if has_response is True, then this variable controls the type of the \"response\" column:\n        setting response_factors to 1 will generate real-valued response, any value greater or equal than 2 will\n        create categorical response with that many categories.\n    :param positive_reponse: when response variable is present and of real type, this will control whether it\n        contains positive values only, or both positive and negative.\n    :param seed: a seed used to generate random values when ``randomize`` is True.\n    :param seed_for_column_types: a seed used to generate random column types when ``randomize`` is True.\n\n    :returns: an :class:`H2OFrame` object\n    \"\"\"\n    t_fraction = U(None, BoundNumeric(0, 1))\n    assert_is_type(frame_id, str, None)\n    assert_is_type(rows, BoundInt(1))\n    assert_is_type(cols, BoundInt(1))\n    assert_is_type(randomize, bool)\n    assert_is_type(value, numeric)\n    assert_is_type(real_range, BoundNumeric(0))\n    assert_is_type(real_fraction, t_fraction)\n    assert_is_type(categorical_fraction, t_fraction)\n    assert_is_type(integer_fraction, t_fraction)\n    assert_is_type(binary_fraction, t_fraction)\n    assert_is_type(time_fraction, t_fraction)\n    assert_is_type(string_fraction, t_fraction)\n    assert_is_type(missing_fraction, t_fraction)\n    assert_is_type(binary_ones_fraction, t_fraction)\n    assert_is_type(factors, BoundInt(1))\n    assert_is_type(integer_range, BoundInt(1))\n    assert_is_type(has_response, bool)\n    assert_is_type(response_factors, None, BoundInt(1))\n    assert_is_type(positive_response, bool)\n    assert_is_type(seed, int, None)\n    assert_is_type(seed_for_column_types, int, None)\n    check_frame_id(frame_id)\n\n    if randomize and value:\n        raise H2OValueError(\"Cannot set data to a `value` if `randomize` is true\")\n\n    if (categorical_fraction or integer_fraction) and not randomize:\n        raise H2OValueError(\"`randomize` should be True when either categorical or integer columns are used.\")\n\n    # The total column fraction that the user has specified explicitly. This sum should not exceed 1. We will respect\n    # all explicitly set fractions, and will auto-select the remaining fractions.\n    frcs = [real_fraction, categorical_fraction, integer_fraction, binary_fraction, time_fraction, string_fraction]\n    wgts = [0.5, 0.2, 0.2, 0.1, 0.0, 0.0]\n    sum_explicit_fractions = sum(0 if f is None else f for f in frcs)\n    count_explicit_fractions = sum(0 if f is None else 1 for f in frcs)\n    remainder = 1 - sum_explicit_fractions\n    if sum_explicit_fractions >= 1 + 1e-10:\n        raise H2OValueError(\"Fractions of binary, integer, categorical, time and string columns should add up \"\n                            \"to a number less than 1.\")\n    elif sum_explicit_fractions >= 1 - 1e-10:\n        # The fractions already add up to almost 1. No need to do anything (the server will absorb the tiny\n        # remainder into the real_fraction column).\n        pass\n    else:\n        # sum_explicit_fractions < 1  =>  distribute the remainder among the columns that were not set explicitly\n        if count_explicit_fractions == 6:\n            raise H2OValueError(\"Fraction of binary, integer, categorical, time and string columns add up to a \"\n                                \"number less than 1.\")\n        # Each column type receives a certain part (proportional to column's \"weight\") of the remaining fraction.\n        sum_implicit_weights = sum(wgts[i] if frcs[i] is None else 0 for i in range(6))\n        for i, f in enumerate(frcs):\n            if frcs[i] is not None: continue\n            if sum_implicit_weights == 0:\n                frcs[i] = remainder\n            else:\n                frcs[i] = remainder * wgts[i] / sum_implicit_weights\n            remainder -= frcs[i]\n            sum_implicit_weights -= wgts[i]\n    for i, f in enumerate(frcs):\n        if f is None:\n            frcs[i] = 0\n    real_fraction, categorical_fraction, integer_fraction, binary_fraction, time_fraction, string_fraction = frcs\n\n    parms = {\"dest\": frame_id if frame_id else py_tmp_key(append=h2oconn.session_id),\n             \"rows\": rows,\n             \"cols\": cols,\n             \"randomize\": randomize,\n             \"categorical_fraction\": categorical_fraction,\n             \"integer_fraction\": integer_fraction,\n             \"binary_fraction\": binary_fraction,\n             \"time_fraction\": time_fraction,\n             \"string_fraction\": string_fraction,\n             # \"real_fraction\" is not provided, the backend computes it as 1 - sum(5 other fractions)\n             \"value\": value,\n             \"real_range\": real_range,\n             \"factors\": factors,\n             \"integer_range\": integer_range,\n             \"binary_ones_fraction\": binary_ones_fraction,\n             \"missing_fraction\": missing_fraction,\n             \"has_response\": has_response,\n             \"response_factors\": response_factors,\n             \"positive_response\": positive_response,\n             \"seed\": -1 if seed is None else seed,\n             \"seed_for_column_types\": -1 if seed_for_column_types is None else seed_for_column_types,\n             }\n    H2OJob(api(\"POST /3/CreateFrame\", data=parms), \"Create Frame\").poll()\n    return get_frame(parms[\"dest\"])"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\ncreating a H2O frame containing n - th order interaction features between categorical columns.", "response": "def interaction(data, factors, pairwise, max_factors, min_occurrence, destination_frame=None):\n    \"\"\"\n    Categorical Interaction Feature Creation in H2O.\n\n    Creates a frame in H2O with n-th order interaction features between categorical columns, as specified by\n    the user.\n\n    :param data: the H2OFrame that holds the target categorical columns.\n    :param factors: factor columns (either indices or column names).\n    :param pairwise: If True, create pairwise interactions between factors (otherwise create one\n        higher-order interaction). Only applicable if there are 3 or more factors.\n    :param max_factors: Max. number of factor levels in pair-wise interaction terms (if enforced, one extra\n        catch-all factor will be made).\n    :param min_occurrence: Min. occurrence threshold for factor levels in pair-wise interaction terms\n    :param destination_frame: a string indicating the destination key. If empty, this will be auto-generated by H2O.\n\n    :returns: :class:`H2OFrame`\n    \"\"\"\n    assert_is_type(data, H2OFrame)\n    assert_is_type(factors, [str, int])\n    assert_is_type(pairwise, bool)\n    assert_is_type(max_factors, int)\n    assert_is_type(min_occurrence, int)\n    assert_is_type(destination_frame, str, None)\n    factors = [data.names[n] if is_type(n, int) else n for n in factors]\n    parms = {\"dest\": py_tmp_key(append=h2oconn.session_id) if destination_frame is None else destination_frame,\n             \"source_frame\": data.frame_id,\n             \"factor_columns\": [quoted(f) for f in factors],\n             \"pairwise\": pairwise,\n             \"max_factors\": max_factors,\n             \"min_occurrence\": min_occurrence,\n             }\n    H2OJob(api(\"POST /3/Interaction\", data=parms), \"Interactions\").poll()\n    return get_frame(parms[\"dest\"])"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nconvert an H2O data object into a list of lists.", "response": "def as_list(data, use_pandas=True, header=True):\n    \"\"\"\n    Convert an H2O data object into a python-specific object.\n\n    WARNING! This will pull all data local!\n\n    If Pandas is available (and use_pandas is True), then pandas will be used to parse the\n    data frame. Otherwise, a list-of-lists populated by character data will be returned (so\n    the types of data will all be str).\n\n    :param data: an H2O data object.\n    :param use_pandas: If True, try to use pandas for reading in the data.\n    :param header: If True, return column names as first element in list\n\n    :returns: List of lists (Rows x Columns).\n    \"\"\"\n    assert_is_type(data, H2OFrame)\n    assert_is_type(use_pandas, bool)\n    assert_is_type(header, bool)\n    return H2OFrame.as_data_frame(data, use_pandas=use_pandas, header=header)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef demo(funcname, interactive=True, echo=True, test=False):\n    import h2o.demos as h2odemo\n    assert_is_type(funcname, str)\n    assert_is_type(interactive, bool)\n    assert_is_type(echo, bool)\n    assert_is_type(test, bool)\n\n    demo_function = getattr(h2odemo, funcname, None)\n    if demo_function and type(demo_function) is type(demo):\n        demo_function(interactive, echo, test)\n    else:\n        print(\"Demo for %s is not available.\" % funcname)", "response": "A demo facility that uses the h2o python function to demonstrate the h2o segment."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nimporting a data file within the h2o_data folder.", "response": "def load_dataset(relative_path):\n    \"\"\"Imports a data file within the 'h2o_data' folder.\"\"\"\n    assert_is_type(relative_path, str)\n    h2o_dir = os.path.split(__file__)[0]\n    for possible_file in [os.path.join(h2o_dir, relative_path),\n                          os.path.join(h2o_dir, \"h2o_data\", relative_path),\n                          os.path.join(h2o_dir, \"h2o_data\", relative_path + \".csv\")]:\n        if os.path.exists(possible_file):\n            return upload_file(possible_file)\n    # File not found -- raise an error!\n    raise H2OValueError(\"Data file %s cannot be found\" % relative_path)"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\ncreates Model Metrics from predicted and actual values in H2O.", "response": "def make_metrics(predicted, actual, domain=None, distribution=None):\n    \"\"\"\n    Create Model Metrics from predicted and actual values in H2O.\n\n    :param H2OFrame predicted: an H2OFrame containing predictions.\n    :param H2OFrame actuals: an H2OFrame containing actual values.\n    :param domain: list of response factors for classification.\n    :param distribution: distribution for regression.\n    \"\"\"\n    assert_is_type(predicted, H2OFrame)\n    assert_is_type(actual, H2OFrame)\n    # assert predicted.ncol == 1, \"`predicted` frame should have exactly 1 column\"\n    assert actual.ncol == 1, \"`actual` frame should have exactly 1 column\"\n    assert_is_type(distribution, str, None)\n    assert_satisfies(actual.ncol, actual.ncol == 1)\n    if domain is None and any(actual.isfactor()):\n        domain = actual.levels()[0]\n    res = api(\"POST /3/ModelMetrics/predictions_frame/%s/actuals_frame/%s\" % (predicted.frame_id, actual.frame_id),\n              data={\"domain\": domain, \"distribution\": distribution})\n    return res[\"model_metrics\"]"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nupload given file into DKV and save it under given key.", "response": "def _put_key(file_path, dest_key=None, overwrite=True):\n    \"\"\"\n    Upload given file into DKV and save it under give key as raw object.\n\n    :param dest_key:  name of destination key in DKV\n    :param file_path:  path to file to upload\n    :return: key name if object was uploaded successfully\n    \"\"\"\n    ret = api(\"POST /3/PutKey?destination_key={}&overwrite={}\".format(dest_key if dest_key else '', overwrite),\n              filename=file_path)\n    return ret[\"destination_key\"]"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nupload given metrics function into H2O cluster.", "response": "def upload_custom_metric(func, func_file=\"metrics.py\", func_name=None, class_name=None, source_provider=None):\n    \"\"\"\n    Upload given metrics function into H2O cluster.\n\n    The metrics can have different representation:\n      - class: needs to implement map(pred, act, weight, offset, model), reduce(l, r) and metric(l) methods\n      - string: the same as in class case, but the class is given as a string\n\n    :param func:  metric representation: string, class\n    :param func_file:  internal name of file to save given metrics representation\n    :param func_name:  name for h2o key under which the given metric is saved\n    :param class_name: name of class wrapping the metrics function (when supplied as string)\n    :param source_provider: a function which provides a source code for given function\n    :return: reference to uploaded metrics function\n\n    :examples:\n        >>> class CustomMaeFunc:\n        >>>     def map(self, pred, act, w, o, model):\n        >>>         return [abs(act[0] - pred[0]), 1]\n        >>>\n        >>>     def reduce(self, l, r):\n        >>>         return [l[0] + r[0], l[1] + r[1]]\n        >>>\n        >>>     def metric(self, l):\n        >>>         return l[0] / l[1]\n        >>>\n        >>>\n        >>> h2o.upload_custom_metric(CustomMaeFunc, func_name=\"mae\")\n        >>>\n        >>> custom_func_str = '''class CustomMaeFunc:\n        >>>     def map(self, pred, act, w, o, model):\n        >>>         return [abs(act[0] - pred[0]), 1]\n        >>>\n        >>>     def reduce(self, l, r):\n        >>>         return [l[0] + r[0], l[1] + r[1]]\n        >>>\n        >>>     def metric(self, l):\n        >>>         return l[0] / l[1]'''\n        >>>\n        >>>\n        >>> h2o.upload_custom_metric(custom_func_str, class_name=\"CustomMaeFunc\", func_name=\"mae\")\n    \"\"\"\n    import tempfile\n    import inspect\n\n    # Use default source provider\n    if not source_provider:\n        source_provider = _default_source_provider\n\n    # The template wraps given metrics representation\n    _CFUNC_CODE_TEMPLATE = \"\"\"# Generated code\nimport water.udf.CMetricFunc as MetricFunc\n\n# User given metric function as a class implementing\n# 3 methods defined by interface CMetricFunc\n{}\n\n# Generated user metric which satisfies the interface\n# of Java MetricFunc\nclass {}Wrapper({}, MetricFunc, object):\n    pass\n\n\"\"\"\n\n    assert_satisfies(func, inspect.isclass(func) or isinstance(func, str),\n                     \"The argument func needs to be string or class !\")\n    assert_satisfies(func_file, func_file is not None,\n                     \"The argument func_file is missing!\")\n    assert_satisfies(func_file, func_file.endswith('.py'),\n                     \"The argument func_file needs to end with '.py'\")\n    code = None\n    derived_func_name = None\n    module_name = func_file[:-3]\n    if isinstance(func, str):\n        assert_satisfies(class_name, class_name is not None,\n                         \"The argument class_name is missing! \" +\n                         \"It needs to reference the class in given string!\")\n        code = _CFUNC_CODE_TEMPLATE.format(func, class_name, class_name)\n        derived_func_name = \"metrics_{}\".format(class_name)\n        class_name = \"{}.{}Wrapper\".format(module_name, class_name)\n    else:\n        assert_satisfies(func, inspect.isclass(func), \"The parameter `func` should be str or class\")\n        for method in ['map', 'reduce', 'metric']:\n            assert_satisfies(func, method in func.__dict__, \"The class `func` needs to define method `{}`\".format(method))\n\n        assert_satisfies(class_name, class_name is None,\n                         \"If class is specified then class_name parameter needs to be None\")\n\n        class_name = \"{}.{}Wrapper\".format(module_name, func.__name__)\n        derived_func_name = \"metrics_{}\".format(func.__name__)\n        code = _CFUNC_CODE_TEMPLATE.format(source_provider(func), func.__name__, func.__name__)\n\n    # If the func name is not given, use whatever we can derived from given definition\n    if not func_name:\n        func_name = derived_func_name\n    # Saved into jar file\n    tmpdir = tempfile.mkdtemp(prefix=\"h2o-func\")\n    func_arch_file = _create_zip_file(\"{}/func.jar\".format(tmpdir), (func_file, code))\n    # Upload into K/V\n    dest_key = _put_key(func_arch_file, dest_key=func_name)\n    # Reference\n    return \"python:{}={}\".format(dest_key, class_name)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef main(argv):\n    global g_script_name\n    global g_tmp_dir\n\n    g_script_name = os.path.basename(argv[0])\n\n    # Override any defaults with the user's choices.\n    parse_args(argv)\n\n    # Create tmp dir and clean up on exit with a callback.\n    g_tmp_dir = tempfile.mkdtemp(suffix=\".tmp_minicran\")\n    print \"Created tmp directory: \" + g_tmp_dir\n    atexit.register(remove_tmp_dir)\n\n    # Do the work.\n    try:\n        b = MinicranBuilder(g_print_only, g_output_dir, g_tmp_dir, g_platform, g_rversion, g_branch, g_buildnum)\n        b.build()\n    except KeyboardInterrupt:\n        print(\"\")\n        pass", "response": "Main function for the nacrator."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\ncheck that the provided frame id is valid in Rapids language.", "response": "def check_frame_id(frame_id):\n    \"\"\"Check that the provided frame id is valid in Rapids language.\"\"\"\n    if frame_id is None:\n        return\n    if frame_id.strip() == \"\":\n        raise H2OValueError(\"Frame id cannot be an empty string: %r\" % frame_id)\n    for i, ch in enumerate(frame_id):\n        # '$' character has special meaning at the beginning of the string; and prohibited anywhere else\n        if ch == \"$\" and i == 0: continue\n        if ch not in _id_allowed_characters:\n            raise H2OValueError(\"Character '%s' is illegal in frame id: %s\" % (ch, frame_id))\n    if re.match(r\"-?[0-9]\", frame_id):\n        raise H2OValueError(\"Frame id cannot start with a number: %s\" % frame_id)"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef _locate(path):\n\n    tmp_dir = os.path.realpath(os.getcwd())\n    possible_result = os.path.join(tmp_dir, path)\n    while True:\n        if os.path.exists(possible_result):\n            return possible_result\n\n        next_tmp_dir = os.path.dirname(tmp_dir)\n        if next_tmp_dir == tmp_dir:\n            raise ValueError(\"File not found: \" + path)\n\n        tmp_dir = next_tmp_dir\n        possible_result = os.path.join(tmp_dir, path)", "response": "Search for a relative path and turn it into an absolute path."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef get_human_readable_bytes(size):\n    if size == 0: return \"0\"\n    if size is None: return \"\"\n    assert_is_type(size, int)\n    assert size >= 0, \"`size` cannot be negative, got %d\" % size\n    suffixes = \"PTGMk\"\n    maxl = len(suffixes)\n    for i in range(maxl + 1):\n        shift = (maxl - i) * 10\n        if size >> shift == 0: continue\n        ndigits = 0\n        for nd in [3, 2, 1]:\n            if size >> (shift + 12 - nd * 3) == 0:\n                ndigits = nd\n                break\n        if ndigits == 0 or size == (size >> shift) << shift:\n            rounded_val = str(size >> shift)\n        else:\n            rounded_val = \"%.*f\" % (ndigits, size / (1 << shift))\n        return \"%s %sb\" % (rounded_val, suffixes[i] if i < maxl else \"\")", "response": "Convert given number of bytes into a human - readable string."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nconverts given duration in milliseconds into a human - readable string.", "response": "def get_human_readable_time(time_ms):\n    \"\"\"\n    Convert given duration in milliseconds into a human-readable representation, i.e. hours, minutes, seconds,\n    etc. More specifically, the returned string may look like following:\n        1 day 3 hours 12 mins\n        3 days 0 hours 0 mins\n        8 hours 12 mins\n        34 mins 02 secs\n        13 secs\n        541 ms\n    In particular, the following rules are applied:\n        * milliseconds are printed only if the duration is less than a second;\n        * seconds are printed only if the duration is less than an hour;\n        * for durations greater than 1 hour we print days, hours and minutes keeping zeros in the middle (i.e. we\n          return \"4 days 0 hours 12 mins\" instead of \"4 days 12 mins\").\n\n    :param time_ms: duration, as a number of elapsed milliseconds.\n    :return: human-readable string representation of the provided duration.\n    \"\"\"\n    millis = time_ms % 1000\n    secs = (time_ms // 1000) % 60\n    mins = (time_ms // 60000) % 60\n    hours = (time_ms // 3600000) % 24\n    days = (time_ms // 86400000)\n\n    res = \"\"\n    if days > 1:\n        res += \"%d days\" % days\n    elif days == 1:\n        res += \"1 day\"\n\n    if hours > 1 or (hours == 0 and res):\n        res += \" %d hours\" % hours\n    elif hours == 1:\n        res += \" 1 hour\"\n\n    if mins > 1 or (mins == 0 and res):\n        res += \" %d mins\" % mins\n    elif mins == 1:\n        res += \" 1 min\"\n\n    if days == 0 and hours == 0:\n        res += \" %02d secs\" % secs\n    if not res:\n        res = \" %d ms\" % millis\n\n    return res.strip()"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nprints msg to stdout.", "response": "def print2(msg, flush=False, end=\"\\n\"):\n    \"\"\"\n    This function exists here ONLY because Sphinx.ext.autodoc gets into a bad state when seeing the print()\n    function. When in that state, autodoc doesn't display any errors or warnings, but instead completely\n    ignores the \"bysource\" member-order option.\n    \"\"\"\n    print(msg, end=end)\n    if flush: sys.stdout.flush()"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nreturns a canonical version of slice s.", "response": "def normalize_slice(s, total):\n    \"\"\"\n    Return a \"canonical\" version of slice ``s``.\n\n    :param slice s: the original slice expression\n    :param total int: total number of elements in the collection sliced by ``s``\n    :return slice: a slice equivalent to ``s`` but not containing any negative indices or Nones.\n    \"\"\"\n    newstart = 0 if s.start is None else max(0, s.start + total) if s.start < 0 else min(s.start, total)\n    newstop = total if s.stop is None else max(0, s.stop + total) if s.stop < 0 else min(s.stop, total)\n    newstep = 1 if s.step is None else s.step\n    return slice(newstart, newstop, newstep)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nreturning True if slice s in normalized form.", "response": "def slice_is_normalized(s):\n    \"\"\"Return True if slice ``s`` in \"normalized\" form.\"\"\"\n    return (s.start is not None and s.stop is not None and s.step is not None and s.start <= s.stop)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef deprecated(message):\n    from traceback import extract_stack\n    assert message, \"`message` argument in @deprecated is required.\"\n\n    def deprecated_decorator(fun):\n        def decorator_invisible(*args, **kwargs):\n            stack = extract_stack()\n            assert len(stack) >= 2 and stack[-1][2] == \"decorator_invisible\", \"Got confusing stack... %r\" % stack\n            print(\"[WARNING] in %s line %d:\" % (stack[-2][0], stack[-2][1]))\n            print(\"    >>> %s\" % (stack[-2][3] or \"????\"))\n            print(\"        ^^^^ %s\" % message)\n            return fun(*args, **kwargs)\n\n        decorator_invisible.__doc__ = message\n        decorator_invisible.__name__ = fun.__name__\n        decorator_invisible.__module__ = fun.__module__\n        decorator_invisible.__deprecated__ = True\n        return decorator_invisible\n\n    return deprecated_decorator", "response": "The decorator to mark deprecated functions."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef join(self):\n        self._future = False\n        self._job.poll()\n        self._job = None", "response": "Wait until grid is complete."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef train(self, x=None, y=None, training_frame=None, offset_column=None, fold_column=None, weights_column=None,\n              validation_frame=None, **params):\n        \"\"\"\n        Train the model synchronously (i.e. do not return until the model finishes training).\n\n        To train asynchronously call :meth:`start`.\n\n        :param x: A list of column names or indices indicating the predictor columns.\n        :param y: An index or a column name indicating the response column.\n        :param training_frame: The H2OFrame having the columns indicated by x and y (as well as any\n            additional columns specified by fold, offset, and weights).\n        :param offset_column: The name or index of the column in training_frame that holds the offsets.\n        :param fold_column: The name or index of the column in training_frame that holds the per-row fold\n            assignments.\n        :param weights_column: The name or index of the column in training_frame that holds the per-row weights.\n        :param validation_frame: H2OFrame with validation data to be scored on while training.\n        \"\"\"\n        algo_params = locals()\n        parms = self._parms.copy()\n        parms.update({k: v for k, v in algo_params.items() if k not in [\"self\", \"params\", \"algo_params\", \"parms\"]})\n        # dictionaries have special handling in grid search, avoid the implicit conversion\n        parms[\"search_criteria\"] = None if self.search_criteria is None else str(self.search_criteria)\n        parms[\"hyper_parameters\"] = None if self.hyper_params  is None else str(self.hyper_params) # unique to grid search\n        parms.update({k: v for k, v in list(self.model._parms.items()) if v is not None})  # unique to grid search\n        parms.update(params)\n        if '__class__' in parms:  # FIXME: hackt for PY3\n            del parms['__class__']\n        y = algo_params[\"y\"]\n        tframe = algo_params[\"training_frame\"]\n        if tframe is None: raise ValueError(\"Missing training_frame\")\n        if y is not None:\n            if is_type(y, list, tuple):\n                if len(y) == 1:\n                    parms[\"y\"] = y[0]\n                else:\n                    raise ValueError('y must be a single column reference')\n        if x is None:\n            if(isinstance(y, int)):\n                xset = set(range(training_frame.ncols)) - {y}\n            else:\n                xset = set(training_frame.names) - {y}\n        else:\n            xset = set()\n            if is_type(x, int, str): x = [x]\n            for xi in x:\n                if is_type(xi, int):\n                    if not (-training_frame.ncols <= xi < training_frame.ncols):\n                        raise H2OValueError(\"Column %d does not exist in the training frame\" % xi)\n                    xset.add(training_frame.names[xi])\n                else:\n                    if xi not in training_frame.names:\n                        raise H2OValueError(\"Column %s not in the training frame\" % xi)\n                    xset.add(xi)\n        x = list(xset)\n        parms[\"x\"] = x\n        self.build_model(parms)", "response": "Train the model for the current instance of the class."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef predict(self, test_data):\n        return {model.model_id: model.predict(test_data) for model in self.models}", "response": "Predict on a dataset."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nreturning a dict of models with cross - validated data for the given key.", "response": "def get_xval_models(self, key=None):\n        \"\"\"\n        Return a Model object.\n\n        :param str key: If None, return all cross-validated models; otherwise return the model\n            specified by the key.\n        :returns: A model or a list of models.\n        \"\"\"\n        return {model.model_id: model.get_xval_models(key) for model in self.models}"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef deepfeatures(self, test_data, layer):\n        return {model.model_id: model.deepfeatures(test_data, layer) for model in self.models}", "response": "Obtain a hidden layer s details on a dataset."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nreturn the frame for the respective weight matrix.", "response": "def weights(self, matrix_id=0):\n        \"\"\"\n        Return the frame for the respective weight matrix.\n\n        :param: matrix_id: an integer, ranging from 0 to number of layers, that specifies the weight matrix to return.\n        :returns: an H2OFrame which represents the weight matrix identified by matrix_id\n        \"\"\"\n        return {model.model_id: model.weights(matrix_id) for model in self.models}"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nreturns the frame for the respective bias vector.", "response": "def biases(self, vector_id=0):\n        \"\"\"\n        Return the frame for the respective bias vector.\n\n        :param: vector_id: an integer, ranging from 0 to number of layers, that specifies the bias vector to return.\n        :returns: an H2OFrame which represents the bias vector identified by vector_id\n        \"\"\"\n        return {model.model_id: model.biases(vector_id) for model in self.models}"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\ngenerate model metrics for this model on test_data.", "response": "def model_performance(self, test_data=None, train=False, valid=False, xval=False):\n        \"\"\"\n        Generate model metrics for this model on test_data.\n\n        :param test_data: Data set for which model metrics shall be computed against. All three of train, valid\n            and xval arguments are ignored if test_data is not None.\n        :param train: Report the training metrics for the model.\n        :param valid: Report the validation metrics for the model.\n        :param xval: Report the validation metrics for the model.\n        :return: An object of class H2OModelMetrics.\n        \"\"\"\n        return {model.model_id: model.model_performance(test_data, train, valid, xval) for model in self.models}"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef summary(self, header=True):\n        table = []\n        for model in self.models:\n            model_summary = model._model_json[\"output\"][\"model_summary\"]\n            r_values = list(model_summary.cell_values[0])\n            r_values[0] = model.model_id\n            table.append(r_values)\n\n        # if h2o.can_use_pandas():\n        #  import pandas\n        #  pandas.options.display.max_rows = 20\n        #  print pandas.DataFrame(table,columns=self.col_header)\n        #  return\n        print()\n        if header:\n            print('Grid Summary:')\n        print()\n        H2ODisplay(table, ['Model Id'] + model_summary.col_header[1:], numalign=\"left\", stralign=\"left\")", "response": "Print a detailed summary of the explored models."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nprint models sorted by metric.", "response": "def show(self):\n        \"\"\"Print models sorted by metric.\"\"\"\n        hyper_combos = itertools.product(*list(self.hyper_params.values()))\n        if not self.models:\n            c_values = [[idx + 1, list(val)] for idx, val in enumerate(hyper_combos)]\n            print(H2OTwoDimTable(\n                col_header=['Model', 'Hyperparameters: [' + ', '.join(list(self.hyper_params.keys())) + ']'],\n                table_header='Grid Search of Model ' + self.model.__class__.__name__, cell_values=c_values))\n        else:\n            print(self.sorted_metric_table())"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef varimp(self, use_pandas=False):\n        return {model.model_id: model.varimp(use_pandas) for model in self.models}", "response": "Pretty print the variable importances or return them in a list or Pandas DataFrame."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef pprint_coef(self):\n        for i, model in enumerate(self.models):\n            print('Model', i)\n            model.pprint_coef()\n            print()", "response": "Pretty print the coefficents table."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nget the hyperparameters of a model explored by grid search.", "response": "def get_hyperparams(self, id, display=True):\n        \"\"\"\n        Get the hyperparameters of a model explored by grid search.\n\n        :param str id: The model id of the model with hyperparameters of interest.\n        :param bool display: Flag to indicate whether to display the hyperparameter names.\n\n        :returns: A list of the hyperparameters for the specified model.\n        \"\"\"\n        idx = id if is_type(id, int) else self.model_ids.index(id)\n        model = self[idx]\n\n        # if cross-validation is turned on, parameters in one of the fold model actuall contains the max_runtime_secs\n        # parameter and not the main model that is returned.\n        if model._is_xvalidated:\n            model = h2o.get_model(model._xval_keys[0])\n\n        res = [model.params[h]['actual'][0] if isinstance(model.params[h]['actual'], list)\n               else model.params[h]['actual']\n               for h in self.hyper_params]\n        if display: print('Hyperparameters: [' + ', '.join(list(self.hyper_params.keys())) + ']')\n        return res"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nreturn the dictionary of model parameters derived from the hyper - parameters of interest.", "response": "def get_hyperparams_dict(self, id, display=True):\n        \"\"\"\n        Derived and returned the model parameters used to train the particular grid search model.\n\n        :param str id: The model id of the model with hyperparameters of interest.\n        :param bool display: Flag to indicate whether to display the hyperparameter names.\n\n        :returns: A dict of model pararmeters derived from the hyper-parameters used to train this particular model.\n        \"\"\"\n        idx = id if is_type(id, int) else self.model_ids.index(id)\n        model = self[idx]\n\n        model_params = dict()\n\n        # if cross-validation is turned on, parameters in one of the fold model actual contains the max_runtime_secs\n        # parameter and not the main model that is returned.\n        if model._is_xvalidated:\n            model = h2o.get_model(model._xval_keys[0])\n\n        for param_name in self.hyper_names:\n            model_params[param_name] = model.params[param_name]['actual'][0] if \\\n                isinstance(model.params[param_name]['actual'], list) else model.params[param_name]['actual']\n\n        if display: print('Hyperparameters: [' + ', '.join(list(self.hyper_params.keys())) + ']')\n        return model_params"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef get_grid(self, sort_by=None, decreasing=None):\n        if sort_by is None and decreasing is None: return self\n\n        grid_json = h2o.api(\"GET /99/Grids/%s\" % self._id, data={\"sort_by\": sort_by, \"decreasing\": decreasing})\n        grid = H2OGridSearch(self.model, self.hyper_params, self._id)\n        grid.models = [h2o.get_model(key['name']) for key in grid_json['model_ids']]  # reordered\n        first_model_json = h2o.api(\"GET /99/Models/%s\" % grid_json['model_ids'][0]['name'])['models'][0]\n        model_class = H2OGridSearch._metrics_class(first_model_json)\n        m = model_class()\n        m._id = self._id\n        m._grid_json = grid_json\n        # m._metrics_class = metrics_class\n        m._parms = grid._parms\n        H2OEstimator.mixin(grid, model_class)\n        grid.__dict__.update(m.__dict__.copy())\n        return grid", "response": "Retrieve an H2OGridSearch instance."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef sort_by(self, metric, increasing=True):\n\n        if metric[-1] != ')': metric += '()'\n        c_values = [list(x) for x in zip(*sorted(eval('self.' + metric + '.items()'), key=lambda k_v: k_v[1]))]\n        c_values.insert(1, [self.get_hyperparams(model_id, display=False) for model_id in c_values[0]])\n        if not increasing:\n            for col in c_values: col.reverse()\n        if metric[-2] == '(': metric = metric[:-2]\n        return H2OTwoDimTable(\n            col_header=['Model Id', 'Hyperparameters: [' + ', '.join(list(self.hyper_params.keys())) + ']', metric],\n            table_header='Grid Search Results for ' + self.model.__class__.__name__,\n            cell_values=[list(x) for x in zip(*c_values)])", "response": "Returns a table of the hyperparameters sorted by a metric."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef anomaly(self, test_data, per_feature=False):\n        return {model.model_id: model.anomaly(test_data, per_feature) for model in self.models}", "response": "Calculate the anomaly of the dataset."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef F1(self, thresholds=None, train=False, valid=False, xval=False):\n        return {model.model_id: model.F1(thresholds, train, valid, xval) for model in\n                self.models}", "response": "Get the F1 values for a set of thresholds for each model in the hierarchy."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\ngets the confusion matrix for the specified metrics and thresholds.", "response": "def confusion_matrix(self, metrics=None, thresholds=None, train=False, valid=False, xval=False):\n        \"\"\"\n        Get the confusion matrix for the specified metrics/thresholds.\n\n        If all are False (default), then return the training metric value.\n        If more than one options is set to True, then return a dictionary of metrics where the keys are \"train\",\n        \"valid\", and \"xval\".\n\n        :param metrics: A list of metrics or a single metric among ``\"min_per_class_accuracy\"``, ``\"absolute_mcc\"``,\n            ``\"tnr\"``, ``\"fnr\"``, ``\"fpr\"``, ``\"tpr\"``, ``\"precision\"``, ``\"accuracy\"``, ``\"f0point5\"``, ``\"f2\"``,\n            ``\"f1\"``.\n        :param thresholds: thresholds parameter must be a list (i.e. [0.01, 0.5, 0.99]). If None, then the thresholds\n            in this set of metrics will be used.\n        :param bool train: If train is True, then return the confusion matrix value for the training data.\n        :param bool valid: If valid is True, then return the confusion matrix value for the validation data.\n        :param bool xval:  If xval is True, then return the confusion matrix value for the cross validation data.\n        :returns: The confusion matrix for this binomial model.\n        \"\"\"\n        return {model.model_id: model.confusion_matrix(metrics, thresholds, train, valid, xval) for model in\n                self.models}"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nfinding the index in this binomial model s threshold list at which the given threshold is located.", "response": "def find_idx_by_threshold(self, threshold, train=False, valid=False, xval=False):\n        \"\"\"\n        Retrieve the index in this metric's threshold list at which the given threshold is located.\n\n        If all are False (default), then return the training metric value.\n        If more than one options is set to True, then return a dictionary of metrics where the keys are \"train\",\n        \"valid\", and \"xval\".\n\n        :param float threshold: The threshold value to search for.\n        :param bool train: If train is True, then return the idx_by_threshold for the training data.\n        :param bool valid: If valid is True, then return the idx_by_threshold for the validation data.\n        :param bool xval:  If xval is True, then return the idx_by_threshold for the cross validation data.\n        :returns: The idx_by_threshold for this binomial model.\n        \"\"\"\n        return {model.model_id: model.find_idx_by_threshold(threshold, train, valid, xval) for model in self.models}"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nreturning a confusion matrix based of H2O s default prediction threshold for a dataset.", "response": "def confusion_matrix(self, data):\n        \"\"\"\n        Returns a confusion matrix based of H2O's default prediction threshold for a dataset.\n\n        :param data: metric for which the confusion matrix will be calculated.\n        \"\"\"\n        return {model.model_id: model.confusion_matrix(data) for model in self.models}"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nreturning the Importance of components associcated with a pca model.", "response": "def varimp(self, use_pandas=False):\n        \"\"\"\n        Return the Importance of components associcated with a pca model.\n\n        use_pandas: ``bool``  (default: ``False``).\n        \"\"\"\n        model = self._model_json[\"output\"]\n        if \"importance\" in list(model.keys()) and model[\"importance\"]:\n            vals = model[\"importance\"].cell_values\n            header = model[\"importance\"].col_header\n            if use_pandas and can_use_pandas():\n                import pandas\n                return pandas.DataFrame(vals, columns=header)\n            else:\n                return vals\n        else:\n            print(\"Warning: This model doesn't have importances of components.\")"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef archetypes(self):\n        o = self._model_json[\"output\"]\n        yvals = o[\"archetypes\"].cell_values\n        archetypes = []\n        for yidx, yval in enumerate(yvals):\n            archetypes.append(list(yvals[yidx])[1:])\n        return archetypes", "response": "The archetypes of the GLRM model."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef proj_archetypes(self, test_data, reverse_transform=False):\n        if test_data is None or test_data.nrow == 0: raise ValueError(\"Must specify test data\")\n        j = h2o.api(\"POST /3/Predictions/models/%s/frames/%s\" % (self.model_id, test_data.frame_id),\n                    data={\"project_archetypes\": True, \"reverse_transform\": reverse_transform})\n        return h2o.get_frame(j[\"model_metrics\"][0][\"predictions\"][\"frame_id\"][\"name\"])", "response": "Project archetypes of the model into original feature space."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef screeplot(self, type=\"barplot\", **kwargs):\n        # check for matplotlib. exit if absent.\n        is_server = kwargs.pop(\"server\")\n        if kwargs:\n            raise ValueError(\"Unknown arguments %s to screeplot()\" % \", \".join(kwargs.keys()))\n        try:\n            import matplotlib\n            if is_server: matplotlib.use('Agg', warn=False)\n            import matplotlib.pyplot as plt\n        except ImportError:\n            print(\"matplotlib is required for this function!\")\n            return\n\n        variances = [s ** 2 for s in self._model_json['output']['importance'].cell_values[0][1:]]\n        plt.xlabel('Components')\n        plt.ylabel('Variances')\n        plt.title('Scree Plot')\n        plt.xticks(list(range(1, len(variances) + 1)))\n        if type == \"barplot\":\n            plt.bar(list(range(1, len(variances) + 1)), variances)\n        elif type == \"lines\":\n            plt.plot(list(range(1, len(variances) + 1)), variances, 'b--')\n        if not is_server: plt.show()", "response": "Produce the scree plot."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef translate_name(name):\n    parts = name.split(\"_\")\n    i = 0\n    while parts[i] == \"\":\n        parts[i] = \"_\"\n        i += 1\n    parts[i] = parts[i].lower()\n    for j in range(i + 1, len(parts)):\n        parts[j] = parts[j].capitalize()\n    i = len(parts) - 1\n    while parts[i] == \"\":\n        parts[i] = \"_\"\n        i -= 1\n    return \"\".join(parts)", "response": "Translate a name with underscores into camelcase."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef dedent(ind, text):\n    text2 = textwrap.dedent(text)\n    if ind == 0:\n        return text2\n    indent_str = \" \" * ind\n    return \"\\n\".join(indent_str + line for line in text2.split(\"\\n\"))", "response": "dedent text to the specific indentation level."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\ngenerating the Java class.", "response": "def generate_schema(class_name, schema):\n    \"\"\"\n    Generate schema Java class.\n\n    :param class_name: name of the class\n    :param schema: information about the class\n    \"\"\"\n    superclass = schema[\"superclass\"]\n    if superclass == \"Schema\": superclass = \"Object\"\n\n    has_map = False\n    is_model_builder = False\n    has_inherited = False\n    for field in schema[\"fields\"]:\n        if field[\"name\"] == \"__meta\": continue\n        if field[\"is_inherited\"]:\n            has_inherited = True\n            continue\n        if field[\"type\"].startswith(\"Map\"): has_map = True\n        if field[\"name\"] == \"can_build\": is_model_builder = True\n\n    fields = []\n    for field in schema[\"fields\"]:\n        if field[\"name\"] == \"__meta\": continue\n        java_type = translate_type(field[\"type\"], field[\"schema_name\"])\n        java_value = get_java_value(field)\n\n        # hackery: we flatten the parameters up into the ModelBuilder schema, rather than nesting them in the\n        # parameters schema class...\n        if False and is_model_builder and field[\"name\"] == \"parameters\":\n            fields.append((\"parameters\", \"null\", \"ModelParameterSchemaV3[]\", field[\"help\"], field[\"is_inherited\"]))\n        else:\n            fields.append((field[\"name\"], java_value, java_type, field[\"help\"], field[\"is_inherited\"]))\n\n    class_decl = class_name\n    if \"generics\" in schema:\n        class_decl += \"<\" + \", \".join(\"%s extends %s\" % (t, long_type) for t, long_type in schema[\"generics\"]) + \">\"\n    super_decl = superclass\n    if \"super_generics\" in schema:\n        super_decl += \"<\" + \", \".join(schema[\"super_generics\"]) + \">\"\n\n    yield \"/*\"\n    yield \" * This file is auto-generated by h2o-3/h2o-bindings/bin/gen_java.py\"\n    yield \" * Copyright 2016 H2O.ai;  Apache License Version 2.0 (see LICENSE for details)\"\n    yield \" */\"\n    yield \"package water.bindings.pojos;\"\n    yield \"\"\n    yield \"import com.google.gson.Gson;\"\n    yield \"import com.google.gson.annotations.*;\"\n    yield \"import java.util.Map;\" if has_map else None\n    yield \"\"\n    yield \"\"\n    yield \"public class %s extends %s {\" % (class_decl, super_decl) if super_decl != \"Object\" else None\n    yield \"public class %s {\" % (class_decl) if super_decl == \"Object\" else None\n    yield \"\"\n    for name, value, ftype, fhelp, inherited in fields:\n        if inherited: continue\n        ccname = translate_name(name)\n        yield \"    /**\"\n        yield bi.wrap(fhelp, indent=\"     * \")\n        yield \"     */\"\n        yield \"    @SerializedName(\\\"%s\\\")\" % name if name != ccname else None\n        yield \"    public %s %s;\" % (ftype, ccname)\n        yield \"\"\n    if has_inherited:\n        yield \"\"\n        yield \"    /*\" + (\"-\" * 114)\n        yield \"    //\" + (\" \" * 50) + \"INHERITED\"\n        yield \"    //\" + (\"-\" * 114)\n        yield \"\"\n        for name, value, ftype, fhelp, inherited in fields:\n            if not inherited: continue\n            yield bi.wrap(fhelp, \"    // \")\n            yield \"    public %s %s;\" % (ftype, translate_name(name))\n            yield \"\"\n        yield \"    */\"\n        yield \"\"\n    yield \"    /**\"\n    yield \"     * Public constructor\"\n    yield \"     */\"\n    yield \"    public %s() {\" % class_name\n    for name, value, _, _, _ in fields:\n        if name == \"parameters\": continue\n        if value == \"null\": continue\n        yield \"        %s = %s;\" % (translate_name(name), value)\n    yield \"    }\"\n    yield \"\"\n    yield \"    /**\"\n    yield \"     * Return the contents of this object as a JSON String.\"\n    yield \"     */\"\n    yield \"    @Override\"\n    yield \"    public String toString() {\"\n    yield \"        return new Gson().toJson(this);\"\n    yield \"    }\"\n    yield \"\"\n    yield \"}\""}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef generate_proxy(classname, endpoints):\n    # Replace path vars like (?<schemaname>.*) with {schemaname} for Retrofit's annotation\n    var_pattern = re.compile(r\"\\{(\\w+)\\}\")\n\n    helper_class = []\n    found_key_array_parameter = False\n\n    yield \"/*\"\n    yield \" * This file is auto-generated by h2o-3/h2o-bindings/bin/gen_java.py\"\n    yield \" * Copyright 2016 H2O.ai;  Apache License Version 2.0 (see LICENSE for details)\"\n    yield \" */\"\n    yield \"package water.bindings.proxies.retrofit;\"\n    yield \"\"\n    yield \"import water.bindings.pojos.*;\"\n    yield \"import retrofit2.*;\"\n    yield \"import retrofit2.http.*;\"\n    yield \"import java.util.Map;\" if classname == \"Grid\" or classname == \"ModelBuilders\" else None\n    yield \"\"\n    yield \"public interface \" + classname + \" {\"\n    yield \"\"\n\n    for e in endpoints:\n        method = e[\"handler_method\"]\n        # should we always use e.api_name ?\n        if method == \"exec\":\n            method = e[\"api_name\"]\n\n        param_strs = []\n        required_param_strs = []\n        for field in e[\"input_params\"]:\n            fname = field[\"name\"]\n            ftype = \"Path\" if field[\"is_path_param\"] else \"Field\"\n            ptype = translate_type(field[\"type\"], field[\"schema_name\"])\n            if ptype.endswith(\"KeyV3\") or ptype == \"ColSpecifierV3\": ptype = \"String\"\n            if ptype.endswith(\"KeyV3[]\"): ptype = \"String[]\"\n            param_str = \"@{ftype}(\\\"{fname}\\\") {ptype} {fname}\".format(**locals())\n            param_strs.append(param_str)\n            if field[\"required\"]:\n                required_param_strs.append(param_str)\n        if len(param_strs) == len(required_param_strs): required_param_strs = None\n\n        yield u\"  /** \"\n        yield bi.wrap(e[\"summary\"], indent=\"   * \")\n        for field in e[\"input_params\"]:\n            s = \"   *   @param %s \" % field[\"name\"]\n            yield s + bi.wrap(field[\"help\"], indent=\"   *\" + \" \" * (len(s) - 4), indent_first=False)\n        yield u\"   */\"\n        # Create 2 versions of each call: first with all input parameters present, and then only with required params\n        for params in [param_strs, required_param_strs]:\n            if params is None: continue\n            yield u\"  @FormUrlEncoded\" if e[\"http_method\"] == \"POST\" else None\n            yield u\"  @{method}(\\\"{path}\\\")\".format(method=e[\"http_method\"], path=e[\"url_pattern\"])\n            if len(params) <= 1:\n                args = params[0] if params else \"\"\n                yield \"  Call<{schema}> {method}({args});\".format(schema=e[\"output_schema\"], method=method, args=args)\n            else:\n                yield \"  Call<{schema}> {method}(\".format(schema=e[\"output_schema\"], method=method)\n                for arg in params:\n                    yield \"    \" + arg + (\"\" if arg == params[-1] else \",\")\n                yield \"  );\"\n            yield \"\"\n\n        # Make special static Helper class for Grid and ModelBuilders.\n        if \"algo\" in e:\n            # We make two train_ and validate_ methods.  One (built here) takes the parameters schema, the other\n            # (built above) takes each parameter.\n            helper_class.append(\"    /**\")\n            helper_class.append(bi.wrap(e[\"summary\"], indent=\"     * \"))\n            helper_class.append(\"     */\")\n            helper_class.append(\"    public static Call<{oschema}> {method}({outer_class} z, {ischema} p) {{\"\n                                .format(ischema=e[\"input_schema\"], oschema=e[\"output_schema\"], method=method,\n                                        outer_class=classname))\n            helper_class.append(\"      return z.{method}(\".format(method=method))\n            for field in e[\"input_params\"]:\n                ptype = translate_type(field[\"type\"], field[\"schema_name\"])\n                pname = translate_name(field[\"name\"])\n                if ptype.endswith(\"KeyV3\"):\n                    s = \"(p.{parm} == null? null : p.{parm}.name)\".format(parm=pname)\n                elif ptype.endswith(\"KeyV3[]\"):\n                    found_key_array_parameter = True\n                    s = \"(p.{parm} == null? null : keyArrayToStringArray(p.{parm}))\".format(parm=pname)\n                elif ptype.startswith(\"ColSpecifier\"):\n                    s = \"(p.{parm} == null? null : p.{parm}.columnName)\".format(parm=pname)\n                else:\n                    s = \"p.\" + pname\n                if field != e[\"input_params\"][-1]:\n                    s += \",\"\n                helper_class.append(\"        \" + s)\n            helper_class.append(\"      );\")\n            helper_class.append(\"    }\")\n            helper_class.append(\"\")\n\n    if helper_class:\n        yield \"\"\n        yield \"  @SuppressWarnings(\\\"unused\\\")\"\n        yield \"  class Helper {\"\n        for line in helper_class:\n            yield line\n        if found_key_array_parameter:\n            yield \"    /**\"\n            yield \"     * Return an array of Strings for an array of keys.\"\n            yield \"     */\"\n            yield \"    public static String[] keyArrayToStringArray(KeyV3[] keys) {\"\n            yield \"      if (keys == null) return null;\"\n            yield \"      String[] ids = new String[keys.length];\"\n            yield \"      int i = 0;\"\n            yield \"      for (KeyV3 key : keys) ids[i++] = key.name;\"\n            yield \"      return ids;\"\n            yield \"    }\"\n        yield \"  }\"\n        yield \"\"\n\n    yield \"}\"", "response": "Generates a list of Retrofit Proxy classes."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef parse(self):\n        f = open(self.parse_log_path, \"r\")\n        self.parse2(f)\n        f.close()", "response": "Parse the log file specified by constructor."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef parse2(self, f):\n        line_num = 0\n        s = f.readline()\n        while (len(s) > 0):\n            line_num += 1\n\n            # Check for beginning of parsed data set.\n            match_groups = re.search(r\"Parse result for (.*) .(\\d+) rows.:\", s)\n            if (match_groups is not None):\n                dataset_name = match_groups.group(1)\n                if (self.dataset_name is not None):\n                    print(\"ERROR: Too many datasets found on file {} line {}\".format(self.parse_log_path, line_num))\n                    sys.exit(1)\n                self.dataset_name = dataset_name\n                num_rows = int(match_groups.group(2))\n                self.num_rows = num_rows\n                s = f.readline()\n                continue\n\n            # Check for numeric column.\n            match_groups = re.search(r\"INFO WATER:\" +\n                                     r\"\\s*C(\\d+):\" +\n                                     r\"\\s*numeric\" +\n                                     r\"\\s*min\\((\\S*)\\)\" +\n                                     r\"\\s*max\\((\\S*).\\)\" +\n                                     r\"\\s*(na\\((\\S+)\\))?\" +\n                                     r\"\\s*(constant)?\",\n                                     s)\n            if (match_groups is not None):\n                col_num = int(match_groups.group(1))\n                min_val = float(match_groups.group(2))\n                max_val = float(match_groups.group(3))\n                # four = match_groups.group(4)\n                na_count = match_groups.group(5)\n                if (na_count is None):\n                    na_count = 0\n                else:\n                    na_count = int(na_count)\n                constant_str = match_groups.group(6)\n                is_constant = constant_str is not None\n                if (is_constant):\n                    if (min_val != max_val):\n                        print(\"ERROR: is_constant mismatch on file {} line {}\".format(self.parse_log_path, line_num))\n                        sys.exit(1)\n\n                na_fraction = float(na_count) / float(self.num_rows)\n\n                is_min_integer = float(int(min_val)) == float(min_val)\n                is_max_integer = float(int(min_val)) == float(min_val)\n                is_integer = is_min_integer and is_max_integer\n\n                c = RealColumn(col_num, \"C\" + str(col_num), min_val, max_val, na_fraction, is_integer)\n                self.add_col(c)\n\n                s = f.readline()\n                continue\n\n            # Check for categorical column.\n            match_groups = re.search(r\"INFO WATER:\" +\n                                     r\"\\s*C(\\d+):\" +\n                                     r\"\\s*categorical\" +\n                                     r\"\\s*min\\((\\S*)\\)\" +\n                                     r\"\\s*max\\((\\S*).\\)\" +\n                                     r\"\\s*(na\\((\\S+)\\))?\" +\n                                     r\"\\s*(constant)?\" +\n                                     r\"\\s*cardinality\\((\\d+)\\)\",\n                                     s)\n            if (match_groups is not None):\n                col_num = int(match_groups.group(1))\n                min_val = float(match_groups.group(2))\n                max_val = float(match_groups.group(3))\n                # four = match_groups.group(4)\n                na_count = match_groups.group(5)\n                if (na_count is None):\n                    na_count = 0\n                else:\n                    na_count = int(na_count)\n                constant_str = match_groups.group(6)\n                is_constant = constant_str is not None\n                if (is_constant):\n                    if (min_val != max_val):\n                        print(\"ERROR: is_constant mismatch on file {} line {}\".format(self.parse_log_path, line_num))\n                        sys.exit(1)\n                num_levels = int(match_groups.group(7))\n                if (is_constant):\n                    if (num_levels != 1):\n                        print(\"ERROR: num_levels mismatch on file {} line {}\".format(self.parse_log_path, line_num))\n                        sys.exit(1)\n                na_fraction = float(na_count) / float(self.num_rows)\n\n                c = CategoricalColumn(col_num, \"C\" + str(col_num), num_levels, na_fraction)\n                self.add_col(c)\n\n                s = f.readline()\n                continue\n\n            print(\"ERROR: Unrecognized regexp pattern on file {} line {}\".format(self.parse_log_path, line_num))\n            sys.exit(1)", "response": "Parse the log file specified by constructor."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\ncalculate the anomaly of the reconstruction error for the input test_data.", "response": "def anomaly(self, test_data, per_feature=False):\n        \"\"\"\n        Obtain the reconstruction error for the input test_data.\n\n        :param H2OFrame test_data: The dataset upon which the reconstruction error is computed.\n        :param bool per_feature: Whether to return the square reconstruction error per feature.\n            Otherwise, return the mean square error.\n\n        :returns: the reconstruction error.\n        \"\"\"\n        if test_data is None or test_data.nrow == 0: raise ValueError(\"Must specify test data\")\n        j = h2o.api(\"POST /3/Predictions/models/%s/frames/%s\" % (self.model_id, test_data.frame_id),\n                    data={\"reconstruction_error\": True, \"reconstruction_error_per_feature\": per_feature})\n        return h2o.get_frame(j[\"model_metrics\"][0][\"predictions\"][\"frame_id\"][\"name\"])"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef main(argv):\n    global g_test_root_dir\n    global g_temp_filename\n\n    if len(argv) < 2:\n        print(\"invoke this script as python extractGLRMRuntimeJavaLog.py javatextlog.\\n\")\n        sys.exit(1)\n    else:   # we may be in business\n        javaLogText = argv[1]         # filename while java log is stored\n\n        print(\"your java text is {0}\".format(javaLogText))\n        extractRunInto(javaLogText)", "response": "This function is the main function of extractGLRMRuntimeJavaLog. py."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef _parse_see_also(self, content):\n        items = []\n\n        def parse_item_name(text):\n            \"\"\"Match ':role:`name`' or 'name'\"\"\"\n            m = self._name_rgx.match(text)\n            if m:\n                g = m.groups()\n                if g[1] is None:\n                    return g[3], None\n                else:\n                    return g[2], g[1]\n            raise ValueError(\"%s is not a item name\" % text)\n\n        def push_item(name, rest):\n            if not name:\n                return\n            name, role = parse_item_name(name)\n            items.append((name, list(rest), role))\n            del rest[:]\n\n        current_func = None\n        rest = []\n        \n        for line in content:\n            if not line.strip(): continue\n\n            m = self._name_rgx.match(line)\n            if m and line[m.end():].strip().startswith(':'):\n                push_item(current_func, rest)\n                current_func, line = line[:m.end()], line[m.end():]\n                rest = [line.split(':', 1)[1].strip()]\n                if not rest[0]:\n                    rest = []\n            elif not line.startswith(' '):\n                push_item(current_func, rest)\n                current_func = None\n                if ',' in line:\n                    for func in line.split(','):\n                        push_item(func, [])\n                elif line.strip():\n                    current_func = line\n            elif current_func is not None:\n                rest.append(line.strip())\n        push_item(current_func, rest)\n        return items", "response": "func_name : Descriptive text\n            continued text\n        another_func_name : Descriptive text\n        func_name1, func_name2, :meth:`func_name`, func_name3"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef _parse_index(self, section, content):\n        def strip_each_in(lst):\n            return [s.strip() for s in lst]\n\n        out = {}\n        section = section.split('::')\n        if len(section) > 1:\n            out['default'] = strip_each_in(section[1].split(','))[0]\n        for line in content:\n            line = line.split(':')\n            if len(line) > 2:\n                out[line[1]] = strip_each_in(line[2].split(','))\n        return out", "response": "Parse the index section from the content of a file."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nfilling this instance with values from given dictionary.", "response": "def _fill_from_config(self, config):\n        \"\"\"\n        Fill this instance from given dictionary.\n        The method only uses keys which corresponds to properties\n        this class, throws exception on unknown property name.\n        :param conf:  dictionary of parameters\n        :return:  a new instance of this class filled with values from given dictionary\n        :raises H2OValueError: if input config contains unknown property name.\n        \"\"\"\n        for k,v in config.items():\n            if k in H2OConnectionConf.allowed_properties:\n                setattr(self, k, v)\n            else:\n                raise H2OValueError(message=\"Unsupported name of property: %s!\" % k, var_name=\"config\")"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nperforming a REST API request to the backend H2O server.", "response": "def request(self, endpoint, data=None, json=None, filename=None, save_to=None):\n        \"\"\"\n        Perform a REST API request to the backend H2O server.\n\n        :param endpoint: (str) The endpoint's URL, for example \"GET /4/schemas/KeyV4\"\n        :param data: data payload for POST (and sometimes GET) requests. This should be a dictionary of simple\n            key/value pairs (values can also be arrays), which will be sent over in x-www-form-encoded format.\n        :param json: also data payload, but it will be sent as a JSON body. Cannot be used together with `data`.\n        :param filename: file to upload to the server. Cannot be used with `data` or `json`.\n        :param save_to: if provided, will write the response to that file (additionally, the response will be\n            streamed, so large files can be downloaded seamlessly). This parameter can be either a file name,\n            or a folder name. If the folder doesn't exist, it will be created automatically.\n\n        :returns: an H2OResponse object representing the server's response (unless ``save_to`` parameter is\n            provided, in which case the output file's name will be returned).\n        :raises H2OConnectionError: if the H2O server cannot be reached (or connection is not initialized)\n        :raises H2OServerError: if there was a server error (http 500), or server returned malformed JSON\n        :raises H2OResponseError: if the server returned an H2OErrorV3 response (e.g. if the parameters were invalid)\n        \"\"\"\n        if self._stage == 0: raise H2OConnectionError(\"Connection not initialized; run .connect() first.\")\n        if self._stage == -1: raise H2OConnectionError(\"Connection was closed, and can no longer be used.\")\n\n        # Prepare URL\n        assert_is_type(endpoint, str)\n        match = assert_matches(str(endpoint), r\"^(GET|POST|PUT|DELETE|PATCH|HEAD) (/.*)$\")\n        method = match.group(1)\n        urltail = match.group(2)\n        url = self._base_url + urltail\n\n        # Prepare data\n        if filename is not None:\n            assert_is_type(filename, str)\n            assert_is_type(json, None, \"Argument `json` should be None when `filename` is used.\")\n            assert_is_type(data, None, \"Argument `data` should be None when `filename` is used.\")\n            assert_satisfies(method, method == \"POST\",\n                             \"File uploads can only be done via POST method, got %s\" % method)\n        elif data is not None:\n            assert_is_type(data, dict)\n            assert_is_type(json, None, \"Argument `json` should be None when `data` is used.\")\n        elif json is not None:\n            assert_is_type(json, dict)\n\n        data = self._prepare_data_payload(data)\n        files = self._prepare_file_payload(filename)\n        params = None\n        if method == \"GET\" and data:\n            params = data\n            data = None\n\n        stream = False\n        if save_to is not None:\n            assert_is_type(save_to, str)\n            stream = True\n\n        if self._cookies is not None and isinstance(self._cookies, list):\n            self._cookies = \";\".join(self._cookies)\n\n        # Make the request\n        start_time = time.time()\n        try:\n            self._log_start_transaction(endpoint, data, json, files, params)\n\n            headers = {\"User-Agent\": \"H2O Python client/\" + sys.version.replace(\"\\n\", \"\"),\n                       \"X-Cluster\": self._cluster_id,\n                       \"Cookie\": self._cookies}\n            resp = requests.request(method=method, url=url, data=data, json=json, files=files, params=params,\n                                    headers=headers, timeout=self._timeout, stream=stream,\n                                    auth=self._auth, verify=self._verify_ssl_cert, proxies=self._proxies)\n            self._log_end_transaction(start_time, resp)\n            return self._process_response(resp, save_to)\n\n        except (requests.exceptions.ConnectionError, requests.exceptions.HTTPError) as e:\n            if self._local_server and not self._local_server.is_running():\n                self._log_end_exception(\"Local server has died.\")\n                raise H2OConnectionError(\"Local server has died unexpectedly. RIP.\")\n            else:\n                self._log_end_exception(e)\n                raise H2OConnectionError(\"Unexpected HTTP error: %s\" % e)\n        except requests.exceptions.Timeout as e:\n            self._log_end_exception(e)\n            elapsed_time = time.time() - start_time\n            raise H2OConnectionError(\"Timeout after %.3fs\" % elapsed_time)\n        except H2OResponseError as e:\n            err = e.args[0]\n            err.endpoint = endpoint\n            err.payload = (data, json, files, params)\n            raise"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nclosing an existing connection.", "response": "def close(self):\n        \"\"\"\n        Close an existing connection; once closed it cannot be used again.\n\n        Strictly speaking it is not necessary to close all connection that you opened -- we have several mechanisms\n        in place that will do so automatically (__del__(), __exit__() and atexit() handlers), however there is also\n        no good reason to make this method private.\n        \"\"\"\n        if self._session_id:\n            try:\n                # If the server gone bad, we don't want to wait forever...\n                if self._timeout is None: self._timeout = 1\n                self.request(\"DELETE /4/sessions/%s\" % self._session_id)\n                self._print(\"H2O session %s closed.\" % self._session_id)\n            except Exception:\n                pass\n            self._session_id = None\n        self._stage = -1"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef session_id(self):\n        if self._session_id is None:\n            req = self.request(\"POST /4/sessions\")\n            self._session_id = req.get(\"session_key\") or req.get(\"session_id\")\n        return CallableString(self._session_id)", "response": "Return the session id of the current connection."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef start_logging(self, dest=None):\n        assert_is_type(dest, None, str, type(sys.stdout))\n        if dest is None:\n            dest = os.path.join(tempfile.mkdtemp(), \"h2o-connection.log\")\n        self._print(\"Now logging all API requests to file %r\" % dest)\n        self._is_logging = True\n        self._logging_dest = dest", "response": "Start logging all API requests to a file."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nprepare the data payload for sending to the server.", "response": "def _prepare_data_payload(data):\n        \"\"\"\n        Make a copy of the `data` object, preparing it to be sent to the server.\n\n        The data will be sent via x-www-form-urlencoded or multipart/form-data mechanisms. Both of them work with\n        plain lists of key/value pairs, so this method converts the data into such format.\n        \"\"\"\n        if not data: return None\n        res = {}\n        for key, value in viewitems(data):\n            if value is None: continue  # don't send args set to None so backend defaults take precedence\n            if isinstance(value, list):\n                value = stringify_list(value)\n            elif isinstance(value, dict):\n                if \"__meta\" in value and value[\"__meta\"][\"schema_name\"].endswith(\"KeyV3\"):\n                    value = value[\"name\"]\n                else:\n                    value = stringify_dict(value)\n            else:\n                value = str(value)\n            res[key] = value\n        return res"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef _prepare_file_payload(filename):\n        if not filename: return None\n        absfilename = os.path.abspath(filename)\n        if not os.path.exists(absfilename):\n            raise H2OValueError(\"File %s does not exist\" % filename, skip_frames=1)\n        return {os.path.basename(absfilename): open(absfilename, \"rb\")}", "response": "Prepare the file payload for sending to the server."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef _log_start_transaction(self, endpoint, data, json, files, params):\n        # TODO: add information about the caller, i.e. which module + line of code called the .request() method\n        #       This can be done by fetching current traceback and then traversing it until we find the request function\n        self._requests_counter += 1\n        if not self._is_logging: return\n        msg = \"\\n---- %d --------------------------------------------------------\\n\" % self._requests_counter\n        msg += \"[%s] %s\\n\" % (time.strftime(\"%H:%M:%S\"), endpoint)\n        if params is not None: msg += \"     params: {%s}\\n\" % \", \".join(\"%s:%s\" % item for item in viewitems(params))\n        if data is not None:   msg += \"     body: {%s}\\n\" % \", \".join(\"%s:%s\" % item for item in viewitems(data))\n        if json is not None:\n            import json as j\n            msg += \"     json: %s\\n\" % j.dumps(json)\n        if files is not None:  msg += \"     file: %s\\n\" % \", \".join(f.name for f in viewvalues(files))\n        self._log_message(msg + \"\\n\")", "response": "Log the beginning of an API request."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nlogs response from an API request.", "response": "def _log_end_transaction(self, start_time, response):\n        \"\"\"Log response from an API request.\"\"\"\n        if not self._is_logging: return\n        elapsed_time = int((time.time() - start_time) * 1000)\n        msg = \"<<< HTTP %d %s   (%d ms)\\n\" % (response.status_code, response.reason, elapsed_time)\n        if \"Content-Type\" in response.headers:\n            msg += \"    Content-Type: %s\\n\" % response.headers[\"Content-Type\"]\n        msg += response.text\n        self._log_message(msg + \"\\n\\n\")"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef _log_message(self, msg):\n        if is_type(self._logging_dest, str):\n            with open(self._logging_dest, \"at\", encoding=\"utf-8\") as f:\n                f.write(msg)\n        else:\n            self._logging_dest.write(msg)", "response": "Log the message msg to the destination self. _logging_dest."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nprocess a response object and return the full path to the file.", "response": "def _process_response(response, save_to):\n        \"\"\"\n        Given a response object, prepare it to be handed over to the external caller.\n\n        Preparation steps include:\n           * detect if the response has error status, and convert it to an appropriate exception;\n           * detect Content-Type, and based on that either parse the response as JSON or return as plain text.\n        \"\"\"\n        status_code = response.status_code\n        if status_code == 200 and save_to:\n            if save_to.startswith(\"~\"): save_to = os.path.expanduser(save_to)\n            if os.path.isdir(save_to) or save_to.endswith(os.path.sep):\n                dirname = os.path.abspath(save_to)\n                filename = H2OConnection._find_file_name(response)\n            else:\n                dirname, filename = os.path.split(os.path.abspath(save_to))\n            fullname = os.path.join(dirname, filename)\n            try:\n                if not os.path.exists(dirname):\n                    os.makedirs(dirname)\n                with open(fullname, \"wb\") as f:\n                    for chunk in response.iter_content(chunk_size=65536):\n                        if chunk:  # Empty chunks may occasionally happen\n                            f.write(chunk)\n            except OSError as e:\n                raise H2OValueError(\"Cannot write to file %s: %s\" % (fullname, e))\n            return fullname\n\n        content_type = response.headers.get(\"Content-Type\", \"\")\n        if \";\" in content_type:  # Remove a \";charset=...\" part\n            content_type = content_type[:content_type.index(\";\")]\n\n        # Auto-detect response type by its content-type. Decode JSON, all other responses pass as-is.\n        if content_type == \"application/json\":\n            try:\n                data = response.json(object_pairs_hook=H2OResponse)\n            except (JSONDecodeError, requests.exceptions.ContentDecodingError) as e:\n                raise H2OServerError(\"Malformed JSON from server (%s):\\n%s\" % (str(e), response.text))\n        else:\n            data = response.text\n\n        # Success (200 = \"Ok\", 201 = \"Created\", 202 = \"Accepted\", 204 = \"No Content\")\n        if status_code in {200, 201, 202, 204}:\n            return data\n\n        # Client errors (400 = \"Bad Request\", 404 = \"Not Found\", 412 = \"Precondition Failed\")\n        if status_code in {400, 404, 412} and isinstance(data, (H2OErrorV3, H2OModelBuilderErrorV3)):\n            raise H2OResponseError(data)\n\n        # Server errors (notably 500 = \"Server Error\")\n        # Note that it is possible to receive valid H2OErrorV3 object in this case, however it merely means the server\n        # did not provide the correct status code.\n        raise H2OServerError(\"HTTP %d %s:\\n%r\" % (status_code, response.reason, data))"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef approxEqual(x, y, *args, **kwargs):\n    if not (type(x) is type(y) is float):\n        # Skip checking for __approxEqual__ in the common case of two floats.\n        methodname = '__approxEqual__'\n        # Allow the objects to specify what they consider \"approximately equal\",\n        # giving precedence to x. If either object has the appropriate method, we\n        # pass on any optional arguments untouched.\n        for a,b in ((x, y), (y, x)):\n            try:\n                method = getattr(a, methodname)\n            except AttributeError:\n                continue\n            else:\n                result = method(b, *args, **kwargs)\n                if result is NotImplemented:\n                    print \"WARNING: NotImplemented approxEqual for types\"\n                    continue\n                return bool(result)\n\n    # If we get here without returning, then neither x nor y knows how to do an\n    # approximate equal comparison (or are both floats). Fall back to a numeric\n    # comparison.\n    return _float_approxEqual(x, y, *args, **kwargs)", "response": "A simple comparison of two objects x and y."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef json_repr(obj, curr_depth=0, max_depth=4):\n    def serialize(obj, curr_depth):\n        \"\"\"Recursively walk object's hierarchy. Limit to max_depth\"\"\"\n        if curr_depth>max_depth:\n            return\n        if isinstance(obj, (bool, int, long, float, basestring)):\n            return obj\n        elif isinstance(obj, dict):\n            obj = obj.copy()\n            for key in obj:\n                obj[key] = serialize(obj[key], curr_depth+1)\n            return obj\n        elif isinstance(obj, list):\n            return [serialize(item, curr_depth+1) for item in obj]\n        elif isinstance(obj, tuple):\n            return tuple(serialize([item for item in obj], curr_depth+1))\n        elif hasattr(obj, '__dict__'):\n            return serialize(obj.__dict__, curr_depth+1)\n        else:\n            return repr(obj) # Don't know how to handle, convert to string\n\n    return (serialize(obj, curr_depth+1))", "response": "Represent an object as JSON."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nretrieving information about an AutoML instance.", "response": "def get_automl(project_name):\n    \"\"\"\n    Retrieve information about an AutoML instance.\n\n    :param str project_name:  A string indicating the project_name of the automl instance to retrieve.\n    :returns: A dictionary containing the project_name, leader model, and leaderboard.\n    \"\"\"\n    automl_json = h2o.api(\"GET /99/AutoML/%s\" % project_name)\n    project_name = automl_json[\"project_name\"]\n    leaderboard_list = [key[\"name\"] for key in automl_json['leaderboard']['models']]\n\n    if leaderboard_list is not None and len(leaderboard_list) > 0:\n        leader_id = leaderboard_list[0]\n    else:\n        leader_id = None\n\n    leader = h2o.get_model(leader_id)\n    # Intentionally mask the progress bar here since showing multiple progress bars is confusing to users.\n    # If any failure happens, revert back to user's original setting for progress and display the error message.\n    is_progress = H2OJob.__PROGRESS_BAR__\n    h2o.no_progress()\n    try:\n        # Parse leaderboard H2OTwoDimTable & return as an H2OFrame\n        leaderboard = h2o.H2OFrame(\n            automl_json[\"leaderboard_table\"].cell_values,\n            column_names=automl_json[\"leaderboard_table\"].col_header)\n    except Exception as ex:\n        raise ex\n    finally:\n        if is_progress is True:\n            h2o.show_progress()\n\n    leaderboard = leaderboard[1:]\n    automl_dict = {'project_name': project_name, \"leader\": leader, \"leaderboard\": leaderboard}\n    return automl_dict"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef train(self, x = None, y = None, training_frame = None, fold_column = None, \n              weights_column = None, validation_frame = None, leaderboard_frame = None, blending_frame = None):\n        \"\"\"\n        Begins an AutoML task, a background task that automatically builds a number of models\n        with various algorithms and tracks their performance in a leaderboard. At any point \n        in the process you may use H2O's performance or prediction functions on the resulting \n        models.\n\n        :param x: A list of column names or indices indicating the predictor columns.\n        :param y: An index or a column name indicating the response column.\n        :param fold_column: The name or index of the column in training_frame that holds per-row fold\n            assignments.\n        :param weights_column: The name or index of the column in training_frame that holds per-row weights.\n        :param training_frame: The H2OFrame having the columns indicated by x and y (as well as any\n            additional columns specified by fold_column or weights_column).\n        :param validation_frame: H2OFrame with validation data. This argument is ignored unless the user sets \n            nfolds = 0. If cross-validation is turned off, then a validation frame can be specified and used \n            for early stopping of individual models and early stopping of the grid searches.  By default and \n            when nfolds > 1, cross-validation metrics will be used for early stopping and thus validation_frame will be ignored.\n        :param leaderboard_frame: H2OFrame with test data for scoring the leaderboard.  This is optional and\n            if this is set to None (the default), then cross-validation metrics will be used to generate the leaderboard \n            rankings instead.\n        :param blending_frame: H2OFrame used to train the the metalearning algorithm in Stacked Ensembles (instead of relying on cross-validated predicted values).\n            This is optional, but when provided, it is also recommended to disable cross validation \n            by setting `nfolds=0` and to provide a leaderboard frame for scoring purposes.\n\n        :returns: An H2OAutoML object.\n\n        :examples:\n        >>> # Set up an H2OAutoML object\n        >>> aml = H2OAutoML(max_runtime_secs=30)\n        >>> # Launch an AutoML run\n        >>> aml.train(y=y, training_frame=train)\n        \"\"\"\n        ncols = training_frame.ncols\n        names = training_frame.names\n\n        #Set project name if None\n        if self.project_name is None:\n            self.project_name = \"automl_\" + training_frame.frame_id\n            self.build_control[\"project_name\"] = self.project_name\n\n        # Minimal required arguments are training_frame and y (response)\n        if y is None:\n            raise ValueError('The response column (y) is not set; please set it to the name of the column that you are trying to predict in your data.')\n        else:\n            assert_is_type(y,int,str)\n            if is_type(y, int):\n                if not (-ncols <= y < ncols):\n                    raise H2OValueError(\"Column %d does not exist in the training frame\" % y)\n                y = names[y]\n            else:\n                if y not in names:\n                    raise H2OValueError(\"Column %s does not exist in the training frame\" % y)\n            input_spec = {\n                'response_column': y,\n            }\n\n        if training_frame is None:\n            raise ValueError('The training frame is not set!')\n        else:\n            assert_is_type(training_frame, H2OFrame)\n            input_spec['training_frame'] = training_frame.frame_id\n\n        if fold_column is not None:\n            assert_is_type(fold_column,int,str)\n            input_spec['fold_column'] = fold_column\n\n        if weights_column is not None:\n            assert_is_type(weights_column,int,str)\n            input_spec['weights_column'] = weights_column\n\n        if validation_frame is not None:\n            assert_is_type(validation_frame, H2OFrame)\n            input_spec['validation_frame'] = validation_frame.frame_id\n\n        if leaderboard_frame is not None:\n            assert_is_type(leaderboard_frame, H2OFrame)\n            input_spec['leaderboard_frame'] = leaderboard_frame.frame_id\n\n        if blending_frame is not None:\n            assert_is_type(blending_frame, H2OFrame)\n            input_spec['blending_frame'] = blending_frame.frame_id\n\n        if self.sort_metric is not None:\n            assert_is_type(self.sort_metric, str)\n            sort_metric = self.sort_metric.lower()\n            # Changed the API to use \"deviance\" to be consistent with stopping_metric values\n            # TO DO: let's change the backend to use \"deviance\" since we use the term \"deviance\"\n            # After that we can take this `if` statement out\n            if sort_metric == \"deviance\":\n                sort_metric = \"mean_residual_deviance\"\n            input_spec['sort_metric'] = sort_metric\n\n        if x is not None:\n            assert_is_type(x,list)\n            xset = set()\n            if is_type(x, int, str): x = [x]\n            for xi in x:\n                if is_type(xi, int):\n                    if not (-ncols <= xi < ncols):\n                        raise H2OValueError(\"Column %d does not exist in the training frame\" % xi)\n                    xset.add(names[xi])\n                else:\n                    if xi not in names:\n                        raise H2OValueError(\"Column %s not in the training frame\" % xi)\n                    xset.add(xi)\n            x = list(xset)\n            ignored_columns = set(names) - {y} - set(x)\n            if fold_column is not None and fold_column in ignored_columns:\n                ignored_columns.remove(fold_column)\n            if weights_column is not None and weights_column in ignored_columns:\n                ignored_columns.remove(weights_column)\n            if ignored_columns is not None:\n                input_spec['ignored_columns'] = list(ignored_columns)\n\n        automl_build_params = dict(input_spec = input_spec)\n\n        # NOTE: if the user hasn't specified some block of parameters don't send them!\n        # This lets the back end use the defaults.\n        automl_build_params['build_control'] = self.build_control\n        automl_build_params['build_models']  = self.build_models\n\n        resp = h2o.api('POST /99/AutoMLBuilder', json=automl_build_params)\n        if 'job' not in resp:\n            print(\"Exception from the back end: \")\n            print(resp)\n            return\n\n        self._job = H2OJob(resp['job'], \"AutoML\")\n        self._job.poll()\n        self._fetch()", "response": "This method builds a number of models and runs the metalearning algorithm on the specified data."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\npredict on a dataset.", "response": "def predict(self, test_data):\n        \"\"\"\n        Predict on a dataset.\n\n        :param H2OFrame test_data: Data on which to make predictions.\n\n        :returns: A new H2OFrame of predictions.\n\n        :examples:\n        >>> # Set up an H2OAutoML object\n        >>> aml = H2OAutoML(max_runtime_secs=30)\n        >>> # Launch an H2OAutoML run\n        >>> aml.train(y=y, training_frame=train)\n        >>> # Predict with top model from AutoML Leaderboard on a H2OFrame called 'test'\n        >>> aml.predict(test)\n\n        \"\"\"\n        if self._fetch():\n            self._model = h2o.get_model(self._leader_id)\n            return self._model.predict(test_data)\n        print(\"No model built yet...\")"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef download_pojo(self, path=\"\", get_genmodel_jar=False, genmodel_name=\"\"):\n\n        return h2o.download_pojo(self.leader, path, get_jar=get_genmodel_jar, jar_name=genmodel_name)", "response": "Download the POJO for the AutoML leader model in AutoML to the specified path."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\ndownloads the leader model in AutoML in MOJO format.", "response": "def download_mojo(self, path=\".\", get_genmodel_jar=False, genmodel_name=\"\"):\n        \"\"\"\n        Download the leader model in AutoML in MOJO format.\n\n        :param path: the path where MOJO file should be saved.\n        :param get_genmodel_jar: if True, then also download h2o-genmodel.jar and store it in folder ``path``.\n        :param genmodel_name Custom name of genmodel jar\n        :returns: name of the MOJO file written.\n        \"\"\"\n\n        return ModelBase.download_mojo(self.leader, path, get_genmodel_jar, genmodel_name)"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef fit(self, X, y=None, **params):\n        if isinstance(self.parms[\"center\"], (tuple, list)): self._means = self.parms[\"center\"]\n        if isinstance(self.parms[\"scale\"], (tuple, list)): self._stds = self.parms[\"scale\"]\n        if self.means is None and self.parms[\"center\"]:\n            self._means = X.mean(return_frame=True).getrow()\n        else:\n            self._means = False\n        if self.stds is None and self.parms[\"scale\"]:\n            self._stds = X.sd()\n        else:\n            self._stds = False\n        return self", "response": "Fits this object by computing the means and standard deviations used by the transform method."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef transform(self, X, y=None, **params):\n        return X.scale(self.means, self.stds)", "response": "Scale an H2OFrame with the fitted means and standard deviations."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef inverse_transform(self, X, y=None, **params):\n        for i in range(X.ncol):\n            X[i] = self.means[i] + self.stds[i] * X[i]\n        return X", "response": "Undo the scale transformation."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef main(argv):\n    global g_script_name\n\n    g_script_name = os.path.basename(argv[0])\n    parse_config_file()\n    parse_args(argv)\n\n    url = 'https://0xdata.atlassian.net/rest/api/2/search?jql=' \\\n          + 'project+in+(PUBDEV,HEXDEV)' \\\n          + '+and+' \\\n          + 'resolutiondate+>=+' + g_start_date.strftime(\"%Y-%m-%d\") \\\n          + '+and+' \\\n          + 'resolution+in+(Done,Fixed)' \\\n          + 'order+by+type,component,resolutiondate' \\\n          + '&maxResults=1000'\n\n    r = requests.get(url, auth=(g_user, g_pass))\n    if (r.status_code != 200):\n        print(\"ERROR: status code is \" + str(r.status_code))\n        sys.exit(1)\n    j = r.json()\n    issues = j[u'issues']\n    if len(issues) >= 1000:\n        print(\"ERROR: len(issues) >= 1000.  Too many issues.\")\n        sys.exit(1)\n    last_issue_type_name = \"\"\n    last_component_name = \"\"\n    for issue in issues:\n        issue_type_name = get_issue_type_name(issue)\n        component_name = get_issue_component_name(issue)\n        if (issue_type_name != last_issue_type_name):\n            print(\"\")\n            print \"# \" + issue_type_name\n            last_issue_type_name = issue_type_name\n        if (component_name != last_component_name):\n            print(\"\")\n            print \"## \" + component_name\n            last_component_name = component_name\n        key = get_issue_key(issue)\n        summary = get_issue_summary(issue)\n        print \"* \" + key + \": \" + summary", "response": "Main program.\n\n    @return: none"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nextract true string from the string_content.", "response": "def extract_true_string(string_content):\n    \"\"\"\n    remove extra characters before the actual string we are\n    looking for.  The Jenkins console output is encoded using utf-8.  However, the stupid\n    redirect function can only encode using ASCII.  I have googled for half a day with no\n    results to how to resolve the issue.  Hence, we are going to the heat and just manually\n    get rid of the junk.\n\n    Parameters\n    ----------\n\n    string_content :  str\n        contains a line read in from jenkins console\n\n    :return: str: contains the content of the line after the string '[0m'\n\n    \"\"\"\n\n    startL,found,endL = string_content.partition('[0m')\n\n    if found:\n        return endL\n    else:\n        return string_content"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef find_time(each_line,temp_func_list):\n    global g_weekdays\n    global g_months\n    global g_failed_test_info_dict\n    \n    temp_strings = each_line.strip().split()\n\n    if (len(temp_strings) > 2):\n        if ((temp_strings[0] in g_weekdays) or (temp_strings[1] in g_weekdays)) and ((temp_strings[1] in g_months) or (temp_strings[2] in g_months)):\n            g_failed_test_info_dict[\"3.timestamp\"] = each_line.strip()\n            temp_func_list.remove(find_time)    # found timestamp, don't need to look again for it\n\n    return True", "response": "This function finds the approximate date and time from the timestamp of the job we are running on."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nfinds the slave machine where a Jenkins job was executed on.", "response": "def find_node_name(each_line,temp_func_list):\n    \"\"\"\n    Find the slave machine where a Jenkins job was executed on.  It will save this\n    information in g_failed_test_info_dict.  In addition, it will\n    delete this particular function handle off the temp_func_list as we do not need\n    to perform this action again.\n\n    Parameters\n    ----------\n\n    each_line :  str\n        contains a line read in from jenkins console\n    temp_func_list :  list of Python function handles\n        contains a list of functions that we want to invoke to extract information from\n        the Jenkins console text.\n\n    :return: bool to determine if text mining should continue on the jenkins console text\n    \"\"\"\n    global g_node_name\n    global g_failed_test_info_dict\n\n    if g_node_name in each_line:\n        temp_strings = each_line.split()\n        [start,found,endstr] = each_line.partition(g_node_name)\n\n        if found:\n            temp_strings = endstr.split()\n            g_failed_test_info_dict[\"6.node_name\"] = extract_true_string(temp_strings[1])\n            temp_func_list.remove(find_node_name)\n\n    return True"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nfinds the git hash and branch info that a Jenkins job was taken from. It will save this information in g_failed_test_info_dict. In addition, it will delete this particular function handle off the temp_func_list as we do not need to perform this action again. Parameters ---------- each_line : str contains a line read in from jenkins console temp_func_list : list of Python function handles contains a list of functions that we want to invoke to extract information from the Jenkins console text. :return: bool to determine if text mining should continue on the jenkins console text", "response": "def find_git_hash_branch(each_line,temp_func_list):\n    \"\"\"\n    Find the git hash and branch info that  a Jenkins job was taken from.  It will save this\n    information in g_failed_test_info_dict.  In addition, it will delete this particular\n    function handle off the temp_func_list as we do not need to perform this action again.\n\n    Parameters\n    ----------\n\n    each_line :  str\n        contains a line read in from jenkins console\n    temp_func_list :  list of Python function handles\n        contains a list of functions that we want to invoke to extract information from\n        the Jenkins console text.\n\n    :return: bool to determine if text mining should continue on the jenkins console text\n    \"\"\"\n    global g_git_hash_branch\n    global g_failed_test_info_dict\n\n    if g_git_hash_branch in each_line:\n        [start,found,endstr] = each_line.partition(g_git_hash_branch)\n        temp_strings = endstr.strip().split()\n\n        if len(temp_strings) > 1:\n            g_failed_test_info_dict[\"4.git_hash\"] = temp_strings[0]\n            g_failed_test_info_dict[\"5.git_branch\"] = temp_strings[1]\n\n        temp_func_list.remove(find_git_hash_branch)\n\n    return True"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nfind if a Jenkins job has taken too long to finish and was killed.", "response": "def find_build_timeout(each_line,temp_func_list):\n    \"\"\"\n    Find if a Jenkins job has taken too long to finish and was killed.  It will save this\n    information in g_failed_test_info_dict.\n\n    Parameters\n    ----------\n\n    each_line :  str\n        contains a line read in from jenkins console\n    temp_func_list :  list of Python function handles\n        contains a list of functions that we want to invoke to extract information from\n        the Jenkins console text.\n\n    :return: bool to determine if text mining should continue on the jenkins console text\n\"\"\"\n    global g_build_timeout\n    global g_failed_test_info_dict\n    global g_failure_occurred\n\n    if g_build_timeout in each_line:\n        g_failed_test_info_dict[\"8.build_timeout\"] = 'Yes'\n        g_failure_occurred = True\n        return False    # build timeout was found, no need to continue mining the console text\n    else:\n        return True"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nfinds if a Jenkins job has failed to build.", "response": "def find_build_failure(each_line,temp_func_list):\n    \"\"\"\n    Find if a Jenkins job has failed to build.  It will save this\n    information in g_failed_test_info_dict.  In addition, it will delete this particular\n    function handle off the temp_func_list as we do not need to perform this action again.\n\n    Parameters\n    ----------\n\n    each_line :  str\n        contains a line read in from jenkins console\n    temp_func_list :  list of Python function handles\n        contains a list of functions that we want to invoke to extract information from\n        the Jenkins console text.\n\n    :return: bool to determine if text mining should continue on the jenkins console text\n    \"\"\"\n    global g_build_success\n    global g_build_success_tests\n    global g_failed_test_info_dict\n    global g_failure_occurred\n    global g_build_failed_message\n\n    for ind in range(0,len(g_build_failed_message)):\n        if g_build_failed_message[ind] in each_line.lower():\n            if ((ind == 0) and (len(g_failed_jobs) > 0)):\n                continue\n            else:\n                g_failure_occurred = True\n                g_failed_test_info_dict[\"7.build_failure\"] = 'Yes'\n                temp_func_list.remove(find_build_failure)\n                return False\n\n    return True"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef find_java_filename(each_line,temp_func_list):\n    global g_before_java_file\n    global g_java_filenames\n\n    for each_word in g_before_java_file:\n        if (each_word not in each_line):\n            return True\n\n    # line contains the name and location of java txt output filename\n    temp_strings = each_line.split()\n    g_java_filenames.append(temp_strings[-1])\n\n    return True", "response": "Find if all the java_0. out. txt files that were mentioned in the Jenkins console output are present in the Jenkins console output."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef find_build_id(each_line,temp_func_list):\n    global g_before_java_file\n    global g_java_filenames\n    global g_build_id_text\n    global g_jenkins_url\n    global g_output_filename\n    global g_output_pickle_filename\n\n\n    if g_build_id_text in each_line:\n        [startStr,found,endStr] = each_line.partition(g_build_id_text)\n        g_failed_test_info_dict[\"2.build_id\"] = endStr.strip()\n\n        temp_func_list.remove(find_build_id)\n        g_jenkins_url = os.path.join('http://',g_jenkins_url,'view',g_view_name,'job',g_failed_test_info_dict[\"1.jobName\"],g_failed_test_info_dict[\"2.build_id\"],'artifact')\n\n\n    return True", "response": "This function finds the build id of a jenkins job."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef extract_job_build_url(url_string):\n    global g_failed_test_info_dict\n    global g_jenkins_url\n    global g_view_name\n    \n    tempString = url_string.strip('/').split('/')\n\n    if len(tempString) < 6:\n        print \"Illegal URL resource address.\\n\"\n        sys.exit(1)\n        \n    g_failed_test_info_dict[\"1.jobName\"] = tempString[6]\n        \n    g_jenkins_url = tempString[2]\n    g_view_name = tempString[4]", "response": "Extract the job build url from the user input and store it in g_failed_test_info_dict."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nscanning through the java output text and extract the bad java messages that may or may not happened when unit tests are run. It will not record any bad java messages that are stored in g_ok_java_messages. :return: none", "response": "def grab_java_message():\n    \"\"\"scan through the java output text and extract the bad java messages that may or may not happened when\n    unit tests are run.  It will not record any bad java messages that are stored in g_ok_java_messages.\n\n    :return: none\n    \"\"\"\n\n    global g_temp_filename\n    global g_current_testname\n    global g_java_start_text\n    global g_ok_java_messages\n    global g_java_general_bad_messages  # store bad java messages not associated with running a unit test\n    global g_java_general_bad_message_types\n    global g_failure_occurred\n    global g_java_message_type\n    global g_all_java_message_type\n    global g_toContinue\n\n    java_messages = []      # store all bad java messages associated with running a unit test\n    java_message_types = [] # store all bad java message types associated with running a unit test\n\n    if os.path.isfile(g_temp_filename): # open temp file containing content of some java_*_0.out.txt\n        java_file = open(g_temp_filename,'r')\n\n        g_toContinue = False    # denote if a multi-line message starts\n\n        tempMessage = \"\"\n        messageType = \"\"\n\n        for each_line in java_file:\n\n            if (g_java_start_text in each_line):\n                startStr,found,endStr = each_line.partition(g_java_start_text)\n\n                if len(found) > 0:\n                    if len(g_current_testname) > 0: # a new unit test is being started.  Save old info and move on\n                        associate_test_with_java(g_current_testname,java_messages,java_message_types)\n        \n                    g_current_testname = endStr.strip() # record the test name\n                    \n                    java_messages = []\n                    java_message_types = []\n        \n            temp_strings = each_line.strip().split()\n\n            if (len(temp_strings) >= 6) and (temp_strings[5] in g_all_java_message_type):\n                if g_toContinue == True:    # at the end of last message fragment\n                    addJavaMessages(tempMessage,messageType,java_messages,java_message_types)\n                    tempMessage = \"\"\n                    messageType = \"\"\n\n                # start of new message fragment\n                g_toContinue = False\n            else: # non standard output.  Continuation of last java message, add it to bad java message list\n                if g_toContinue:\n\n                    tempMessage += each_line    # add more java message here\n                    # if len(g_current_testname) == 0:\n                    #     addJavaMessages(each_line.strip(),\"\",java_messages,java_message_types)\n                    # else:\n                    #     addJavaMessages(each_line.strip(),\"\",java_messages,java_message_types)\n\n            if ((len(temp_strings) > 5) and (temp_strings[5] in g_java_message_type)):  # find a bad java message\n                startStr,found,endStr = each_line.partition(temp_strings[5])    # can be WARN,ERRR,FATAL,TRACE\n\n                if found and (len(endStr.strip()) > 0):\n                    tempMessage += endStr\n                    messageType = temp_strings[5]\n#                    if (tempMessage not in g_ok_java_messages[\"general\"]):  # found new bad messages that cannot be ignored\n                    g_toContinue = True\n\n                        # add tempMessage to bad java message list\n#                        addJavaMessages(tempMessage,temp_strings[5],java_messages,java_message_types)\n        java_file.close()"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef addJavaMessages(tempMessage,messageType,java_messages,java_message_types):\n    global g_current_testname\n    global g_java_general_bad_messages\n    global g_java_general_bad_message_types\n    global g_failure_occurred\n\n    tempMess = tempMessage.strip()\n\n    if (tempMess not in g_ok_java_messages[\"general\"]):\n        if (len(g_current_testname) == 0):    # java message not associated with any test name\n            g_java_general_bad_messages.append(tempMess)\n            g_java_general_bad_message_types.append(messageType)\n            g_failure_occurred = True\n        else:   # java message found during a test\n            write_test = False  # do not include java message for test if False\n            if (g_current_testname in g_ok_java_messages.keys()) and (tempMess in g_ok_java_messages[g_current_testname]): # test name associated with ignored Java messages\n                write_test = False\n            else:   # not java ignored message for current unit test\n                write_test = True\n\n            if write_test:\n                java_messages.append(tempMess)\n                java_message_types.append(messageType)\n                g_failure_occurred = True", "response": "Adds Java messages to the untagged bad java messages list and returns the empty list if the message is not found in the untagged bad java messages list."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nlooping through java_0. out. txt and extract potentially dangerous WARN or ERRR or FATAL messages associated with a test.", "response": "def extract_java_messages():\n    \"\"\"\n    loop through java_*_0.out.txt and extract potentially dangerous WARN/ERRR/FATAL\n    messages associated with a test.  The test may even pass but something terrible\n    has actually happened.\n\n    :return: none\n    \"\"\"\n    global g_jenkins_url\n    global g_failed_test_info_dict\n    global g_java_filenames\n\n    global g_failed_jobs  # record job names of failed jobs\n    global g_failed_job_java_messages # record failed job java message\n    global g_failed_job_java_message_types\n\n    global g_success_jobs # record job names of passed jobs\n    global g_success_job_java_messages # record of successful jobs bad java messages\n    global g_success_job_java_message_types\n   \n    global g_java_general_bad_messages  # store java error messages when no job is running\n    global g_java_general_bad_message_types # store java error message types when no job is running.\n\n    if (len(g_failed_jobs) > 0):  # artifacts available only during failure of some sort\n        for fname in g_java_filenames:  # grab java message from each java_*_*_.out file\n            temp_strings = fname.split('/')\n\n            start_url = g_jenkins_url\n\n            for windex in range(6,len(temp_strings)):\n                start_url = os.path.join(start_url,temp_strings[windex])\n            try:    # first java file path is different.  Can ignore it.\n                get_console_out(start_url)  # get java text and save it in local directory for processing\n                grab_java_message()         # actually process the java text output and see if we found offensive stuff\n            except:\n                pass\n\n    # build up the dict structure that we are storing our data in\n    if len(g_failed_jobs) > 0:\n        g_failed_test_info_dict[\"failed_tests_info *********\"] = [g_failed_jobs,g_failed_job_java_messages,g_failed_job_java_message_types]\n    if len(g_success_jobs) > 0:\n        g_failed_test_info_dict[\"passed_tests_info *********\"] = [g_success_jobs,g_success_job_java_messages,g_success_job_java_message_types]\n\n    if len(g_java_general_bad_messages) > 0:\n        g_failed_test_info_dict[\"9.general_bad_java_messages\"] = [g_java_general_bad_messages,g_java_general_bad_message_types]"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nsaves the log scraping results into the log files denoted by g_output_filename_failed_tests and g_output_filename_passed_tests.", "response": "def save_dict():\n    \"\"\"\n    Save the log scraping results into logs denoted by g_output_filename_failed_tests and\n    g_output_filename_passed_tests.\n\n    :return: none\n    \"\"\"\n\n    global g_test_root_dir\n    global g_output_filename_failed_tests\n    global g_output_filename_passed_tests\n    global g_output_pickle_filename\n    global g_failed_test_info_dict\n\n\n    # some build can fail really early that no buid id info is stored in the console text.\n    if \"2.build_id\" not in g_failed_test_info_dict.keys():\n        g_failed_test_info_dict[\"2.build_id\"] = \"unknown\"\n\n    build_id = g_failed_test_info_dict[\"2.build_id\"]\n\n    g_output_filename_failed_tests = g_output_filename_failed_tests+'_build_'+build_id+'_failed_tests.log'\n    g_output_filename_passed_tests = g_output_filename_passed_tests+'_build_'+build_id+'_passed_tests.log'\n    g_output_pickle_filename = g_output_pickle_filename+'_build_'+build_id+'.pickle'\n\n    allKeys = sorted(g_failed_test_info_dict.keys())\n\n    # write out the jenkins job info into log files.\n    with open(g_output_pickle_filename,'wb') as test_file:\n        pickle.dump(g_failed_test_info_dict,test_file)\n\n    # write out the failure report as text into a text file\n    text_file_failed_tests = open(g_output_filename_failed_tests,'w')\n    text_file_passed_tests = None\n    allKeys = sorted(g_failed_test_info_dict.keys())\n    write_passed_tests = False\n\n    if (\"passed_tests_info *********\" in allKeys):\n        text_file_passed_tests = open(g_output_filename_passed_tests,'w')\n        write_passed_tests = True\n\n    for keyName in allKeys:\n        val = g_failed_test_info_dict[keyName]\n        if isinstance(val,list):    # writing one of the job lists\n            if (len(val) == 3):     # it is a message for a test\n                if keyName == \"failed_tests_info *********\":\n                    write_test_java_message(keyName,val,text_file_failed_tests)\n\n                if keyName == \"passed_tests_info *********\":\n                    write_test_java_message(keyName,val,text_file_passed_tests)\n            elif (len(val) == 2):                   # it is a general bad java message\n                write_java_message(keyName,val,text_file_failed_tests)\n                if write_passed_tests:\n                    write_java_message(keyName,val,text_file_passed_tests)\n        else:\n            write_general_build_message(keyName,val,text_file_failed_tests)\n            if write_passed_tests:\n                write_general_build_message(keyName,val,text_file_passed_tests)\n\n    text_file_failed_tests.close()\n    if write_passed_tests:\n        text_file_passed_tests.close()"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nwrite key value into log file when value is a string and not a list.", "response": "def write_general_build_message(key,val,text_file):\n    \"\"\"\n    Write key/value into log file when the value is a string and not a list.\n\n    Parameters\n    ----------\n    key :  str\n        key value in g_failed_test_info_dict\n    value :  str\n        corresponding value associated with the key in key\n    text_file : file handle\n        file handle of log file to write the info to.\n\n\n    :return: none\n    \"\"\"\n    text_file.write(key+\": \")\n    text_file.write(val)\n    text_file.write('\\n\\n')"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nwriting the content of the log file into the summary text file.", "response": "def write_file_content(fhandle,file2read):\n    \"\"\"\n    Write one log file into the summary text file.\n\n    Parameters\n    ----------\n    fhandle :  Python file handle\n        file handle to the summary text file\n    file2read : Python file handle\n        file handle to log file where we want to add its content to the summary text file.\n\n    :return: none\n    \"\"\"\n    if os.path.isfile(file2read):\n\n        # write summary of failed tests logs\n        with open(file2read,'r') as tfile:\n            fhandle.write('============ Content of '+ file2read)\n            fhandle.write('\\n')\n            fhandle.write(tfile.read())\n            fhandle.write('\\n\\n')"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nwriting a single java message into a log file.", "response": "def write_java_message(key,val,text_file):\n    \"\"\"\n    Loop through all java messages that are not associated with a unit test and\n    write them into a log file.\n\n    Parameters\n    ----------\n    key :  str\n        9.general_bad_java_messages\n    val : list of list of str\n        contains the bad java messages and the message types.\n\n\n    :return: none\n    \"\"\"\n\n    text_file.write(key)\n    text_file.write('\\n')\n\n    if (len(val[0]) > 0) and (len(val) >= 3):\n        for index in range(len(val[0])):\n            text_file.write(\"Java Message Type: \")\n            text_file.write(val[1][index])\n            text_file.write('\\n')\n\n            text_file.write(\"Java Message: \")\n\n            for jmess in val[2][index]:\n                text_file.write(jmess)\n                text_file.write('\\n')\n\n        text_file.write('\\n \\n')"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nloads in pickle file that contains dict with bad java messages to ignore per unit test or for all cases.", "response": "def load_java_messages_to_ignore():\n    \"\"\"\n    Load in pickle file that contains dict structure with bad java messages to ignore per unit test\n    or for all cases.  The ignored bad java info is stored in g_ok_java_messages dict.\n\n    :return:\n    \"\"\"\n    global g_ok_java_messages\n    global g_java_message_pickle_filename\n\n    if os.path.isfile(g_java_message_pickle_filename):\n        with open(g_java_message_pickle_filename,'rb') as tfile:\n            g_ok_java_messages = pickle.load(tfile)\n    else:\n        g_ok_java_messages[\"general\"] = []"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef main(argv):\n    global g_script_name\n    global g_test_root_dir\n    global g_temp_filename\n    global g_output_filename_failed_tests\n    global g_output_filename_passed_tests\n    global g_output_pickle_filename\n    global g_failure_occurred\n    global g_failed_test_info_dict\n    global g_java_message_pickle_filename\n    global g_summary_text_filename\n\n    if len(argv) < 3:\n        print \"Must resource url like http://mr-0xa1:8080/view/wendy_jenkins/job/h2o_regression_pyunit_medium_large/lastBuild/consoleFull, filename of summary text, filename (optional ending in .pickle) to retrieve Java error messages to exclude.\\n\"\n        sys.exit(1)\n    else:   # we may be in business\n        g_script_name = os.path.basename(argv[0])   # get name of script being run.\n        resource_url = argv[1]\n\n        g_temp_filename = os.path.join(g_test_root_dir,'tempText')\n        g_summary_text_filename = os.path.join(g_test_root_dir,argv[2])\n\n        if len(argv) == 4:\n            g_java_message_pickle_filename  = argv[3]\n\n        get_console_out(resource_url)       # save remote console output in local directory\n        extract_job_build_url(resource_url) # extract the job name of build id for identification purposes\n\n        log_filename = g_failed_test_info_dict[\"1.jobName\"]\n        log_pickle_filename = g_failed_test_info_dict[\"1.jobName\"]\n\n        # pickle file that store bad Java messages that we can ignore.\n        g_java_message_pickle_filename = os.path.join(g_test_root_dir,g_java_message_pickle_filename)\n        g_output_filename_failed_tests = os.path.join(g_test_root_dir,log_filename)\n        g_output_filename_passed_tests = os.path.join(g_test_root_dir,log_filename)\n        g_output_pickle_filename = os.path.join(g_test_root_dir,log_pickle_filename)\n\n        load_java_messages_to_ignore()          # load in bad java messages to ignore and store in g_ok_java_messages\n        extract_test_results()      # grab the console text and stored the failed tests.\n        extract_java_messages()     # grab dangerous java messages that we found for the various unit tests\n        if ((len(g_failed_jobs) > 0) or (g_failed_test_info_dict[\"7.build_failure\"]=='Yes')):\n            g_failure_occurred = True\n\n        if g_failure_occurred:\n            save_dict() # save the dict structure in a pickle file and a text file when failure is detected\n            update_summary_file()   # join together all log files into one giant summary text.\n\n            # output this info to console to form the list of failed jenkins jobs.\n            print g_failed_test_info_dict[\"1.jobName\"]+' build '+g_failed_test_info_dict[\"2.build_id\"]+','\n        else:\n            print \"\"", "response": "Main function for the non - fatal test case."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef normalize_enum_constant(s):\n    if s.islower(): return s\n    if s.isupper(): return s.lower()\n    return \"\".join(ch if ch.islower() else \"_\" + ch.lower() for ch in s).strip(\"_\")", "response": "Return enum constant s converted to a canonical snake - case."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nfind synonyms using a word2vec model.", "response": "def find_synonyms(self, word, count=20):\n        \"\"\"\n        Find synonyms using a word2vec model.\n\n        :param str word: A single word to find synonyms for.\n        :param int count: The first \"count\" synonyms will be returned.\n\n        :returns: the approximate reconstruction of the training data.\n        \"\"\"\n        j = h2o.api(\"GET /3/Word2VecSynonyms\", data={'model': self.model_id, 'word': word, 'count': count})\n        return OrderedDict(sorted(zip(j['synonyms'], j['scores']), key=lambda t: t[1], reverse=True))"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef transform(self, words, aggregate_method):\n        j = h2o.api(\"GET /3/Word2VecTransform\", data={'model': self.model_id, 'words_frame': words.frame_id, 'aggregate_method': aggregate_method})\n        return h2o.get_frame(j[\"vectors_frame\"][\"name\"])", "response": "Transform words or sequences of words to vectors using a word2vec model."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\npolling the job for the next time.", "response": "def poll(self, verbose_model_scoring_history = False):\n        \"\"\"\n        Wait until the job finishes.\n\n        This method will continuously query the server about the status of the job, until the job reaches a\n        completion. During this time we will display (in stdout) a progress bar with % completion status.\n        \"\"\"\n        try:\n            hidden = not H2OJob.__PROGRESS_BAR__\n            pb = ProgressBar(title=self._job_type + \" progress\", hidden=hidden)\n            if verbose_model_scoring_history:\n                pb.execute(self._refresh_job_status, print_verbose_info=lambda x: self._print_verbose_info() if int(x * 10) % 5 == 0  else \" \")\n            else:\n                pb.execute(self._refresh_job_status)\n        except StopIteration as e:\n            if str(e) == \"cancelled\":\n                h2o.api(\"POST /3/Jobs/%s/cancel\" % self.job_key)\n                self.status = \"CANCELLED\"\n            # Potentially we may want to re-raise the exception here\n\n        assert self.status in {\"DONE\", \"CANCELLED\", \"FAILED\"} or self._poll_count <= 0, \\\n            \"Polling finished while the job has status %s\" % self.status\n        if self.warnings:\n            for w in self.warnings:\n                warnings.warn(w)\n\n        # check if failed... and politely print relevant message\n        if self.status == \"CANCELLED\":\n            raise H2OJobCancelled(\"Job<%s> was cancelled by the user.\" % self.job_key)\n        if self.status == \"FAILED\":\n            if (isinstance(self.job, dict)) and (\"stacktrace\" in list(self.job)):\n                raise EnvironmentError(\"Job with key {} failed with an exception: {}\\nstacktrace: \"\n                                       \"\\n{}\".format(self.job_key, self.exception, self.job[\"stacktrace\"]))\n            else:\n                raise EnvironmentError(\"Job with key %s failed with an exception: %s\" % (self.job_key, self.exception))\n\n        return self"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nconverting the munging operations performed on H2OFrame into a POJO.", "response": "def to_pojo(self, pojo_name=\"\", path=\"\", get_jar=True):\n        \"\"\"\n        Convert the munging operations performed on H2OFrame into a POJO.\n\n        :param pojo_name:  (str) Name of POJO\n        :param path:  (str) path of POJO.\n        :param get_jar: (bool) Whether to also download the h2o-genmodel.jar file needed to compile the POJO\n        :return: None\n        \"\"\"\n        assert_is_type(pojo_name, str)\n        assert_is_type(path, str)\n        assert_is_type(get_jar, bool)\n        if pojo_name == \"\":\n            pojo_name = \"AssemblyPOJO_\" + str(uuid.uuid4())\n        java = h2o.api(\"GET /99/Assembly.java/%s/%s\" % (self.id, pojo_name))\n        file_path = path + \"/\" + pojo_name + \".java\"\n        if path == \"\":\n            print(java)\n        else:\n            with open(file_path, 'w', encoding=\"utf-8\") as f:\n                f.write(java)  # this had better be utf-8 ?\n        if get_jar and path != \"\":\n            h2o.api(\"GET /3/h2o-genmodel.jar\", save_to=os.path.join(path, \"h2o-genmodel.jar\"))"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef fit(self, fr):\n        assert_is_type(fr, H2OFrame)\n        steps = \"[%s]\" % \",\".join(quoted(step[1].to_rest(step[0]).replace('\"', \"'\")) for step in self.steps)\n        j = h2o.api(\"POST /99/Assembly\", data={\"steps\": steps, \"frame\": fr.frame_id})\n        self.id = j[\"assembly\"][\"name\"]\n        return H2OFrame.get_frame(j[\"result\"][\"name\"])", "response": "Fits the munging operations on a frame."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef percentileOnSortedList(N, percent, key=lambda x:x, interpolate='mean'):\n    # 5 ways of resolving fractional\n    # floor, ceil, funky, linear, mean\n    interpolateChoices = ['floor', 'ceil', 'funky', 'linear', 'mean']\n    if interpolate not in interpolateChoices:\n        print \"Bad choice for interpolate:\", interpolate\n        print \"Supported choices:\", interpolateChoices\n    \"\"\"\n    Find the percentile of a list of values.\n\n    @parameter N - is a list of values. Note N MUST BE already sorted.\n    @parameter percent - a float value from 0.0 to 1.0.\n    @parameter key - optional key function to compute value from each element of N.\n\n    @return - the percentile of the values\n    \"\"\"\n    if N is None:\n        return None\n    k = (len(N)-1) * percent\n    f = int(math.floor(k))\n    c = int(math.ceil(k))\n    \n    if f == c:\n        d = key(N[f])\n        msg = \"aligned:\" \n\n    elif interpolate=='floor':\n        d = key(N[f])\n        msg = \"fractional with floor:\" \n\n    elif interpolate=='ceil':\n        d = key(N[c])\n        msg = \"fractional with ceil:\" \n\n    elif interpolate=='funky':\n        d0 = key(N[f]) * (c-k)\n        d1 = key(N[c]) * (k-f)\n        d = d0+d1\n        msg = \"fractional with Tung(floor and ceil) :\" \n    \n    elif interpolate=='linear':\n        assert (c-f)==1\n        assert (k>=f) and (k<=c)\n        pctDiff = k-f\n        dDiff = pctDiff * (key(N[c]) - key(N[f]))\n        d = key(N[f] + dDiff)\n        msg = \"fractional %s with linear(floor and ceil):\" % pctDiff\n\n    elif interpolate=='mean':\n        d = (key(N[c]) + key(N[f])) / 2.0\n        msg = \"fractional with mean(floor and ceil):\" \n\n    # print 3 around the floored k, for eyeballing when we're close\n    flooredK = int(f)\n    # print the 3 around the median\n    if flooredK > 0:\n        print \"prior->\", key(N[flooredK-1]), \" \"\n    else:\n        print \"prior->\", \"<bof>\"\n    print \"floor->\", key(N[flooredK]), \" \", msg, 'result:', d, \"f:\", f, \"len(N):\", len(N)\n    if flooredK+1 < len(N):\n        print \" ceil->\", key(N[flooredK+1]), \"c:\", c\n    else:\n        print \" ceil-> <eof>\", \"c:\", c\n\n    return d", "response": "This function returns the percentile of a list of values."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef params(self):\n        params = {}\n        for p in self.parms:\n            params[p] = {\"default\": self.parms[p][\"default_value\"],\n                         \"actual\": self.parms[p][\"actual_value\"]}\n        return params", "response": "Get the parameters and the actual values only."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef default_params(self):\n        params = {}\n        for p in self.parms:\n            params[p] = self.parms[p][\"default_value\"]\n        return params", "response": "Dictionary of the default parameters of the model."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef actual_params(self):\n        params_to_select = {\"model_id\": \"name\",\n                            \"response_column\": \"column_name\",\n                            \"training_frame\": \"name\",\n                            \"validation_frame\": \"name\"}\n        params = {}\n        for p in self.parms:\n            if p in params_to_select.keys():\n                params[p] = self.parms[p][\"actual_value\"].get(params_to_select[p], None)\n            else:\n                params[p] = self.parms[p][\"actual_value\"]\n        return params", "response": "Dictionary of actual parameters of the model."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef predict_leaf_node_assignment(self, test_data, type=\"Path\"):\n        if not isinstance(test_data, h2o.H2OFrame): raise ValueError(\"test_data must be an instance of H2OFrame\")\n        assert_is_type(type, None, Enum(\"Path\", \"Node_ID\"))\n        j = h2o.api(\"POST /3/Predictions/models/%s/frames/%s\" % (self.model_id, test_data.frame_id),\n                    data={\"leaf_node_assignment\": True, \"leaf_node_assignment_type\": type})\n        return h2o.get_frame(j[\"predictions_frame\"][\"name\"])", "response": "Predict on a dataset and return the leaf node assignment."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef staged_predict_proba(self, test_data):\n        if not isinstance(test_data, h2o.H2OFrame): raise ValueError(\"test_data must be an instance of H2OFrame\")\n        j = h2o.api(\"POST /3/Predictions/models/%s/frames/%s\" % (self.model_id, test_data.frame_id),\n                    data={\"predict_staged_proba\": True})\n        return h2o.get_frame(j[\"predictions_frame\"][\"name\"])", "response": "Predict class probabilities at each stage of an H2O Model."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef predict(self, test_data, custom_metric = None, custom_metric_func = None):\n        # Upload evaluation function into DKV\n        if custom_metric:\n            assert_satisfies(custom_metric_func, custom_metric_func is None,\n                             \"The argument 'eval_func_ref' cannot be specified when eval_func is specified, \")\n            eval_func_ref = h2o.upload_custom_metric(custom_metric)\n        if not isinstance(test_data, h2o.H2OFrame): raise ValueError(\"test_data must be an instance of H2OFrame\")\n        j = H2OJob(h2o.api(\"POST /4/Predictions/models/%s/frames/%s\" % (self.model_id, test_data.frame_id), data = {'custom_metric_func': custom_metric_func}),\n                   self._model_json[\"algo\"] + \" prediction\")\n        j.poll()\n        return h2o.get_frame(j.dest_key)", "response": "Predict on a dataset."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nreturn a list of models that are cross - validated by the given key.", "response": "def get_xval_models(self, key=None):\n        \"\"\"\n        Return a Model object.\n\n        :param key: If None, return all cross-validated models; otherwise return the model that key points to.\n\n        :returns: A model or list of models.\n        \"\"\"\n        return h2o.get_model(key) if key is not None else [h2o.get_model(k) for k in self._xval_keys]"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef deepfeatures(self, test_data, layer):\n        if test_data is None: raise ValueError(\"Must specify test data\")\n        if str(layer).isdigit():\n            j = H2OJob(h2o.api(\"POST /4/Predictions/models/%s/frames/%s\" % (self._id, test_data.frame_id),\n                               data={\"deep_features_hidden_layer\": layer}), \"deepfeatures\")\n        else:\n            j = H2OJob(h2o.api(\"POST /4/Predictions/models/%s/frames/%s\" % (self._id, test_data.frame_id),\n                               data={\"deep_features_hidden_layer_name\": layer}), \"deepfeatures\")\n        j.poll()\n        return h2o.get_frame(j.dest_key)", "response": "Create a deep feature space on a hidden layer."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nreturning the H2OFrame for the respective weight matrix.", "response": "def weights(self, matrix_id=0):\n        \"\"\"\n        Return the frame for the respective weight matrix.\n\n        :param: matrix_id: an integer, ranging from 0 to number of layers, that specifies the weight matrix to return.\n\n        :returns: an H2OFrame which represents the weight matrix identified by matrix_id\n        \"\"\"\n        num_weight_matrices = len(self._model_json[\"output\"][\"weights\"])\n        if matrix_id not in list(range(num_weight_matrices)):\n            raise ValueError(\n                \"Weight matrix does not exist. Model has {0} weight matrices (0-based indexing), but matrix {1} \"\n                \"was requested.\".format(num_weight_matrices, matrix_id))\n        return h2o.get_frame(self._model_json[\"output\"][\"weights\"][matrix_id][\"URL\"].split(\"/\")[3])"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nreturning the frame for the respective bias vector.", "response": "def biases(self, vector_id=0):\n        \"\"\"\n        Return the frame for the respective bias vector.\n\n        :param: vector_id: an integer, ranging from 0 to number of layers, that specifies the bias vector to return.\n\n        :returns: an H2OFrame which represents the bias vector identified by vector_id\n        \"\"\"\n        num_bias_vectors = len(self._model_json[\"output\"][\"biases\"])\n        if vector_id not in list(range(num_bias_vectors)):\n            raise ValueError(\n                \"Bias vector does not exist. Model has {0} bias vectors (0-based indexing), but vector {1} \"\n                \"was requested.\".format(num_bias_vectors, vector_id))\n        return h2o.get_frame(self._model_json[\"output\"][\"biases\"][vector_id][\"URL\"].split(\"/\")[3])"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef model_performance(self, test_data=None, train=False, valid=False, xval=False):\n        if test_data is None:\n            if not train and not valid and not xval: train = True  # default to train\n            if train: return self._model_json[\"output\"][\"training_metrics\"]\n            if valid: return self._model_json[\"output\"][\"validation_metrics\"]\n            if xval: return self._model_json[\"output\"][\"cross_validation_metrics\"]\n\n        else:  # cases dealing with test_data not None\n            if not isinstance(test_data, h2o.H2OFrame):\n                raise ValueError(\"`test_data` must be of type H2OFrame.  Got: \" + type(test_data))\n            if (self._model_json[\"response_column_name\"] != None) and not(self._model_json[\"response_column_name\"] in test_data.names):\n                print(\"WARNING: Model metrics cannot be calculated and metric_json is empty due to the absence of the response column in your dataset.\")\n                return\n            res = h2o.api(\"POST /3/ModelMetrics/models/%s/frames/%s\" % (self.model_id, test_data.frame_id))\n\n            # FIXME need to do the client-side filtering...  (PUBDEV-874)\n            raw_metrics = None\n            for mm in res[\"model_metrics\"]:\n                if mm[\"frame\"] is not None and mm[\"frame\"][\"name\"] == test_data.frame_id:\n                    raw_metrics = mm\n                    break\n            return self._metrics_class(raw_metrics, algo=self._model_json[\"algo\"])", "response": "Generate model metrics for this model on a given test_data."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef scoring_history(self):\n        model = self._model_json[\"output\"]\n        if \"scoring_history\" in model and model[\"scoring_history\"] is not None:\n            return model[\"scoring_history\"].as_data_frame()\n        print(\"No score history for this model\")", "response": "Retrieve Model Score History."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nprinting innards of model without regards to type.", "response": "def show(self):\n        \"\"\"Print innards of model, without regards to type.\"\"\"\n        if self._future:\n            self._job.poll_once()\n            return\n        if self._model_json is None:\n            print(\"No model trained yet\")\n            return\n        if self.model_id is None:\n            print(\"This H2OEstimator has been removed.\")\n            return\n        model = self._model_json[\"output\"]\n        print(\"Model Details\")\n        print(\"=============\")\n\n        print(self.__class__.__name__, \": \", self._model_json[\"algo_full_name\"])\n        print(\"Model Key: \", self._id)\n\n        self.summary()\n\n        print()\n        # training metrics\n        tm = model[\"training_metrics\"]\n        if tm: tm.show()\n        vm = model[\"validation_metrics\"]\n        if vm: vm.show()\n        xm = model[\"cross_validation_metrics\"]\n        if xm: xm.show()\n        xms = model[\"cross_validation_metrics_summary\"]\n        if xms: xms.show()\n\n        if \"scoring_history\" in model and model[\"scoring_history\"]:\n            model[\"scoring_history\"].show()\n        if \"variable_importances\" in model and model[\"variable_importances\"]:\n            model[\"variable_importances\"].show()"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef varimp(self, use_pandas=False):\n        model = self._model_json[\"output\"]\n        if self.algo=='glm' or \"variable_importances\" in list(model.keys()) and model[\"variable_importances\"]:\n            if self.algo=='glm':\n                tempvals = model[\"standardized_coefficient_magnitudes\"].cell_values\n                maxVal = 0\n                sum=0\n                for item in tempvals:\n                    sum=sum+item[1]\n                    if item[1]>maxVal:\n                        maxVal = item[1]\n                vals = []\n                for item in tempvals:\n                    tempT = (item[0], item[1], item[1]/maxVal, item[1]/sum)\n                    vals.append(tempT)\n                header = [\"variable\", \"relative_importance\", \"scaled_importance\", \"percentage\"]\n            else:\n                vals = model[\"variable_importances\"].cell_values\n                header = model[\"variable_importances\"].col_header\n                \n            if use_pandas and can_use_pandas():\n                import pandas\n                return pandas.DataFrame(vals, columns=header)\n            else:\n                return vals\n        else:\n            print(\"Warning: This model doesn't have variable importances\")", "response": "Pretty print the variable importances or return them in a list or Pandas DataFrame."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nreturn the residual degress of freedom for the current set.", "response": "def residual_degrees_of_freedom(self, train=False, valid=False, xval=False):\n        \"\"\"\n        Retreive the residual degress of freedom if this model has the attribute, or None otherwise.\n\n        :param bool train: Get the residual dof for the training set. If both train and valid are False, then train\n            is selected by default.\n        :param bool valid: Get the residual dof for the validation set. If both train and valid are True, then train\n            is selected by default.\n\n        :returns: Return the residual dof, or None if it is not present.\n        \"\"\"\n        if xval: raise H2OValueError(\"Cross-validation metrics are not available.\")\n        if not train and not valid: train = True\n        if train and valid:         train = True\n        if train:\n            return self._model_json[\"output\"][\"training_metrics\"].residual_degrees_of_freedom()\n        else:\n            return self._model_json[\"output\"][\"validation_metrics\"].residual_degrees_of_freedom()"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef coef(self):\n        tbl = self._model_json[\"output\"][\"coefficients_table\"]\n        if tbl is None:\n            return None\n        return {name: coef for name, coef in zip(tbl[\"names\"], tbl[\"coefficients\"])}", "response": "Return the coefficients which can be applied to the non - standardized data."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef coef_norm(self):\n        if self._model_json[\"output\"][\"model_category\"]==\"Multinomial\":\n            tbl = self._model_json[\"output\"][\"standardized_coefficient_magnitudes\"]\n            if tbl is None:\n                return None\n            return {name: coef for name, coef in zip(tbl[\"names\"], tbl[\"coefficients\"])}\n        else:\n            tbl = self._model_json[\"output\"][\"coefficients_table\"]\n            if tbl is None:\n                return None\n            return {name: coef for name, coef in zip(tbl[\"names\"], tbl[\"standardized_coefficients\"])}", "response": "Return coefficients fitted on the standardized data."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\ndownloading the POJO for this model to the specified path.", "response": "def download_pojo(self, path=\"\", get_genmodel_jar=False, genmodel_name=\"\"):\n        \"\"\"\n        Download the POJO for this model to the directory specified by path.\n\n        If path is an empty string, then dump the output to screen.\n\n        :param path:  An absolute path to the directory where POJO should be saved.\n        :param get_genmodel_jar: if True, then also download h2o-genmodel.jar and store it in folder ``path``.\n        :param genmodel_name Custom name of genmodel jar\n        :returns: name of the POJO file written.\n        \"\"\"\n        assert_is_type(path, str)\n        assert_is_type(get_genmodel_jar, bool)\n        path = path.rstrip(\"/\")\n        return h2o.download_pojo(self, path, get_jar=get_genmodel_jar, jar_name=genmodel_name)"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef download_mojo(self, path=\".\", get_genmodel_jar=False, genmodel_name=\"\"):\n        assert_is_type(path, str)\n        assert_is_type(get_genmodel_jar, bool)\n\n        if not self.have_mojo:\n            raise H2OValueError(\"Export to MOJO not supported\")\n\n        if get_genmodel_jar:\n            if genmodel_name == \"\":\n                h2o.api(\"GET /3/h2o-genmodel.jar\", save_to=os.path.join(path, \"h2o-genmodel.jar\"))\n            else:\n                h2o.api(\"GET /3/h2o-genmodel.jar\", save_to=os.path.join(path, genmodel_name))\n        return h2o.api(\"GET /3/Models/%s/mojo\" % self.model_id, save_to=path)", "response": "Download the MOJO file in MOJO format."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef save_mojo(self, path=\"\", force=False):\n        assert_is_type(path, str)\n        assert_is_type(force, bool)\n        if not self.have_mojo:\n            raise H2OValueError(\"Export to MOJO not supported\")\n        path = os.path.join(os.getcwd() if path == \"\" else path, self.model_id + \".zip\")\n        return h2o.api(\"GET /99/Models.mojo/%s\" % self.model_id, data={\"dir\": path, \"force\": force})[\"dir\"]", "response": "Save the current state of the current model as a MOJO."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nsaves the details of the current model in JSON Format to disk.", "response": "def save_model_details(self, path=\"\", force=False):\n        \"\"\"\n        Save Model Details of an H2O Model in JSON Format to disk.\n\n        :param model: The model object to save.\n        :param path: a path to save the model details at (hdfs, s3, local)\n        :param force: if True overwrite destination directory in case it exists, or throw exception if set to False.\n\n        :returns str: the path of the saved model details\n        \"\"\"\n        assert_is_type(path, str)\n        assert_is_type(force, bool)\n        path = os.path.join(os.getcwd() if path == \"\" else path, self.model_id + \".json\")\n        return h2o.api(\"GET /99/Models/%s/json\" % self.model_id, data={\"dir\": path, \"force\": force})[\"dir\"]"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef partial_plot(self, data, cols, destination_key=None, nbins=20, weight_column=None,\n                     plot=True, plot_stddev = True, figsize=(7, 10), server=False, include_na=False, user_splits=None,\n                     save_to_file=None):\n        \"\"\"\n        Create partial dependence plot which gives a graphical depiction of the marginal effect of a variable on the\n        response. The effect of a variable is measured in change in the mean response.\n\n        :param H2OFrame data: An H2OFrame object used for scoring and constructing the plot.\n        :param cols: Feature(s) for which partial dependence will be calculated.\n        :param destination_key: An key reference to the created partial dependence tables in H2O.\n        :param nbins: Number of bins used. For categorical columns make sure the number of bins exceed the level count. If you enable add_missing_NA, the returned length will be nbin+1.\n        :param weight_column: A string denoting which column of data should be used as the weight column.\n        :param plot: A boolean specifying whether to plot partial dependence table.\n        :param plot_stddev: A boolean specifying whether to add std err to partial dependence plot.\n        :param figsize: Dimension/size of the returning plots, adjust to fit your output cells.\n        :param server: ?\n        :param include_na: A boolean specifying whether missing value should be included in the Feature values.\n        :param user_splits: a dictionary containing column names as key and user defined split values as value in a list.\n        :param save_to_file Fully qualified name to an image file the resulting plot should be saved to, e.g. '/home/user/pdpplot.png'. The 'png' postfix might be omitted. If the file already exists, it will be overridden. Plot is only saved if plot = True.\n        :returns: Plot and list of calculated mean response tables for each feature requested.\n        \"\"\"\n\n        if not isinstance(data, h2o.H2OFrame): raise ValueError(\"data must be an instance of H2OFrame\")\n        assert_is_type(cols, [str])\n        assert_is_type(destination_key, None, str)\n        assert_is_type(nbins, int)\n        assert_is_type(plot, bool)\n        assert_is_type(figsize, (int, int))\n\n        # Check cols specified exist in frame data\n        for xi in cols:\n            if xi not in data.names:\n                raise H2OValueError(\"Column %s does not exist in the training frame\" % xi)\n        if isinstance(weight_column, int) and not (weight_column == -1):\n            raise H2OValueError(\"Weight column should be a column name in your data frame.\")\n        elif isinstance(weight_column, str): # index is a name\n            if weight_column not in data.names:\n                raise H2OValueError(\"Column %s does not exist in the data frame\" % weight_column)\n            weight_column = data.names.index(weight_column)\n\n        kwargs = {}\n        kwargs[\"cols\"] = cols\n        kwargs[\"model_id\"] = self.model_id\n        kwargs[\"frame_id\"] = data.frame_id\n        kwargs[\"nbins\"] = nbins\n        kwargs[\"destination_key\"] = destination_key\n        kwargs[\"weight_column_index\"] = weight_column\n        kwargs[\"add_missing_na\"] = include_na\n\n        # extract user defined split points from dict user_splits into an integer array of column indices\n        # and a double array of user define values for the corresponding columns\n        if not(user_splits == None):\n            if not(isinstance(user_splits, dict)):\n                raise H2OValueError(\"user_splits must be a Python dict.\")\n            else:\n                if len(user_splits)>0: # do nothing with an empty dict\n                    user_cols = []\n                    user_values = []\n                    user_num_splits = []\n                    data_ncol = data.ncol\n                    column_names = data.names\n                    for colKey,val in user_splits.items():\n                        if is_type(colKey, str) and colKey in column_names:\n                            user_cols.append(colKey)\n                        elif isinstance(colKey, int) and colKey < data_ncol:\n                            user_cols.append(column_names[colKey])\n                        else:\n                            raise H2OValueError(\"column names/indices used in user_splits are not valid.  They \"\n                                                \"should be chosen from the columns of your data set.\")\n\n                        if data[colKey].isfactor()[0] or data[colKey].isnumeric()[0]: # replace enum string with actual value\n                            nVal = len(val)\n                            if data[colKey].isfactor()[0]:\n                                domains = data[colKey].levels()[0]\n\n                                numVal = [0]*nVal\n                                for ind in range(nVal):\n                                    if (val[ind] in domains):\n                                        numVal[ind] = domains.index(val[ind])\n                                    else:\n                                        raise H2OValueError(\"Illegal enum value {0} encountered.  To include missing\"\n                                                        \" values in your feature values, set include_na to \"\n                                                        \"True\".format(val[ind]))\n\n                                user_values.extend(numVal)\n                            else:\n                                user_values.extend(val)\n                            user_num_splits.append(nVal)\n                        else:\n                            raise H2OValueError(\"Partial dependency plots are generated for numerical and categorical \"\n                                                \"columns only.\")\n                    kwargs[\"user_cols\"] = user_cols\n                    kwargs[\"user_splits\"] = user_values\n                    kwargs[\"num_user_splits\"] = user_num_splits\n                else:\n                    kwargs[\"user_cols\"] = None\n                    kwargs[\"user_splits\"] = None\n                    kwargs[\"num_user_splits\"] = None\n\n        json = H2OJob(h2o.api(\"POST /3/PartialDependence/\", data=kwargs),  job_type=\"PartialDependencePlot\").poll()\n        json = h2o.api(\"GET /3/PartialDependence/%s\" % json.dest_key)\n\n        # Extract partial dependence data from json response\n        pps = json[\"partial_dependence_data\"]\n\n        # Plot partial dependence plots using matplotlib\n        if plot:\n            plt = _get_matplotlib_pyplot(server)\n            if not plt: return\n\n            fig, axs = plt.subplots(len(cols), squeeze=False, figsize=figsize)\n            for i, pp in enumerate(pps):\n                # Check weather column was categorical or numeric\n                col = cols[i]\n                cat = data[col].isfactor()[0]\n                upper = [a + b for a, b in zip(pp[1], pp[2]) ]\n                lower = [a - b for a, b in zip(pp[1], pp[2]) ]\n                if cat:\n                    labels = pp[0]\n                    x = range(len(labels))\n                    y = pp[1]\n                    axs[i, 0].plot(x, y, \"ro\")\n                    if plot_stddev:\n                        axs[i, 0].plot(x, lower, 'b--')\n                        axs[i, 0].plot(x, upper, 'b--')\n                    axs[i, 0].set_ylim(min(lower) - 0.1*abs(min(lower)), max(upper) + 0.1*abs(max(upper)))\n                    axs[i, 0].set_xticks(x)\n                    axs[i, 0].set_xticklabels(labels)\n                    axs[i, 0].margins(0.2)\n                else:\n                    x = pp[0]\n                    y = pp[1]\n                    axs[i, 0].plot(x, y, \"r-\")\n                    if plot_stddev:\n                        axs[i, 0].plot(x, lower, 'b--')\n                        axs[i, 0].plot(x, upper, 'b--')\n                    axs[i, 0].set_xlim(min(x), max(x))\n                    axs[i, 0].set_ylim(min(lower) - 0.1*abs(min(lower)), max(upper) + 0.1*abs(max(upper)))\n\n                axs[i, 0].set_title(\"Partial Dependence Plot For {}\".format(col))\n                axs[i, 0].set_xlabel(pp.col_header[0])\n                axs[i, 0].set_ylabel(pp.col_header[1])\n                axs[i, 0].xaxis.grid()\n                axs[i, 0].yaxis.grid()\n            if len(col) > 1:\n                fig.tight_layout(pad=0.4, w_pad=0.5, h_pad=1.0)\n            if(save_to_file is not None):\n                plt.savefig(save_to_file)\n\n        return pps", "response": "Create a partial dependence plot for each feature requested."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef varimp_plot(self, num_of_features=None, server=False):\n        assert_is_type(num_of_features, None, int)\n        assert_is_type(server, bool)\n\n        plt = _get_matplotlib_pyplot(server)\n        if not plt: return\n\n        # get the variable importances as a list of tuples, do not use pandas dataframe\n        importances = self.varimp(use_pandas=False)\n        # features labels correspond to the first value of each tuple in the importances list\n        feature_labels = [tup[0] for tup in importances]\n        # relative importances correspond to the first value of each tuple in the importances list\n        scaled_importances = [tup[2] for tup in importances]\n        # specify bar centers on the y axis, but flip the order so largest bar appears at top\n        pos = range(len(feature_labels))[::-1]\n        # specify the bar lengths\n        val = scaled_importances\n\n        # # check that num_of_features is an integer\n        # if num_of_features is None:\n        #     num_of_features = len(val)\n\n        # default to 10 or less features if num_of_features is not specified\n        if num_of_features is None:\n            num_of_features = min(len(val), 10)\n\n        fig, ax = plt.subplots(1, 1, figsize=(14, 10))\n        # create separate plot for the case where num_of_features == 1\n        if num_of_features == 1:\n            plt.barh(pos[0:num_of_features], val[0:num_of_features], align=\"center\",\n                     height=0.8, color=\"#1F77B4\", edgecolor=\"none\")\n            # Hide the right and top spines, color others grey\n            ax.spines[\"right\"].set_visible(False)\n            ax.spines[\"top\"].set_visible(False)\n            ax.spines[\"bottom\"].set_color(\"#7B7B7B\")\n            ax.spines[\"left\"].set_color(\"#7B7B7B\")\n            # Only show ticks on the left and bottom spines\n            ax.yaxis.set_ticks_position(\"left\")\n            ax.xaxis.set_ticks_position(\"bottom\")\n            plt.yticks(pos[0:num_of_features], feature_labels[0:num_of_features])\n            ax.margins(None, 0.5)\n\n        else:\n            plt.barh(pos[0:num_of_features], val[0:num_of_features], align=\"center\",\n                     height=0.8, color=\"#1F77B4\", edgecolor=\"none\")\n            # Hide the right and top spines, color others grey\n            ax.spines[\"right\"].set_visible(False)\n            ax.spines[\"top\"].set_visible(False)\n            ax.spines[\"bottom\"].set_color(\"#7B7B7B\")\n            ax.spines[\"left\"].set_color(\"#7B7B7B\")\n            # Only show ticks on the left and bottom spines\n            ax.yaxis.set_ticks_position(\"left\")\n            ax.xaxis.set_ticks_position(\"bottom\")\n            plt.yticks(pos[0:num_of_features], feature_labels[0:num_of_features])\n            plt.ylim([min(pos[0:num_of_features])- 1, max(pos[0:num_of_features])+1])\n            # ax.margins(y=0.5)\n\n        # check which algorithm was used to select right plot title\n        if self._model_json[\"algo\"] == \"gbm\":\n            plt.title(\"Variable Importance: H2O GBM\", fontsize=20)\n            if not server: plt.show()\n        elif self._model_json[\"algo\"] == \"drf\":\n            plt.title(\"Variable Importance: H2O DRF\", fontsize=20)\n            if not server: plt.show()\n        elif self._model_json[\"algo\"] == \"xgboost\":\n            plt.title(\"Variable Importance: H2O XGBoost\", fontsize=20)\n            if not server: plt.show()\n        # if H2ODeepLearningEstimator has variable_importances == True\n        elif self._model_json[\"algo\"] == \"deeplearning\":\n            plt.title(\"Variable Importance: H2O Deep Learning\", fontsize=20)\n            if not server: plt.show()\n        elif self._model_json[\"algo\"] == \"glm\":\n            plt.title(\"Variable Importance: H2O GLM\", fontsize=20)\n            if not server: plt.show()            \n        else:\n            raise H2OValueError(\"A variable importances plot is not implemented for this type of model\")", "response": "Plots the variable importance for a trained model."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nplotting a GLM model s standardized coefficient magnitudes.", "response": "def std_coef_plot(self, num_of_features=None, server=False):\n        \"\"\"\n        Plot a GLM model\"s standardized coefficient magnitudes.\n\n        :param num_of_features: the number of features shown in the plot.\n        :param server: ?\n\n        :returns: None.\n        \"\"\"\n        assert_is_type(num_of_features, None, I(int, lambda x: x > 0))\n\n        # check that model is a glm\n        if self._model_json[\"algo\"] != \"glm\":\n            raise H2OValueError(\"This function is available for GLM models only\")\n\n        plt = _get_matplotlib_pyplot(server)\n        if not plt: return\n\n        # get unsorted tuple of labels and coefficients\n        unsorted_norm_coef = self.coef_norm().items()\n        # drop intercept value then sort tuples by the coefficient\"s absolute value\n        drop_intercept = [tup for tup in unsorted_norm_coef if tup[0] != \"Intercept\"]\n        norm_coef = sorted(drop_intercept, key=lambda x: abs(x[1]), reverse=True)\n\n        signage = []\n        for element in norm_coef:\n            # if positive including zero, color blue, else color orange (use same colors as Flow)\n            if element[1] >= 0:\n                signage.append(\"#1F77B4\")  # blue\n            else:\n                signage.append(\"#FF7F0E\")  # dark orange\n\n        # get feature labels and their corresponding magnitudes\n        feature_labels = [tup[0] for tup in norm_coef]\n        norm_coef_magn = [abs(tup[1]) for tup in norm_coef]\n        # specify bar centers on the y axis, but flip the order so largest bar appears at top\n        pos = range(len(feature_labels))[::-1]\n        # specify the bar lengths\n        val = norm_coef_magn\n\n        # check number of features, default is all the features\n        if num_of_features is None:\n            num_of_features = len(val)\n\n        # plot horizontal plot\n        fig, ax = plt.subplots(1, 1, figsize=(14, 10))\n        # create separate plot for the case where num_of_features = 1\n        if num_of_features == 1:\n            plt.barh(pos[0], val[0],\n                     align=\"center\", height=0.8, color=signage[0], edgecolor=\"none\")\n            # Hide the right and top spines, color others grey\n            ax.spines[\"right\"].set_visible(False)\n            ax.spines[\"top\"].set_visible(False)\n            ax.spines[\"bottom\"].set_color(\"#7B7B7B\")\n            ax.spines[\"left\"].set_color(\"#7B7B7B\")\n            # Only show ticks on the left and bottom spines\n            ax.yaxis.set_ticks_position(\"left\")\n            ax.xaxis.set_ticks_position(\"bottom\")\n            plt.yticks([0], feature_labels[0])\n            ax.margins(None, 0.5)\n\n        else:\n            plt.barh(pos[0:num_of_features], val[0:num_of_features],\n                     align=\"center\", height=0.8, color=signage[0:num_of_features], edgecolor=\"none\")\n            # Hide the right and top spines, color others grey\n            ax.spines[\"right\"].set_visible(False)\n            ax.spines[\"top\"].set_visible(False)\n            ax.spines[\"bottom\"].set_color(\"#7B7B7B\")\n            ax.spines[\"left\"].set_color(\"#7B7B7B\")\n            # Only show ticks on the left and bottom spines\n            ax.yaxis.set_ticks_position(\"left\")\n            ax.xaxis.set_ticks_position(\"bottom\")\n            plt.yticks(pos[0:num_of_features], feature_labels[0:num_of_features])\n            ax.margins(None, 0.05)\n\n        # generate custom fake lines that will be used as legend entries:\n        # check if positive and negative values exist\n        # if positive create positive legend\n        if \"#1F77B4\" in signage[0:num_of_features] and \"#FF7F0E\" not in signage[0:num_of_features]:\n            color_ids = (\"Positive\",)\n            markers = [plt.Line2D([0, 0], [0, 0], color=color, marker=\"s\", linestyle=\"\")\n                       for color in signage[0:num_of_features]]\n            lgnd = plt.legend(markers, color_ids, numpoints=1, loc=\"best\", frameon=False, fontsize=13)\n            lgnd.legendHandles[0]._legmarker.set_markersize(10)\n        # if neg create neg legend\n        elif \"#FF7F0E\" in signage[0:num_of_features] and \"#1F77B4\" not in signage[0:num_of_features]:\n            color_ids = (\"Negative\",)\n            markers = [plt.Line2D([0, 0], [0, 0], color=color, marker=\"s\", linestyle=\"\")\n                       for color in set(signage[0:num_of_features])]\n            lgnd = plt.legend(markers, color_ids, numpoints=1, loc=\"best\", frameon=False, fontsize=13)\n            lgnd.legendHandles[0]._legmarker.set_markersize(10)\n        # if both provide both colors in legend\n        else:\n            color_ids = (\"Positive\", \"Negative\")\n            markers = [plt.Line2D([0, 0], [0, 0], color=color, marker=\"s\", linestyle=\"\")\n                       for color in ['#1F77B4', '#FF7F0E']] # blue should always be positive, orange negative\n            lgnd = plt.legend(markers, color_ids, numpoints=1, loc=\"best\", frameon=False, fontsize=13)\n            lgnd.legendHandles[0]._legmarker.set_markersize(10)\n            lgnd.legendHandles[1]._legmarker.set_markersize(10)\n\n        # Hide the right and top spines, color others grey\n        ax.spines[\"right\"].set_visible(False)\n        ax.spines[\"top\"].set_visible(False)\n        ax.spines[\"bottom\"].set_color(\"#7B7B7B\")\n        ax.spines[\"left\"].set_color(\"#7B7B7B\")\n\n        # Only show ticks on the left and bottom spines\n        # ax.yaxis.set_ticks_position(\"left\")\n        # ax.xaxis.set_ticks_position(\"bottom\")\n        plt.yticks(pos[0:num_of_features], feature_labels[0:num_of_features])\n        plt.tick_params(axis=\"x\", which=\"minor\", bottom=\"off\", top=\"off\",  labelbottom=\"off\")\n        plt.title(\"Standardized Coef. Magnitudes: H2O GLM\", fontsize=20)\n        # plt.axis(\"tight\")\n        # show plot\n        if not server: plt.show()"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\ncheck that the two sets of records are the same length.", "response": "def _check_targets(y_actual, y_predicted):\n        \"\"\"Check that y_actual and y_predicted have the same length.\n\n        :param H2OFrame y_actual:\n        :param H2OFrame y_predicted:\n\n        :returns: None\n        \"\"\"\n        if len(y_actual) != len(y_predicted):\n            raise ValueError(\"Row mismatch: [{},{}]\".format(len(y_actual), len(y_predicted)))"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef cross_validation_models(self):\n        cvmodels = self._model_json[\"output\"][\"cross_validation_models\"]\n        if cvmodels is None: return None\n        m = []\n        for p in cvmodels: m.append(h2o.get_model(p[\"name\"]))\n        return m", "response": "Obtain a list of cross - validation models."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nobtaining the H2OFrame objects representing the cross - validation predictions on the training data.", "response": "def cross_validation_predictions(self):\n        \"\"\"\n        Obtain the (out-of-sample) holdout predictions of all cross-validation models on their holdout data.\n\n        Note that the predictions are expanded to the full number of rows of the training data, with 0 fill-in.\n\n        :returns: list of H2OFrame objects.\n        \"\"\"\n        preds = self._model_json[\"output\"][\"cross_validation_predictions\"]\n        if preds is None: return None\n        m = []\n        for p in preds: m.append(h2o.get_frame(p[\"name\"]))\n        return m"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef _run_demo(body_fn, interactive, echo, testing):\n    import colorama\n    from colorama import Style, Fore\n    colorama.init()\n\n    class StopExecution(Exception):\n        \"\"\"Helper class for cancelling the demo.\"\"\"\n\n    assert_is_type(body_fn, type(_run_demo))\n    # Reformat description by removing extra spaces; then print it.\n    if body_fn.__doc__:\n        desc_lines = body_fn.__doc__.split(\"\\n\")\n        while desc_lines[0].strip() == \"\":\n            desc_lines = desc_lines[1:]\n        while desc_lines[-1].strip() == \"\":\n            desc_lines = desc_lines[:-1]\n        strip_spaces = min(len(line) - len(line.lstrip(\" \")) for line in desc_lines[1:] if line.strip() != \"\")\n        maxlen = max(len(line) for line in desc_lines)\n        print(Fore.CYAN)\n        print(\"-\" * maxlen)\n        for line in desc_lines:\n            print(line[strip_spaces:].rstrip())\n        print(\"-\" * maxlen)\n        print(Style.RESET_ALL, end=\"\")\n\n    # Prepare the executor function\n    def controller():\n        \"\"\"Print to console the next block of commands, and wait for keypress.\"\"\"\n        try:\n            raise RuntimeError(\"Catch me!\")\n        except RuntimeError:\n            print()\n            # Extract and print lines that will be executed next\n            if echo:\n                tb = sys.exc_info()[2]\n                fr = tb.tb_frame.f_back\n                filename = fr.f_code.co_filename\n                linecache.checkcache(filename)\n                line = linecache.getline(filename, fr.f_lineno, fr.f_globals).rstrip()\n                indent_len = len(line) - len(line.lstrip(\" \"))\n                assert line[indent_len:] == \"go()\"\n                i = fr.f_lineno\n                output_lines = []\n                n_blank_lines = 0\n                while True:\n                    i += 1\n                    line = linecache.getline(filename, i, fr.f_globals).rstrip()\n                    # Detect dedent\n                    if line[:indent_len].strip() != \"\": break\n                    line = line[indent_len:]\n                    if line == \"go()\": break\n                    style = Fore.LIGHTBLACK_EX if line.lstrip().startswith(\"#\") else Style.BRIGHT\n                    prompt = \"... \" if line.startswith(\" \") else \">>> \"\n                    output_lines.append(Fore.CYAN + prompt + Fore.RESET + style + line + Style.RESET_ALL)\n                    del style  # Otherwise exception print-outs may get messed-up...\n                    if line.strip() == \"\":\n                        n_blank_lines += 1\n                        if n_blank_lines > 5: break  # Just in case we hit file end or something\n                    else:\n                        n_blank_lines = 0\n                for line in output_lines[:-n_blank_lines]:\n                    print(line)\n\n            # Prompt for user input\n            if interactive:\n                print(\"\\n\" + Style.DIM + \"(press any key)\" + Style.RESET_ALL, end=\"\")\n                key = _wait_for_keypress()\n                print(\"\\r                     \\r\", end=\"\")\n                if key.lower() == \"q\":\n                    raise StopExecution()\n\n    # Replace h2o.init() with a stub when running in \"test\" mode\n    _h2o_init = h2o.init\n    if testing:\n        h2o.init = lambda *args, **kwargs: None\n\n    # Run the test\n    try:\n        body_fn(controller)\n        print(\"\\n\" + Fore.CYAN + \"---- End of Demo ----\" + Style.RESET_ALL)\n    except (StopExecution, KeyboardInterrupt):\n        print(\"\\n\" + Fore.RED + \"---- Demo aborted ----\" + Style.RESET_ALL)\n\n    # Clean-up\n    if testing:\n        h2o.init = _h2o_init\n    print()\n    colorama.deinit()", "response": "Runs a demo function."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nwaiting for a key press on the console and return it.", "response": "def _wait_for_keypress():\n    \"\"\"\n    Wait for a key press on the console and return it.\n\n    Borrowed from http://stackoverflow.com/questions/983354/how-do-i-make-python-to-wait-for-a-pressed-key\n    \"\"\"\n    result = None\n    if os.name == \"nt\":\n        # noinspection PyUnresolvedReferences\n        import msvcrt\n        result = msvcrt.getch()\n    else:\n        import termios\n        fd = sys.stdin.fileno()\n\n        oldterm = termios.tcgetattr(fd)\n        newattr = termios.tcgetattr(fd)\n        newattr[3] = newattr[3] & ~termios.ICANON & ~termios.ECHO\n        termios.tcsetattr(fd, termios.TCSANOW, newattr)\n\n        try:\n            result = sys.stdin.read(1)\n        except IOError:\n            pass\n        finally:\n            termios.tcsetattr(fd, termios.TCSAFLUSH, oldterm)\n\n    return result"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef make(keyvals):\n        kwargs = {}\n        for key, value in keyvals:\n            if key == \"columns\":\n                kwargs[\"col_formats\"] = [c[\"format\"] for c in value]\n                kwargs[\"col_types\"] = [c[\"type\"] for c in value]\n                kwargs[\"col_header\"] = [c[\"name\"] for c in value]\n                kwargs[\"row_header\"] = len(value)\n            if key == \"name\": kwargs[\"table_header\"] = value\n            if key == \"description\": kwargs[\"table_description\"] = value\n            if key == \"data\": kwargs[\"raw_cell_values\"] = value\n        return H2OTwoDimTable(**kwargs)", "response": "Create a new H2ODimTable object from a list of key - value tuples which are a pre - cursor to JSON dict."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef as_data_frame(self):\n        if can_use_pandas():\n            import pandas\n            pandas.options.display.max_colwidth = 70\n            return pandas.DataFrame(self._cell_values, columns=self._col_header)\n        return self", "response": "Convert to a python data frame."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef show(self, header=True):\n        # if h2o.can_use_pandas():\n        #  import pandas\n        #  pandas.options.display.max_rows = 20\n        #  print pandas.DataFrame(self._cell_values,columns=self._col_header)\n        #  return\n        if header and self._table_header:\n            print(self._table_header + \":\", end=' ')\n            if self._table_description: print(self._table_description)\n        print()\n        table = copy.deepcopy(self._cell_values)\n        nr = 0\n        if _is_list_of_lists(table): nr = len(\n            table)  # only set if we truly have multiple rows... not just one long row :)\n        if nr > 20:  # create a truncated view of the table, first/last 5 rows\n            trunc_table = []\n            trunc_table += [v for v in table[:5]]\n            trunc_table.append([\"---\"] * len(table[0]))\n            trunc_table += [v for v in table[(nr - 5):]]\n            table = trunc_table\n\n        H2ODisplay(table, self._col_header, numalign=\"left\", stralign=\"left\")\n        if nr > 20 and can_use_pandas(): print('\\nSee the whole table with table.as_data_frame()')", "response": "Print the contents of this table."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef reader(stream, dialect=DIALECT, encoding=False, **fmtparams):\n    if encoding is False:\n        return UnicodeTextReader(stream, dialect, **fmtparams)\n    if encoding is None:\n        encoding = none_encoding()\n    if not is_8bit_clean(encoding):\n        raise NotImplementedError\n    return UnicodeBytesReader(stream, dialect, encoding, **fmtparams)", "response": "r CSV reader yielding lists of unicode strings."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nstarts a new H2O server on the local machine.", "response": "def start(jar_path=None, nthreads=-1, enable_assertions=True, max_mem_size=None, min_mem_size=None,\n              ice_root=None, log_dir=None, log_level=None, port=\"54321+\", name=None, extra_classpath=None,\n              verbose=True, jvm_custom_args=None, bind_to_localhost=True):\n        \"\"\"\n        Start new H2O server on the local machine.\n\n        :param jar_path: Path to the h2o.jar executable. If not given, then we will search for h2o.jar in the\n            locations returned by `._jar_paths()`.\n        :param nthreads: Number of threads in the thread pool. This should be related to the number of CPUs used.\n            -1 means use all CPUs on the host. A positive integer specifies the number of CPUs directly.\n        :param enable_assertions: If True, pass `-ea` option to the JVM.\n        :param max_mem_size: Maximum heap size (jvm option Xmx), in bytes.\n        :param min_mem_size: Minimum heap size (jvm option Xms), in bytes.\n        :param log_dir: Directory for H2O logs to be stored if a new instance is started. Default directory is determined\n        by H2O internally.\n        :param log_level: The logger level for H2O if a new instance is started.\n        :param ice_root: A directory where H2O stores its temporary files. Default location is determined by\n            tempfile.mkdtemp().\n        :param port: Port where to start the new server. This could be either an integer, or a string of the form\n            \"DDDDD+\", indicating that the server should start looking for an open port starting from DDDDD and up.\n        :param name: name of the h2o cluster to be started\n        :param extra_classpath List of paths to libraries that should be included on the Java classpath.\n        :param verbose: If True, then connection info will be printed to the stdout.\n        :param jvm_custom_args Custom, user-defined arguments for the JVM H2O is instantiated in\n        :param bind_to_localhost A flag indicating whether access to the H2O instance should be restricted to the local\n            machine (default) or if it can be reached from other computers on the network.\n            Only applicable when H2O is started from the Python client.\n\n        :returns: a new H2OLocalServer instance\n        \"\"\"\n        assert_is_type(jar_path, None, str)\n        assert_is_type(port, None, int, str)\n        assert_is_type(name, None, str)\n        assert_is_type(nthreads, -1, BoundInt(1, 4096))\n        assert_is_type(enable_assertions, bool)\n        assert_is_type(min_mem_size, None, int)\n        assert_is_type(max_mem_size, None, BoundInt(1 << 25))\n        assert_is_type(log_dir, str, None)\n        assert_is_type(log_level, str, None)\n        assert_satisfies(log_level, log_level in [None, \"TRACE\", \"DEBUG\", \"INFO\", \"WARN\", \"ERRR\", \"FATA\"])\n        assert_is_type(ice_root, None, I(str, os.path.isdir))\n        assert_is_type(extra_classpath, None, [str])\n        assert_is_type(jvm_custom_args, list, None)\n        assert_is_type(bind_to_localhost, bool)\n        if jar_path:\n            assert_satisfies(jar_path, jar_path.endswith(\"h2o.jar\"))\n\n        if min_mem_size is not None and max_mem_size is not None and min_mem_size > max_mem_size:\n            raise H2OValueError(\"`min_mem_size`=%d is larger than the `max_mem_size`=%d\" % (min_mem_size, max_mem_size))\n        if port is None: port = \"54321+\"\n        baseport = None\n        # TODO: get rid of this port gimmick and have 2 separate parameters.\n        if is_type(port, str):\n            if port.isdigit():\n                port = int(port)\n            else:\n                if not(port[-1] == \"+\" and port[:-1].isdigit()):\n                    raise H2OValueError(\"`port` should be of the form 'DDDD+', where D is a digit. Got: %s\" % port)\n                baseport = int(port[:-1])\n                port = 0\n\n        hs = H2OLocalServer()\n        hs._verbose = bool(verbose)\n        hs._jar_path = hs._find_jar(jar_path)\n        hs._extra_classpath = extra_classpath\n        hs._ice_root = ice_root\n        hs._name = name\n        if not ice_root:\n            hs._ice_root = tempfile.mkdtemp()\n            hs._tempdir = hs._ice_root\n\n        if verbose: print(\"Attempting to start a local H2O server...\")\n        hs._launch_server(port=port, baseport=baseport, nthreads=int(nthreads), ea=enable_assertions,\n                          mmax=max_mem_size, mmin=min_mem_size, jvm_custom_args=jvm_custom_args,\n                          bind_to_localhost=bind_to_localhost, log_dir=log_dir, log_level=log_level)\n        if verbose: print(\"  Server is running at %s://%s:%d\" % (hs.scheme, hs.ip, hs.port))\n        atexit.register(lambda: hs.shutdown())\n        return hs"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef shutdown(self):\n        if not self._process: return\n        try:\n            kill_time = time.time() + self._TIME_TO_KILL\n            while self._process.poll() is None and time.time() < kill_time:\n                self._process.terminate()\n                time.sleep(0.2)\n            if self._process().poll() is None:\n                self._process.kill()\n                time.sleep(0.2)\n            if self._verbose:\n                print(\"Local H2O server %s:%s stopped.\" % (self.ip, self.port))\n        except:\n            pass\n        self._process = None", "response": "Shut down the server by trying to terminate the server process gracefully."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef _find_jar(self, path0=None):\n        jar_paths = [path0] if path0 else self._jar_paths()\n        searched_paths = []\n        for jp in jar_paths:\n            searched_paths.append(jp)\n            if os.path.exists(jp):\n                return jp\n        raise H2OStartupError(\"Cannot start local server: h2o.jar not found. Paths searched:\\n\" +\n                              \"\".join(\"    %s\\n\" % s for s in searched_paths))", "response": "Find the location of an h2o. jar executable."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nproduces potential paths for an h2o. jar executable.", "response": "def _jar_paths():\n        \"\"\"Produce potential paths for an h2o.jar executable.\"\"\"\n\n        # PUBDEV-3534 hook to use arbitrary h2o.jar\n        own_jar = os.getenv(\"H2O_JAR_PATH\", \"\")\n        if own_jar != \"\":\n            if not os.path.isfile(own_jar):\n                raise H2OStartupError(\"Environment variable H2O_JAR_PATH is set to '%d' but file does not exists, unset environment variable or provide valid path to h2o.jar file.\" % own_jar)\n            yield own_jar\n\n        # Check if running from an h2o-3 src folder (or any subfolder), in which case use the freshly-built h2o.jar\n        cwd_chunks = os.path.abspath(\".\").split(os.path.sep)\n        for i in range(len(cwd_chunks), 0, -1):\n            if cwd_chunks[i - 1] == \"h2o-3\":\n                yield os.path.sep.join(cwd_chunks[:i] + [\"build\", \"h2o.jar\"])\n        # Then check the backend/bin folder:\n        # (the following works assuming this code is located in h2o/backend/server.py file)\n        backend_dir = os.path.split(os.path.realpath(__file__))[0]\n        yield os.path.join(backend_dir, \"bin\", \"h2o.jar\")\n\n        # Then try several old locations where h2o.jar might have been installed\n        prefix1 = prefix2 = sys.prefix\n        # On Unix-like systems Python typically gets installed into /Library/... or /System/Library/... If one of\n        # those paths is sys.prefix, then we also build its counterpart.\n        if prefix1.startswith(os.path.sep + \"Library\"):\n            prefix2 = os.path.join(\"\", \"System\", prefix1)\n        elif prefix1.startswith(os.path.sep + \"System\"):\n            prefix2 = prefix1[len(os.path.join(\"\", \"System\")):]\n        yield os.path.join(prefix1, \"h2o_jar\", \"h2o.jar\")\n        yield os.path.join(os.path.abspath(os.sep), \"usr\", \"local\", \"h2o_jar\", \"h2o.jar\")\n        yield os.path.join(prefix1, \"local\", \"h2o_jar\", \"h2o.jar\")\n        yield os.path.join(get_config_var(\"userbase\"), \"h2o_jar\", \"h2o.jar\")\n        yield os.path.join(prefix2, \"h2o_jar\", \"h2o.jar\")"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nlaunch the H2O server.", "response": "def _launch_server(self, port, baseport, mmax, mmin, ea, nthreads, jvm_custom_args, bind_to_localhost, log_dir=None, log_level=None):\n        \"\"\"Actually start the h2o.jar executable (helper method for `.start()`).\"\"\"\n        self._ip = \"127.0.0.1\"\n\n        # Find Java and check version. (Note that subprocess.check_output returns the output as a bytes object)\n        java = self._find_java()\n        self._check_java(java, self._verbose)\n\n        if self._verbose:\n            print(\"  Starting server from \" + self._jar_path)\n            print(\"  Ice root: \" + self._ice_root)\n\n        # Combine jar path with the optional extra classpath\n        classpath = [self._jar_path] if self._extra_classpath is None else [self._jar_path] + self._extra_classpath\n\n        # Construct java command to launch the process\n        cmd = [java]\n\n        # ...add JVM options\n        cmd += [\"-ea\"] if ea else []\n        for (mq, num) in [(\"-Xms\", mmin), (\"-Xmx\", mmax)]:\n            if num is None: continue\n            numstr = \"%dG\" % (num >> 30) if num == (num >> 30) << 30 else \\\n                     \"%dM\" % (num >> 20) if num == (num >> 20) << 20 else \\\n                     str(num)\n            cmd += [mq + numstr]\n        if jvm_custom_args is not None:\n            for arg in jvm_custom_args:\n                assert type(arg) is str\n                cmd += [arg]\n\n        cmd += [\"-cp\", os.pathsep.join(classpath), \"water.H2OApp\"]  # This should be the last JVM option\n\n        # ...add H2O options\n        cmd += [\"-ip\", self._ip]\n        if bind_to_localhost:\n            cmd += [\"-web_ip\", self._ip]\n        cmd += [\"-port\", str(port)] if port else []\n        cmd += [\"-baseport\", str(baseport)] if baseport else []\n        cmd += [\"-ice_root\", self._ice_root]\n        cmd += [\"-nthreads\", str(nthreads)] if nthreads > 0 else []\n\n        if log_dir:\n            cmd += [\"-log_dir\", log_dir]\n\n        if log_level:\n            cmd += [\"-log_level\", log_level]\n\n        if not self._name:\n            self._name = \"H2O_from_python_%s\" % self._tmp_file(\"salt\")\n        cmd += [\"-name\", self._name]\n\n        # Warning: do not change to any higher log-level, otherwise we won't be able to know which port the\n        # server is listening to.\n        cmd += [\"-log_level\", \"INFO\"]\n\n        # Create stdout and stderr files\n        self._stdout = self._tmp_file(\"stdout\")\n        self._stderr = self._tmp_file(\"stderr\")\n        cwd = os.path.abspath(os.getcwd())\n        out = open(self._stdout, \"w\")\n        err = open(self._stderr, \"w\")\n        if self._verbose:\n            print(\"  JVM stdout: \" + out.name)\n            print(\"  JVM stderr: \" + err.name)\n\n        # Launch the process\n        win32 = sys.platform == \"win32\"\n        flags = getattr(subprocess, \"CREATE_NEW_PROCESS_GROUP\", 0) if win32 else 0\n        prex = os.setsid if not win32 else None\n        try:\n            proc = subprocess.Popen(args=cmd, stdout=out, stderr=err, cwd=cwd, creationflags=flags, preexec_fn=prex)\n        except OSError as e:\n            traceback = getattr(e, \"child_traceback\", None)\n            raise H2OServerError(\"Unable to start server: %s\" % e, traceback)\n\n        # Wait until the server is up-and-running\n        giveup_time = time.time() + self._TIME_TO_START\n        while True:\n            if proc.poll() is not None:\n                raise H2OServerError(\"Server process terminated with error code %d\" % proc.returncode)\n            ret = self._get_server_info_from_logs()\n            if ret:\n                self._scheme = ret[0]\n                self._ip = ret[1]\n                self._port = ret[2]\n                self._process = proc\n                break\n            if time.time() > giveup_time:\n                elapsed_time = time.time() - (giveup_time - self._TIME_TO_START)\n                raise H2OServerError(\"Server wasn't able to start in %f seconds.\" % elapsed_time)\n            time.sleep(0.2)"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _find_java():\n        # is java callable directly (doesn't work on windows it seems)?\n        java = \"java.exe\" if sys.platform == \"win32\" else \"java\"\n        if os.access(java, os.X_OK):\n            return java\n\n        # Can Java be found on the PATH?\n        for path in os.getenv(\"PATH\").split(os.pathsep):  # not same as os.path.sep!\n            full_path = os.path.join(path, java)\n            if os.access(full_path, os.X_OK):\n                return full_path\n\n        # check if JAVA_HOME is set (for Windows)\n        if os.getenv(\"JAVA_HOME\"):\n            full_path = os.path.join(os.getenv(\"JAVA_HOME\"), \"bin\", java)\n            if os.path.exists(full_path):\n                return full_path\n\n        # check \"/Program Files\" and \"/Program Files (x86)\" on Windows\n        if sys.platform == \"win32\":\n            # On Windows, backslash on the drive letter is necessary, otherwise os.path.join produces an invalid path\n            program_folders = [os.path.join(\"C:\\\\\", \"Program Files\", \"Java\"),\n                               os.path.join(\"C:\\\\\", \"Program Files (x86)\", \"Java\"),\n                               os.path.join(\"C:\\\\\", \"ProgramData\", \"Oracle\", \"Java\")]\n            for folder in program_folders:\n                for dirpath, dirnames, filenames in os.walk(folder):\n                    if java in filenames:\n                        return os.path.join(dirpath, java)\n\n        # not found...\n        raise H2OStartupError(\"Cannot find Java. Please install the latest JRE from\\n\"\n                              \"http://www.oracle.com/technetwork/java/javase/downloads/index.html\")", "response": "Find location of the Java executable."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef _tmp_file(self, kind):\n        if sys.platform == \"win32\":\n            username = os.getenv(\"USERNAME\")\n        else:\n            username = os.getenv(\"USER\")\n        if not username:\n            username = \"unknownUser\"\n        usr = \"\".join(ch if ch.isalnum() else \"_\" for ch in username)\n\n        if kind == \"salt\":\n            return usr + \"_\" + \"\".join(choice(\"0123456789abcdefghijklmnopqrstuvwxyz\") for _ in range(6))\n        else:\n            if not self._tempdir:\n                self._tempdir = tempfile.mkdtemp()\n            return os.path.join(self._tempdir, \"h2o_%s_started_from_python.%s\" % (usr, kind[3:]))", "response": "Generate a temporary file name for the currently - a\n           ."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef _get_server_info_from_logs(self):\n        searchstr = \"INFO: Open H2O Flow in your web browser:\"\n        with open(self._stdout, \"rt\") as f:\n            for line in f:\n                if searchstr in line:\n                    url = line[line.index(searchstr) + len(searchstr):].strip().rstrip(\"/\")\n                    parts = url.split(\":\")\n                    assert len(parts) == 3 and (parts[0] == \"http\" or parts[1] == \"https\") and parts[2].isdigit(), \\\n                        \"Unexpected URL: %s\" % url\n                    return parts[0], parts[1][2:], int(parts[2])\n        return None", "response": "This method checks the server s output log and returns the scheme IP and port tuple. If the server has already started None is returned."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nreturning a confusion matrix based of H2O s default prediction threshold for a dataset.", "response": "def confusion_matrix(self, data):\n        \"\"\"\n        Returns a confusion matrix based of H2O's default prediction threshold for a dataset.\n\n        :param H2OFrame data: the frame with the prediction results for which the confusion matrix should be extracted.\n        \"\"\"\n        assert_is_type(data, H2OFrame)\n        j = h2o.api(\"POST /3/Predictions/models/%s/frames/%s\" % (self._id, data.frame_id))\n        return j[\"model_metrics\"][0][\"cm\"][\"table\"]"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef hit_ratio_table(self, train=False, valid=False, xval=False):\n        tm = ModelBase._get_metrics(self, train, valid, xval)\n        m = {}\n        for k, v in zip(list(tm.keys()), list(tm.values())): m[k] = None if v is None else v.hit_ratio_table()\n        return list(m.values())[0] if len(m) == 1 else m", "response": "Retrieve the Hit Ratios table."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef bytes_iterator(s):\n    if s is None: return\n    if PY2 or PY3 and isinstance(s, str):\n        for ch in s:\n            yield ord(ch)\n    elif PY3 and isinstance(s, bytes):\n        for ch in s:\n            yield ch\n    else:\n        raise TypeError(\"String argument expected, got %s\" % type(s))", "response": "Given a string return an iterator over this string s bytes as ints."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nget object name from first token in line", "response": "def _get_object_name(self, line):\n        ''' Get second token in line\n        >>> docwriter = ApiDocWriter('sphinx')\n        >>> docwriter._get_object_name(\"  def func():  \")\n        'func'\n        >>> docwriter._get_object_name(\"  class Klass(object):  \")\n        'Klass'\n        >>> docwriter._get_object_name(\"  class Klass:  \")\n        'Klass'\n        '''\n        name = line.split()[1].split('(')[0].strip()\n        # in case we have classes which are not derived from object\n        # ie. old style classes\n        return name.rstrip(':')"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _uri2path(self, uri):\n        ''' Convert uri to absolute filepath\n\n        Parameters\n        ----------\n        uri : string\n            URI of python module to return path for\n\n        Returns\n        -------\n        path : None or string\n            Returns None if there is no valid path for this URI\n            Otherwise returns absolute file system path for URI\n\n        Examples\n        --------\n        >>> docwriter = ApiDocWriter('sphinx')\n        >>> import sphinx\n        >>> modpath = sphinx.__path__[0]\n        >>> res = docwriter._uri2path('sphinx.builder')\n        >>> res == os.path.join(modpath, 'builder.py')\n        True\n        >>> res = docwriter._uri2path('sphinx')\n        >>> res == os.path.join(modpath, '__init__.py')\n        True\n        >>> docwriter._uri2path('sphinx.does_not_exist')\n\n        '''\n        if uri == self.package_name:\n            return os.path.join(self.root_path, '__init__.py')\n        path = uri.replace('.', os.path.sep)\n        path = path.replace(self.package_name + os.path.sep, '')\n        path = os.path.join(self.root_path, path)\n        # XXX maybe check for extensions as well?\n        if os.path.exists(path + '.py'): # file\n            path += '.py'\n        elif os.path.exists(os.path.join(path, '__init__.py')):\n            path = os.path.join(path, '__init__.py')\n        else:\n            return None\n        return path", "response": "Convert uri to absolute filepath"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nconvert directory path to uri", "response": "def _path2uri(self, dirpath):\n        ''' Convert directory path to uri '''\n        relpath = dirpath.replace(self.root_path, self.package_name)\n        if relpath.startswith(os.path.sep):\n            relpath = relpath[1:]\n        return relpath.replace(os.path.sep, '.')"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nparsing a module defined in uri and return a list of functions and classes.", "response": "def _parse_module(self, uri):\n        ''' Parse module defined in *uri* '''\n        filename = self._uri2path(uri)\n        if filename is None:\n            # nothing that we could handle here.\n            return ([],[])\n        f = open(filename, 'rt')\n        functions, classes = self._parse_lines(f)\n        f.close()\n        return functions, classes"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nparse lines of text for functions and classes.", "response": "def _parse_lines(self, linesource):\n        ''' Parse lines of text for functions and classes '''\n        functions = []\n        classes = []\n        for line in linesource:\n            if line.startswith('def ') and line.count('('):\n                # exclude private stuff\n                name = self._get_object_name(line)\n                if not name.startswith('_'):\n                    functions.append(name)\n            elif line.startswith('class '):\n                # exclude private stuff\n                name = self._get_object_name(line)\n                if not name.startswith('_'):\n                    classes.append(name)\n            else:\n                pass\n        functions.sort()\n        classes.sort()\n        return functions, classes"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\ngenerates the API documentation for a module.", "response": "def generate_api_doc(self, uri):\n        '''Make autodoc documentation template string for a module\n\n        Parameters\n        ----------\n        uri : string\n            python location of module - e.g 'sphinx.builder'\n\n        Returns\n        -------\n        S : string\n            Contents of API doc\n        '''\n        # get the names of all classes and functions\n        functions, classes = self._parse_module(uri)\n        if not len(functions) and not len(classes):\n            print 'WARNING: Empty -',uri  # dbg\n            return ''\n\n        # Make a shorter version of the uri that omits the package name for\n        # titles \n        uri_short = re.sub(r'^%s\\.' % self.package_name,'',uri)\n        \n        ad = '.. AUTO-GENERATED FILE -- DO NOT EDIT!\\n\\n'\n\n        chap_title = uri_short\n        ad += (chap_title+'\\n'+ self.rst_section_levels[1] * len(chap_title)\n               + '\\n\\n')\n\n        # Set the chapter title to read 'module' for all modules except for the\n        # main packages\n        if '.' in uri:\n            title = 'Module: :mod:`' + uri_short + '`'\n        else:\n            title = ':mod:`' + uri_short + '`'\n        ad += title + '\\n' + self.rst_section_levels[2] * len(title)\n\n        if len(classes):\n            ad += '\\nInheritance diagram for ``%s``:\\n\\n' % uri\n            ad += '.. inheritance-diagram:: %s \\n' % uri\n            ad += '   :parts: 3\\n'\n\n        ad += '\\n.. automodule:: ' + uri + '\\n'\n        ad += '\\n.. currentmodule:: ' + uri + '\\n'\n        multi_class = len(classes) > 1\n        multi_fx = len(functions) > 1\n        if multi_class:\n            ad += '\\n' + 'Classes' + '\\n' + \\\n                  self.rst_section_levels[2] * 7 + '\\n'\n        elif len(classes) and multi_fx:\n            ad += '\\n' + 'Class' + '\\n' + \\\n                  self.rst_section_levels[2] * 5 + '\\n'\n        for c in classes:\n            ad += '\\n:class:`' + c + '`\\n' \\\n                  + self.rst_section_levels[multi_class + 2 ] * \\\n                  (len(c)+9) + '\\n\\n'\n            ad += '\\n.. autoclass:: ' + c + '\\n'\n            # must NOT exclude from index to keep cross-refs working\n            ad += '  :members:\\n' \\\n                  '  :undoc-members:\\n' \\\n                  '  :show-inheritance:\\n' \\\n                  '  :inherited-members:\\n' \\\n                  '\\n' \\\n                  '  .. automethod:: __init__\\n'\n        if multi_fx:\n            ad += '\\n' + 'Functions' + '\\n' + \\\n                  self.rst_section_levels[2] * 9 + '\\n\\n'\n        elif len(functions) and multi_class:\n            ad += '\\n' + 'Function' + '\\n' + \\\n                  self.rst_section_levels[2] * 8 + '\\n\\n'\n        for f in functions:\n            # must NOT exclude from index to keep cross-refs working\n            ad += '\\n.. autofunction:: ' + uri + '.' + f + '\\n\\n'\n        return ad"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nreturn True if matchstr does not match patterns .", "response": "def _survives_exclude(self, matchstr, match_type):\n        ''' Returns True if *matchstr* does not match patterns\n\n        ``self.package_name`` removed from front of string if present\n\n        Examples\n        --------\n        >>> dw = ApiDocWriter('sphinx')\n        >>> dw._survives_exclude('sphinx.okpkg', 'package')\n        True\n        >>> dw.package_skip_patterns.append('^\\\\.badpkg$')\n        >>> dw._survives_exclude('sphinx.badpkg', 'package')\n        False\n        >>> dw._survives_exclude('sphinx.badpkg', 'module')\n        True\n        >>> dw._survives_exclude('sphinx.badmod', 'module')\n        True\n        >>> dw.module_skip_patterns.append('^\\\\.badmod$')\n        >>> dw._survives_exclude('sphinx.badmod', 'module')\n        False\n        '''\n        if match_type == 'module':\n            patterns = self.module_skip_patterns\n        elif match_type == 'package':\n            patterns = self.package_skip_patterns\n        else:\n            raise ValueError('Cannot interpret match type \"%s\"' \n                             % match_type)\n        # Match to URI without package name\n        L = len(self.package_name)\n        if matchstr[:L] == self.package_name:\n            matchstr = matchstr[L:]\n        for pat in patterns:\n            try:\n                pat.search\n            except AttributeError:\n                pat = re.compile(pat)\n            if pat.search(matchstr):\n                return False\n        return True"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nreturn the list of module names within the current package_name.", "response": "def discover_modules(self):\n        ''' Return module sequence discovered from ``self.package_name`` \n\n\n        Parameters\n        ----------\n        None\n\n        Returns\n        -------\n        mods : sequence\n            Sequence of module names within ``self.package_name``\n\n        Examples\n        --------\n        >>> dw = ApiDocWriter('sphinx')\n        >>> mods = dw.discover_modules()\n        >>> 'sphinx.util' in mods\n        True\n        >>> dw.package_skip_patterns.append('\\.util$')\n        >>> 'sphinx.util' in dw.discover_modules()\n        False\n        >>> \n        '''\n        modules = [self.package_name]\n        # raw directory parsing\n        for dirpath, dirnames, filenames in os.walk(self.root_path):\n            # Check directory names for packages\n            root_uri = self._path2uri(os.path.join(self.root_path,\n                                                   dirpath))\n            for dirname in dirnames[:]: # copy list - we modify inplace\n                package_uri = '.'.join((root_uri, dirname))\n                if (self._uri2path(package_uri) and\n                    self._survives_exclude(package_uri, 'package')):\n                    modules.append(package_uri)\n                else:\n                    dirnames.remove(dirname)\n            # Check filenames for modules\n            for filename in filenames:\n                module_name = filename[:-3]\n                module_uri = '.'.join((root_uri, module_name))\n                if (self._uri2path(module_uri) and\n                    self._survives_exclude(module_uri, 'module')):\n                    modules.append(module_uri)\n        return sorted(modules)"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef write_api_docs(self, outdir):\n        if not os.path.exists(outdir):\n            os.mkdir(outdir)\n        # compose list of modules\n        modules = self.discover_modules()\n        self.write_modules_api(modules,outdir)", "response": "Generate API reST files for each module in the current directory."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef write_index(self, outdir, froot='gen', relative_to=None):\n        if self.written_modules is None:\n            raise ValueError('No modules written')\n        # Get full filename path\n        path = os.path.join(outdir, froot+self.rst_extension)\n        # Path written into index is relative to rootpath\n        if relative_to is not None:\n            relpath = outdir.replace(relative_to + os.path.sep, '')\n        else:\n            relpath = outdir\n        idx = open(path,'wt')\n        w = idx.write\n        w('.. AUTO-GENERATED FILE -- DO NOT EDIT!\\n\\n')\n        w('.. toctree::\\n\\n')\n        for f in self.written_modules:\n            w('   %s\\n' % os.path.join(relpath,f))\n        idx.close()", "response": "Make a reST API index file from written files."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef main(argv):\n    global g_script_name\n\n    g_script_name = os.path.basename(argv[0])\n    parse_config_file()\n    parse_args(argv)\n\n    url = 'https://0xdata.atlassian.net/rest/api/2/search?jql=sprint=\"' + urllib.quote(g_sprint) + '\"&maxResults=1000'\n    r = requests.get(url, auth=(g_user, g_pass))\n    if (r.status_code != 200):\n        print(\"ERROR: status code is \" + str(r.status_code))\n        sys.exit(1)\n    j = r.json()\n    issues = j[u'issues']\n    pm = PeopleManager()\n    for issue in issues:\n        pm.add(issue)\n\n    pm.emit()\n    print(\"\")", "response": "Main program.\n\n    @return: none"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef to_list(self):\n        return [[int(self.table.cell_values[0][1]), int(self.table.cell_values[0][2])],\n                [int(self.table.cell_values[1][1]), int(self.table.cell_values[1][2])]]", "response": "Convert this confusion matrix into a 2x2 plain list of values."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nreads confusion matrices from the list of sources.", "response": "def read_cms(cms=None, domains=None):\n        \"\"\"Read confusion matrices from the list of sources (?).\"\"\"\n        assert_is_type(cms, [list])\n        return [ConfusionMatrix(cm, domains) for cm in cms]"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef load_dict():\n    global g_load_java_message_filename\n    global g_ok_java_messages\n\n    if os.path.isfile(g_load_java_message_filename):\n            # only load dict from file if it exists.\n        with open(g_load_java_message_filename,'rb') as ofile:\n            g_ok_java_messages = pickle.load(ofile)\n    else:   # no previous java messages to be excluded are found\n        g_ok_java_messages[\"general\"] = []", "response": "Load the java messages that can be ignored pickle file into a dict structure g_ok_java_messages."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef add_new_message():\n    global g_new_messages_to_exclude    # filename containing text file from user containing new java ignored messages\n    global g_dict_changed               # True if new ignored java messages are added.\n\n    new_message_dict = extract_message_to_dict(g_new_messages_to_exclude)\n\n    if new_message_dict:\n        g_dict_changed = True\n        update_message_dict(new_message_dict,1)", "response": "Add new java messages to ignore from user text file."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef remove_old_message():\n    global g_old_messages_to_remove\n    global g_dict_changed\n\n    # extract old java ignored messages to be removed in old_message_dict\n    old_message_dict = extract_message_to_dict(g_old_messages_to_remove)\n\n    if old_message_dict:\n        g_dict_changed = True\n        update_message_dict(old_message_dict,2)", "response": "Removes old java messages from ignored list if users desired it."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nupdate the g_ok_java_messages dict by the given message_dict.", "response": "def update_message_dict(message_dict,action):\n    \"\"\"\n    Update the g_ok_java_messages dict structure by\n    1. add the new java ignored messages stored in message_dict if action == 1\n    2. remove the java ignored messages stired in message_dict if action == 2.\n\n    Parameters\n    ----------\n\n    message_dict :  Python dict\n      key: unit test name or \"general\"\n      value: list of java messages that are to be ignored if they are found when running the test stored as the key.  If\n        the key is \"general\", the list of java messages are to be ignored when running all tests.\n    action : int\n      if 1: add java ignored messages stored in message_dict to g_ok_java_messages dict;\n      if 2: remove java ignored messages stored in message_dict from g_ok_java_messages dict.\n\n    :return: none\n    \"\"\"\n    global g_ok_java_messages\n\n    allKeys = g_ok_java_messages.keys()\n\n    for key in message_dict.keys():\n        if key in allKeys:  # key already exists, just add to it\n            for message in message_dict[key]:\n\n                if action == 1:\n                    if message not in g_ok_java_messages[key]:\n                        g_ok_java_messages[key].append(message)\n\n                if action == 2:\n                    if message in g_ok_java_messages[key]:\n                        g_ok_java_messages[key].remove(message)\n        else:   # new key here.  Can only add and cannot remove\n            if action == 1:\n                g_ok_java_messages[key] = message_dict[key]"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nreads in a text file that contains ignored java messages and generates a dictionary structure out of it.", "response": "def extract_message_to_dict(filename):\n    \"\"\"\n    Read in a text file that java messages to be ignored and generate a dictionary structure out of\n    it with key and value pairs.  The keys are test names and the values are lists of java message\n    strings associated with that test name where we are either going to add to the existing java messages\n    to ignore or remove them from g_ok_java_messages.\n\n    Parameters\n    ----------\n\n    filename :  Str\n       filename that contains ignored java messages.  The text file shall contain something like this:\n        keyName = general\n        Message = nfolds: nfolds cannot be larger than the number of rows (406).\n        KeyName = pyunit_cv_cars_gbm.py\n        Message = Caught exception: Illegal argument(s) for GBM model: GBM_model_python_1452503348770_2586.  \\\n            Details: ERRR on field: _nfolds: nfolds must be either 0 or >1.\n        ...\n\n    :return:\n    message_dict : dict\n        contains java message to be ignored with key as unit test name or \"general\" and values as list of ignored java\n        messages.\n    \"\"\"\n    message_dict = {}\n\n    if os.path.isfile(filename):\n        # open file to read in new exclude messages if it exists\n        with open(filename,'r') as wfile:\n\n            key = \"\"\n            val = \"\"\n            startMess = False\n\n            while 1:\n                each_line = wfile.readline()\n\n                if not each_line:   # reached EOF\n                    if startMess:\n                        add_to_dict(val.strip(),key,message_dict)\n                    break\n\n                # found a test name or general with values to follow\n                if \"keyname\" in each_line.lower():  # name of test file or the word \"general\"\n                    temp_strings = each_line.strip().split('=')\n\n                    if (len(temp_strings) > 1): # make sure the line is formatted sort of correctly\n                        if startMess:   # this is the start of a new key/value pair\n                            add_to_dict(val.strip(),key,message_dict)\n                            val = \"\"\n\n                        key = temp_strings[1].strip()\n                        startMess = False\n\n                if (len(each_line) > 1) and startMess:\n                    val += each_line\n\n                if \"ignoredmessage\" in each_line.lower():\n                    startMess = True    # start of a Java message.\n                    temp_mess = each_line.split('=')\n\n                    if (len(temp_mess) > 1):\n                        val = temp_mess[1]\n\n\n\n    return message_dict"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef add_to_dict(val,key,message_dict):\n    allKeys = message_dict.keys()\n    if (len(val) > 0):    # got a valid message here\n        if (key in allKeys) and (val not in message_dict[key]):\n            message_dict[key].append(val)   # only include this message if it has not been added before\n        else:\n            message_dict[key] = [val]", "response": "Adds new key val to dict message_dict."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nsave the ignored java message dict stored in g_ok_java_messages into a pickle file for future use.", "response": "def save_dict():\n    \"\"\"\n    Save the ignored java message dict stored in g_ok_java_messages into a pickle file for future use.\n\n    :return: none\n    \"\"\"\n    global g_ok_java_messages\n    global g_save_java_message_filename\n    global g_dict_changed\n\n    if g_dict_changed:\n        with open(g_save_java_message_filename,'wb') as ofile:\n            pickle.dump(g_ok_java_messages,ofile)"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nwrites the java ignored messages in g_ok_java_messages into a text file for humans to read.", "response": "def print_dict():\n    \"\"\"\n    Write the java ignored messages in g_ok_java_messages into a text file for humans to read.\n\n    :return: none\n    \"\"\"\n    global g_ok_java_messages\n    global g_java_messages_to_ignore_text_filename\n\n    allKeys = sorted(g_ok_java_messages.keys())\n\n    with open(g_java_messages_to_ignore_text_filename,'w') as ofile:\n        for key in allKeys:\n\n            for mess in g_ok_java_messages[key]:\n                ofile.write('KeyName: '+key+'\\n')\n                ofile.write('IgnoredMessage: '+mess+'\\n')\n\n            print('KeyName: ',key)\n            print('IgnoredMessage: ',g_ok_java_messages[key])\n            print('\\n')"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef parse_args(argv):\n    global g_new_messages_to_exclude\n    global g_old_messages_to_remove\n    global g_load_java_message_filename\n    global g_save_java_message_filename\n    global g_print_java_messages\n\n\n    if len(argv) < 2:   # print out help menu if user did not enter any arguments.\n        usage()\n\n    i = 1\n    while (i < len(argv)):\n        s = argv[i]\n\n        if (s == \"--inputfileadd\"):         # input text file where new java messages are stored\n            i += 1\n            if (i > len(argv)):\n                usage()\n            g_new_messages_to_exclude = argv[i]\n        elif (s == \"--inputfilerm\"):        # input text file containing java messages to be removed from the ignored list\n            i += 1\n            if (i > len(argv)):\n                usage()\n            g_old_messages_to_remove = argv[i]\n        elif (s == \"--loadjavamessage\"):    # load previously saved java message pickle file from file other than\n            i += 1                          # the default one before performing update\n            if i > len(argv):\n                usage()\n            g_load_java_message_filename = argv[i]\n        elif (s == \"--savejavamessage\"):    # save updated java message in this file instead of default file\n            i += 1\n            if (i > len(argv)):\n                usage()\n            g_save_java_message_filename = argv[i]\n        elif (s == '--printjavamessage'):   # will print java message out to console and save in a text file\n            i += 1\n            g_print_java_messages = True\n            g_load_java_message_filename = argv[i]\n        elif (s == '--help'):               # print help menu and exit\n            usage()\n        else:\n            unknown_arg(s)\n\n        i += 1", "response": "Parse user input and set the corresponding global variables to perform the necessary tasks."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nillustrating what the various input flags and the options should be.", "response": "def usage():\n    \"\"\"\n    Illustrate what the various input flags are and the options should be.\n\n    :return: none\n    \"\"\"\n    global g_script_name    # name of the script being run.\n\n    print(\"\")\n    print(\"Usage:  \" + g_script_name + \" [...options...]\")\n    print(\"\")\n    print(\"     --help print out this help menu and show all the valid flags and inputs.\")\n    print(\"\")\n    print(\"    --inputfileadd filename where the new java messages to ignore are stored in.\")\n    print(\"\")\n    print(\"    --inputfilerm filename where the java messages are removed from the ignored list.\")\n    print(\"\")\n    print(\"    --loadjavamessage filename pickle file that stores the dict structure containing java messages to include.\")\n    print(\"\")\n    print(\"    --savejavamessage filename pickle file that saves the final dict structure after update.\")\n    print(\"\")\n    print(\"    --printjavamessage filename print java ignored java messages stored in pickle file filenam onto console and save into a text file.\")\n    print(\"\")\n    sys.exit(1)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nfind all python files in the given directory and all subfolders.", "response": "def locate_files(root_dir):\n    \"\"\"Find all python files in the given directory and all subfolders.\"\"\"\n    all_files = []\n    root_dir = os.path.abspath(root_dir)\n    for dir_name, subdirs, files in os.walk(root_dir):\n        for f in files:\n            if f.endswith(\".py\"):\n                all_files.append(os.path.join(dir_name, f))\n    return all_files"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef find_magic_in_file(filename):\n    with open(filename, \"rt\", encoding=\"utf-8\") as f:\n        for line in f:\n            if line.startswith(\"#\"):\n                comment = line[1:].strip()\n                if comment.startswith(\"~~~~* \") or comment.startswith(\"----* \") or comment.startswith(\"====* \"):\n                    spell = comment[5:].strip()\n                    return tuple(spell.split())\n            else:\n                break\n    return None", "response": "Search the file for any magic incantations."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef parse_python_file(filename):\n\n\n\n    with open(filename, \"rt\", encoding=\"utf-8\") as f:\n        tokens = list(tokenize.generate_tokens(f.readline))\n        tokens = normalize_tokens(tokens)\n        module = ChunkCode(tokens, 0, len(tokens))\n        module.parse()\n        print(module)", "response": "Parse a python file into chunks and objects."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef main():\n    # magic_files = {}\n    for filename in locate_files(ROOT_DIR):\n        print(\"Processing %s\" % filename)\n        with open(filename, \"rt\") as f:\n            tokens = list(tokenize.generate_tokens(f.readline))\n            text1 = tokenize.untokenize(tokens)\n            ntokens = normalize_tokens(tokens)\n            text2 = tokenize.untokenize(ntokens)\n            assert text1 == text2", "response": "Executed when script is run as - is."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef init_for_pipeline(self):\n        import inspect\n        from h2o.transforms.decomposition import H2OPCA\n        # check which parameters can be passed to H2OPCA init\n        var_names = list(dict(inspect.getmembers(H2OPCA.__init__.__code__))['co_varnames'])\n        parameters = {k: v for k, v in self._parms.items() if k in var_names}\n        return H2OPCA(**parameters)", "response": "Returns H2OPCA object which implements fit and transform method to be used in sklearn. Pipeline properly."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\ntransform a H2OFrame using a MOJO Pipeline.", "response": "def transform(self, data, allow_timestamps=False):\n        \"\"\"\n        Transform H2OFrame using a MOJO Pipeline.\n\n        :param data: Frame to be transformed.\n        :param allow_timestamps: Allows datetime columns to be used directly with MOJO pipelines. It is recommended\n        to parse your datetime columns as Strings when using pipelines because pipelines can interpret certain datetime\n        formats in a different way. If your H2OFrame is parsed from a binary file format (eg. Parquet) instead of CSV\n        it is safe to turn this option on and use datetime columns directly.\n\n        :returns: A new H2OFrame.\n        \"\"\"\n        assert_is_type(data, H2OFrame)\n        assert_is_type(allow_timestamps, bool)\n        return H2OFrame._expr(ExprNode(\"mojo.pipeline.transform\", self.pipeline_id[0], data, allow_timestamps))"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef summarizeFailedRuns():\n    global g_summary_dict_all\n\n    onlyFiles = [x for x in listdir(g_test_root_dir) if isfile(join(g_test_root_dir, x))]   # grab files\n\n    for f in onlyFiles:\n        for fileStart in g_file_start:\n            if (fileStart in f) and (os.path.getsize(f) > 10):  # found the file containing failed tests\n                fFullPath = os.path.join(g_test_root_dir, f)\n                try:\n                    temp_dict = json.load(open(fFullPath,'r'))\n\n                    # scrape through temp_dict and see if we need to add the test to intermittents\n                    for ind in range(len(temp_dict[\"TestName\"])):\n                        addFailedTests(g_summary_dict_all, temp_dict, ind)\n                except:\n                    continue\n                break", "response": "This function will look at the local directory and check if there are any failed tests and add them to g_summary_dict_all."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef extractPrintSaveIntermittens():\n    # extract intermittents from collected failed tests\n    global g_summary_dict_intermittents\n\n    localtz = time.tzname[0]\n\n\n    for ind in range(len(g_summary_dict_all[\"TestName\"])):\n        if g_summary_dict_all[\"TestInfo\"][ind][\"FailureCount\"] >= g_threshold_failure:\n            addFailedTests(g_summary_dict_intermittents, g_summary_dict_all, ind)\n\n    # save dict in file\n    if len(g_summary_dict_intermittents[\"TestName\"]) > 0:\n        json.dump(g_summary_dict_intermittents, open(g_summary_dict_name, 'w'))\n\n        with open(g_summary_csv_filename, 'w') as summaryFile:\n            for ind in range(len(g_summary_dict_intermittents[\"TestName\"])):\n                testName = g_summary_dict_intermittents[\"TestName\"][ind]\n                numberFailure = g_summary_dict_intermittents[\"TestInfo\"][ind][\"FailureCount\"]\n                firstFailedTS  = parser.parse(time.ctime(min(g_summary_dict_intermittents[\"TestInfo\"][ind][\"Timestamp\"]))+\n                                              ' '+localtz)\n                firstFailedStr = firstFailedTS.strftime(\"%a %b %d %H:%M:%S %Y %Z\")\n                recentFail = parser.parse(time.ctime(max(g_summary_dict_intermittents[\"TestInfo\"][ind][\"Timestamp\"]))+\n                                          ' '+localtz)\n                recentFailStr = recentFail.strftime(\"%a %b %d %H:%M:%S %Y %Z\")\n                eachTest = \"{0}, {1}, {2}, {3}\\n\".format(testName, recentFailStr, numberFailure,\n                                                       g_summary_dict_intermittents[\"TestInfo\"][ind][\"TestCategory\"][0])\n                summaryFile.write(eachTest)\n                print(\"Intermittent: {0}, Last failed: {1}, Failed {2} times since \"\n                      \"{3}\".format(testName, recentFailStr, numberFailure, firstFailedStr))", "response": "This function extracts the intermittents from collected failed tests and saves them to a file."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef main(argv):\n    global g_script_name\n    global g_test_root_dir\n    global g_threshold_failure\n    global g_file_start\n    global g_summary_dict_name\n    global g_summary_dict_all\n    global g_summary_dict_intermittents\n    global g_summary_csv_filename\n\n    if len(argv) < 5:\n        print \"Wrong call.  Not enough arguments.\\n\"\n        usage()\n        sys.exit(1)\n    else:   # we may be in business\n        g_threshold_failure = int(argv[1])\n        g_summary_dict_name = os.path.join(g_test_root_dir, argv[2])\n        g_summary_csv_filename = g_summary_dict_name+\".csv\"\n\n        for ind in range(3, len(argv)):\n            g_file_start.append(argv[ind])\n\n        init_intermittents_dict(g_summary_dict_all)\n        init_intermittents_dict(g_summary_dict_intermittents)\n        summarizeFailedRuns()\n        extractPrintSaveIntermittens()", "response": "This function is the main function of the base class. It is called by the test module."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\ndisplays a short summary of the metrics.", "response": "def show(self):\n        \"\"\"Display a short summary of the metrics.\"\"\"\n        if self._metric_json==None:\n            print(\"WARNING: Model metrics cannot be calculated and metric_json is empty due to the absence of the response column in your dataset.\")\n            return\n        metric_type = self._metric_json['__meta']['schema_type']\n        types_w_glm = ['ModelMetricsRegressionGLM', 'ModelMetricsBinomialGLM']\n        types_w_clustering = ['ModelMetricsClustering']\n        types_w_mult = ['ModelMetricsMultinomial']\n        types_w_ord = ['ModelMetricsOrdinal']\n        types_w_bin = ['ModelMetricsBinomial', 'ModelMetricsBinomialGLM']\n        types_w_r2 = ['ModelMetricsRegressionGLM']\n        types_w_mean_residual_deviance = ['ModelMetricsRegressionGLM', 'ModelMetricsRegression']\n        types_w_mean_absolute_error = ['ModelMetricsRegressionGLM', 'ModelMetricsRegression']\n        types_w_logloss = types_w_bin + types_w_mult+types_w_ord\n        types_w_dim = [\"ModelMetricsGLRM\"]\n        types_w_anomaly = ['ModelMetricsAnomaly']\n\n        print()\n        print(metric_type + \": \" + self._algo)\n        reported_on = \"** Reported on {} data. **\"\n        if self._on_train:\n            print(reported_on.format(\"train\"))\n        elif self._on_valid:\n            print(reported_on.format(\"validation\"))\n        elif self._on_xval:\n            print(reported_on.format(\"cross-validation\"))\n        else:\n            print(reported_on.format(\"test\"))\n        print()\n        if metric_type not in types_w_anomaly:\n            print(\"MSE: \" + str(self.mse()))\n            print(\"RMSE: \" + str(self.rmse()))\n        if metric_type in types_w_mean_absolute_error:\n            print(\"MAE: \" + str(self.mae()))\n            print(\"RMSLE: \" + str(self.rmsle()))\n        if metric_type in types_w_r2:\n            print(\"R^2: \" + str(self.r2()))\n        if metric_type in types_w_mean_residual_deviance:\n            print(\"Mean Residual Deviance: \" + str(self.mean_residual_deviance()))\n        if metric_type in types_w_logloss:\n            print(\"LogLoss: \" + str(self.logloss()))\n        if metric_type == 'ModelMetricsBinomial':\n            # second element for first threshold is the actual mean per class error\n            print(\"Mean Per-Class Error: %s\" % self.mean_per_class_error()[0][1])\n        if metric_type == 'ModelMetricsMultinomial' or metric_type == 'ModelMetricsOrdinal':\n            print(\"Mean Per-Class Error: \" + str(self.mean_per_class_error()))\n        if metric_type in types_w_glm:\n            print(\"Null degrees of freedom: \" + str(self.null_degrees_of_freedom()))\n            print(\"Residual degrees of freedom: \" + str(self.residual_degrees_of_freedom()))\n            print(\"Null deviance: \" + str(self.null_deviance()))\n            print(\"Residual deviance: \" + str(self.residual_deviance()))\n            print(\"AIC: \" + str(self.aic()))\n        if metric_type in types_w_bin:\n            print(\"AUC: \" + str(self.auc()))\n            print(\"pr_auc: \" + str(self.pr_auc()))\n            print(\"Gini: \" + str(self.gini()))\n            self.confusion_matrix().show()\n            self._metric_json[\"max_criteria_and_metric_scores\"].show()\n            if self.gains_lift():\n                print(self.gains_lift())\n        if metric_type in types_w_anomaly:\n            print(\"Anomaly Score: \" + str(self.mean_score()))\n            print(\"Normalized Anomaly Score: \" + str(self.mean_normalized_score()))\n        if (metric_type in types_w_mult) or (metric_type in types_w_ord):\n            self.confusion_matrix().show()\n            self.hit_ratio_table().show()\n        if metric_type in types_w_clustering:\n            print(\"Total Within Cluster Sum of Square Error: \" + str(self.tot_withinss()))\n            print(\"Total Sum of Square Error to Grand Mean: \" + str(self.totss()))\n            print(\"Between Cluster Sum of Square Error: \" + str(self.betweenss()))\n            self._metric_json['centroid_stats'].show()\n\n        if metric_type in types_w_dim:\n            print(\"Sum of Squared Error (Numeric): \" + str(self.num_err()))\n            print(\"Misclassification Error (Categorical): \" + str(self.cat_err()))\n        if self.custom_metric_name():\n            print(\"{}: {}\".format(self.custom_metric_name(), self.custom_metric_value()))"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef mean_per_class_error(self, thresholds=None):\n        return [[x[0], 1 - x[1]] for x in self.metric(\"mean_per_class_accuracy\", thresholds=thresholds)]", "response": "returns the mean per class error for the given set of thresholds"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef metric(self, metric, thresholds=None):\n        assert_is_type(thresholds, None, [numeric])\n        if not thresholds: thresholds = [self.find_threshold_by_max_metric(metric)]\n        thresh2d = self._metric_json['thresholds_and_metric_scores']\n        metrics = []\n        for t in thresholds:\n            idx = self.find_idx_by_threshold(t)\n            metrics.append([t, thresh2d[metric][idx]])\n        return metrics", "response": "Returns a list of all the metrics that are in the thresholds set."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef plot(self, type=\"roc\", server=False):\n        # TODO: add more types (i.e. cutoffs)\n        assert_is_type(type, \"roc\")\n        # check for matplotlib. exit if absent.\n        try:\n            imp.find_module('matplotlib')\n            import matplotlib\n            if server: matplotlib.use('Agg', warn=False)\n            import matplotlib.pyplot as plt\n        except ImportError:\n            print(\"matplotlib is required for this function!\")\n            return\n\n        if type == \"roc\":\n            plt.xlabel('False Positive Rate (FPR)')\n            plt.ylabel('True Positive Rate (TPR)')\n            plt.title('ROC Curve')\n            plt.text(0.5, 0.5, r'AUC={0:.4f}'.format(self._metric_json[\"AUC\"]))\n            plt.plot(self.fprs, self.tprs, 'b--')\n            plt.axis([0, 1, 0, 1])\n            if not server: plt.show()", "response": "Produce the desired metric plot."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef confusion_matrix(self, metrics=None, thresholds=None):\n        # make lists out of metrics and thresholds arguments\n        if metrics is None and thresholds is None:\n            metrics = ['f1']\n\n        if isinstance(metrics, list):\n            metrics_list = metrics\n        elif metrics is None:\n            metrics_list = []\n        else:\n            metrics_list = [metrics]\n\n        if isinstance(thresholds, list):\n            thresholds_list = thresholds\n        elif thresholds is None:\n            thresholds_list = []\n        else:\n            thresholds_list = [thresholds]\n\n        # error check the metrics_list and thresholds_list\n        assert_is_type(thresholds_list, [numeric])\n        assert_satisfies(thresholds_list, all(0 <= t <= 1 for t in thresholds_list))\n\n        if not all(m.lower() in H2OBinomialModelMetrics.max_metrics for m in metrics_list):\n            raise ValueError(\"The only allowable metrics are {}\", ', '.join(H2OBinomialModelMetrics.max_metrics))\n\n        # make one big list that combines the thresholds and metric-thresholds\n        metrics_thresholds = [self.find_threshold_by_max_metric(m) for m in metrics_list]\n        for mt in metrics_thresholds:\n            thresholds_list.append(mt)\n        first_metrics_thresholds_offset = len(thresholds_list) - len(metrics_thresholds)\n\n        thresh2d = self._metric_json['thresholds_and_metric_scores']\n        actual_thresholds = [float(e[0]) for i, e in enumerate(thresh2d.cell_values)]\n        cms = []\n        for i, t in enumerate(thresholds_list):\n            idx = self.find_idx_by_threshold(t)\n            row = thresh2d.cell_values[idx]\n            tns = row[11]\n            fns = row[12]\n            fps = row[13]\n            tps = row[14]\n            p = tps + fns\n            n = tns + fps\n            c0 = n - fps\n            c1 = p - tps\n            if t in metrics_thresholds:\n                m = metrics_list[i - first_metrics_thresholds_offset]\n                table_header = \"Confusion Matrix (Act/Pred) for max {} @ threshold = {}\".format(m, actual_thresholds[idx])\n            else:\n                table_header = \"Confusion Matrix (Act/Pred) @ threshold = {}\".format(actual_thresholds[idx])\n            cms.append(ConfusionMatrix(cm=[[c0, fps], [c1, tps]], domains=self._metric_json['domain'],\n                                       table_header=table_header))\n\n        if len(cms) == 1:\n            return cms[0]\n        else:\n            return cms", "response": "Get the confusion matrix for the specified metric."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef find_threshold_by_max_metric(self, metric):\n        crit2d = self._metric_json['max_criteria_and_metric_scores']\n\n        for e in crit2d.cell_values:\n            if e[0] == \"max \" + metric.lower():\n                return e[1]\n        raise ValueError(\"No metric \" + str(metric.lower()))", "response": "Find the threshold at which the given metric is maximal."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nfind the index in this metric s threshold list at which the given threshold is located.", "response": "def find_idx_by_threshold(self, threshold):\n        \"\"\"\n        Retrieve the index in this metric's threshold list at which the given threshold is located.\n\n        :param threshold: Find the index of this input threshold.\n        :returns: the index\n        :raises ValueError: if no such index can be found.\n        \"\"\"\n        assert_is_type(threshold, numeric)\n        thresh2d = self._metric_json['thresholds_and_metric_scores']\n        for i, e in enumerate(thresh2d.cell_values):\n            t = float(e[0])\n            if abs(t - threshold) < 1e-8 * max(t, threshold):\n                return i\n        if 0 <= threshold <= 1:\n            thresholds = [float(e[0]) for i, e in enumerate(thresh2d.cell_values)]\n            threshold_diffs = [abs(t - threshold) for t in thresholds]\n            closest_idx = threshold_diffs.index(min(threshold_diffs))\n            closest_threshold = thresholds[closest_idx]\n            print(\"Could not find exact threshold {0}; using closest threshold found {1}.\"\n                  .format(threshold, closest_threshold))\n            return closest_idx\n        raise ValueError(\"Threshold must be between 0 and 1, but got {0} \".format(threshold))"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\ngenerate a C# declaration file for a schema.", "response": "def generate_schema(class_name, schema):\n    \"\"\"\n    Generate C# declaration file for a schema.\n    \"\"\"\n    has_map = False\n    for field in schema[\"fields\"]:\n        if field[\"type\"].startswith(\"Map\"): has_map = True\n\n    superclass = schema[\"superclass\"]\n    if superclass == \"Iced\": superclass = \"Object\"\n\n    yield \"/**\"\n    yield \" * This file is auto-generated by h2o-3/h2o-bindings/bin/gen_csharp.py\"\n    yield \" * Copyright 2016 H2O.ai;  Apache License Version 2.0 (see LICENSE for details)\"\n    yield \" */\"\n    yield \"namespace ai.h2o\"\n    yield \"{\"\n    yield \"  using System;\"\n    yield \"  using System.Collections.Generic;\" if has_map else None\n    yield \"\"\n    yield \"  public class {name}: {super} {{\".format(name=class_name, super=superclass)\n\n    for field in schema[\"fields\"]:\n        if field[\"name\"] == \"__meta\": continue\n        csharp_type = translate_type(field[\"type\"], field[\"schema_name\"])\n        yield \"    /// <summary>\"\n        yield bi.wrap(field[\"help\"], \"    ///   \")\n        yield \"    /// </summary>\"\n        yield \"    public {type} {name} {{ get; set; }}\".format(type=csharp_type, name=field[\"name\"])\n        yield \"\"\n\n    yield \"  }\"\n    yield \"}\""}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef available():\n        builder_json = h2o.api(\"GET /3/ModelBuilders\", data={\"algo\": \"deepwater\"})\n        visibility = builder_json[\"model_builders\"][\"deepwater\"][\"visibility\"]\n        if visibility == \"Experimental\":\n            print(\"Cannot build a Deep Water model - no backend found.\")\n            return False\n        else:\n            return True", "response": "Returns True if a deep water model can be built or False otherwise."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef get_console_out(url_string):\n    full_command = 'curl ' + '\"'+ url_string +'\"'+ ' --user '+'\"admin:admin\"'+' > ' + g_temp_filename\n    subprocess.call(full_command,shell=True)", "response": "Get the console output from Jenkins and save it into a temp file\n    "}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef main(argv):\n    global g_script_name\n    global g_test_root_dir\n    global g_timestamp\n    global g_job_name\n    global g_build_id\n    global g_git_hash\n    global g_node_name\n    global g_unit_test_type\n    global g_jenkins_url\n    global g_temp_filename\n    global g_summary_text_filename  # store failed test info in csv format\n    global g_failed_tests_dict      # store failed test info as a dictionary\n    global g_resource_url\n    global g_timestring\n    global g_daily_failure_csv\n\n    if len(argv) < 12:\n        print \"Wrong call.  Not enough arguments.\\n\"\n        usage()\n        sys.exit(1)\n    else:   # we may be in business\n        g_script_name = os.path.basename(argv[0])   # get name of script being run.\n        g_timestamp = float(argv[1])\n        g_job_name = argv[2]\n        g_build_id = argv[3]\n        g_git_hash = argv[4]\n        g_node_name= argv[5]\n        g_unit_test_type = argv[6]\n        g_jenkins_url = argv[7]\n\n        localtz = time.tzname[0]\n        dt = parser.parse(time.ctime(g_timestamp)+ ' '+localtz)\n        g_timestring = dt.strftime(\"%a %b %d %H:%M:%S %Y %Z\")\n        g_temp_filename = os.path.join(g_test_root_dir,'tempText')\n        g_summary_text_filename = os.path.join(g_test_root_dir, argv[8])\n        g_failed_tests_dict = os.path.join(g_test_root_dir, argv[9])\n        monthToKeep = float(argv[10])\n        g_daily_failure_csv = os.path.join(g_test_root_dir, argv[11])\n\n        g_resource_url = '/'.join([g_jenkins_url, \"job\", g_job_name, g_build_id])\n        get_console_out(g_resource_url+\"/#showFailuresLink/\")       # save remote console output in local directory\n        extract_failed_tests_info()      # grab the console text and stored the failed tests/paths\n        save_failed_tests_info()         # save new failed test info into a file\n        if monthToKeep > 0:\n            trim_data_back_to(monthToKeep)", "response": "This function is the main function of the main function of the main function of the main function of the main function of the main function of the main function of the main function of the main function of the main function."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nprints the provided string msg but only when verbose is on.", "response": "def vprint(msg, pretty=False):\n    \"\"\"\n    Print the provided string {msg}, but only when the --verbose option is on.\n      :param msg     String to print.\n      :param pretty  If on, then pprint() will be used instead of the regular print function.\n    \"\"\"\n    if not config[\"verbose\"]:\n        return\n    if pretty:\n        pp(msg)\n    else:\n        print(msg)"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef wrap(msg, indent, indent_first=True):\n    wrapper.width = 120\n    wrapper.initial_indent = indent\n    wrapper.subsequent_indent = indent\n    msg = wrapper.fill(msg)\n    return msg if indent_first else msg[len(indent):]", "response": "Wrap a message with a new line with the given indent string."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef endpoints(raw=False):\n    json = _request_or_exit(\"/3/Metadata/endpoints\")\n    if raw: return json\n\n    schmap = schemas_map()\n    apinames = {}  # Used for checking for api name duplicates\n    assert \"routes\" in json, \"Unexpected result from /3/Metadata/endpoints call\"\n    re_api_name = re.compile(r\"^\\w+$\")\n\n    def gen_rich_route():\n        for e in json[\"routes\"]:\n            path = e[\"url_pattern\"]\n            method = e[\"handler_method\"]\n            apiname = e[\"api_name\"]\n            assert apiname not in apinames, \"Duplicate api name %s (for %s and %s)\" % (apiname, apinames[apiname], path)\n            assert re_api_name.match(apiname), \"Bad api name %s\" % apiname\n            apinames[apiname] = path\n\n            # These redundant paths cause conflicts, remove them\n            if path == \"/3/NodePersistentStorage/categories/{category}/exists\": continue\n            if path == \"/3/ModelMetrics/frames/{frame}/models/{model}\": continue\n            if path == \"/3/ModelMetrics/frames/{frame}\": continue\n            if path == \"/3/ModelMetrics/models/{model}\": continue\n            if path == \"/3/ModelMetrics\": continue\n            if \"AutoML\" in path: continue  # Generation code doesn't know how to deal with defaults for complex objects yet\n            if apiname.endswith(\"_deprecated\"): continue\n\n            # Resolve one name conflict\n            if path == \"/3/DKV\": e[\"handler_method\"] = \"removeAll\"\n\n            # Find the class_name (first part of the URL after the version: \"/3/About\" => \"About\")\n            mm = classname_pattern.match(path)\n            assert mm, \"Cannot determine class name in URL \" + path\n            e[\"class_name\"] = mm.group(1)\n            if e[\"class_name\"].islower():\n                e[\"class_name\"] = e[\"class_name\"].capitalize()\n\n            # Resolve input/output schemas into actual objects\n            assert e[\"input_schema\"] in schmap, \"Encountered unknown schema %s in %s\" % (e[\"input_schema\"], path)\n            assert e[\"output_schema\"] in schmap, \"Encountered unknown schema %s in %s\" % (e[\"output_schema\"], path)\n            e[\"ischema\"] = schmap[e[\"input_schema\"]]\n            e[\"oschema\"] = schmap[e[\"output_schema\"]]\n\n            # For these special cases, the actual input schema is not the one reported by the endpoint, but the schema\n            # of the 'parameters' field (which is fake).\n            if (e[\"class_name\"], method) in {(\"Grid\", \"train\"), (\"ModelBuilders\", \"train\"),\n                                             (\"ModelBuilders\", \"validate_parameters\")}:\n                pieces = path.split(\"/\")\n                assert len(pieces) >= 4, \"Expected to see algo name in the path: \" + path\n                e[\"algo\"] = pieces[3]\n                method = method + e[\"algo\"].capitalize()  # e.g. trainGlm()\n                e[\"handler_method\"] = method\n                for field in e[\"ischema\"][\"fields\"]:\n                    if field[\"name\"] == \"parameters\":\n                        e[\"input_schema\"] = field[\"schema_name\"]\n                        e[\"ischema\"] = schmap[e[\"input_schema\"]]\n                        break\n\n            # Create the list of input_params (as objects, not just names)\n            e[\"input_params\"] = []\n            for parm in e[\"path_params\"]:\n                # find the metadata for the field from the input schema:\n                fields = [field for field in e[\"ischema\"][\"fields\"] if field[\"name\"] == parm]\n                assert len(fields) == 1, \\\n                    \"Failed to find parameter: %s for endpoint: %s in the input schema %s\" \\\n                    % (parm, e[\"url_pattern\"], e[\"ischema\"][\"name\"])\n                field = fields[0].copy()\n                schema = field[\"schema_name\"] or \"\"   # {schema} is null for primitive types\n                ftype = field[\"type\"]\n                assert ftype == \"string\" or ftype == \"int\" or schema.endswith(\"KeyV3\") or schema == \"ColSpecifierV3\", \\\n                    \"Unexpected param %s of type %s (schema %s)\" % (field[\"name\"], ftype, schema)\n                assert field[\"direction\"] != \"OUTPUT\", \"A path param %s cannot be of type OUTPUT\" % field[\"name\"]\n                field[\"is_path_param\"] = True\n                field[\"required\"] = True\n                e[\"input_params\"].append(field)\n            for parm in e[\"ischema\"][\"fields\"]:\n                if parm[\"direction\"] == \"OUTPUT\" or parm[\"name\"] in e[\"path_params\"]: continue\n                field = parm.copy()\n                field[\"is_path_param\"] = False\n                e[\"input_params\"].append(field)\n            yield e\n\n    return list(gen_rich_route())", "response": "Returns the list of REST API endpoints."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef endpoint_groups():\n    groups = defaultdict(list)\n    for e in endpoints():\n        groups[e[\"class_name\"]].append(e)\n    return groups", "response": "Return endpoints grouped by the class which handles them."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef schemas(raw=False):\n    json = _request_or_exit(\"/3/Metadata/schemas\")\n    if raw: return json\n    assert \"schemas\" in json, \"Unexpected result from /3/Metadata/schemas call\"\n\n    # Simplify names of some horribly sounding enums\n    pattern0 = re.compile(r\"^\\w+(V\\d+)\\D\\w+$\")\n    pattern1 = re.compile(r\"^(\\w{3,})(\\1)Model\\1Parameters(\\w+)$\", re.IGNORECASE)\n    pattern2 = re.compile(r\"^(\\w{3,})(\\1)(\\w+)$\", re.IGNORECASE)\n\n    def translate_name(name):\n        if name is None: return\n        if name.startswith(\"Apischemas3\"): name = name[len(\"Apischemas3\"):]\n        if name.startswith(\"Apischemas4input\"): name = name[len(\"Apischemas4input\"):]\n        if name.startswith(\"Apischemas4output\"): name = name[len(\"Apischemas4output\"):]\n        if \"V3\" in name: name = name.replace(\"V3\", \"\") + \"V3\"  # force multiple \"V3\"s at the end\n        if \"V4\" in name: name = name.replace(\"V4\", \"\") + \"V4\"  # force multiple \"V4\"s at the end\n        if name == \"CreateframerecipesSimpleCreateFrameRecipeResponseType\": return \"SimpleRecipeResponseType\"\n        assert not pattern0.match(name), \"Bad schema name %s (version number in the middle)\" % name\n        mm = pattern1.match(name) or pattern2.match(name)\n        if mm: return mm.group(2) + mm.group(3)\n        return name\n    for schema in json[\"schemas\"]:\n        schema[\"name\"] = translate_name(schema[\"name\"])\n        for field in schema[\"fields\"]:\n            field[\"schema_name\"] = translate_name(field[\"schema_name\"])\n\n    return json[\"schemas\"]", "response": "Return the list of H\u2082O schemas."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef schemas_map(add_generics=False):\n    m = {}\n    for schema in schemas():\n        if schema[\"name\"].startswith('AutoML'): continue  # Generation code doesn't know how to deal with defaults for complex objects yet\n        if schema[\"name\"].startswith('UserFeedback'): continue  # UserFeedback schema contains an AutoMLKeyV3\n        m[schema[\"name\"]] = schema\n\n    def find_field(fields, field_name):\n        \"\"\"Finds a field with the given `field_name` among the list of fields.\"\"\"\n        for f in fields:\n            if f[\"is_inherited\"] and f[\"name\"] == field_name:\n                return f\n        raise RuntimeError(\"Unable to find field %s\" % (field_name))\n\n    # Add information about the generics. This is rather hacky at the moment.\n    if add_generics:\n        for base, generics in [\n            # Note: derived classes must come before base classes here\n            (\"SharedTreeModelV3\", [(\"P\", \"ModelParametersSchemaV3\"), (\"O\", \"ModelOutputSchemaV3\")]),\n            (\"ModelSchemaV3\", [(\"P\", \"ModelParametersSchemaV3\"), (\"O\", \"ModelOutputSchemaV3\")]),\n            (\"SharedTreeV3\", [(\"P\", \"ModelParametersSchemaV3\")]),\n            (\"ModelBuilderSchema\", [(\"P\", \"ModelParametersSchemaV3\")]),\n        ]:\n            # Write the generic information about the base class\n            schema = m[base]\n            schema[\"generics\"] = generics\n            generic_map = {long_type: gen_type for gen_type, long_type in generics}\n            generic_index = {geninfo[0]: i for i, geninfo in enumerate(generics)}\n            mapped_fields = {}\n            for field in schema[\"fields\"]:\n                ftype = field[\"schema_name\"]\n                if ftype in generic_map:\n                    gen_type = generic_map[ftype]\n                    field[\"schema_name\"] = gen_type\n                    mapped_fields[field[\"name\"]] = generic_index[gen_type]\n            assert len(mapped_fields) == len(generics), (\n                \"Unable to find generic types %r in base class %s. Schema: %r\" %\n                (generic_map, base, {f[\"name\"]: f[\"schema_name\"] for f in schema[\"fields\"]}))\n\n            # Find all the derived classes, and fill in their derived information\n            for schema_name, schema in m.items():\n                if schema[\"superclass\"] == base:\n                    base_generics = [None] * len(generics)\n                    for mapped_field_name, generic_index in mapped_fields.items():\n                        field = find_field(schema[\"fields\"], mapped_field_name)\n                        base_generics[generic_index] = field[\"schema_name\"]\n                    assert None not in base_generics, (\n                        \"Unable to find mapped super types in schema %s: base = %r, map = %r\" %\n                        (schema_name, base_generics, mapped_fields))\n                    schema[\"super_generics\"] = base_generics\n\n    return m", "response": "Returns a dictionary of H\u2082O schemas indexed by their name."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef enums():\n    enumset = defaultdict(set)\n    for schema in schemas():\n        for field in schema[\"fields\"]:\n            if field[\"type\"] == \"enum\":\n                enumset[field[\"schema_name\"]].update(field[\"values\"])\n    return enumset", "response": "Returns the dictionary of H\u2082O enums retrieved from data in schemas."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nwrites content to the given file.", "response": "def write_to_file(filename, content):\n    \"\"\"\n    Writes content to the given file. The file's directory will be created if needed.\n      :param filename: name of the output file, relative to the \"destination folder\" provided by the user\n      :param content: iterable (line-by-line) that should be written to the file. Either a list or a generator. Each\n                      line will be appended with a \"\\n\". Lines containing None will be skipped.\n    \"\"\"\n    if not config[\"destdir\"]:\n        print(\"{destdir} config variable not present. Did you forget to run init()?\")\n        sys.exit(8)\n    abs_filename = os.path.abspath(config[\"destdir\"] + \"/\" + filename)\n    abs_filepath = os.path.dirname(abs_filename)\n    if not os.path.exists(abs_filepath):\n        try:\n            os.makedirs(abs_filepath)\n        except OSError as e:\n            print(\"Cannot create directory \" + abs_filepath)\n            print(\"Error %d: %s\" % (e.errno, e.strerror))\n            sys.exit(6)\n    with codecs.open(abs_filename, \"w\", \"utf-8\") as out:\n        if isinstance(content, str): content = [content]\n        for line in content:\n            if line is not None:\n                out.write(line)\n                out.write(\"\\n\")"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef _request_or_exit(endpoint):\n    if endpoint[0] == \"/\":\n        endpoint = endpoint[1:]\n    if endpoint in requests_memo:\n        return requests_memo[endpoint]\n\n    if not config[\"baseurl\"]:\n        print(\"Configuration not present. Did you forget to run init()?\")\n        sys.exit(8)\n    url = config[\"baseurl\"] + endpoint\n    try:\n        resp = requests.get(url)\n    except requests.exceptions.InvalidURL:\n        print(\"Invalid url address of an H2O server: \" + config[\"baseurl\"])\n        sys.exit(2)\n    except requests.ConnectionError:\n        print(\"Cannot connect to the server \" + config[\"baseurl\"])\n        print(\"Please check that you have an H2O instance running, and its address is passed in \" +\n              \"the --usecloud argument.\")\n        sys.exit(3)\n    except requests.Timeout:\n        print(\"Request timeout when fetching \" + url + \". Check your internet connection and try again.\")\n        sys.exit(4)\n    if resp.status_code == 200:\n        try:\n            json = resp.json()\n        except ValueError:\n            print(\"Invalid JSON response from \" + url + \" :\\n\")\n            print(resp.text)\n            sys.exit(5)\n        if \"__meta\" not in json or \"schema_type\" not in json[\"__meta\"]:\n            print(\"Unexpected JSON returned from \" + url + \":\")\n            pp(json)\n            sys.exit(6)\n        if json[\"__meta\"][\"schema_type\"] == \"H2OError\":\n            print(\"Server returned an error message for %s:\" % url)\n            print(json[\"msg\"])\n            pp(json)\n            sys.exit(7)\n        requests_memo[endpoint] = json\n        return json\n    else:\n        print(\"[HTTP %d] Cannot retrieve %s\" % (resp.status_code, url))\n        sys.exit(1)", "response": "Internal function to retrieve and return json data from the provided endpoint or die with an error message."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nsets the credentials for the H2O API.", "response": "def set_s3_credentials(secret_key_id, secret_access_key):\n    \"\"\"Creates a new Amazon S3 client internally with specified credentials.\n    There are no validations done to the credentials. Incorrect credentials are thus revealed with first S3 import call.\n    \n    secretKeyId Amazon S3 Secret Key ID (provided by Amazon)\n    secretAccessKey Amazon S3 Secret Access Key (provided by Amazon)\n    \"\"\"\n    if(secret_key_id is None):\n        raise H2OValueError(\"Secret key ID must be specified\")\n\n    if(secret_access_key is None):\n        raise H2OValueError(\"Secret access key must be specified\")\n    \n    if(not secret_key_id):\n        raise H2OValueError(\"Secret key ID must not be empty\")\n    \n    if(not secret_access_key):\n        raise H2OValueError(\"Secret access key must not be empty\")\n    \n    \n    params = {\"secret_key_id\": secret_key_id,\n              \"secret_access_key\": secret_access_key\n              }\n    \n    h2o.api(endpoint=\"POST /3/PersistS3\", data=params)\n    print(\"Credentials successfully set.\")"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nreturns the result of aggregation of the group by operation.", "response": "def get_frame(self):\n        \"\"\"\n        Return the resulting H2OFrame containing the result(s) of aggregation(s) of the group by.\n\n        The number of rows denote the number of groups generated by the group by operation.\n\n        The number of columns depend on the number of aggregations performed, the number of columns specified in\n        the col parameter.  Generally, expect the number of columns to be\n\n        (len(col) of aggregation 0 + len(col) of aggregation 1 +...+ len(col) of aggregation n) x\n        (number of groups of the GroupBy object) +1 (for group-by group names).\n\n        Note:\n            - the count aggregation only generates one column;\n            - if col is a str or int, len(col) = 1.\n        \"\"\"\n        if self._res is None:\n            aggs = []\n            cols_operated = []\n            for k in self._aggs:\n                aggs += (self._aggs[k])\n                col_used = self._aggs[k][1]\n                if col_used not in cols_operated:\n                    cols_operated.append(col_used)\n\n            for cind in cols_operated:\n                if cind not in self._by:\n                    self._check_string_columns(cind)\n            self._res = h2o.H2OFrame._expr(expr=ExprNode(\"GB\", self._fr, self._by, *aggs))\n\n        return self._res"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nupdating site domain and name.", "response": "def update_site_forward(apps, schema_editor):\n    \"\"\"Set site domain and name.\"\"\"\n    Site = apps.get_model(\"sites\", \"Site\")\n    Site.objects.update_or_create(\n        id=settings.SITE_ID,\n        defaults={\n            \"domain\": \"{{cookiecutter.domain_name}}\",\n            \"name\": \"{{cookiecutter.project_name}}\",\n        },\n    )"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef generate_random_string(\n    length, using_digits=False, using_ascii_letters=False, using_punctuation=False\n):\n    \"\"\"\n    Example:\n        opting out for 50 symbol-long, [a-z][A-Z][0-9] string\n        would yield log_2((26+26+50)^50) ~= 334 bit strength.\n    \"\"\"\n    if not using_sysrandom:\n        return None\n\n    symbols = []\n    if using_digits:\n        symbols += string.digits\n    if using_ascii_letters:\n        symbols += string.ascii_letters\n    if using_punctuation:\n        all_punctuation = set(string.punctuation)\n        # These symbols can cause issues in environment variables\n        unsuitable = {\"'\", '\"', \"\\\\\", \"$\"}\n        suitable = all_punctuation.difference(unsuitable)\n        symbols += \"\".join(suitable)\n    return \"\".join([random.choice(symbols) for _ in range(length)])", "response": "Generates a random string from the given length."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef send_message(self, text, user_ids, thread_id=None):\n    user_ids = _get_user_ids(self, user_ids)\n    if not isinstance(text, str) and isinstance(user_ids, (list, str)):\n        self.logger.error('Text must be an string, user_ids must be an list or string')\n        return False\n\n    if self.reached_limit('messages'):\n        self.logger.info(\"Out of messages for today.\")\n        return False\n\n    self.delay('message')\n    urls = self.extract_urls(text)\n    item_type = 'link' if urls else 'text'\n    if self.api.send_direct_item(\n        item_type,\n        user_ids,\n        text=text,\n        thread=thread_id,\n        urls=urls\n    ):\n        self.total['messages'] += 1\n        return True\n\n    self.logger.info(\"Message to {user_ids} wasn't sent\".format(user_ids=user_ids))\n    return False", "response": "Send a message to a user_id"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef send_media(self, media_id, user_ids, text='', thread_id=None):\n    user_ids = _get_user_ids(self, user_ids)\n    if not isinstance(text, str) and not isinstance(user_ids, (list, str)):\n        self.logger.error('Text must be an string, user_ids must be an list or string')\n        return False\n    if self.reached_limit('messages'):\n        self.logger.info(\"Out of messages for today.\")\n        return False\n\n    media = self.get_media_info(media_id)\n    media = media[0] if isinstance(media, list) else media\n\n    self.delay('message')\n    if self.api.send_direct_item(\n        'media_share',\n        user_ids,\n        text=text,\n        thread=thread_id,\n        media_type=media.get('media_type'),\n        media_id=media.get('id')\n    ):\n        self.total['messages'] += 1\n        return True\n\n    self.logger.info(\"Message to {user_ids} wasn't sent\".format(user_ids=user_ids))\n    return False", "response": "Send a message to a user_id"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nsends a message to a hashtag.", "response": "def send_hashtag(self, hashtag, user_ids, text='', thread_id=None):\n    \"\"\"\n    :param hashtag: hashtag\n    :param self: bot\n    :param text: text of message\n    :param user_ids: list of user_ids for creating group or one user_id for send to one person\n    :param thread_id: thread_id\n    \"\"\"\n    user_ids = _get_user_ids(self, user_ids)\n    if not isinstance(text, str) and not isinstance(user_ids, (list, str)):\n        self.logger.error('Text must be an string, user_ids must be an list or string')\n        return False\n\n    if self.reached_limit('messages'):\n        self.logger.info(\"Out of messages for today.\")\n        return False\n\n    self.delay('message')\n    if self.api.send_direct_item(\n        'hashtag', user_ids, text=text, thread=thread_id, hashtag=hashtag\n    ):\n        self.total['messages'] += 1\n        return True\n\n    self.logger.info(\"Message to {user_ids} wasn't sent\".format(user_ids=user_ids))\n    return False"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef send_profile(self, profile_user_id, user_ids, text='', thread_id=None):\n    profile_id = self.convert_to_user_id(profile_user_id)\n    user_ids = _get_user_ids(self, user_ids)\n    if not isinstance(text, str) and not isinstance(user_ids, (list, str)):\n        self.logger.error('Text must be an string, user_ids must be an list or string')\n        return False\n\n    if self.reached_limit('messages'):\n        self.logger.info(\"Out of messages for today.\")\n        return False\n\n    self.delay('message')\n    if self.api.send_direct_item(\n        'profile',\n        user_ids,\n        text=text,\n        thread=thread_id,\n        profile_user_id=profile_id\n    ):\n        self.total['messages'] += 1\n        return True\n    self.logger.info(\"Message to {user_ids} wasn't sent\".format(user_ids=user_ids))\n    return False", "response": "Send a profile to a bot"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nadd the default_data to data and dumps it to a json.", "response": "def json_data(self, data=None):\n        \"\"\"Adds the default_data to data and dumps it to a json.\"\"\"\n        if data is None:\n            data = {}\n        data.update(self.default_data)\n        return json.dumps(data)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef get_users_reel(self, user_ids):\n        url = 'feed/reels_media/'\n        res = self.send_request(\n            url,\n            post=self.json_data({\n                'user_ids': [str(x) for x in user_ids]\n            })\n        )\n        if res:\n            if \"reels\" in self.last_json:\n                return self.last_json[\"reels\"]\n            return []\n        return []", "response": "This method returns the list of users that have been reelled."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nsee the reels in instagram.", "response": "def see_reels(self, reels):\n        \"\"\"\n            Input - the list of reels jsons\n            They can be aquired by using get_users_reel() or get_user_reel() methods\n        \"\"\"\n        if not isinstance(reels, list):\n            reels = [reels]\n\n        story_seen = {}\n        now = int(time.time())\n        for i, story in enumerate(sorted(reels, key=lambda m: m['taken_at'], reverse=True)):\n            story_seen_at = now - min(i + 1 + random.randint(0, 2), max(0, now - story['taken_at']))\n            story_seen[\n                '{0!s}_{1!s}'.format(story['id'], story['user']['pk'])\n            ] = [\n                '{0!s}_{1!s}'.format(story['taken_at'], story_seen_at)\n            ]\n\n        data = self.json_data({\n            'reels': story_seen,\n            '_csrftoken': self.token,\n            '_uuid': self.uuid,\n            '_uid': self.user_id\n        })\n        data = self.generate_signature(data)\n        return self.session.post('https://i.instagram.com/api/v2/' + 'media/seen/', data=data).ok"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\ncomment last user_id s medias", "response": "def comment_user(self, user_id, amount=None):\n    \"\"\" Comments last user_id's medias \"\"\"\n    if not self.check_user(user_id, filter_closed_acc=True):\n        return False\n    self.logger.info(\"Going to comment user_%s's feed:\" % user_id)\n    user_id = self.convert_to_user_id(user_id)\n    medias = self.get_user_medias(user_id, is_comment=True)\n    if not medias:\n        self.logger.info(\n            \"None medias received: account is closed or medias have been filtered.\")\n        return False\n    return self.comment_medias(medias[:amount])"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef delay(self, key):\n        last_action, target_delay = self.last[key], self.delays[key]\n        elapsed_time = time.time() - last_action\n        if elapsed_time < target_delay:\n            t_remaining = target_delay - elapsed_time\n            time.sleep(t_remaining * random.uniform(0.25, 1.25))\n        self.last[key] = time.time()", "response": "Sleep only if elapsed time since last action is less than target delay."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nreturns login and password stored in secret. txt.", "response": "def get_credentials(username=None):\n    \"\"\"Returns login and password stored in `secret.txt`.\"\"\"\n    while not check_secret():\n        pass\n    while True:\n        try:\n            with open(SECRET_FILE, \"r\") as f:\n                lines = [line.strip().split(\":\", 2) for line in f.readlines()]\n        except ValueError:\n            msg = 'Problem with opening `{}`, will remove the file.'\n            raise Exception(msg.format(SECRET_FILE))\n        if username is not None:\n            for login, password in lines:\n                if login == username.strip():\n                    return login, password\n        print(\"Which account do you want to use? (Type number)\")\n        for ind, (login, password) in enumerate(lines):\n            print(\"%d: %s\" % (ind + 1, login))\n        print(\"%d: %s\" % (0, \"add another account.\"))\n        print(\"%d: %s\" % (-1, \"delete all accounts.\"))\n        try:\n            ind = int(sys.stdin.readline())\n            if ind == 0:\n                add_credentials()\n                continue\n            elif ind == -1:\n                delete_credentials()\n                check_secret()\n                continue\n            elif 0 <= ind - 1 < len(lines):\n                return lines[ind - 1]\n        except Exception:\n            print(\"Wrong input, enter the number of the account to use.\")"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef like_user(self, user_id, amount=None, filtration=True):\n    if filtration:\n        if not self.check_user(user_id):\n            return False\n    self.logger.info(\"Liking user_%s's feed:\" % user_id)\n    user_id = self.convert_to_user_id(user_id)\n    medias = self.get_user_medias(user_id, filtration=filtration)\n    if not medias:\n        self.logger.info(\n            \"None medias received: account is closed or medias have been filtered.\")\n        return False\n    return self.like_medias(medias[:amount])", "response": "Likes last user_id's medias"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef like_hashtag(self, hashtag, amount=None):\n    self.logger.info(\"Going to like media with hashtag #%s.\" % hashtag)\n    medias = self.get_total_hashtag_medias(hashtag, amount)\n    return self.like_medias(medias)", "response": "Like the last hashtag."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nchecking if a user is not a bot.", "response": "def check_not_bot(self, user_id):\n    \"\"\" Filter bot from real users. \"\"\"\n    self.small_delay()\n    user_id = self.convert_to_user_id(user_id)\n    if not user_id:\n        return False\n    if user_id in self.whitelist:\n        return True\n    if user_id in self.blacklist:\n        return False\n\n    user_info = self.get_user_info(user_id)\n    if not user_info:\n        return True  # closed acc\n\n    skipped = self.skipped_file\n    if \"following_count\" in user_info and user_info[\"following_count\"] > self.max_following_to_block:\n        msg = 'following_count > bot.max_following_to_block, skipping!'\n        self.console_print(msg, 'red')\n        skipped.append(user_id)\n        return False  # massfollower\n\n    if search_stop_words_in_user(self, user_info):\n        msg = '`bot.search_stop_words_in_user` found in user, skipping!'\n        skipped.append(user_id)\n        return False\n\n    return True"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef read_list_from_file(file_path, quiet=False):\n    try:\n        if not check_if_file_exists(file_path, quiet=quiet):\n            return []\n        with codecs.open(file_path, \"r\", encoding=\"utf-8\") as f:\n            content = f.readlines()\n            if sys.version_info[0] < 3:\n                content = [str(item.encode('utf8')) for item in content]\n            content = [item.strip() for item in content]\n            return [i for i in content if i]\n    except Exception as exception:\n        print(str(exception))\n        return []", "response": "Reads list from file. One line - one item.\nAttributeNames Returns the list if file items.\nAttributeNames Returns the list if file items."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef get_tweets(user, pages=25):\n\n    url = f'https://twitter.com/i/profiles/show/{user}/timeline/tweets?include_available_features=1&include_entities=1&include_new_items_bar=true'\n    headers = {\n        'Accept': 'application/json, text/javascript, */*; q=0.01',\n        'Referer': f'https://twitter.com/{user}',\n        'User-Agent': 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_12_6) AppleWebKit/603.3.8 (KHTML, like Gecko) Version/10.1.2 Safari/603.3.8',\n        'X-Twitter-Active-User': 'yes',\n        'X-Requested-With': 'XMLHttpRequest',\n        'Accept-Language': 'en-US'\n    }\n\n    def gen_tweets(pages):\n        r = session.get(url, headers=headers)\n\n        while pages > 0:\n            try:\n                html = HTML(html=r.json()['items_html'],\n                            url='bunk', default_encoding='utf-8')\n            except KeyError:\n                raise ValueError(\n                    f'Oops! Either \"{user}\" does not exist or is private.')\n\n            comma = \",\"\n            dot = \".\"\n            tweets = []\n            for tweet in html.find('html > .stream-item'):\n                # 10~11 html elements have `.stream-item` class and also their `data-item-type` is `tweet`\n                # but their content doesn't look like a tweet's content\n                try:\n                    text = tweet.find('.tweet-text')[0].full_text\n                except IndexError:  # issue #50\n                    continue\n\n                tweet_id = tweet.find('.js-permalink')[0].attrs['data-conversation-id']\n\n                time = datetime.fromtimestamp(int(tweet.find('._timestamp')[0].attrs['data-time-ms']) / 1000.0)\n\n                interactions = [\n                    x.text\n                    for x in tweet.find('.ProfileTweet-actionCount')\n                ]\n\n                replies = int(\n                    interactions[0].split(' ')[0].replace(comma, '').replace(dot, '')\n                    or interactions[3]\n                )\n\n                retweets = int(\n                    interactions[1].split(' ')[0].replace(comma, '').replace(dot, '')\n                    or interactions[4]\n                    or interactions[5]\n                )\n\n                likes = int(\n                    interactions[2].split(' ')[0].replace(comma, '').replace(dot, '')\n                    or interactions[6]\n                    or interactions[7]\n                )\n\n                hashtags = [\n                    hashtag_node.full_text\n                    for hashtag_node in tweet.find('.twitter-hashtag')\n                ]\n                urls = [\n                    url_node.attrs['data-expanded-url']\n                    for url_node in tweet.find('a.twitter-timeline-link:not(.u-hidden)')\n                ]\n                photos = [\n                    photo_node.attrs['data-image-url']\n                    for photo_node in tweet.find('.AdaptiveMedia-photoContainer')\n                ]\n\n                videos = []\n                video_nodes = tweet.find(\".PlayableMedia-player\")\n                for node in video_nodes:\n                    styles = node.attrs['style'].split()\n                    for style in styles:\n                        if style.startswith('background'):\n                            tmp = style.split('/')[-1]\n                            video_id = tmp[:tmp.index('.jpg')]\n                            videos.append({'id': video_id})\n\n                tweets.append({\n                    'tweetId': tweet_id,\n                    'time': time,\n                    'text': text,\n                    'replies': replies,\n                    'retweets': retweets,\n                    'likes': likes,\n                    'entries': {\n                        'hashtags': hashtags, 'urls': urls,\n                        'photos': photos, 'videos': videos\n                    }\n                })\n\n            last_tweet = html.find('.stream-item')[-1].attrs['data-item-id']\n\n            for tweet in tweets:\n                if tweet:\n                    tweet['text'] = re.sub('http', ' http', tweet['text'], 1)\n                    yield tweet\n\n            r = session.get(url, params={'max_position': last_tweet}, headers=headers)\n            pages += -1\n\n    yield from gen_tweets(pages)", "response": "Returns a list of tweets for a given user."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef schedule(self, schedule_time):\n        if not self.properties.message_id:\n            self.properties.message_id = str(uuid.uuid4())\n        if not self.message.annotations:\n            self.message.annotations = {}\n        self.message.annotations[types.AMQPSymbol(self._x_OPT_SCHEDULED_ENQUEUE_TIME)] = schedule_time", "response": "Add a specific enqueue time to the message."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef complete(self):\n        self._is_live('complete')\n        try:\n            self.message.accept()\n        except Exception as e:\n            raise MessageSettleFailed(\"complete\", e)", "response": "Complete the message.\n\n        This removes the message from the queue.\n\n        :raises: ~azure.servicebus.common.errors.MessageAlreadySettled if the message has been settled.\n        :raises: ~azure.servicebus.common.errors.MessageLockExpired if message lock has already expired.\n        :raises: ~azure.servicebus.common.errors.SessionLockExpired if session lock has already expired.\n        :raises: ~azure.servicebus.common.errors.MessageSettleFailed if message settle operation fails."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nmoving the message to the Dead Letter queue.", "response": "def dead_letter(self, description=None):\n        \"\"\"Move the message to the Dead Letter queue.\n\n        The Dead Letter queue is a sub-queue that can be\n        used to store messages that failed to process correctly, or otherwise require further inspection\n        or processing. The queue can also be configured to send expired messages to the Dead Letter queue.\n        To receive dead-lettered messages, use `QueueClient.get_deadletter_receiver()` or\n        `SubscriptionClient.get_deadletter_receiver()`.\n\n        :param description: The reason for dead-lettering the message.\n        :type description: str\n        :raises: ~azure.servicebus.common.errors.MessageAlreadySettled if the message has been settled.\n        :raises: ~azure.servicebus.common.errors.MessageLockExpired if message lock has already expired.\n        :raises: ~azure.servicebus.common.errors.SessionLockExpired if session lock has already expired.\n        :raises: ~azure.servicebus.common.errors.MessageSettleFailed if message settle operation fails.\n        \"\"\"\n        self._is_live('reject')\n        try:\n            self.message.reject(condition=DEADLETTERNAME, description=description)\n        except Exception as e:\n            raise MessageSettleFailed(\"reject\", e)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nabandoning the message. This message will be returned to the queue to be reprocessed. :raises: ~azure.servicebus.common.errors.MessageAlreadySettled if the message has been settled. :raises: ~azure.servicebus.common.errors.MessageLockExpired if message lock has already expired. :raises: ~azure.servicebus.common.errors.SessionLockExpired if session lock has already expired. :raises: ~azure.servicebus.common.errors.MessageSettleFailed if message settle operation fails.", "response": "def abandon(self):\n        \"\"\"Abandon the message.\n\n        This message will be returned to the queue to be reprocessed.\n\n        :raises: ~azure.servicebus.common.errors.MessageAlreadySettled if the message has been settled.\n        :raises: ~azure.servicebus.common.errors.MessageLockExpired if message lock has already expired.\n        :raises: ~azure.servicebus.common.errors.SessionLockExpired if session lock has already expired.\n        :raises: ~azure.servicebus.common.errors.MessageSettleFailed if message settle operation fails.\n        \"\"\"\n        self._is_live('abandon')\n        try:\n            self.message.modify(True, False)\n        except Exception as e:\n            raise MessageSettleFailed(\"abandon\", e)"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef defer(self):\n        self._is_live('defer')\n        try:\n            self.message.modify(True, True)\n        except Exception as e:\n            raise MessageSettleFailed(\"defer\", e)", "response": "Defer the message.\n\n        This message will remain in the queue but must be received\n        specifically by its sequence number in order to be processed.\n\n        :raises: ~azure.servicebus.common.errors.MessageAlreadySettled if the message has been settled.\n        :raises: ~azure.servicebus.common.errors.MessageLockExpired if message lock has already expired.\n        :raises: ~azure.servicebus.common.errors.SessionLockExpired if session lock has already expired.\n        :raises: ~azure.servicebus.common.errors.MessageSettleFailed if message settle operation fails."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nmove the message to the Dead Letter queue.", "response": "def dead_letter(self, description=None):\n        \"\"\"Move the message to the Dead Letter queue.\n\n        The Dead Letter queue is a sub-queue that can be\n        used to store messages that failed to process correctly, or otherwise require further inspection\n        or processing. The queue can also be configured to send expired messages to the Dead Letter queue.\n        To receive dead-lettered messages, use `QueueClient.get_deadletter_receiver()` or\n        `SubscriptionClient.get_deadletter_receiver()`.\n\n        :param description: The reason for dead-lettering the message.\n        :type description: str\n        :raises: ~azure.servicebus.common.errors.MessageAlreadySettled if the message has been settled.\n        :raises: ~azure.servicebus.common.errors.MessageLockExpired if message lock has already expired.\n        :raises: ~azure.servicebus.common.errors.SessionLockExpired if session lock has already expired.\n        :raises: ~azure.servicebus.common.errors.MessageSettleFailed if message settle operation fails.\n        \"\"\"\n        self._is_live('dead-letter')\n        details = {\n            'deadletter-reason': str(description) if description else \"\",\n            'deadletter-description': str(description) if description else \"\"}\n        self._receiver._settle_deferred(  # pylint: disable=protected-access\n            'suspended', [self.lock_token], dead_letter_details=details)\n        self._settled = True"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef abandon(self):\n        self._is_live('abandon')\n        self._receiver._settle_deferred('abandoned', [self.lock_token])  # pylint: disable=protected-access\n        self._settled = True", "response": "Abandon the message.\n\n        This message will be returned to the queue to be reprocessed.\n\n        :raises: ~azure.servicebus.common.errors.MessageAlreadySettled if the message has been settled.\n        :raises: ~azure.servicebus.common.errors.MessageLockExpired if message lock has already expired.\n        :raises: ~azure.servicebus.common.errors.SessionLockExpired if session lock has already expired.\n        :raises: ~azure.servicebus.common.errors.MessageSettleFailed if message settle operation fails."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef download(\n            self, resource_group_name, virtual_wan_name, vpn_sites=None, output_blob_sas_url=None, custom_headers=None, raw=False, polling=True, **operation_config):\n        \"\"\"Gives the sas-url to download the configurations for vpn-sites in a\n        resource group.\n\n        :param resource_group_name: The resource group name.\n        :type resource_group_name: str\n        :param virtual_wan_name: The name of the VirtualWAN for which\n         configuration of all vpn-sites is needed.\n        :type virtual_wan_name: str\n        :param vpn_sites: List of resource-ids of the vpn-sites for which\n         config is to be downloaded.\n        :type vpn_sites:\n         list[~azure.mgmt.network.v2018_04_01.models.SubResource]\n        :param output_blob_sas_url: The sas-url to download the configurations\n         for vpn-sites\n        :type output_blob_sas_url: str\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: The poller return type is ClientRawResponse, the\n         direct response alongside the deserialized response\n        :param polling: True for ARMPolling, False for no polling, or a\n         polling object for personal polling strategy\n        :return: An instance of LROPoller that returns None or\n         ClientRawResponse<None> if raw==True\n        :rtype: ~msrestazure.azure_operation.AzureOperationPoller[None] or\n         ~msrestazure.azure_operation.AzureOperationPoller[~msrest.pipeline.ClientRawResponse[None]]\n        :raises:\n         :class:`ErrorException<azure.mgmt.network.v2018_04_01.models.ErrorException>`\n        \"\"\"\n        raw_result = self._download_initial(\n            resource_group_name=resource_group_name,\n            virtual_wan_name=virtual_wan_name,\n            vpn_sites=vpn_sites,\n            output_blob_sas_url=output_blob_sas_url,\n            custom_headers=custom_headers,\n            raw=True,\n            **operation_config\n        )\n\n        def get_long_running_output(response):\n            if raw:\n                client_raw_response = ClientRawResponse(None, response)\n                return client_raw_response\n\n        lro_delay = operation_config.get(\n            'long_running_operation_timeout',\n            self.config.long_running_operation_timeout)\n        if polling is True: polling_method = ARMPolling(lro_delay, **operation_config)\n        elif polling is False: polling_method = NoPolling()\n        else: polling_method = polling\n        return LROPoller(self._client, raw_result, get_long_running_output, polling_method)", "response": "Downloads the configuration of the specified virtual - wan - site and stores it in the resource - group."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nreturn a new instance of the models class.", "response": "def models(cls, api_version=DEFAULT_API_VERSION):\n        \"\"\"Module depends on the API version:\n\n           * 2015-06-15: :mod:`v2015_06_15.models<azure.mgmt.compute.v2015_06_15.models>`\n           * 2016-03-30: :mod:`v2016_03_30.models<azure.mgmt.compute.v2016_03_30.models>`\n           * 2016-04-30-preview: :mod:`v2016_04_30_preview.models<azure.mgmt.compute.v2016_04_30_preview.models>`\n           * 2017-03-30: :mod:`v2017_03_30.models<azure.mgmt.compute.v2017_03_30.models>`\n           * 2017-09-01: :mod:`v2017_09_01.models<azure.mgmt.compute.v2017_09_01.models>`\n           * 2017-12-01: :mod:`v2017_12_01.models<azure.mgmt.compute.v2017_12_01.models>`\n           * 2018-04-01: :mod:`v2018_04_01.models<azure.mgmt.compute.v2018_04_01.models>`\n           * 2018-06-01: :mod:`v2018_06_01.models<azure.mgmt.compute.v2018_06_01.models>`\n           * 2018-09-30: :mod:`v2018_09_30.models<azure.mgmt.compute.v2018_09_30.models>`\n           * 2018-10-01: :mod:`v2018_10_01.models<azure.mgmt.compute.v2018_10_01.models>`\n           * 2019-03-01: :mod:`v2019_03_01.models<azure.mgmt.compute.v2019_03_01.models>`\n           * 2019-04-01: :mod:`v2019_04_01.models<azure.mgmt.compute.v2019_04_01.models>`\n        \"\"\"\n        if api_version == '2015-06-15':\n            from .v2015_06_15 import models\n            return models\n        elif api_version == '2016-03-30':\n            from .v2016_03_30 import models\n            return models\n        elif api_version == '2016-04-30-preview':\n            from .v2016_04_30_preview import models\n            return models\n        elif api_version == '2017-03-30':\n            from .v2017_03_30 import models\n            return models\n        elif api_version == '2017-09-01':\n            from .v2017_09_01 import models\n            return models\n        elif api_version == '2017-12-01':\n            from .v2017_12_01 import models\n            return models\n        elif api_version == '2018-04-01':\n            from .v2018_04_01 import models\n            return models\n        elif api_version == '2018-06-01':\n            from .v2018_06_01 import models\n            return models\n        elif api_version == '2018-09-30':\n            from .v2018_09_30 import models\n            return models\n        elif api_version == '2018-10-01':\n            from .v2018_10_01 import models\n            return models\n        elif api_version == '2019-03-01':\n            from .v2019_03_01 import models\n            return models\n        elif api_version == '2019-04-01':\n            from .v2019_04_01 import models\n            return models\n        raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef disks(self):\n        api_version = self._get_api_version('disks')\n        if api_version == '2016-04-30-preview':\n            from .v2016_04_30_preview.operations import DisksOperations as OperationClass\n        elif api_version == '2017-03-30':\n            from .v2017_03_30.operations import DisksOperations as OperationClass\n        elif api_version == '2018-04-01':\n            from .v2018_04_01.operations import DisksOperations as OperationClass\n        elif api_version == '2018-06-01':\n            from .v2018_06_01.operations import DisksOperations as OperationClass\n        elif api_version == '2018-09-30':\n            from .v2018_09_30.operations import DisksOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))", "response": "Return an instance of the disks API endpoint."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef galleries(self):\n        api_version = self._get_api_version('galleries')\n        if api_version == '2018-06-01':\n            from .v2018_06_01.operations import GalleriesOperations as OperationClass\n        elif api_version == '2019-03-01':\n            from .v2019_03_01.operations import GalleriesOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))", "response": "Return an instance of the Galleries API client."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef gallery_image_versions(self):\n        api_version = self._get_api_version('gallery_image_versions')\n        if api_version == '2018-06-01':\n            from .v2018_06_01.operations import GalleryImageVersionsOperations as OperationClass\n        elif api_version == '2019-03-01':\n            from .v2019_03_01.operations import GalleryImageVersionsOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))", "response": "Return an instance of the GalleryImageVersionsOperations API version."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nreturn an instance of the GalleryImagesOperations API version 2. 0.", "response": "def gallery_images(self):\n        \"\"\"Instance depends on the API version:\n\n           * 2018-06-01: :class:`GalleryImagesOperations<azure.mgmt.compute.v2018_06_01.operations.GalleryImagesOperations>`\n           * 2019-03-01: :class:`GalleryImagesOperations<azure.mgmt.compute.v2019_03_01.operations.GalleryImagesOperations>`\n        \"\"\"\n        api_version = self._get_api_version('gallery_images')\n        if api_version == '2018-06-01':\n            from .v2018_06_01.operations import GalleryImagesOperations as OperationClass\n        elif api_version == '2019-03-01':\n            from .v2019_03_01.operations import GalleryImagesOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nreturns a list of all images in the current API version.", "response": "def images(self):\n        \"\"\"Instance depends on the API version:\n\n           * 2016-04-30-preview: :class:`ImagesOperations<azure.mgmt.compute.v2016_04_30_preview.operations.ImagesOperations>`\n           * 2017-03-30: :class:`ImagesOperations<azure.mgmt.compute.v2017_03_30.operations.ImagesOperations>`\n           * 2017-12-01: :class:`ImagesOperations<azure.mgmt.compute.v2017_12_01.operations.ImagesOperations>`\n           * 2018-04-01: :class:`ImagesOperations<azure.mgmt.compute.v2018_04_01.operations.ImagesOperations>`\n           * 2018-06-01: :class:`ImagesOperations<azure.mgmt.compute.v2018_06_01.operations.ImagesOperations>`\n           * 2018-10-01: :class:`ImagesOperations<azure.mgmt.compute.v2018_10_01.operations.ImagesOperations>`\n           * 2019-03-01: :class:`ImagesOperations<azure.mgmt.compute.v2019_03_01.operations.ImagesOperations>`\n        \"\"\"\n        api_version = self._get_api_version('images')\n        if api_version == '2016-04-30-preview':\n            from .v2016_04_30_preview.operations import ImagesOperations as OperationClass\n        elif api_version == '2017-03-30':\n            from .v2017_03_30.operations import ImagesOperations as OperationClass\n        elif api_version == '2017-12-01':\n            from .v2017_12_01.operations import ImagesOperations as OperationClass\n        elif api_version == '2018-04-01':\n            from .v2018_04_01.operations import ImagesOperations as OperationClass\n        elif api_version == '2018-06-01':\n            from .v2018_06_01.operations import ImagesOperations as OperationClass\n        elif api_version == '2018-10-01':\n            from .v2018_10_01.operations import ImagesOperations as OperationClass\n        elif api_version == '2019-03-01':\n            from .v2019_03_01.operations import ImagesOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nreturning an instance of the Log Analytics API version.", "response": "def log_analytics(self):\n        \"\"\"Instance depends on the API version:\n\n           * 2017-12-01: :class:`LogAnalyticsOperations<azure.mgmt.compute.v2017_12_01.operations.LogAnalyticsOperations>`\n           * 2018-04-01: :class:`LogAnalyticsOperations<azure.mgmt.compute.v2018_04_01.operations.LogAnalyticsOperations>`\n           * 2018-06-01: :class:`LogAnalyticsOperations<azure.mgmt.compute.v2018_06_01.operations.LogAnalyticsOperations>`\n           * 2018-10-01: :class:`LogAnalyticsOperations<azure.mgmt.compute.v2018_10_01.operations.LogAnalyticsOperations>`\n           * 2019-03-01: :class:`LogAnalyticsOperations<azure.mgmt.compute.v2019_03_01.operations.LogAnalyticsOperations>`\n        \"\"\"\n        api_version = self._get_api_version('log_analytics')\n        if api_version == '2017-12-01':\n            from .v2017_12_01.operations import LogAnalyticsOperations as OperationClass\n        elif api_version == '2018-04-01':\n            from .v2018_04_01.operations import LogAnalyticsOperations as OperationClass\n        elif api_version == '2018-06-01':\n            from .v2018_06_01.operations import LogAnalyticsOperations as OperationClass\n        elif api_version == '2018-10-01':\n            from .v2018_10_01.operations import LogAnalyticsOperations as OperationClass\n        elif api_version == '2019-03-01':\n            from .v2019_03_01.operations import LogAnalyticsOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef resource_skus(self):\n        api_version = self._get_api_version('resource_skus')\n        if api_version == '2017-03-30':\n            from .v2017_03_30.operations import ResourceSkusOperations as OperationClass\n        elif api_version == '2017-09-01':\n            from .v2017_09_01.operations import ResourceSkusOperations as OperationClass\n        elif api_version == '2019-04-01':\n            from .v2019_04_01.operations import ResourceSkusOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))", "response": "Return an instance of the resource skus operation class."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nreturning a list of all the usage operations for this resource.", "response": "def usage(self):\n        \"\"\"Instance depends on the API version:\n\n           * 2015-06-15: :class:`UsageOperations<azure.mgmt.compute.v2015_06_15.operations.UsageOperations>`\n           * 2016-03-30: :class:`UsageOperations<azure.mgmt.compute.v2016_03_30.operations.UsageOperations>`\n           * 2016-04-30-preview: :class:`UsageOperations<azure.mgmt.compute.v2016_04_30_preview.operations.UsageOperations>`\n           * 2017-03-30: :class:`UsageOperations<azure.mgmt.compute.v2017_03_30.operations.UsageOperations>`\n           * 2017-12-01: :class:`UsageOperations<azure.mgmt.compute.v2017_12_01.operations.UsageOperations>`\n           * 2018-04-01: :class:`UsageOperations<azure.mgmt.compute.v2018_04_01.operations.UsageOperations>`\n           * 2018-06-01: :class:`UsageOperations<azure.mgmt.compute.v2018_06_01.operations.UsageOperations>`\n           * 2018-10-01: :class:`UsageOperations<azure.mgmt.compute.v2018_10_01.operations.UsageOperations>`\n           * 2019-03-01: :class:`UsageOperations<azure.mgmt.compute.v2019_03_01.operations.UsageOperations>`\n        \"\"\"\n        api_version = self._get_api_version('usage')\n        if api_version == '2015-06-15':\n            from .v2015_06_15.operations import UsageOperations as OperationClass\n        elif api_version == '2016-03-30':\n            from .v2016_03_30.operations import UsageOperations as OperationClass\n        elif api_version == '2016-04-30-preview':\n            from .v2016_04_30_preview.operations import UsageOperations as OperationClass\n        elif api_version == '2017-03-30':\n            from .v2017_03_30.operations import UsageOperations as OperationClass\n        elif api_version == '2017-12-01':\n            from .v2017_12_01.operations import UsageOperations as OperationClass\n        elif api_version == '2018-04-01':\n            from .v2018_04_01.operations import UsageOperations as OperationClass\n        elif api_version == '2018-06-01':\n            from .v2018_06_01.operations import UsageOperations as OperationClass\n        elif api_version == '2018-10-01':\n            from .v2018_10_01.operations import UsageOperations as OperationClass\n        elif api_version == '2019-03-01':\n            from .v2019_03_01.operations import UsageOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nguess Python Autorest options based on the spec path.", "response": "def guess_service_info_from_path(spec_path):\n    \"\"\"Guess Python Autorest options based on the spec path.\n\n    Expected path:\n    specification/compute/resource-manager/readme.md\n    \"\"\"\n    spec_path = spec_path.lower()\n    spec_path = spec_path[spec_path.index(\"specification\"):] # Might raise and it's ok\n    split_spec_path = spec_path.split(\"/\")\n\n    rp_name = split_spec_path[1]\n    is_arm = split_spec_path[2] == \"resource-manager\"\n\n    return {\n        \"rp_name\": rp_name,\n        \"is_arm\": is_arm\n    }"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nopening the handler connection and authenticate session.", "response": "def open(self):\n        \"\"\"Open handler connection and authenticate session.\n\n        If the handler is already open, this operation will do nothing.\n        A handler opened with this method must be explicitly closed.\n        It is recommended to open a handler within a context manager as\n        opposed to calling the method directly.\n\n        .. note:: This operation is not thread-safe.\n\n        \"\"\"\n        if self.running:\n            return\n        self.running = True\n        try:\n            self._handler.open(connection=self.connection)\n            while not self._handler.client_ready():\n                time.sleep(0.05)\n        except Exception as e:  # pylint: disable=broad-except\n            try:\n                self._handle_exception(e)\n            except:\n                self.running = False\n                raise"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nupdate a running PowerShell command with more data.", "response": "def update_command(\n            self, resource_group_name, node_name, session, pssession, custom_headers=None, raw=False, polling=True, **operation_config):\n        \"\"\"Updates a running PowerShell command with more data.\n\n        :param resource_group_name: The resource group name uniquely\n         identifies the resource group within the user subscriptionId.\n        :type resource_group_name: str\n        :param node_name: The node name (256 characters maximum).\n        :type node_name: str\n        :param session: The sessionId from the user.\n        :type session: str\n        :param pssession: The PowerShell sessionId from the user.\n        :type pssession: str\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: The poller return type is ClientRawResponse, the\n         direct response alongside the deserialized response\n        :param polling: True for ARMPolling, False for no polling, or a\n         polling object for personal polling strategy\n        :return: An instance of LROPoller that returns\n         PowerShellCommandResults or\n         ClientRawResponse<PowerShellCommandResults> if raw==True\n        :rtype:\n         ~msrestazure.azure_operation.AzureOperationPoller[~azure.mgmt.servermanager.models.PowerShellCommandResults]\n         or\n         ~msrestazure.azure_operation.AzureOperationPoller[~msrest.pipeline.ClientRawResponse[~azure.mgmt.servermanager.models.PowerShellCommandResults]]\n        :raises:\n         :class:`ErrorException<azure.mgmt.servermanager.models.ErrorException>`\n        \"\"\"\n        raw_result = self._update_command_initial(\n            resource_group_name=resource_group_name,\n            node_name=node_name,\n            session=session,\n            pssession=pssession,\n            custom_headers=custom_headers,\n            raw=True,\n            **operation_config\n        )\n\n        def get_long_running_output(response):\n            deserialized = self._deserialize('PowerShellCommandResults', response)\n\n            if raw:\n                client_raw_response = ClientRawResponse(deserialized, response)\n                return client_raw_response\n\n            return deserialized\n\n        lro_delay = operation_config.get(\n            'long_running_operation_timeout',\n            self.config.long_running_operation_timeout)\n        if polling is True: polling_method = ARMPolling(lro_delay, **operation_config)\n        elif polling is False: polling_method = NoPolling()\n        else: polling_method = polling\n        return LROPoller(self._client, raw_result, get_long_running_output, polling_method)"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef delete_by_id(\n            self, application_definition_id, custom_headers=None, raw=False, polling=True, **operation_config):\n        \"\"\"Deletes the managed application definition.\n\n        :param application_definition_id: The fully qualified ID of the\n         managed application definition, including the managed application name\n         and the managed application definition resource type. Use the format,\n         /subscriptions/{guid}/resourceGroups/{resource-group-name}/Microsoft.Solutions/applicationDefinitions/{applicationDefinition-name}\n        :type application_definition_id: str\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: The poller return type is ClientRawResponse, the\n         direct response alongside the deserialized response\n        :param polling: True for ARMPolling, False for no polling, or a\n         polling object for personal polling strategy\n        :return: An instance of LROPoller that returns None or\n         ClientRawResponse<None> if raw==True\n        :rtype: ~msrestazure.azure_operation.AzureOperationPoller[None] or\n         ~msrestazure.azure_operation.AzureOperationPoller[~msrest.pipeline.ClientRawResponse[None]]\n        :raises:\n         :class:`ErrorResponseException<azure.mgmt.resource.managedapplications.models.ErrorResponseException>`\n        \"\"\"\n        raw_result = self._delete_by_id_initial(\n            application_definition_id=application_definition_id,\n            custom_headers=custom_headers,\n            raw=True,\n            **operation_config\n        )\n\n        def get_long_running_output(response):\n            if raw:\n                client_raw_response = ClientRawResponse(None, response)\n                return client_raw_response\n\n        lro_delay = operation_config.get(\n            'long_running_operation_timeout',\n            self.config.long_running_operation_timeout)\n        if polling is True: polling_method = ARMPolling(lro_delay, **operation_config)\n        elif polling is False: polling_method = NoPolling()\n        else: polling_method = polling\n        return LROPoller(self._client, raw_result, get_long_running_output, polling_method)", "response": "Deletes the managed application definition with the given ID."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef create_or_update_by_id(\n            self, application_definition_id, parameters, custom_headers=None, raw=False, polling=True, **operation_config):\n        \"\"\"Creates a new managed application definition.\n\n        :param application_definition_id: The fully qualified ID of the\n         managed application definition, including the managed application name\n         and the managed application definition resource type. Use the format,\n         /subscriptions/{guid}/resourceGroups/{resource-group-name}/Microsoft.Solutions/applicationDefinitions/{applicationDefinition-name}\n        :type application_definition_id: str\n        :param parameters: Parameters supplied to the create or update a\n         managed application definition.\n        :type parameters:\n         ~azure.mgmt.resource.managedapplications.models.ApplicationDefinition\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: The poller return type is ClientRawResponse, the\n         direct response alongside the deserialized response\n        :param polling: True for ARMPolling, False for no polling, or a\n         polling object for personal polling strategy\n        :return: An instance of LROPoller that returns ApplicationDefinition\n         or ClientRawResponse<ApplicationDefinition> if raw==True\n        :rtype:\n         ~msrestazure.azure_operation.AzureOperationPoller[~azure.mgmt.resource.managedapplications.models.ApplicationDefinition]\n         or\n         ~msrestazure.azure_operation.AzureOperationPoller[~msrest.pipeline.ClientRawResponse[~azure.mgmt.resource.managedapplications.models.ApplicationDefinition]]\n        :raises:\n         :class:`ErrorResponseException<azure.mgmt.resource.managedapplications.models.ErrorResponseException>`\n        \"\"\"\n        raw_result = self._create_or_update_by_id_initial(\n            application_definition_id=application_definition_id,\n            parameters=parameters,\n            custom_headers=custom_headers,\n            raw=True,\n            **operation_config\n        )\n\n        def get_long_running_output(response):\n            deserialized = self._deserialize('ApplicationDefinition', response)\n\n            if raw:\n                client_raw_response = ClientRawResponse(deserialized, response)\n                return client_raw_response\n\n            return deserialized\n\n        lro_delay = operation_config.get(\n            'long_running_operation_timeout',\n            self.config.long_running_operation_timeout)\n        if polling is True: polling_method = ARMPolling(lro_delay, **operation_config)\n        elif polling is False: polling_method = NoPolling()\n        else: polling_method = polling\n        return LROPoller(self._client, raw_result, get_long_running_output, polling_method)", "response": "Creates or updates an application definition by ID."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nreturn a new instance of the base class for the specified API version.", "response": "def models(cls, api_version=DEFAULT_API_VERSION):\n        \"\"\"Module depends on the API version:\n\n           * 2016-04-01: :mod:`v2016_04_01.models<azure.mgmt.dns.v2016_04_01.models>`\n           * 2018-03-01-preview: :mod:`v2018_03_01_preview.models<azure.mgmt.dns.v2018_03_01_preview.models>`\n           * 2018-05-01: :mod:`v2018_05_01.models<azure.mgmt.dns.v2018_05_01.models>`\n        \"\"\"\n        if api_version == '2016-04-01':\n            from .v2016_04_01 import models\n            return models\n        elif api_version == '2018-03-01-preview':\n            from .v2018_03_01_preview import models\n            return models\n        elif api_version == '2018-05-01':\n            from .v2018_05_01 import models\n            return models\n        raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef dns_resource_reference(self):\n        api_version = self._get_api_version('dns_resource_reference')\n        if api_version == '2018-05-01':\n            from .v2018_05_01.operations import DnsResourceReferenceOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))", "response": "Return an instance of the DnsResourceReferenceOperations API version 2. 0. 0."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef record_sets(self):\n        api_version = self._get_api_version('record_sets')\n        if api_version == '2016-04-01':\n            from .v2016_04_01.operations import RecordSetsOperations as OperationClass\n        elif api_version == '2018-03-01-preview':\n            from .v2018_03_01_preview.operations import RecordSetsOperations as OperationClass\n        elif api_version == '2018-05-01':\n            from .v2018_05_01.operations import RecordSetsOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))", "response": "Return an instance of the RecordSets API version."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nget the current zones.", "response": "def zones(self):\n        \"\"\"Instance depends on the API version:\n\n           * 2016-04-01: :class:`ZonesOperations<azure.mgmt.dns.v2016_04_01.operations.ZonesOperations>`\n           * 2018-03-01-preview: :class:`ZonesOperations<azure.mgmt.dns.v2018_03_01_preview.operations.ZonesOperations>`\n           * 2018-05-01: :class:`ZonesOperations<azure.mgmt.dns.v2018_05_01.operations.ZonesOperations>`\n        \"\"\"\n        api_version = self._get_api_version('zones')\n        if api_version == '2016-04-01':\n            from .v2016_04_01.operations import ZonesOperations as OperationClass\n        elif api_version == '2018-03-01-preview':\n            from .v2018_03_01_preview.operations import ZonesOperations as OperationClass\n        elif api_version == '2018-05-01':\n            from .v2018_05_01.operations import ZonesOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nreturn a new instance of the base class for the given API version.", "response": "def models(cls, api_version=DEFAULT_API_VERSION):\n        \"\"\"Module depends on the API version:\n\n           * 2016-10-01: :mod:`v2016_10_01.models<azure.mgmt.keyvault.v2016_10_01.models>`\n           * 2018-02-14: :mod:`v2018_02_14.models<azure.mgmt.keyvault.v2018_02_14.models>`\n        \"\"\"\n        if api_version == '2016-10-01':\n            from .v2016_10_01 import models\n            return models\n        elif api_version == '2018-02-14':\n            from .v2018_02_14 import models\n            return models\n        raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef set_proxy(self, host, port, user, password):\n        '''\n        Sets the proxy server host and port for the HTTP CONNECT Tunnelling.\n\n        host:\n            Address of the proxy. Ex: '192.168.0.100'\n        port:\n            Port of the proxy. Ex: 6000\n        user:\n            User for proxy authorization.\n        password:\n            Password for proxy authorization.\n        '''\n        self.proxy_host = host\n        self.proxy_port = port\n        self.proxy_user = user\n        self.proxy_password = password", "response": "Sets the proxy server host and port for the HTTP CONNECT Tunnelling."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nreturns the target uri for the request.", "response": "def get_uri(self, request):\n        ''' Return the target uri for the request.'''\n        protocol = request.protocol_override \\\n            if request.protocol_override else self.protocol\n        protocol = protocol.lower()\n        port = HTTP_PORT if protocol == 'http' else HTTPS_PORT\n        return protocol + '://' + request.host + ':' + str(port) + request.path"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\ncreate a connection to the specified request.", "response": "def get_connection(self, request):\n        ''' Create connection for the request. '''\n        protocol = request.protocol_override \\\n            if request.protocol_override else self.protocol\n        protocol = protocol.lower()\n        target_host = request.host\n        # target_port = HTTP_PORT if protocol == 'http' else HTTPS_PORT\n\n        connection = _RequestsConnection(\n            target_host, protocol, self.request_session, self.timeout)\n        proxy_host = self.proxy_host\n        proxy_port = self.proxy_port\n\n        if self.proxy_host:\n            headers = None\n            if self.proxy_user and self.proxy_password:\n                auth = base64.b64encode(\"{0}:{1}\".format(self.proxy_user, self.proxy_password).encode())\n                headers = {'Proxy-Authorization': 'Basic {0}'.format(auth.decode())}\n            connection.set_tunnel(proxy_host, int(proxy_port), headers)\n\n        return connection"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef perform_request(self, request):\n        ''' Sends request to cloud service server and return the response. '''\n        connection = self.get_connection(request)\n        try:\n            connection.putrequest(request.method, request.path)\n\n            self.send_request_headers(connection, request.headers)\n            self.send_request_body(connection, request.body)\n\n            if DEBUG_REQUESTS and request.body:\n                print('request:')\n                try:\n                    print(request.body)\n                except:  # pylint: disable=bare-except\n                    pass\n\n            resp = connection.getresponse()\n            status = int(resp.status)\n            message = resp.reason\n            respheaders = resp.getheaders()\n\n            # for consistency across platforms, make header names lowercase\n            for i, value in enumerate(respheaders):\n                respheaders[i] = (value[0].lower(), value[1])\n\n            respbody = None\n            if resp.length is None:\n                respbody = resp.read()\n            elif resp.length > 0:\n                respbody = resp.read(resp.length)\n\n            if DEBUG_RESPONSES and respbody:\n                print('response:')\n                try:\n                    print(respbody)\n                except:  # pylint: disable=bare-except\n                    pass\n\n            response = HTTPResponse(\n                status, resp.reason, respheaders, respbody)\n            if status == 307:\n                new_url = urlparse(dict(respheaders)['location'])\n                request.host = new_url.hostname\n                request.path = new_url.path\n                request.path, request.query = self._update_request_uri_query(request)\n                return self.perform_request(request)\n            if status >= 300:\n                raise HTTPError(status, message, respheaders, respbody)\n\n            return response\n        finally:\n            connection.close()", "response": "Sends a request to the cloud service server and returns the response."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nexecute script actions on the specified HDInsight cluster.", "response": "def execute_script_actions(\n            self, resource_group_name, cluster_name, persist_on_success, script_actions=None, custom_headers=None, raw=False, polling=True, **operation_config):\n        \"\"\"Executes script actions on the specified HDInsight cluster.\n\n        :param resource_group_name: The name of the resource group.\n        :type resource_group_name: str\n        :param cluster_name: The name of the cluster.\n        :type cluster_name: str\n        :param persist_on_success: Gets or sets if the scripts needs to be\n         persisted.\n        :type persist_on_success: bool\n        :param script_actions: The list of run time script actions.\n        :type script_actions:\n         list[~azure.mgmt.hdinsight.models.RuntimeScriptAction]\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: The poller return type is ClientRawResponse, the\n         direct response alongside the deserialized response\n        :param polling: True for ARMPolling, False for no polling, or a\n         polling object for personal polling strategy\n        :return: An instance of LROPoller that returns None or\n         ClientRawResponse<None> if raw==True\n        :rtype: ~msrestazure.azure_operation.AzureOperationPoller[None] or\n         ~msrestazure.azure_operation.AzureOperationPoller[~msrest.pipeline.ClientRawResponse[None]]\n        :raises:\n         :class:`ErrorResponseException<azure.mgmt.hdinsight.models.ErrorResponseException>`\n        \"\"\"\n        raw_result = self._execute_script_actions_initial(\n            resource_group_name=resource_group_name,\n            cluster_name=cluster_name,\n            persist_on_success=persist_on_success,\n            script_actions=script_actions,\n            custom_headers=custom_headers,\n            raw=True,\n            **operation_config\n        )\n\n        def get_long_running_output(response):\n            if raw:\n                client_raw_response = ClientRawResponse(None, response)\n                return client_raw_response\n\n        lro_delay = operation_config.get(\n            'long_running_operation_timeout',\n            self.config.long_running_operation_timeout)\n        if polling is True: polling_method = ARMPolling(lro_delay, **operation_config)\n        elif polling is False: polling_method = NoPolling()\n        else: polling_method = polling\n        return LROPoller(self._client, raw_result, get_long_running_output, polling_method)"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nreturns a new instance of the base class for the specified API version.", "response": "def models(cls, api_version=DEFAULT_API_VERSION):\n        \"\"\"Module depends on the API version:\n\n           * 2015-08-01: :mod:`v2015_08_01.models<azure.mgmt.eventhub.v2015_08_01.models>`\n           * 2017-04-01: :mod:`v2017_04_01.models<azure.mgmt.eventhub.v2017_04_01.models>`\n           * 2018-01-01-preview: :mod:`v2018_01_01_preview.models<azure.mgmt.eventhub.v2018_01_01_preview.models>`\n        \"\"\"\n        if api_version == '2015-08-01':\n            from .v2015_08_01 import models\n            return models\n        elif api_version == '2017-04-01':\n            from .v2017_04_01 import models\n            return models\n        elif api_version == '2018-01-01-preview':\n            from .v2018_01_01_preview import models\n            return models\n        raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef clusters(self):\n        api_version = self._get_api_version('clusters')\n        if api_version == '2018-01-01-preview':\n            from .v2018_01_01_preview.operations import ClustersOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))", "response": "Instance depends on the API version"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nreturning an instance of the ConfigurationOperations API version.", "response": "def configuration(self):\n        \"\"\"Instance depends on the API version:\n\n           * 2018-01-01-preview: :class:`ConfigurationOperations<azure.mgmt.eventhub.v2018_01_01_preview.operations.ConfigurationOperations>`\n        \"\"\"\n        api_version = self._get_api_version('configuration')\n        if api_version == '2018-01-01-preview':\n            from .v2018_01_01_preview.operations import ConfigurationOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nreturn an instance of the ConsumerGroups API Version.", "response": "def consumer_groups(self):\n        \"\"\"Instance depends on the API version:\n\n           * 2015-08-01: :class:`ConsumerGroupsOperations<azure.mgmt.eventhub.v2015_08_01.operations.ConsumerGroupsOperations>`\n           * 2017-04-01: :class:`ConsumerGroupsOperations<azure.mgmt.eventhub.v2017_04_01.operations.ConsumerGroupsOperations>`\n        \"\"\"\n        api_version = self._get_api_version('consumer_groups')\n        if api_version == '2015-08-01':\n            from .v2015_08_01.operations import ConsumerGroupsOperations as OperationClass\n        elif api_version == '2017-04-01':\n            from .v2017_04_01.operations import ConsumerGroupsOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef disaster_recovery_configs(self):\n        api_version = self._get_api_version('disaster_recovery_configs')\n        if api_version == '2017-04-01':\n            from .v2017_04_01.operations import DisasterRecoveryConfigsOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))", "response": "Return an instance of the DisasterRecoveryConfigsOperations class."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef event_hubs(self):\n        api_version = self._get_api_version('event_hubs')\n        if api_version == '2015-08-01':\n            from .v2015_08_01.operations import EventHubsOperations as OperationClass\n        elif api_version == '2017-04-01':\n            from .v2017_04_01.operations import EventHubsOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))", "response": "Return an instance of the Event Hubs API version."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef namespaces(self):\n        api_version = self._get_api_version('namespaces')\n        if api_version == '2015-08-01':\n            from .v2015_08_01.operations import NamespacesOperations as OperationClass\n        elif api_version == '2017-04-01':\n            from .v2017_04_01.operations import NamespacesOperations as OperationClass\n        elif api_version == '2018-01-01-preview':\n            from .v2018_01_01_preview.operations import NamespacesOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))", "response": "Return an instance of the NamespacesOperations API version."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nreturning an instance of the RegionsOperations API Version", "response": "def regions(self):\n        \"\"\"Instance depends on the API version:\n\n           * 2017-04-01: :class:`RegionsOperations<azure.mgmt.eventhub.v2017_04_01.operations.RegionsOperations>`\n        \"\"\"\n        api_version = self._get_api_version('regions')\n        if api_version == '2017-04-01':\n            from .v2017_04_01.operations import RegionsOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef check_front_door_name_availability(\n            self, name, type, custom_headers=None, raw=False, **operation_config):\n        \"\"\"Check the availability of a Front Door resource name.\n\n        :param name: The resource name to validate.\n        :type name: str\n        :param type: The type of the resource whose name is to be validated.\n         Possible values include: 'Microsoft.Network/frontDoors',\n         'Microsoft.Network/frontDoors/frontendEndpoints'\n        :type type: str or ~azure.mgmt.frontdoor.models.ResourceType\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: CheckNameAvailabilityOutput or ClientRawResponse if raw=true\n        :rtype: ~azure.mgmt.frontdoor.models.CheckNameAvailabilityOutput or\n         ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`ErrorResponseException<azure.mgmt.frontdoor.models.ErrorResponseException>`\n        \"\"\"\n        check_front_door_name_availability_input = models.CheckNameAvailabilityInput(name=name, type=type)\n\n        api_version = \"2018-08-01\"\n\n        # Construct URL\n        url = self.check_front_door_name_availability.metadata['url']\n\n        # Construct parameters\n        query_parameters = {}\n        query_parameters['api-version'] = self._serialize.query(\"api_version\", api_version, 'str')\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Accept'] = 'application/json'\n        header_parameters['Content-Type'] = 'application/json; charset=utf-8'\n        if self.config.generate_client_request_id:\n            header_parameters['x-ms-client-request-id'] = str(uuid.uuid1())\n        if custom_headers:\n            header_parameters.update(custom_headers)\n        if self.config.accept_language is not None:\n            header_parameters['accept-language'] = self._serialize.header(\"self.config.accept_language\", self.config.accept_language, 'str')\n\n        # Construct body\n        body_content = self._serialize.body(check_front_door_name_availability_input, 'CheckNameAvailabilityInput')\n\n        # Construct and send request\n        request = self._client.post(url, query_parameters, header_parameters, body_content)\n        response = self._client.send(request, stream=False, **operation_config)\n\n        if response.status_code not in [200]:\n            raise models.ErrorResponseException(self._deserialize, response)\n\n        deserialized = None\n\n        if response.status_code == 200:\n            deserialized = self._deserialize('CheckNameAvailabilityOutput', response)\n\n        if raw:\n            client_raw_response = ClientRawResponse(deserialized, response)\n            return client_raw_response\n\n        return deserialized", "response": "Checks the availability of a Front Door resource name."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef purge_deleted(\n            self, vault_name, location, custom_headers=None, raw=False, polling=True, **operation_config):\n        \"\"\"Permanently deletes the specified vault. aka Purges the deleted Azure\n        key vault.\n\n        :param vault_name: The name of the soft-deleted vault.\n        :type vault_name: str\n        :param location: The location of the soft-deleted vault.\n        :type location: str\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: The poller return type is ClientRawResponse, the\n         direct response alongside the deserialized response\n        :param polling: True for ARMPolling, False for no polling, or a\n         polling object for personal polling strategy\n        :return: An instance of LROPoller that returns None or\n         ClientRawResponse<None> if raw==True\n        :rtype: ~msrestazure.azure_operation.AzureOperationPoller[None] or\n         ~msrestazure.azure_operation.AzureOperationPoller[~msrest.pipeline.ClientRawResponse[None]]\n        :raises: :class:`CloudError<msrestazure.azure_exceptions.CloudError>`\n        \"\"\"\n        raw_result = self._purge_deleted_initial(\n            vault_name=vault_name,\n            location=location,\n            custom_headers=custom_headers,\n            raw=True,\n            **operation_config\n        )\n\n        def get_long_running_output(response):\n            if raw:\n                client_raw_response = ClientRawResponse(None, response)\n                return client_raw_response\n\n        lro_delay = operation_config.get(\n            'long_running_operation_timeout',\n            self.config.long_running_operation_timeout)\n        if polling is True: polling_method = ARMPolling(lro_delay, **operation_config)\n        elif polling is False: polling_method = NoPolling()\n        else: polling_method = polling\n        return LROPoller(self._client, raw_result, get_long_running_output, polling_method)", "response": "Permanently deletes the specified Azure\n           ."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nreturns the URI for the authorization server if present otherwise empty string.", "response": "def get_authorization_server(self):\n        \"\"\" Returns the URI for the authorization server if present, otherwise empty string. \"\"\"\n        value = ''\n        for key in ['authorization_uri', 'authorization']:\n            value = self.get_value(key) or ''\n            if value:\n                break\n        return value"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nvalidate that the given URI is valid.", "response": "def _validate_request_uri(self, uri):\n        \"\"\" Extracts the host authority from the given URI. \"\"\"\n        if not uri:\n            raise ValueError('request_uri cannot be empty')\n\n        uri = parse.urlparse(uri)\n        if not uri.netloc:\n            raise ValueError('request_uri must be an absolute URI')\n\n        if uri.scheme.lower() not in ['http', 'https']:\n            raise ValueError('request_uri must be HTTP or HTTPS')\n\n        return uri.netloc"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nreturning a CLI profile class.", "response": "def get_cli_profile():\n    \"\"\"Return a CLI profile class.\n\n    .. versionadded:: 1.1.6\n\n    :return: A CLI Profile\n    :rtype: azure.cli.core._profile.Profile\n    :raises: ImportError if azure-cli-core package is not available\n    \"\"\"\n\n    try:\n        from azure.cli.core._profile import Profile\n        from azure.cli.core._session import ACCOUNT\n        from azure.cli.core._environment import get_config_dir\n    except ImportError:\n        raise ImportError(\"You need to install 'azure-cli-core' to load CLI credentials\")\n\n\n    azure_folder = get_config_dir()\n    ACCOUNT.load(os.path.join(azure_folder, 'azureProfile.json'))\n    return Profile(storage=ACCOUNT)"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nreturn the current credentials and default SubscriptionID of current loaded CLI.", "response": "def get_azure_cli_credentials(resource=None, with_tenant=False):\n    \"\"\"Return Credentials and default SubscriptionID of current loaded profile of the CLI.\n\n    Credentials will be the \"az login\" command:\n    https://docs.microsoft.com/cli/azure/authenticate-azure-cli\n\n    Default subscription ID is either the only one you have, or you can define it:\n    https://docs.microsoft.com/cli/azure/manage-azure-subscriptions-azure-cli\n\n    .. versionadded:: 1.1.6\n\n    :param str resource: The alternative resource for credentials if not ARM (GraphRBac, etc.)\n    :param bool with_tenant: If True, return a three-tuple with last as tenant ID\n    :return: tuple of Credentials and SubscriptionID (and tenant ID if with_tenant)\n    :rtype: tuple\n    \"\"\"\n    profile = get_cli_profile()\n    cred, subscription_id, tenant_id = profile.get_login_credentials(resource=resource)\n    if with_tenant:\n        return cred, subscription_id, tenant_id\n    else:\n        return cred, subscription_id"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef get_all(\n            self, target_resource=None, target_resource_type=None, target_resource_group=None, monitor_service=None, monitor_condition=None, severity=None, alert_state=None, alert_rule=None, smart_group_id=None, include_context=None, include_egress_config=None, page_count=None, sort_by=None, sort_order=None, select=None, time_range=None, custom_time_range=None, custom_headers=None, raw=False, **operation_config):\n        \"\"\"List all the existing alerts, where the results can be selective by\n        passing multiple filter parameters including time range and sorted on\n        specific fields. .\n\n        :param target_resource: Filter by target resource( which is full ARM\n         ID) Default value is select all.\n        :type target_resource: str\n        :param target_resource_type: Filter by target resource type. Default\n         value is select all.\n        :type target_resource_type: str\n        :param target_resource_group: Filter by target resource group name.\n         Default value is select all.\n        :type target_resource_group: str\n        :param monitor_service: Filter by monitor service which is the source\n         of the alert instance. Default value is select all. Possible values\n         include: 'Application Insights', 'ActivityLog Administrative',\n         'ActivityLog Security', 'ActivityLog Recommendation', 'ActivityLog\n         Policy', 'ActivityLog Autoscale', 'Log Analytics', 'Nagios',\n         'Platform', 'SCOM', 'ServiceHealth', 'SmartDetector', 'VM Insights',\n         'Zabbix'\n        :type monitor_service: str or\n         ~azure.mgmt.alertsmanagement.models.MonitorService\n        :param monitor_condition: Filter by monitor condition which is the\n         state of the  monitor(alertRule) at monitor service. Default value is\n         to select all. Possible values include: 'Fired', 'Resolved'\n        :type monitor_condition: str or\n         ~azure.mgmt.alertsmanagement.models.MonitorCondition\n        :param severity: Filter by severity.  Defaut value is select all.\n         Possible values include: 'Sev0', 'Sev1', 'Sev2', 'Sev3', 'Sev4'\n        :type severity: str or ~azure.mgmt.alertsmanagement.models.Severity\n        :param alert_state: Filter by state of the alert instance. Default\n         value is to select all. Possible values include: 'New',\n         'Acknowledged', 'Closed'\n        :type alert_state: str or\n         ~azure.mgmt.alertsmanagement.models.AlertState\n        :param alert_rule: Filter by alert rule(monitor) which fired alert\n         instance.  Default value is to select all.\n        :type alert_rule: str\n        :param smart_group_id: Filter the alerts list by the Smart Group Id.\n         Default value is none.\n        :type smart_group_id: str\n        :param include_context: Include context which has data contextual to\n         the monitor service. Default value is false'\n        :type include_context: bool\n        :param include_egress_config: Include egress config which would be\n         used for displaying the content in portal.  Default value is 'false'.\n        :type include_egress_config: bool\n        :param page_count: Determines number of alerts returned per page in\n         response. Permissible value is between 1 to 250. When the\n         \"includeContent\"  filter is selected, maximum value allowed is 25.\n         Default value is 25.\n        :type page_count: int\n        :param sort_by: Sort the query results by input field,  Default value\n         is 'lastModifiedDateTime'. Possible values include: 'name',\n         'severity', 'alertState', 'monitorCondition', 'targetResource',\n         'targetResourceName', 'targetResourceGroup', 'targetResourceType',\n         'startDateTime', 'lastModifiedDateTime'\n        :type sort_by: str or\n         ~azure.mgmt.alertsmanagement.models.AlertsSortByFields\n        :param sort_order: Sort the query results order in either ascending or\n         descending.  Default value is 'desc' for time fields and 'asc' for\n         others. Possible values include: 'asc', 'desc'\n        :type sort_order: str\n        :param select: This filter allows to selection of the fields(comma\n         seperated) which would  be part of the the essential section. This\n         would allow to project only the  required fields rather than getting\n         entire content.  Default is to fetch all the fields in the essentials\n         section.\n        :type select: str\n        :param time_range: Filter by time range by below listed values.\n         Default value is 1 day. Possible values include: '1h', '1d', '7d',\n         '30d'\n        :type time_range: str or ~azure.mgmt.alertsmanagement.models.TimeRange\n        :param custom_time_range: Filter by custom time range in the format\n         <start-time>/<end-time>  where time is in (ISO-8601 format)'.\n         Permissible values is within 30 days from  query time. Either\n         timeRange or customTimeRange could be used but not both. Default is\n         none.\n        :type custom_time_range: str\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: An iterator like instance of Alert\n        :rtype:\n         ~azure.mgmt.alertsmanagement.models.AlertPaged[~azure.mgmt.alertsmanagement.models.Alert]\n        :raises:\n         :class:`ErrorResponseException<azure.mgmt.alertsmanagement.models.ErrorResponseException>`\n        \"\"\"\n        def internal_paging(next_link=None, raw=False):\n\n            if not next_link:\n                # Construct URL\n                url = self.get_all.metadata['url']\n                path_format_arguments = {\n                    'subscriptionId': self._serialize.url(\"self.config.subscription_id\", self.config.subscription_id, 'str')\n                }\n                url = self._client.format_url(url, **path_format_arguments)\n\n                # Construct parameters\n                query_parameters = {}\n                if target_resource is not None:\n                    query_parameters['targetResource'] = self._serialize.query(\"target_resource\", target_resource, 'str')\n                if target_resource_type is not None:\n                    query_parameters['targetResourceType'] = self._serialize.query(\"target_resource_type\", target_resource_type, 'str')\n                if target_resource_group is not None:\n                    query_parameters['targetResourceGroup'] = self._serialize.query(\"target_resource_group\", target_resource_group, 'str')\n                if monitor_service is not None:\n                    query_parameters['monitorService'] = self._serialize.query(\"monitor_service\", monitor_service, 'str')\n                if monitor_condition is not None:\n                    query_parameters['monitorCondition'] = self._serialize.query(\"monitor_condition\", monitor_condition, 'str')\n                if severity is not None:\n                    query_parameters['severity'] = self._serialize.query(\"severity\", severity, 'str')\n                if alert_state is not None:\n                    query_parameters['alertState'] = self._serialize.query(\"alert_state\", alert_state, 'str')\n                if alert_rule is not None:\n                    query_parameters['alertRule'] = self._serialize.query(\"alert_rule\", alert_rule, 'str')\n                if smart_group_id is not None:\n                    query_parameters['smartGroupId'] = self._serialize.query(\"smart_group_id\", smart_group_id, 'str')\n                if include_context is not None:\n                    query_parameters['includeContext'] = self._serialize.query(\"include_context\", include_context, 'bool')\n                if include_egress_config is not None:\n                    query_parameters['includeEgressConfig'] = self._serialize.query(\"include_egress_config\", include_egress_config, 'bool')\n                if page_count is not None:\n                    query_parameters['pageCount'] = self._serialize.query(\"page_count\", page_count, 'int')\n                if sort_by is not None:\n                    query_parameters['sortBy'] = self._serialize.query(\"sort_by\", sort_by, 'str')\n                if sort_order is not None:\n                    query_parameters['sortOrder'] = self._serialize.query(\"sort_order\", sort_order, 'str')\n                if select is not None:\n                    query_parameters['select'] = self._serialize.query(\"select\", select, 'str')\n                if time_range is not None:\n                    query_parameters['timeRange'] = self._serialize.query(\"time_range\", time_range, 'str')\n                if custom_time_range is not None:\n                    query_parameters['customTimeRange'] = self._serialize.query(\"custom_time_range\", custom_time_range, 'str')\n                query_parameters['api-version'] = self._serialize.query(\"self.api_version\", self.api_version, 'str')\n\n            else:\n                url = next_link\n                query_parameters = {}\n\n            # Construct headers\n            header_parameters = {}\n            header_parameters['Accept'] = 'application/json'\n            if self.config.generate_client_request_id:\n                header_parameters['x-ms-client-request-id'] = str(uuid.uuid1())\n            if custom_headers:\n                header_parameters.update(custom_headers)\n            if self.config.accept_language is not None:\n                header_parameters['accept-language'] = self._serialize.header(\"self.config.accept_language\", self.config.accept_language, 'str')\n\n            # Construct and send request\n            request = self._client.get(url, query_parameters, header_parameters)\n            response = self._client.send(request, stream=False, **operation_config)\n\n            if response.status_code not in [200]:\n                raise models.ErrorResponseException(self._deserialize, response)\n\n            return response\n\n        # Deserialize response\n        deserialized = models.AlertPaged(internal_paging, self._deserialize.dependencies)\n\n        if raw:\n            header_dict = {}\n            client_raw_response = models.AlertPaged(internal_paging, self._deserialize.dependencies, header_dict)\n            return client_raw_response\n\n        return deserialized", "response": "List all the existing alerts."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef get_summary(\n            self, groupby, include_smart_groups_count=None, target_resource=None, target_resource_type=None, target_resource_group=None, monitor_service=None, monitor_condition=None, severity=None, alert_state=None, alert_rule=None, time_range=None, custom_time_range=None, custom_headers=None, raw=False, **operation_config):\n        \"\"\"Summary of alerts with the count each severity.\n\n        :param groupby: This parameter allows the result set to be aggregated\n         by input fields. For example, groupby=severity,alertstate. Possible\n         values include: 'severity', 'alertState', 'monitorCondition',\n         'monitorService', 'signalType', 'alertRule'\n        :type groupby: str or\n         ~azure.mgmt.alertsmanagement.models.AlertsSummaryGroupByFields\n        :param include_smart_groups_count: Include count of the SmartGroups as\n         part of the summary. Default value is 'false'.\n        :type include_smart_groups_count: bool\n        :param target_resource: Filter by target resource( which is full ARM\n         ID) Default value is select all.\n        :type target_resource: str\n        :param target_resource_type: Filter by target resource type. Default\n         value is select all.\n        :type target_resource_type: str\n        :param target_resource_group: Filter by target resource group name.\n         Default value is select all.\n        :type target_resource_group: str\n        :param monitor_service: Filter by monitor service which is the source\n         of the alert instance. Default value is select all. Possible values\n         include: 'Application Insights', 'ActivityLog Administrative',\n         'ActivityLog Security', 'ActivityLog Recommendation', 'ActivityLog\n         Policy', 'ActivityLog Autoscale', 'Log Analytics', 'Nagios',\n         'Platform', 'SCOM', 'ServiceHealth', 'SmartDetector', 'VM Insights',\n         'Zabbix'\n        :type monitor_service: str or\n         ~azure.mgmt.alertsmanagement.models.MonitorService\n        :param monitor_condition: Filter by monitor condition which is the\n         state of the  monitor(alertRule) at monitor service. Default value is\n         to select all. Possible values include: 'Fired', 'Resolved'\n        :type monitor_condition: str or\n         ~azure.mgmt.alertsmanagement.models.MonitorCondition\n        :param severity: Filter by severity.  Defaut value is select all.\n         Possible values include: 'Sev0', 'Sev1', 'Sev2', 'Sev3', 'Sev4'\n        :type severity: str or ~azure.mgmt.alertsmanagement.models.Severity\n        :param alert_state: Filter by state of the alert instance. Default\n         value is to select all. Possible values include: 'New',\n         'Acknowledged', 'Closed'\n        :type alert_state: str or\n         ~azure.mgmt.alertsmanagement.models.AlertState\n        :param alert_rule: Filter by alert rule(monitor) which fired alert\n         instance.  Default value is to select all.\n        :type alert_rule: str\n        :param time_range: Filter by time range by below listed values.\n         Default value is 1 day. Possible values include: '1h', '1d', '7d',\n         '30d'\n        :type time_range: str or ~azure.mgmt.alertsmanagement.models.TimeRange\n        :param custom_time_range: Filter by custom time range in the format\n         <start-time>/<end-time>  where time is in (ISO-8601 format)'.\n         Permissible values is within 30 days from  query time. Either\n         timeRange or customTimeRange could be used but not both. Default is\n         none.\n        :type custom_time_range: str\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: AlertsSummary or ClientRawResponse if raw=true\n        :rtype: ~azure.mgmt.alertsmanagement.models.AlertsSummary or\n         ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`ErrorResponseException<azure.mgmt.alertsmanagement.models.ErrorResponseException>`\n        \"\"\"\n        # Construct URL\n        url = self.get_summary.metadata['url']\n        path_format_arguments = {\n            'subscriptionId': self._serialize.url(\"self.config.subscription_id\", self.config.subscription_id, 'str')\n        }\n        url = self._client.format_url(url, **path_format_arguments)\n\n        # Construct parameters\n        query_parameters = {}\n        query_parameters['groupby'] = self._serialize.query(\"groupby\", groupby, 'str')\n        if include_smart_groups_count is not None:\n            query_parameters['includeSmartGroupsCount'] = self._serialize.query(\"include_smart_groups_count\", include_smart_groups_count, 'bool')\n        if target_resource is not None:\n            query_parameters['targetResource'] = self._serialize.query(\"target_resource\", target_resource, 'str')\n        if target_resource_type is not None:\n            query_parameters['targetResourceType'] = self._serialize.query(\"target_resource_type\", target_resource_type, 'str')\n        if target_resource_group is not None:\n            query_parameters['targetResourceGroup'] = self._serialize.query(\"target_resource_group\", target_resource_group, 'str')\n        if monitor_service is not None:\n            query_parameters['monitorService'] = self._serialize.query(\"monitor_service\", monitor_service, 'str')\n        if monitor_condition is not None:\n            query_parameters['monitorCondition'] = self._serialize.query(\"monitor_condition\", monitor_condition, 'str')\n        if severity is not None:\n            query_parameters['severity'] = self._serialize.query(\"severity\", severity, 'str')\n        if alert_state is not None:\n            query_parameters['alertState'] = self._serialize.query(\"alert_state\", alert_state, 'str')\n        if alert_rule is not None:\n            query_parameters['alertRule'] = self._serialize.query(\"alert_rule\", alert_rule, 'str')\n        if time_range is not None:\n            query_parameters['timeRange'] = self._serialize.query(\"time_range\", time_range, 'str')\n        if custom_time_range is not None:\n            query_parameters['customTimeRange'] = self._serialize.query(\"custom_time_range\", custom_time_range, 'str')\n        query_parameters['api-version'] = self._serialize.query(\"self.api_version\", self.api_version, 'str')\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Accept'] = 'application/json'\n        if self.config.generate_client_request_id:\n            header_parameters['x-ms-client-request-id'] = str(uuid.uuid1())\n        if custom_headers:\n            header_parameters.update(custom_headers)\n        if self.config.accept_language is not None:\n            header_parameters['accept-language'] = self._serialize.header(\"self.config.accept_language\", self.config.accept_language, 'str')\n\n        # Construct and send request\n        request = self._client.get(url, query_parameters, header_parameters)\n        response = self._client.send(request, stream=False, **operation_config)\n\n        if response.status_code not in [200]:\n            raise models.ErrorResponseException(self._deserialize, response)\n\n        deserialized = None\n\n        if response.status_code == 200:\n            deserialized = self._deserialize('AlertsSummary', response)\n\n        if raw:\n            client_raw_response = ClientRawResponse(deserialized, response)\n            return client_raw_response\n\n        return deserialized", "response": "Get the summary of the ARM log entries."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nsubmitting an operation to take a snapshot of the object with the specified type object id and apply scope.", "response": "def take(\n            self, type, object_id, apply_scope, user_data=None, custom_headers=None, raw=False, **operation_config):\n        \"\"\"Submit an operation to take a snapshot of face list, large face list,\n        person group or large person group, with user-specified snapshot type,\n        source object id, apply scope and an optional user data.<br />\n        The snapshot interfaces are for users to backup and restore their face\n        data from one face subscription to another, inside same region or\n        across regions. The workflow contains two phases, user first calls\n        Snapshot - Take to create a copy of the source object and store it as a\n        snapshot, then calls Snapshot - Apply to paste the snapshot to target\n        subscription. The snapshots are stored in a centralized location (per\n        Azure instance), so that they can be applied cross accounts and\n        regions.<br />\n        Taking snapshot is an asynchronous operation. An operation id can be\n        obtained from the \"Operation-Location\" field in response header, to be\n        used in OperationStatus - Get for tracking the progress of creating the\n        snapshot. The snapshot id will be included in the \"resourceLocation\"\n        field in OperationStatus - Get response when the operation status is\n        \"succeeded\".<br />\n        Snapshot taking time depends on the number of person and face entries\n        in the source object. It could be in seconds, or up to several hours\n        for 1,000,000 persons with multiple faces.<br />\n        Snapshots will be automatically expired and cleaned in 48 hours after\n        it is created by Snapshot - Take. User can delete the snapshot using\n        Snapshot - Delete by themselves any time before expiration.<br />\n        Taking snapshot for a certain object will not block any other\n        operations against the object. All read-only operations (Get/List and\n        Identify/FindSimilar/Verify) can be conducted as usual. For all\n        writable operations, including Add/Update/Delete the source object or\n        its persons/faces and Train, they are not blocked but not recommended\n        because writable updates may not be reflected on the snapshot during\n        its taking. After snapshot taking is completed, all readable and\n        writable operations can work as normal. Snapshot will also include the\n        training results of the source object, which means target subscription\n        the snapshot applied to does not need re-train the target object before\n        calling Identify/FindSimilar.<br />\n        * Free-tier subscription quota: 100 take operations per month.\n        * S0-tier subscription quota: 100 take operations per day.\n\n        :param type: User specified type for the source object to take\n         snapshot from. Currently FaceList, PersonGroup, LargeFaceList and\n         LargePersonGroup are supported. Possible values include: 'FaceList',\n         'LargeFaceList', 'LargePersonGroup', 'PersonGroup'\n        :type type: str or\n         ~azure.cognitiveservices.vision.face.models.SnapshotObjectType\n        :param object_id: User specified source object id to take snapshot\n         from.\n        :type object_id: str\n        :param apply_scope: User specified array of target Face subscription\n         ids for the snapshot. For each snapshot, only subscriptions included\n         in the applyScope of Snapshot - Take can apply it.\n        :type apply_scope: list[str]\n        :param user_data: User specified data about the snapshot for any\n         purpose. Length should not exceed 16KB.\n        :type user_data: str\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: None or ClientRawResponse if raw=true\n        :rtype: None or ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`APIErrorException<azure.cognitiveservices.vision.face.models.APIErrorException>`\n        \"\"\"\n        body = models.TakeSnapshotRequest(type=type, object_id=object_id, apply_scope=apply_scope, user_data=user_data)\n\n        # Construct URL\n        url = self.take.metadata['url']\n        path_format_arguments = {\n            'Endpoint': self._serialize.url(\"self.config.endpoint\", self.config.endpoint, 'str', skip_quote=True)\n        }\n        url = self._client.format_url(url, **path_format_arguments)\n\n        # Construct parameters\n        query_parameters = {}\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Content-Type'] = 'application/json; charset=utf-8'\n        if custom_headers:\n            header_parameters.update(custom_headers)\n\n        # Construct body\n        body_content = self._serialize.body(body, 'TakeSnapshotRequest')\n\n        # Construct and send request\n        request = self._client.post(url, query_parameters, header_parameters, body_content)\n        response = self._client.send(request, stream=False, **operation_config)\n\n        if response.status_code not in [202]:\n            raise models.APIErrorException(self._deserialize, response)\n\n        if raw:\n            client_raw_response = ClientRawResponse(None, response)\n            client_raw_response.add_headers({\n                'Operation-Location': 'str',\n            })\n            return client_raw_response"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef apply(\n            self, snapshot_id, object_id, mode=\"CreateNew\", custom_headers=None, raw=False, **operation_config):\n        \"\"\"Submit an operation to apply a snapshot to current subscription. For\n        each snapshot, only subscriptions included in the applyScope of\n        Snapshot - Take can apply it.<br />\n        The snapshot interfaces are for users to backup and restore their face\n        data from one face subscription to another, inside same region or\n        across regions. The workflow contains two phases, user first calls\n        Snapshot - Take to create a copy of the source object and store it as a\n        snapshot, then calls Snapshot - Apply to paste the snapshot to target\n        subscription. The snapshots are stored in a centralized location (per\n        Azure instance), so that they can be applied cross accounts and\n        regions.<br />\n        Applying snapshot is an asynchronous operation. An operation id can be\n        obtained from the \"Operation-Location\" field in response header, to be\n        used in OperationStatus - Get for tracking the progress of applying the\n        snapshot. The target object id will be included in the\n        \"resourceLocation\" field in OperationStatus - Get response when the\n        operation status is \"succeeded\".<br />\n        Snapshot applying time depends on the number of person and face entries\n        in the snapshot object. It could be in seconds, or up to 1 hour for\n        1,000,000 persons with multiple faces.<br />\n        Snapshots will be automatically expired and cleaned in 48 hours after\n        it is created by Snapshot - Take. So the target subscription is\n        required to apply the snapshot in 48 hours since its creation.<br />\n        Applying a snapshot will not block any other operations against the\n        target object, however it is not recommended because the correctness\n        cannot be guaranteed during snapshot applying. After snapshot applying\n        is completed, all operations towards the target object can work as\n        normal. Snapshot also includes the training results of the source\n        object, which means target subscription the snapshot applied to does\n        not need re-train the target object before calling\n        Identify/FindSimilar.<br />\n        One snapshot can be applied multiple times in parallel, while currently\n        only CreateNew apply mode is supported, which means the apply operation\n        will fail if target subscription already contains an object of same\n        type and using the same objectId. Users can specify the \"objectId\" in\n        request body to avoid such conflicts.<br />\n        * Free-tier subscription quota: 100 apply operations per month.\n        * S0-tier subscription quota: 100 apply operations per day.\n\n        :param snapshot_id: Id referencing a particular snapshot.\n        :type snapshot_id: str\n        :param object_id: User specified target object id to be created from\n         the snapshot.\n        :type object_id: str\n        :param mode: Snapshot applying mode. Currently only CreateNew is\n         supported, which means the apply operation will fail if target\n         subscription already contains an object of same type and using the\n         same objectId. Users can specify the \"objectId\" in request body to\n         avoid such conflicts. Possible values include: 'CreateNew'\n        :type mode: str or\n         ~azure.cognitiveservices.vision.face.models.SnapshotApplyMode\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: None or ClientRawResponse if raw=true\n        :rtype: None or ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`APIErrorException<azure.cognitiveservices.vision.face.models.APIErrorException>`\n        \"\"\"\n        body = models.ApplySnapshotRequest(object_id=object_id, mode=mode)\n\n        # Construct URL\n        url = self.apply.metadata['url']\n        path_format_arguments = {\n            'Endpoint': self._serialize.url(\"self.config.endpoint\", self.config.endpoint, 'str', skip_quote=True),\n            'snapshotId': self._serialize.url(\"snapshot_id\", snapshot_id, 'str')\n        }\n        url = self._client.format_url(url, **path_format_arguments)\n\n        # Construct parameters\n        query_parameters = {}\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Content-Type'] = 'application/json; charset=utf-8'\n        if custom_headers:\n            header_parameters.update(custom_headers)\n\n        # Construct body\n        body_content = self._serialize.body(body, 'ApplySnapshotRequest')\n\n        # Construct and send request\n        request = self._client.post(url, query_parameters, header_parameters, body_content)\n        response = self._client.send(request, stream=False, **operation_config)\n\n        if response.status_code not in [202]:\n            raise models.APIErrorException(self._deserialize, response)\n\n        if raw:\n            client_raw_response = ClientRawResponse(None, response)\n            client_raw_response.add_headers({\n                'Operation-Location': 'str',\n            })\n            return client_raw_response", "response": "Submit an operation to apply a snapshot to the current subscription."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nresolving a given utterance with the given app_id and query.", "response": "def resolve(\n            self, app_id, query, timezone_offset=None, verbose=None, staging=None, spell_check=None, bing_spell_check_subscription_key=None, log=None, custom_headers=None, raw=False, **operation_config):\n        \"\"\"Gets predictions for a given utterance, in the form of intents and\n        entities. The current maximum query size is 500 characters.\n\n        :param app_id: The LUIS application ID (Guid).\n        :type app_id: str\n        :param query: The utterance to predict.\n        :type query: str\n        :param timezone_offset: The timezone offset for the location of the\n         request.\n        :type timezone_offset: float\n        :param verbose: If true, return all intents instead of just the top\n         scoring intent.\n        :type verbose: bool\n        :param staging: Use the staging endpoint slot.\n        :type staging: bool\n        :param spell_check: Enable spell checking.\n        :type spell_check: bool\n        :param bing_spell_check_subscription_key: The subscription key to use\n         when enabling Bing spell check\n        :type bing_spell_check_subscription_key: str\n        :param log: Log query (default is true)\n        :type log: bool\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: LuisResult or ClientRawResponse if raw=true\n        :rtype:\n         ~azure.cognitiveservices.language.luis.runtime.models.LuisResult or\n         ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`APIErrorException<azure.cognitiveservices.language.luis.runtime.models.APIErrorException>`\n        \"\"\"\n        # Construct URL\n        url = self.resolve.metadata['url']\n        path_format_arguments = {\n            'Endpoint': self._serialize.url(\"self.config.endpoint\", self.config.endpoint, 'str', skip_quote=True),\n            'appId': self._serialize.url(\"app_id\", app_id, 'str')\n        }\n        url = self._client.format_url(url, **path_format_arguments)\n\n        # Construct parameters\n        query_parameters = {}\n        if timezone_offset is not None:\n            query_parameters['timezoneOffset'] = self._serialize.query(\"timezone_offset\", timezone_offset, 'float')\n        if verbose is not None:\n            query_parameters['verbose'] = self._serialize.query(\"verbose\", verbose, 'bool')\n        if staging is not None:\n            query_parameters['staging'] = self._serialize.query(\"staging\", staging, 'bool')\n        if spell_check is not None:\n            query_parameters['spellCheck'] = self._serialize.query(\"spell_check\", spell_check, 'bool')\n        if bing_spell_check_subscription_key is not None:\n            query_parameters['bing-spell-check-subscription-key'] = self._serialize.query(\"bing_spell_check_subscription_key\", bing_spell_check_subscription_key, 'str')\n        if log is not None:\n            query_parameters['log'] = self._serialize.query(\"log\", log, 'bool')\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Accept'] = 'application/json'\n        header_parameters['Content-Type'] = 'application/json; charset=utf-8'\n        if custom_headers:\n            header_parameters.update(custom_headers)\n\n        # Construct body\n        body_content = self._serialize.body(query, 'str')\n\n        # Construct and send request\n        request = self._client.post(url, query_parameters, header_parameters, body_content)\n        response = self._client.send(request, stream=False, **operation_config)\n\n        if response.status_code not in [200]:\n            raise models.APIErrorException(self._deserialize, response)\n\n        deserialized = None\n\n        if response.status_code == 200:\n            deserialized = self._deserialize('LuisResult', response)\n\n        if raw:\n            client_raw_response = ClientRawResponse(deserialized, response)\n            return client_raw_response\n\n        return deserialized"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nchecking the Name availability of a local resource.", "response": "def check_name_availability_local(\n            self, location, name, type, custom_headers=None, raw=False, **operation_config):\n        \"\"\"Check Name Availability for global uniqueness.\n\n        :param location: The location in which uniqueness will be verified.\n        :type location: str\n        :param name: Resource Name To Verify\n        :type name: str\n        :param type: Fully qualified resource type which includes provider\n         namespace\n        :type type: str\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: CheckNameAvailabilityResponse or ClientRawResponse if\n         raw=true\n        :rtype: ~azure.mgmt.mixedreality.models.CheckNameAvailabilityResponse\n         or ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`ErrorResponseException<azure.mgmt.mixedreality.models.ErrorResponseException>`\n        \"\"\"\n        check_name_availability = models.CheckNameAvailabilityRequest(name=name, type=type)\n\n        # Construct URL\n        url = self.check_name_availability_local.metadata['url']\n        path_format_arguments = {\n            'subscriptionId': self._serialize.url(\"self.config.subscription_id\", self.config.subscription_id, 'str'),\n            'location': self._serialize.url(\"location\", location, 'str', max_length=90, min_length=1, pattern=r'^[-\\w\\._\\(\\)]+$')\n        }\n        url = self._client.format_url(url, **path_format_arguments)\n\n        # Construct parameters\n        query_parameters = {}\n        query_parameters['api-version'] = self._serialize.query(\"self.api_version\", self.api_version, 'str')\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Accept'] = 'application/json'\n        header_parameters['Content-Type'] = 'application/json; charset=utf-8'\n        if self.config.generate_client_request_id:\n            header_parameters['x-ms-client-request-id'] = str(uuid.uuid1())\n        if custom_headers:\n            header_parameters.update(custom_headers)\n        if self.config.accept_language is not None:\n            header_parameters['accept-language'] = self._serialize.header(\"self.config.accept_language\", self.config.accept_language, 'str')\n\n        # Construct body\n        body_content = self._serialize.body(check_name_availability, 'CheckNameAvailabilityRequest')\n\n        # Construct and send request\n        request = self._client.post(url, query_parameters, header_parameters, body_content)\n        response = self._client.send(request, stream=False, **operation_config)\n\n        if response.status_code not in [200]:\n            raise models.ErrorResponseException(self._deserialize, response)\n\n        deserialized = None\n\n        if response.status_code == 200:\n            deserialized = self._deserialize('CheckNameAvailabilityResponse', response)\n\n        if raw:\n            client_raw_response = ClientRawResponse(deserialized, response)\n            return client_raw_response\n\n        return deserialized"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef details(\n            self, query, accept_language=None, content_type=None, user_agent=None, client_id=None, client_ip=None, location=None, crop_bottom=None, crop_left=None, crop_right=None, crop_top=None, crop_type=None, country_code=None, id=None, image_url=None, insights_token=None, modules=None, market=None, safe_search=None, set_lang=None, custom_headers=None, raw=False, **operation_config):\n        \"\"\"The Image Detail Search API lets you search on Bing and get back\n        insights about an image, such as webpages that include the image. This\n        section provides technical details about the query parameters and\n        headers that you use to request insights of images and the JSON\n        response objects that contain them. For examples that show how to make\n        requests, see [Searching the Web for\n        Images](https://docs.microsoft.com/azure/cognitive-services/bing-image-search/search-the-web).\n\n        :param query: The user's search query term. The term cannot be empty.\n         The term may contain [Bing Advanced\n         Operators](http://msdn.microsoft.com/library/ff795620.aspx). For\n         example, to limit images to a specific domain, use the\n         [site:](http://msdn.microsoft.com/library/ff795613.aspx) operator. To\n         help improve relevance of an insights query (see\n         [insightsToken](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#insightstoken)),\n         you should always include the user's query term. Use this parameter\n         only with the Image Search API.Do not specify this parameter when\n         calling the Trending Images API.\n        :type query: str\n        :param accept_language: A comma-delimited list of one or more\n         languages to use for user interface strings. The list is in decreasing\n         order of preference. For additional information, including expected\n         format, see\n         [RFC2616](http://www.w3.org/Protocols/rfc2616/rfc2616-sec14.html).\n         This header and the\n         [setLang](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#setlang)\n         query parameter are mutually exclusive; do not specify both. If you\n         set this header, you must also specify the\n         [cc](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#cc)\n         query parameter. To determine the market to return results for, Bing\n         uses the first supported language it finds from the list and combines\n         it with the cc parameter value. If the list does not include a\n         supported language, Bing finds the closest language and market that\n         supports the request or it uses an aggregated or default market for\n         the results. To determine the market that Bing used, see the\n         BingAPIs-Market header. Use this header and the cc query parameter\n         only if you specify multiple languages. Otherwise, use the\n         [mkt](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#mkt)\n         and\n         [setLang](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#setlang)\n         query parameters. A user interface string is a string that's used as a\n         label in a user interface. There are few user interface strings in the\n         JSON response objects. Any links to Bing.com properties in the\n         response objects apply the specified language.\n        :type accept_language: str\n        :param content_type: Optional request header. If you set the\n         [modules](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#modulesrequested)\n         query parameter to RecognizedEntities, you may specify the binary of\n         an image in the body of a POST request. If you specify the image in\n         the body of a POST request, you must specify this header and set its\n         value to multipart/form-data. The maximum image size is 1 MB.\n        :type content_type: str\n        :param user_agent: The user agent originating the request. Bing uses\n         the user agent to provide mobile users with an optimized experience.\n         Although optional, you are encouraged to always specify this header.\n         The user-agent should be the same string that any commonly used\n         browser sends. For information about user agents, see [RFC\n         2616](http://www.w3.org/Protocols/rfc2616/rfc2616-sec14.html). The\n         following are examples of user-agent strings. Windows Phone:\n         Mozilla/5.0 (compatible; MSIE 10.0; Windows Phone 8.0; Trident/6.0;\n         IEMobile/10.0; ARM; Touch; NOKIA; Lumia 822). Android: Mozilla / 5.0\n         (Linux; U; Android 2.3.5; en - us; SCH - I500 Build / GINGERBREAD)\n         AppleWebKit / 533.1 (KHTML; like Gecko) Version / 4.0 Mobile Safari /\n         533.1. iPhone: Mozilla / 5.0 (iPhone; CPU iPhone OS 6_1 like Mac OS X)\n         AppleWebKit / 536.26 (KHTML; like Gecko) Mobile / 10B142 iPhone4; 1\n         BingWeb / 3.03.1428.20120423. PC: Mozilla / 5.0 (Windows NT 6.3;\n         WOW64; Trident / 7.0; Touch; rv:11.0) like Gecko. iPad: Mozilla / 5.0\n         (iPad; CPU OS 7_0 like Mac OS X) AppleWebKit / 537.51.1 (KHTML, like\n         Gecko) Version / 7.0 Mobile / 11A465 Safari / 9537.53\n        :type user_agent: str\n        :param client_id: Bing uses this header to provide users with\n         consistent behavior across Bing API calls. Bing often flights new\n         features and improvements, and it uses the client ID as a key for\n         assigning traffic on different flights. If you do not use the same\n         client ID for a user across multiple requests, then Bing may assign\n         the user to multiple conflicting flights. Being assigned to multiple\n         conflicting flights can lead to an inconsistent user experience. For\n         example, if the second request has a different flight assignment than\n         the first, the experience may be unexpected. Also, Bing can use the\n         client ID to tailor web results to that client ID\u2019s search history,\n         providing a richer experience for the user. Bing also uses this header\n         to help improve result rankings by analyzing the activity generated by\n         a client ID. The relevance improvements help with better quality of\n         results delivered by Bing APIs and in turn enables higher\n         click-through rates for the API consumer. IMPORTANT: Although\n         optional, you should consider this header required. Persisting the\n         client ID across multiple requests for the same end user and device\n         combination enables 1) the API consumer to receive a consistent user\n         experience, and 2) higher click-through rates via better quality of\n         results from the Bing APIs. Each user that uses your application on\n         the device must have a unique, Bing generated client ID. If you do not\n         include this header in the request, Bing generates an ID and returns\n         it in the X-MSEdge-ClientID response header. The only time that you\n         should NOT include this header in a request is the first time the user\n         uses your app on that device. Use the client ID for each Bing API\n         request that your app makes for this user on the device. Persist the\n         client ID. To persist the ID in a browser app, use a persistent HTTP\n         cookie to ensure the ID is used across all sessions. Do not use a\n         session cookie. For other apps such as mobile apps, use the device's\n         persistent storage to persist the ID. The next time the user uses your\n         app on that device, get the client ID that you persisted. Bing\n         responses may or may not include this header. If the response includes\n         this header, capture the client ID and use it for all subsequent Bing\n         requests for the user on that device. If you include the\n         X-MSEdge-ClientID, you must not include cookies in the request.\n        :type client_id: str\n        :param client_ip: The IPv4 or IPv6 address of the client device. The\n         IP address is used to discover the user's location. Bing uses the\n         location information to determine safe search behavior. Although\n         optional, you are encouraged to always specify this header and the\n         X-Search-Location header. Do not obfuscate the address (for example,\n         by changing the last octet to 0). Obfuscating the address results in\n         the location not being anywhere near the device's actual location,\n         which may result in Bing serving erroneous results.\n        :type client_ip: str\n        :param location: A semicolon-delimited list of key/value pairs that\n         describe the client's geographical location. Bing uses the location\n         information to determine safe search behavior and to return relevant\n         local content. Specify the key/value pair as <key>:<value>. The\n         following are the keys that you use to specify the user's location.\n         lat (required): The latitude of the client's location, in degrees. The\n         latitude must be greater than or equal to -90.0 and less than or equal\n         to +90.0. Negative values indicate southern latitudes and positive\n         values indicate northern latitudes. long (required): The longitude of\n         the client's location, in degrees. The longitude must be greater than\n         or equal to -180.0 and less than or equal to +180.0. Negative values\n         indicate western longitudes and positive values indicate eastern\n         longitudes. re (required): The radius, in meters, which specifies the\n         horizontal accuracy of the coordinates. Pass the value returned by the\n         device's location service. Typical values might be 22m for GPS/Wi-Fi,\n         380m for cell tower triangulation, and 18,000m for reverse IP lookup.\n         ts (optional): The UTC UNIX timestamp of when the client was at the\n         location. (The UNIX timestamp is the number of seconds since January\n         1, 1970.) head (optional): The client's relative heading or direction\n         of travel. Specify the direction of travel as degrees from 0 through\n         360, counting clockwise relative to true north. Specify this key only\n         if the sp key is nonzero. sp (optional): The horizontal velocity\n         (speed), in meters per second, that the client device is traveling.\n         alt (optional): The altitude of the client device, in meters. are\n         (optional): The radius, in meters, that specifies the vertical\n         accuracy of the coordinates. Specify this key only if you specify the\n         alt key. Although many of the keys are optional, the more information\n         that you provide, the more accurate the location results are. Although\n         optional, you are encouraged to always specify the user's geographical\n         location. Providing the location is especially important if the\n         client's IP address does not accurately reflect the user's physical\n         location (for example, if the client uses VPN). For optimal results,\n         you should include this header and the X-MSEdge-ClientIP header, but\n         at a minimum, you should include this header.\n        :type location: str\n        :param crop_bottom: The bottom coordinate of the region to crop. The\n         coordinate is a fractional value of the original image's height and is\n         measured from the top, left corner of the image. Specify the\n         coordinate as a value from 0.0 through 1.0. Use this parameter only\n         with the Insights API. Do not specify this parameter when calling the\n         Images, Trending Images, or Web Search APIs.\n        :type crop_bottom: float\n        :param crop_left: The left coordinate of the region to crop. The\n         coordinate is a fractional value of the original image's height and is\n         measured from the top, left corner of the image. Specify the\n         coordinate as a value from 0.0 through 1.0. Use this parameter only\n         with the Insights API. Do not specify this parameter when calling the\n         Images, Trending Images, or Web Search APIs.\n        :type crop_left: float\n        :param crop_right: The right coordinate of the region to crop. The\n         coordinate is a fractional value of the original image's height and is\n         measured from the top, left corner of the image. Specify the\n         coordinate as a value from 0.0 through 1.0. Use this parameter only\n         with the Insights API. Do not specify this parameter when calling the\n         Images, Trending Images, or Web Search APIs.\n        :type crop_right: float\n        :param crop_top: The top coordinate of the region to crop. The\n         coordinate is a fractional value of the original image's height and is\n         measured from the top, left corner of the image. Specify the\n         coordinate as a value from 0.0 through 1.0. Use this parameter only\n         with the Insights API. Do not specify this parameter when calling the\n         Images, Trending Images, or Web Search APIs.\n        :type crop_top: float\n        :param crop_type: The crop type to use when cropping the image based\n         on the coordinates specified in the cal, cat, car, and cab parameters.\n         The following are the possible values. 0: Rectangular (default). Use\n         this parameter only with the Insights API. Do not specify this\n         parameter when calling the Images, Trending Images, or Web Search\n         APIs. Possible values include: 'Rectangular'\n        :type crop_type: str or\n         ~azure.cognitiveservices.search.imagesearch.models.ImageCropType\n        :param country_code: A 2-character country code of the country where\n         the results come from. For a list of possible values, see [Market\n         Codes](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#market-codes).\n         If you set this parameter, you must also specify the\n         [Accept-Language](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#acceptlanguage)\n         header. Bing uses the first supported language it finds from the\n         languages list, and combine that language with the country code that\n         you specify to determine the market to return results for. If the\n         languages list does not include a supported language, Bing finds the\n         closest language and market that supports the request, or it may use\n         an aggregated or default market for the results instead of a specified\n         one. You should use this query parameter and the Accept-Language query\n         parameter only if you specify multiple languages; otherwise, you\n         should use the mkt and setLang query parameters. This parameter and\n         the\n         [mkt](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#mkt)\n         query parameter are mutually exclusive\u2014do not specify both.\n        :type country_code: str\n        :param id: An ID that uniquely identifies an image. Use this parameter\n         to ensure that the specified image is the first image in the list of\n         images that Bing returns. The\n         [Image](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#image)\n         object's imageId field contains the ID that you set this parameter to.\n        :type id: str\n        :param image_url: The URL of an image that you want to get insights\n         of. Use this parameter as an alternative to using the insightsToken\n         parameter to specify the image. You may also specify the image by\n         placing the binary of the image in the body of a POST request. If you\n         use the binary option, see the\n         [Content-Type](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#contenttype)\n         header. The maximum supported image size is 1 MB. Use this parameter\n         only with the Insights API. Do not specify this parameter when calling\n         the Images, Trending Images, or Web Search APIs.\n        :type image_url: str\n        :param insights_token: An image token. The\n         [Image](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#image)\n         object's\n         [imageInsightsToken](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#image-imageinsightstoken)\n         contains the token. Specify this parameter to get additional\n         information about an image, such as a caption or shopping source. For\n         a list of the additional information about an image that you can get,\n         see the\n         [modules](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#modulesrequested)\n         query parameter. Use this parameter only with the Insights API. Do not\n         specify this parameter when calling the Images, Trending Images, or\n         Web Search APIs.\n        :type insights_token: str\n        :param modules: A comma-delimited list of insights to request. The\n         following are the possible case-insensitive values. All: Return all\n         insights, if available, except RecognizedEntities. BRQ: Best\n         representative query. The query term that best describes the image.\n         Caption: A caption that provides information about the image. If the\n         caption contains entities, the response may include links to images of\n         those entities. Collections: A list of related images. Recipes: A list\n         of recipes for cooking the food shown in the images. PagesIncluding: A\n         list of webpages that include the image. RecognizedEntities: A list of\n         entities (people) that were recognized in the image. NOTE: You may not\n         specify this module with any other module. If you specify it with\n         other modules, the response doesn't include recognized entities.\n         RelatedSearches: A list of related searches made by others.\n         ShoppingSources: A list of merchants where you can buy related\n         offerings. SimilarImages: A list of images that are visually similar\n         to the original image. SimilarProducts: A list of images that contain\n         a product that is similar to a product found in the original image.\n         Tags: Provides characteristics of the type of content found in the\n         image. For example, if the image is of a person, the tags might\n         indicate the person's gender and type of clothes they're wearing. If\n         you specify a module and there is no data for the module, the response\n         object doesn't include the related field. For example, if you specify\n         Caption and it does not exist, the response doesn't include the\n         imageCaption field. To include related searches, the request must\n         include the original query string. Although the original query string\n         is not required for similar images or products, you should always\n         include it because it can help improve relevance and the results. Use\n         this parameter only with the Insights API. Do not specify this\n         parameter when calling the Images, Trending Images, or Web Search\n         APIs.\n        :type modules: list[str or\n         ~azure.cognitiveservices.search.imagesearch.models.ImageInsightModule]\n        :param market: The market where the results come from. Typically, mkt\n         is the country where the user is making the request from. However, it\n         could be a different country if the user is not located in a country\n         where Bing delivers results. The market must be in the form <language\n         code>-<country code>. For example, en-US. The string is case\n         insensitive. For a list of possible market values, see [Market\n         Codes](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#market-codes).\n         NOTE: If known, you are encouraged to always specify the market.\n         Specifying the market helps Bing route the request and return an\n         appropriate and optimal response. If you specify a market that is not\n         listed in [Market\n         Codes](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#market-codes),\n         Bing uses a best fit market code based on an internal mapping that is\n         subject to change. This parameter and the\n         [cc](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#cc)\n         query parameter are mutually exclusive\u2014do not specify both.\n        :type market: str\n        :param safe_search: Filter images for adult content. The following are\n         the possible filter values. Off: May return images with adult content.\n         If the request is through the Image Search API, the response includes\n         thumbnail images that are clear (non-fuzzy). However, if the request\n         is through the Web Search API, the response includes thumbnail images\n         that are pixelated (fuzzy). Moderate: If the request is through the\n         Image Search API, the response doesn't include images with adult\n         content. If the request is through the Web Search API, the response\n         may include images with adult content (the thumbnail images are\n         pixelated (fuzzy)). Strict: Do not return images with adult content.\n         The default is Moderate. If the request comes from a market that\n         Bing's adult policy requires that safeSearch is set to Strict, Bing\n         ignores the safeSearch value and uses Strict. If you use the site:\n         query operator, there is the chance that the response may contain\n         adult content regardless of what the safeSearch query parameter is set\n         to. Use site: only if you are aware of the content on the site and\n         your scenario supports the possibility of adult content. Possible\n         values include: 'Off', 'Moderate', 'Strict'\n        :type safe_search: str or\n         ~azure.cognitiveservices.search.imagesearch.models.SafeSearch\n        :param set_lang: The language to use for user interface strings.\n         Specify the language using the ISO 639-1 2-letter language code. For\n         example, the language code for English is EN. The default is EN\n         (English). Although optional, you should always specify the language.\n         Typically, you set setLang to the same language specified by mkt\n         unless the user wants the user interface strings displayed in a\n         different language. This parameter and the\n         [Accept-Language](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#acceptlanguage)\n         header are mutually exclusive; do not specify both. A user interface\n         string is a string that's used as a label in a user interface. There\n         are few user interface strings in the JSON response objects. Also, any\n         links to Bing.com properties in the response objects apply the\n         specified language.\n        :type set_lang: str\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: ImageInsights or ClientRawResponse if raw=true\n        :rtype:\n         ~azure.cognitiveservices.search.imagesearch.models.ImageInsights or\n         ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`ErrorResponseException<azure.cognitiveservices.search.imagesearch.models.ErrorResponseException>`\n        \"\"\"\n        # Construct URL\n        url = self.details.metadata['url']\n\n        # Construct parameters\n        query_parameters = {}\n        if crop_bottom is not None:\n            query_parameters['cab'] = self._serialize.query(\"crop_bottom\", crop_bottom, 'float')\n        if crop_left is not None:\n            query_parameters['cal'] = self._serialize.query(\"crop_left\", crop_left, 'float')\n        if crop_right is not None:\n            query_parameters['car'] = self._serialize.query(\"crop_right\", crop_right, 'float')\n        if crop_top is not None:\n            query_parameters['cat'] = self._serialize.query(\"crop_top\", crop_top, 'float')\n        if crop_type is not None:\n            query_parameters['ct'] = self._serialize.query(\"crop_type\", crop_type, 'str')\n        if country_code is not None:\n            query_parameters['cc'] = self._serialize.query(\"country_code\", country_code, 'str')\n        if id is not None:\n            query_parameters['id'] = self._serialize.query(\"id\", id, 'str')\n        if image_url is not None:\n            query_parameters['imgUrl'] = self._serialize.query(\"image_url\", image_url, 'str')\n        if insights_token is not None:\n            query_parameters['insightsToken'] = self._serialize.query(\"insights_token\", insights_token, 'str')\n        if modules is not None:\n            query_parameters['modules'] = self._serialize.query(\"modules\", modules, '[str]', div=',')\n        if market is not None:\n            query_parameters['mkt'] = self._serialize.query(\"market\", market, 'str')\n        query_parameters['q'] = self._serialize.query(\"query\", query, 'str')\n        if safe_search is not None:\n            query_parameters['safeSearch'] = self._serialize.query(\"safe_search\", safe_search, 'str')\n        if set_lang is not None:\n            query_parameters['setLang'] = self._serialize.query(\"set_lang\", set_lang, 'str')\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Content-Type'] = 'application/json; charset=utf-8'\n        if custom_headers:\n            header_parameters.update(custom_headers)\n        header_parameters['X-BingApis-SDK'] = self._serialize.header(\"self.x_bing_apis_sdk\", self.x_bing_apis_sdk, 'str')\n        if accept_language is not None:\n            header_parameters['Accept-Language'] = self._serialize.header(\"accept_language\", accept_language, 'str')\n        if content_type is not None:\n            header_parameters['Content-Type'] = self._serialize.header(\"content_type\", content_type, 'str')\n        if user_agent is not None:\n            header_parameters['User-Agent'] = self._serialize.header(\"user_agent\", user_agent, 'str')\n        if client_id is not None:\n            header_parameters['X-MSEdge-ClientID'] = self._serialize.header(\"client_id\", client_id, 'str')\n        if client_ip is not None:\n            header_parameters['X-MSEdge-ClientIP'] = self._serialize.header(\"client_ip\", client_ip, 'str')\n        if location is not None:\n            header_parameters['X-Search-Location'] = self._serialize.header(\"location\", location, 'str')\n\n        # Construct and send request\n        request = self._client.get(url, query_parameters)\n        response = self._client.send(request, header_parameters, stream=False, **operation_config)\n\n        if response.status_code not in [200]:\n            raise models.ErrorResponseException(self._deserialize, response)\n\n        deserialized = None\n\n        if response.status_code == 200:\n            deserialized = self._deserialize('ImageInsights', response)\n\n        if raw:\n            client_raw_response = ClientRawResponse(deserialized, response)\n            return client_raw_response\n\n        return deserialized", "response": "This method provides technical details about an image in a specific language."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef trending(\n            self, accept_language=None, user_agent=None, client_id=None, client_ip=None, location=None, country_code=None, market=None, safe_search=None, set_lang=None, custom_headers=None, raw=False, **operation_config):\n        \"\"\"The Image Trending Search API lets you search on Bing and get back a\n        list of images that are trending based on search requests made by\n        others. The images are broken out into different categories. For\n        example, Popular People Searches. For a list of markets that support\n        trending images, see [Trending\n        Images](https://docs.microsoft.com/en-us/azure/cognitive-services/bing-image-search/trending-images).\n\n        :param accept_language: A comma-delimited list of one or more\n         languages to use for user interface strings. The list is in decreasing\n         order of preference. For additional information, including expected\n         format, see\n         [RFC2616](http://www.w3.org/Protocols/rfc2616/rfc2616-sec14.html).\n         This header and the\n         [setLang](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#setlang)\n         query parameter are mutually exclusive; do not specify both. If you\n         set this header, you must also specify the\n         [cc](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#cc)\n         query parameter. To determine the market to return results for, Bing\n         uses the first supported language it finds from the list and combines\n         it with the cc parameter value. If the list does not include a\n         supported language, Bing finds the closest language and market that\n         supports the request or it uses an aggregated or default market for\n         the results. To determine the market that Bing used, see the\n         BingAPIs-Market header. Use this header and the cc query parameter\n         only if you specify multiple languages. Otherwise, use the\n         [mkt](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#mkt)\n         and\n         [setLang](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#setlang)\n         query parameters. A user interface string is a string that's used as a\n         label in a user interface. There are few user interface strings in the\n         JSON response objects. Any links to Bing.com properties in the\n         response objects apply the specified language.\n        :type accept_language: str\n        :param user_agent: The user agent originating the request. Bing uses\n         the user agent to provide mobile users with an optimized experience.\n         Although optional, you are encouraged to always specify this header.\n         The user-agent should be the same string that any commonly used\n         browser sends. For information about user agents, see [RFC\n         2616](http://www.w3.org/Protocols/rfc2616/rfc2616-sec14.html). The\n         following are examples of user-agent strings. Windows Phone:\n         Mozilla/5.0 (compatible; MSIE 10.0; Windows Phone 8.0; Trident/6.0;\n         IEMobile/10.0; ARM; Touch; NOKIA; Lumia 822). Android: Mozilla / 5.0\n         (Linux; U; Android 2.3.5; en - us; SCH - I500 Build / GINGERBREAD)\n         AppleWebKit / 533.1 (KHTML; like Gecko) Version / 4.0 Mobile Safari /\n         533.1. iPhone: Mozilla / 5.0 (iPhone; CPU iPhone OS 6_1 like Mac OS X)\n         AppleWebKit / 536.26 (KHTML; like Gecko) Mobile / 10B142 iPhone4; 1\n         BingWeb / 3.03.1428.20120423. PC: Mozilla / 5.0 (Windows NT 6.3;\n         WOW64; Trident / 7.0; Touch; rv:11.0) like Gecko. iPad: Mozilla / 5.0\n         (iPad; CPU OS 7_0 like Mac OS X) AppleWebKit / 537.51.1 (KHTML, like\n         Gecko) Version / 7.0 Mobile / 11A465 Safari / 9537.53\n        :type user_agent: str\n        :param client_id: Bing uses this header to provide users with\n         consistent behavior across Bing API calls. Bing often flights new\n         features and improvements, and it uses the client ID as a key for\n         assigning traffic on different flights. If you do not use the same\n         client ID for a user across multiple requests, then Bing may assign\n         the user to multiple conflicting flights. Being assigned to multiple\n         conflicting flights can lead to an inconsistent user experience. For\n         example, if the second request has a different flight assignment than\n         the first, the experience may be unexpected. Also, Bing can use the\n         client ID to tailor web results to that client ID\u2019s search history,\n         providing a richer experience for the user. Bing also uses this header\n         to help improve result rankings by analyzing the activity generated by\n         a client ID. The relevance improvements help with better quality of\n         results delivered by Bing APIs and in turn enables higher\n         click-through rates for the API consumer. IMPORTANT: Although\n         optional, you should consider this header required. Persisting the\n         client ID across multiple requests for the same end user and device\n         combination enables 1) the API consumer to receive a consistent user\n         experience, and 2) higher click-through rates via better quality of\n         results from the Bing APIs. Each user that uses your application on\n         the device must have a unique, Bing generated client ID. If you do not\n         include this header in the request, Bing generates an ID and returns\n         it in the X-MSEdge-ClientID response header. The only time that you\n         should NOT include this header in a request is the first time the user\n         uses your app on that device. Use the client ID for each Bing API\n         request that your app makes for this user on the device. Persist the\n         client ID. To persist the ID in a browser app, use a persistent HTTP\n         cookie to ensure the ID is used across all sessions. Do not use a\n         session cookie. For other apps such as mobile apps, use the device's\n         persistent storage to persist the ID. The next time the user uses your\n         app on that device, get the client ID that you persisted. Bing\n         responses may or may not include this header. If the response includes\n         this header, capture the client ID and use it for all subsequent Bing\n         requests for the user on that device. If you include the\n         X-MSEdge-ClientID, you must not include cookies in the request.\n        :type client_id: str\n        :param client_ip: The IPv4 or IPv6 address of the client device. The\n         IP address is used to discover the user's location. Bing uses the\n         location information to determine safe search behavior. Although\n         optional, you are encouraged to always specify this header and the\n         X-Search-Location header. Do not obfuscate the address (for example,\n         by changing the last octet to 0). Obfuscating the address results in\n         the location not being anywhere near the device's actual location,\n         which may result in Bing serving erroneous results.\n        :type client_ip: str\n        :param location: A semicolon-delimited list of key/value pairs that\n         describe the client's geographical location. Bing uses the location\n         information to determine safe search behavior and to return relevant\n         local content. Specify the key/value pair as <key>:<value>. The\n         following are the keys that you use to specify the user's location.\n         lat (required): The latitude of the client's location, in degrees. The\n         latitude must be greater than or equal to -90.0 and less than or equal\n         to +90.0. Negative values indicate southern latitudes and positive\n         values indicate northern latitudes. long (required): The longitude of\n         the client's location, in degrees. The longitude must be greater than\n         or equal to -180.0 and less than or equal to +180.0. Negative values\n         indicate western longitudes and positive values indicate eastern\n         longitudes. re (required): The radius, in meters, which specifies the\n         horizontal accuracy of the coordinates. Pass the value returned by the\n         device's location service. Typical values might be 22m for GPS/Wi-Fi,\n         380m for cell tower triangulation, and 18,000m for reverse IP lookup.\n         ts (optional): The UTC UNIX timestamp of when the client was at the\n         location. (The UNIX timestamp is the number of seconds since January\n         1, 1970.) head (optional): The client's relative heading or direction\n         of travel. Specify the direction of travel as degrees from 0 through\n         360, counting clockwise relative to true north. Specify this key only\n         if the sp key is nonzero. sp (optional): The horizontal velocity\n         (speed), in meters per second, that the client device is traveling.\n         alt (optional): The altitude of the client device, in meters. are\n         (optional): The radius, in meters, that specifies the vertical\n         accuracy of the coordinates. Specify this key only if you specify the\n         alt key. Although many of the keys are optional, the more information\n         that you provide, the more accurate the location results are. Although\n         optional, you are encouraged to always specify the user's geographical\n         location. Providing the location is especially important if the\n         client's IP address does not accurately reflect the user's physical\n         location (for example, if the client uses VPN). For optimal results,\n         you should include this header and the X-MSEdge-ClientIP header, but\n         at a minimum, you should include this header.\n        :type location: str\n        :param country_code: A 2-character country code of the country where\n         the results come from. This API supports only the United States,\n         Canada, Australia, and China markets. If you specify this query\n         parameter, it must be set to us, ca, au, or cn. If you set this\n         parameter, you must also specify the Accept-Language header. Bing uses\n         the first supported language it finds from the languages list, and\n         combine that language with the country code that you specify to\n         determine the market to return results for. If the languages list does\n         not include a supported language, Bing finds the closest language and\n         market that supports the request, or it may use an aggregated or\n         default market for the results instead of a specified one. You should\n         use this query parameter and the Accept-Language query parameter only\n         if you specify multiple languages; otherwise, you should use the mkt\n         and setLang query parameters. This parameter and the mkt query\n         parameter are mutually exclusive\u2014do not specify both.\n        :type country_code: str\n        :param market: The market where the results come from. Typically, mkt\n         is the country where the user is making the request from. However, it\n         could be a different country if the user is not located in a country\n         where Bing delivers results. The market must be in the form <language\n         code>-<country code>. For example, en-US. The string is case\n         insensitive. For a list of possible market values, see [Market\n         Codes](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#market-codes).\n         NOTE: If known, you are encouraged to always specify the market.\n         Specifying the market helps Bing route the request and return an\n         appropriate and optimal response. If you specify a market that is not\n         listed in [Market\n         Codes](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#market-codes),\n         Bing uses a best fit market code based on an internal mapping that is\n         subject to change. This parameter and the\n         [cc](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#cc)\n         query parameter are mutually exclusive\u2014do not specify both.\n        :type market: str\n        :param safe_search: Filter images for adult content. The following are\n         the possible filter values. Off: May return images with adult content.\n         If the request is through the Image Search API, the response includes\n         thumbnail images that are clear (non-fuzzy). However, if the request\n         is through the Web Search API, the response includes thumbnail images\n         that are pixelated (fuzzy). Moderate: If the request is through the\n         Image Search API, the response doesn't include images with adult\n         content. If the request is through the Web Search API, the response\n         may include images with adult content (the thumbnail images are\n         pixelated (fuzzy)). Strict: Do not return images with adult content.\n         The default is Moderate. If the request comes from a market that\n         Bing's adult policy requires that safeSearch is set to Strict, Bing\n         ignores the safeSearch value and uses Strict. If you use the site:\n         query operator, there is the chance that the response may contain\n         adult content regardless of what the safeSearch query parameter is set\n         to. Use site: only if you are aware of the content on the site and\n         your scenario supports the possibility of adult content. Possible\n         values include: 'Off', 'Moderate', 'Strict'\n        :type safe_search: str or\n         ~azure.cognitiveservices.search.imagesearch.models.SafeSearch\n        :param set_lang: The language to use for user interface strings.\n         Specify the language using the ISO 639-1 2-letter language code. For\n         example, the language code for English is EN. The default is EN\n         (English). Although optional, you should always specify the language.\n         Typically, you set setLang to the same language specified by mkt\n         unless the user wants the user interface strings displayed in a\n         different language. This parameter and the\n         [Accept-Language](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#acceptlanguage)\n         header are mutually exclusive; do not specify both. A user interface\n         string is a string that's used as a label in a user interface. There\n         are few user interface strings in the JSON response objects. Also, any\n         links to Bing.com properties in the response objects apply the\n         specified language.\n        :type set_lang: str\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: TrendingImages or ClientRawResponse if raw=true\n        :rtype:\n         ~azure.cognitiveservices.search.imagesearch.models.TrendingImages or\n         ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`ErrorResponseException<azure.cognitiveservices.search.imagesearch.models.ErrorResponseException>`\n        \"\"\"\n        # Construct URL\n        url = self.trending.metadata['url']\n\n        # Construct parameters\n        query_parameters = {}\n        if country_code is not None:\n            query_parameters['cc'] = self._serialize.query(\"country_code\", country_code, 'str')\n        if market is not None:\n            query_parameters['mkt'] = self._serialize.query(\"market\", market, 'str')\n        if safe_search is not None:\n            query_parameters['safeSearch'] = self._serialize.query(\"safe_search\", safe_search, 'str')\n        if set_lang is not None:\n            query_parameters['setLang'] = self._serialize.query(\"set_lang\", set_lang, 'str')\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Content-Type'] = 'application/json; charset=utf-8'\n        if custom_headers:\n            header_parameters.update(custom_headers)\n        header_parameters['X-BingApis-SDK'] = self._serialize.header(\"self.x_bing_apis_sdk\", self.x_bing_apis_sdk, 'str')\n        if accept_language is not None:\n            header_parameters['Accept-Language'] = self._serialize.header(\"accept_language\", accept_language, 'str')\n        if user_agent is not None:\n            header_parameters['User-Agent'] = self._serialize.header(\"user_agent\", user_agent, 'str')\n        if client_id is not None:\n            header_parameters['X-MSEdge-ClientID'] = self._serialize.header(\"client_id\", client_id, 'str')\n        if client_ip is not None:\n            header_parameters['X-MSEdge-ClientIP'] = self._serialize.header(\"client_ip\", client_ip, 'str')\n        if location is not None:\n            header_parameters['X-Search-Location'] = self._serialize.header(\"location\", location, 'str')\n\n        # Construct and send request\n        request = self._client.get(url, query_parameters)\n        response = self._client.send(request, header_parameters, stream=False, **operation_config)\n\n        if response.status_code not in [200]:\n            raise models.ErrorResponseException(self._deserialize, response)\n\n        deserialized = None\n\n        if response.status_code == 200:\n            deserialized = self._deserialize('TrendingImages', response)\n\n        if raw:\n            client_raw_response = ClientRawResponse(deserialized, response)\n            return client_raw_response\n\n        return deserialized", "response": "This method lets you search on Bing and get back a list of images that are trending based on search requests made by the image trending API."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef open(self, method, url):\n        '''\n        Opens the request.\n\n        method:\n            the request VERB 'GET', 'POST', etc.\n        url:\n            the url to connect\n        '''\n        flag = VARIANT.create_bool_false()\n        _method = BSTR(method)\n        _url = BSTR(url)\n        _WinHttpRequest._Open(self, _method, _url, flag)", "response": "Opens the request.\n\n        method:\n            the request VERB 'GET', 'POST', etc.\n        url:\n            the url to connect"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nsets up the timeout for the request.", "response": "def set_timeout(self, timeout_in_seconds):\n        ''' Sets up the timeout for the request. '''\n        timeout_in_ms = int(timeout_in_seconds * 1000)\n        _WinHttpRequest._SetTimeouts(\n            self, 0, timeout_in_ms, timeout_in_ms, timeout_in_ms)"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef set_request_header(self, name, value):\n        ''' Sets the request header. '''\n\n        _name = BSTR(name)\n        _value = BSTR(value)\n        _WinHttpRequest._SetRequestHeader(self, _name, _value)", "response": "Sets the request header."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nget back all response headers.", "response": "def get_all_response_headers(self):\n        ''' Gets back all response headers. '''\n\n        bstr_headers = c_void_p()\n        _WinHttpRequest._GetAllResponseHeaders(self, byref(bstr_headers))\n        bstr_headers = ctypes.cast(bstr_headers, c_wchar_p)\n        headers = bstr_headers.value\n        _SysFreeString(bstr_headers)\n        return headers"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef send(self, request=None):\n        ''' Sends the request body. '''\n\n        # Sends VT_EMPTY if it is GET, HEAD request.\n        if request is None:\n            var_empty = VARIANT.create_empty()\n            _WinHttpRequest._Send(self, var_empty)\n        else:  # Sends request body as SAFEArray.\n            _request = VARIANT.create_safearray_from_str(request)\n            _WinHttpRequest._Send(self, _request)", "response": "Sends the request body."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef status(self):\n        ''' Gets status of response. '''\n\n        status = c_long()\n        _WinHttpRequest._Status(self, byref(status))\n        return int(status.value)", "response": "Gets status of response."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\ngetting status text of response.", "response": "def status_text(self):\n        ''' Gets status text of response. '''\n\n        bstr_status_text = c_void_p()\n        _WinHttpRequest._StatusText(self, byref(bstr_status_text))\n        bstr_status_text = ctypes.cast(bstr_status_text, c_wchar_p)\n        status_text = bstr_status_text.value\n        _SysFreeString(bstr_status_text)\n        return status_text"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef response_body(self):\n        '''\n        Gets response body as a SAFEARRAY and converts the SAFEARRAY to str.\n        '''\n        var_respbody = VARIANT()\n        _WinHttpRequest._ResponseBody(self, byref(var_respbody))\n        if var_respbody.is_safearray_of_bytes():\n            respbody = var_respbody.str_from_safearray()\n            return respbody\n        else:\n            return ''", "response": "Gets the response body as a SAFEARRAY and converts the SAFEARRAY to str."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef set_client_certificate(self, certificate):\n        '''Sets client certificate for the request. '''\n        _certificate = BSTR(certificate)\n        _WinHttpRequest._SetClientCertificate(self, _certificate)", "response": "Sets the client certificate for the request."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef set_tunnel(self, host, port):\n        ''' Sets up the host and the port for the HTTP CONNECT Tunnelling.'''\n        url = host\n        if port:\n            url = url + u':' + port\n\n        var_host = VARIANT.create_bstr_from_str(url)\n        var_empty = VARIANT.create_empty()\n\n        _WinHttpRequest._SetProxy(\n            self, HTTPREQUEST_PROXYSETTING_PROXY, var_host, var_empty)", "response": "Sets up the host and the port for the HTTP CONNECT Tunnelling."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nsets up the host and port for the HTTP CONNECT Tunnelling.", "response": "def set_tunnel(self, host, port=None, headers=None):\n        ''' Sets up the host and the port for the HTTP CONNECT Tunnelling. '''\n        self._httprequest.set_tunnel(unicode(host), unicode(str(port)))"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nconnect to host and sends the request.", "response": "def putrequest(self, method, uri):\n        ''' Connects to host and sends the request. '''\n\n        protocol = unicode(self.protocol + '://')\n        url = protocol + self.host + unicode(uri)\n        self._httprequest.set_timeout(self.timeout)\n        self._httprequest.open(unicode(method), url)\n\n        # sets certificate for the connection if cert_file is set.\n        if self.cert_file is not None:\n            self._httprequest.set_client_certificate(unicode(self.cert_file))"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nsends the headers of the request.", "response": "def putheader(self, name, value):\n        ''' Sends the headers of request. '''\n        if sys.version_info < (3,):\n            name = str(name).decode('utf-8')\n            value = str(value).decode('utf-8')\n        self._httprequest.set_request_header(name, value)"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef send(self, request_body):\n        ''' Sends request body. '''\n        if not request_body:\n            self._httprequest.send()\n        else:\n            self._httprequest.send(request_body)", "response": "Sends the request body."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef getresponse(self):\n        ''' Gets the response and generates the _Response object'''\n        status = self._httprequest.status()\n        status_text = self._httprequest.status_text()\n\n        resp_headers = self._httprequest.get_all_response_headers()\n        fixed_headers = []\n        for resp_header in resp_headers.split('\\n'):\n            if (resp_header.startswith('\\t') or\\\n                resp_header.startswith(' ')) and fixed_headers:\n                # append to previous header\n                fixed_headers[-1] += resp_header\n            else:\n                fixed_headers.append(resp_header)\n\n        headers = []\n        for resp_header in fixed_headers:\n            if ':' in resp_header:\n                pos = resp_header.find(':')\n                headers.append(\n                    (resp_header[:pos].lower(), resp_header[pos + 1:].strip()))\n\n        body = self._httprequest.response_body()\n        length = len(body)\n\n        return _Response(status, status_text, length, headers, body)", "response": "Gets the response and generates the _Response object"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef _get_readable_id(id_name, id_prefix_to_skip):\n    # id_name is in the form 'https://namespace.host.suffix/name'\n    # where name may contain a forward slash!\n    pos = id_name.find('//')\n    if pos != -1:\n        pos += 2\n        if id_prefix_to_skip:\n            pos = id_name.find(id_prefix_to_skip, pos)\n            if pos != -1:\n                pos += len(id_prefix_to_skip)\n        pos = id_name.find('/', pos)\n        if pos != -1:\n            return id_name[pos + 1:]\n    return id_name", "response": "simplified an id to be more friendly for us people"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef _create_entry(entry_body):\n    ''' Adds common part of entry to a given entry body and return the whole\n    xml. '''\n    updated_str = datetime.utcnow().isoformat()\n    if datetime.utcnow().utcoffset() is None:\n        updated_str += '+00:00'\n\n    entry_start = '''<?xml version=\"1.0\" encoding=\"utf-8\" standalone=\"yes\"?>\n<entry xmlns:d=\"http://schemas.microsoft.com/ado/2007/08/dataservices\" xmlns:m=\"http://schemas.microsoft.com/ado/2007/08/dataservices/metadata\" xmlns=\"http://www.w3.org/2005/Atom\" >\n<title /><updated>{updated}</updated><author><name /></author><id />\n<content type=\"application/xml\">\n    {body}</content></entry>'''\n    return entry_start.format(updated=updated_str, body=entry_body)", "response": "Creates a new entry from a given entry body."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef _get_serialization_name(element_name):\n    known = _KNOWN_SERIALIZATION_XFORMS.get(element_name)\n    if known is not None:\n        return known\n\n    if element_name.startswith('x_ms_'):\n        return element_name.replace('_', '-')\n    if element_name.endswith('_id'):\n        element_name = element_name.replace('_id', 'ID')\n    for name in ['content_', 'last_modified', 'if_', 'cache_control']:\n        if element_name.startswith(name):\n            element_name = element_name.replace('_', '-_')\n\n    return ''.join(name.capitalize() for name in element_name.split('_'))", "response": "converts a Python name into a serializable name"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nserializing bytes into base - 64 string.", "response": "def _bstr_to_b64url(bstr, **kwargs):\n    \"\"\"Serialize bytes into base-64 string.\n    :param str: Object to be serialized.\n    :rtype: str\n    \"\"\"\n    encoded = b64encode(bstr).decode()\n    return encoded.strip('=').replace('+', '-').replace('/', '_')"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef _b64_to_bstr(b64str):\n    padding = '=' * (3 - (len(b64str) + 3) % 4)\n    b64str = b64str + padding\n    encoded = b64str.replace('-', '+').replace('_', '/')\n    return b64decode(encoded)", "response": "Deserialize base64 encoded string into string."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef find_similar(\n            self, face_id, face_list_id=None, large_face_list_id=None, face_ids=None, max_num_of_candidates_returned=20, mode=\"matchPerson\", custom_headers=None, raw=False, **operation_config):\n        \"\"\"Given query face's faceId, to search the similar-looking faces from a\n        faceId array, a face list or a large face list. faceId array contains\n        the faces created by [Face -\n        Detect](/docs/services/563879b61984550e40cbbe8d/operations/563879b61984550f30395236),\n        which will expire 24 hours after creation. A \"faceListId\" is created by\n        [FaceList -\n        Create](/docs/services/563879b61984550e40cbbe8d/operations/563879b61984550f3039524b)\n        containing persistedFaceIds that will not expire. And a\n        \"largeFaceListId\" is created by [LargeFaceList -\n        Create](/docs/services/563879b61984550e40cbbe8d/operations/5a157b68d2de3616c086f2cc)\n        containing persistedFaceIds that will also not expire. Depending on the\n        input the returned similar faces list contains faceIds or\n        persistedFaceIds ranked by similarity.\n        <br/>Find similar has two working modes, \"matchPerson\" and \"matchFace\".\n        \"matchPerson\" is the default mode that it tries to find faces of the\n        same person as possible by using internal same-person thresholds. It is\n        useful to find a known person's other photos. Note that an empty list\n        will be returned if no faces pass the internal thresholds. \"matchFace\"\n        mode ignores same-person thresholds and returns ranked similar faces\n        anyway, even the similarity is low. It can be used in the cases like\n        searching celebrity-looking faces.\n        <br/>The 'recognitionModel' associated with the query face's faceId\n        should be the same as the 'recognitionModel' used by the target faceId\n        array, face list or large face list.\n        .\n\n        :param face_id: FaceId of the query face. User needs to call Face -\n         Detect first to get a valid faceId. Note that this faceId is not\n         persisted and will expire 24 hours after the detection call\n        :type face_id: str\n        :param face_list_id: An existing user-specified unique candidate face\n         list, created in Face List - Create a Face List. Face list contains a\n         set of persistedFaceIds which are persisted and will never expire.\n         Parameter faceListId, largeFaceListId and faceIds should not be\n         provided at the same time.\n        :type face_list_id: str\n        :param large_face_list_id: An existing user-specified unique candidate\n         large face list, created in LargeFaceList - Create. Large face list\n         contains a set of persistedFaceIds which are persisted and will never\n         expire. Parameter faceListId, largeFaceListId and faceIds should not\n         be provided at the same time.\n        :type large_face_list_id: str\n        :param face_ids: An array of candidate faceIds. All of them are\n         created by Face - Detect and the faceIds will expire 24 hours after\n         the detection call. The number of faceIds is limited to 1000.\n         Parameter faceListId, largeFaceListId and faceIds should not be\n         provided at the same time.\n        :type face_ids: list[str]\n        :param max_num_of_candidates_returned: The number of top similar faces\n         returned. The valid range is [1, 1000].\n        :type max_num_of_candidates_returned: int\n        :param mode: Similar face searching mode. It can be \"matchPerson\" or\n         \"matchFace\". Possible values include: 'matchPerson', 'matchFace'\n        :type mode: str or\n         ~azure.cognitiveservices.vision.face.models.FindSimilarMatchMode\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: list or ClientRawResponse if raw=true\n        :rtype: list[~azure.cognitiveservices.vision.face.models.SimilarFace]\n         or ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`APIErrorException<azure.cognitiveservices.vision.face.models.APIErrorException>`\n        \"\"\"\n        body = models.FindSimilarRequest(face_id=face_id, face_list_id=face_list_id, large_face_list_id=large_face_list_id, face_ids=face_ids, max_num_of_candidates_returned=max_num_of_candidates_returned, mode=mode)\n\n        # Construct URL\n        url = self.find_similar.metadata['url']\n        path_format_arguments = {\n            'Endpoint': self._serialize.url(\"self.config.endpoint\", self.config.endpoint, 'str', skip_quote=True)\n        }\n        url = self._client.format_url(url, **path_format_arguments)\n\n        # Construct parameters\n        query_parameters = {}\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Accept'] = 'application/json'\n        header_parameters['Content-Type'] = 'application/json; charset=utf-8'\n        if custom_headers:\n            header_parameters.update(custom_headers)\n\n        # Construct body\n        body_content = self._serialize.body(body, 'FindSimilarRequest')\n\n        # Construct and send request\n        request = self._client.post(url, query_parameters, header_parameters, body_content)\n        response = self._client.send(request, stream=False, **operation_config)\n\n        if response.status_code not in [200]:\n            raise models.APIErrorException(self._deserialize, response)\n\n        deserialized = None\n\n        if response.status_code == 200:\n            deserialized = self._deserialize('[SimilarFace]', response)\n\n        if raw:\n            client_raw_response = ClientRawResponse(deserialized, response)\n            return client_raw_response\n\n        return deserialized", "response": "Given a query face s faceId and a list of faceIds and large faceListId and a list of faceIds and a list of faceIds and large faceIds find the similar faces of the query face."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\ndetecting human faces in an image and return them in a cache file.", "response": "def detect_with_url(\n            self, url, return_face_id=True, return_face_landmarks=False, return_face_attributes=None, recognition_model=\"recognition_01\", return_recognition_model=False, custom_headers=None, raw=False, **operation_config):\n        \"\"\"Detect human faces in an image, return face rectangles, and optionally\n        with faceIds, landmarks, and attributes.<br />\n        * Optional parameters including faceId, landmarks, and attributes.\n        Attributes include age, gender, headPose, smile, facialHair, glasses,\n        emotion, hair, makeup, occlusion, accessories, blur, exposure and\n        noise.\n        * The extracted face feature, instead of the actual image, will be\n        stored on server. The faceId is an identifier of the face feature and\n        will be used in [Face -\n        Identify](/docs/services/563879b61984550e40cbbe8d/operations/563879b61984550f30395239),\n        [Face -\n        Verify](/docs/services/563879b61984550e40cbbe8d/operations/563879b61984550f3039523a),\n        and [Face - Find\n        Similar](/docs/services/563879b61984550e40cbbe8d/operations/563879b61984550f30395237).\n        It will expire 24 hours after the detection call.\n        * Higher face image quality means better detection and recognition\n        precision. Please consider high-quality faces: frontal, clear, and face\n        size is 200x200 pixels (100 pixels between eyes) or bigger.\n        * JPEG, PNG, GIF (the first frame), and BMP format are supported. The\n        allowed image file size is from 1KB to 6MB.\n        * Faces are detectable when its size is 36x36 to 4096x4096 pixels. If\n        need to detect very small but clear faces, please try to enlarge the\n        input image.\n        * Up to 64 faces can be returned for an image. Faces are ranked by face\n        rectangle size from large to small.\n        * Face detector prefer frontal and near-frontal faces. There are cases\n        that faces may not be detected, e.g. exceptionally large face angles\n        (head-pose) or being occluded, or wrong image orientation.\n        * Attributes (age, gender, headPose, smile, facialHair, glasses,\n        emotion, hair, makeup, occlusion, accessories, blur, exposure and\n        noise) may not be perfectly accurate. HeadPose's pitch value is a\n        reserved field and will always return 0.\n        * Different 'recognitionModel' values are provided. If follow-up\n        operations like Verify, Identify, Find Similar are needed, please\n        specify the recognition model with 'recognitionModel' parameter. The\n        default value for 'recognitionModel' is 'recognition_01', if latest\n        model needed, please explicitly specify the model you need in this\n        parameter. Once specified, the detected faceIds will be associated with\n        the specified recognition model. More details, please refer to [How to\n        specify a recognition\n        model](https://docs.microsoft.com/en-us/azure/cognitive-services/face/face-api-how-to-topics/specify-recognition-model)\n        .\n\n        :param url: Publicly reachable URL of an image\n        :type url: str\n        :param return_face_id: A value indicating whether the operation should\n         return faceIds of detected faces.\n        :type return_face_id: bool\n        :param return_face_landmarks: A value indicating whether the operation\n         should return landmarks of the detected faces.\n        :type return_face_landmarks: bool\n        :param return_face_attributes: Analyze and return the one or more\n         specified face attributes in the comma-separated string like\n         \"returnFaceAttributes=age,gender\". Supported face attributes include\n         age, gender, headPose, smile, facialHair, glasses and emotion. Note\n         that each face attribute analysis has additional computational and\n         time cost.\n        :type return_face_attributes: list[str or\n         ~azure.cognitiveservices.vision.face.models.FaceAttributeType]\n        :param recognition_model: Name of recognition model. Recognition model\n         is used when the face features are extracted and associated with\n         detected faceIds, (Large)FaceList or (Large)PersonGroup. A recognition\n         model name can be provided when performing Face - Detect or\n         (Large)FaceList - Create or (Large)PersonGroup - Create. The default\n         value is 'recognition_01', if latest model needed, please explicitly\n         specify the model you need. Possible values include: 'recognition_01',\n         'recognition_02'\n        :type recognition_model: str or\n         ~azure.cognitiveservices.vision.face.models.RecognitionModel\n        :param return_recognition_model: A value indicating whether the\n         operation should return 'recognitionModel' in response.\n        :type return_recognition_model: bool\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: list or ClientRawResponse if raw=true\n        :rtype: list[~azure.cognitiveservices.vision.face.models.DetectedFace]\n         or ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`APIErrorException<azure.cognitiveservices.vision.face.models.APIErrorException>`\n        \"\"\"\n        image_url = models.ImageUrl(url=url)\n\n        # Construct URL\n        url = self.detect_with_url.metadata['url']\n        path_format_arguments = {\n            'Endpoint': self._serialize.url(\"self.config.endpoint\", self.config.endpoint, 'str', skip_quote=True)\n        }\n        url = self._client.format_url(url, **path_format_arguments)\n\n        # Construct parameters\n        query_parameters = {}\n        if return_face_id is not None:\n            query_parameters['returnFaceId'] = self._serialize.query(\"return_face_id\", return_face_id, 'bool')\n        if return_face_landmarks is not None:\n            query_parameters['returnFaceLandmarks'] = self._serialize.query(\"return_face_landmarks\", return_face_landmarks, 'bool')\n        if return_face_attributes is not None:\n            query_parameters['returnFaceAttributes'] = self._serialize.query(\"return_face_attributes\", return_face_attributes, '[FaceAttributeType]', div=',')\n        if recognition_model is not None:\n            query_parameters['recognitionModel'] = self._serialize.query(\"recognition_model\", recognition_model, 'str')\n        if return_recognition_model is not None:\n            query_parameters['returnRecognitionModel'] = self._serialize.query(\"return_recognition_model\", return_recognition_model, 'bool')\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Accept'] = 'application/json'\n        header_parameters['Content-Type'] = 'application/json; charset=utf-8'\n        if custom_headers:\n            header_parameters.update(custom_headers)\n\n        # Construct body\n        body_content = self._serialize.body(image_url, 'ImageUrl')\n\n        # Construct and send request\n        request = self._client.post(url, query_parameters, header_parameters, body_content)\n        response = self._client.send(request, stream=False, **operation_config)\n\n        if response.status_code not in [200]:\n            raise models.APIErrorException(self._deserialize, response)\n\n        deserialized = None\n\n        if response.status_code == 200:\n            deserialized = self._deserialize('[DetectedFace]', response)\n\n        if raw:\n            client_raw_response = ClientRawResponse(deserialized, response)\n            return client_raw_response\n\n        return deserialized"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef verify_face_to_person(\n            self, face_id, person_id, person_group_id=None, large_person_group_id=None, custom_headers=None, raw=False, **operation_config):\n        \"\"\"Verify whether two faces belong to a same person. Compares a face Id\n        with a Person Id.\n\n        :param face_id: FaceId of the face, comes from Face - Detect\n        :type face_id: str\n        :param person_id: Specify a certain person in a person group or a\n         large person group. personId is created in PersonGroup Person - Create\n         or LargePersonGroup Person - Create.\n        :type person_id: str\n        :param person_group_id: Using existing personGroupId and personId for\n         fast loading a specified person. personGroupId is created in\n         PersonGroup - Create. Parameter personGroupId and largePersonGroupId\n         should not be provided at the same time.\n        :type person_group_id: str\n        :param large_person_group_id: Using existing largePersonGroupId and\n         personId for fast loading a specified person. largePersonGroupId is\n         created in LargePersonGroup - Create. Parameter personGroupId and\n         largePersonGroupId should not be provided at the same time.\n        :type large_person_group_id: str\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: VerifyResult or ClientRawResponse if raw=true\n        :rtype: ~azure.cognitiveservices.vision.face.models.VerifyResult or\n         ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`APIErrorException<azure.cognitiveservices.vision.face.models.APIErrorException>`\n        \"\"\"\n        body = models.VerifyFaceToPersonRequest(face_id=face_id, person_group_id=person_group_id, large_person_group_id=large_person_group_id, person_id=person_id)\n\n        # Construct URL\n        url = self.verify_face_to_person.metadata['url']\n        path_format_arguments = {\n            'Endpoint': self._serialize.url(\"self.config.endpoint\", self.config.endpoint, 'str', skip_quote=True)\n        }\n        url = self._client.format_url(url, **path_format_arguments)\n\n        # Construct parameters\n        query_parameters = {}\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Accept'] = 'application/json'\n        header_parameters['Content-Type'] = 'application/json; charset=utf-8'\n        if custom_headers:\n            header_parameters.update(custom_headers)\n\n        # Construct body\n        body_content = self._serialize.body(body, 'VerifyFaceToPersonRequest')\n\n        # Construct and send request\n        request = self._client.post(url, query_parameters, header_parameters, body_content)\n        response = self._client.send(request, stream=False, **operation_config)\n\n        if response.status_code not in [200]:\n            raise models.APIErrorException(self._deserialize, response)\n\n        deserialized = None\n\n        if response.status_code == 200:\n            deserialized = self._deserialize('VerifyResult', response)\n\n        if raw:\n            client_raw_response = ClientRawResponse(deserialized, response)\n            return client_raw_response\n\n        return deserialized", "response": "Verify whether two faces belong to a person."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nget the challenge for the given URL.", "response": "def get_challenge_for_url(url):\n    \"\"\" Gets the challenge for the cached URL.\n    :param url: the URL the challenge is cached for.\n    :rtype: HttpBearerChallenge \"\"\"\n    if not url:\n        raise ValueError('URL cannot be None')\n\n    url = parse.urlparse(url)\n\n    _lock.acquire()\n\n    val = _cache.get(url.netloc)\n\n    _lock.release()\n\n    return val"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nremoves the cached challenge for the specified URL.", "response": "def remove_challenge_for_url(url):\n    \"\"\" Removes the cached challenge for the specified URL.\n    :param url: the URL for which to remove the cached challenge \"\"\"\n    if not url:\n        raise ValueError('URL cannot be empty')\n\n    url = parse.urlparse(url)\n\n    _lock.acquire()\n\n    del _cache[url.netloc]\n\n    _lock.release()"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef set_challenge_for_url(url, challenge):\n    if not url:\n        raise ValueError('URL cannot be empty')\n\n    if not challenge:\n        raise ValueError('Challenge cannot be empty')\n\n    src_url = parse.urlparse(url)\n    if src_url.netloc != challenge.source_authority:\n        raise ValueError('Source URL and Challenge URL do not match')\n\n    _lock.acquire()\n\n    _cache[src_url.netloc] = challenge\n\n    _lock.release()", "response": "Sets the challenge for the specified URL."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef get(\n            self, job_id, job_get_options=None, custom_headers=None, raw=False, **operation_config):\n        \"\"\"Gets information about the specified job.\n\n        :param job_id: The ID of the job.\n        :type job_id: str\n        :param job_get_options: Additional parameters for the operation\n        :type job_get_options: ~azure.batch.models.JobGetOptions\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: CloudJob or ClientRawResponse if raw=true\n        :rtype: ~azure.batch.models.CloudJob or\n         ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`BatchErrorException<azure.batch.models.BatchErrorException>`\n        \"\"\"\n        select = None\n        if job_get_options is not None:\n            select = job_get_options.select\n        expand = None\n        if job_get_options is not None:\n            expand = job_get_options.expand\n        timeout = None\n        if job_get_options is not None:\n            timeout = job_get_options.timeout\n        client_request_id = None\n        if job_get_options is not None:\n            client_request_id = job_get_options.client_request_id\n        return_client_request_id = None\n        if job_get_options is not None:\n            return_client_request_id = job_get_options.return_client_request_id\n        ocp_date = None\n        if job_get_options is not None:\n            ocp_date = job_get_options.ocp_date\n        if_match = None\n        if job_get_options is not None:\n            if_match = job_get_options.if_match\n        if_none_match = None\n        if job_get_options is not None:\n            if_none_match = job_get_options.if_none_match\n        if_modified_since = None\n        if job_get_options is not None:\n            if_modified_since = job_get_options.if_modified_since\n        if_unmodified_since = None\n        if job_get_options is not None:\n            if_unmodified_since = job_get_options.if_unmodified_since\n\n        # Construct URL\n        url = self.get.metadata['url']\n        path_format_arguments = {\n            'batchUrl': self._serialize.url(\"self.config.batch_url\", self.config.batch_url, 'str', skip_quote=True),\n            'jobId': self._serialize.url(\"job_id\", job_id, 'str')\n        }\n        url = self._client.format_url(url, **path_format_arguments)\n\n        # Construct parameters\n        query_parameters = {}\n        query_parameters['api-version'] = self._serialize.query(\"self.api_version\", self.api_version, 'str')\n        if select is not None:\n            query_parameters['$select'] = self._serialize.query(\"select\", select, 'str')\n        if expand is not None:\n            query_parameters['$expand'] = self._serialize.query(\"expand\", expand, 'str')\n        if timeout is not None:\n            query_parameters['timeout'] = self._serialize.query(\"timeout\", timeout, 'int')\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Accept'] = 'application/json'\n        if self.config.generate_client_request_id:\n            header_parameters['client-request-id'] = str(uuid.uuid1())\n        if custom_headers:\n            header_parameters.update(custom_headers)\n        if self.config.accept_language is not None:\n            header_parameters['accept-language'] = self._serialize.header(\"self.config.accept_language\", self.config.accept_language, 'str')\n        if client_request_id is not None:\n            header_parameters['client-request-id'] = self._serialize.header(\"client_request_id\", client_request_id, 'str')\n        if return_client_request_id is not None:\n            header_parameters['return-client-request-id'] = self._serialize.header(\"return_client_request_id\", return_client_request_id, 'bool')\n        if ocp_date is not None:\n            header_parameters['ocp-date'] = self._serialize.header(\"ocp_date\", ocp_date, 'rfc-1123')\n        if if_match is not None:\n            header_parameters['If-Match'] = self._serialize.header(\"if_match\", if_match, 'str')\n        if if_none_match is not None:\n            header_parameters['If-None-Match'] = self._serialize.header(\"if_none_match\", if_none_match, 'str')\n        if if_modified_since is not None:\n            header_parameters['If-Modified-Since'] = self._serialize.header(\"if_modified_since\", if_modified_since, 'rfc-1123')\n        if if_unmodified_since is not None:\n            header_parameters['If-Unmodified-Since'] = self._serialize.header(\"if_unmodified_since\", if_unmodified_since, 'rfc-1123')\n\n        # Construct and send request\n        request = self._client.get(url, query_parameters, header_parameters)\n        response = self._client.send(request, stream=False, **operation_config)\n\n        if response.status_code not in [200]:\n            raise models.BatchErrorException(self._deserialize, response)\n\n        deserialized = None\n        header_dict = {}\n\n        if response.status_code == 200:\n            deserialized = self._deserialize('CloudJob', response)\n            header_dict = {\n                'client-request-id': 'str',\n                'request-id': 'str',\n                'ETag': 'str',\n                'Last-Modified': 'rfc-1123',\n            }\n\n        if raw:\n            client_raw_response = ClientRawResponse(deserialized, response)\n            client_raw_response.add_headers(header_dict)\n            return client_raw_response\n\n        return deserialized", "response": "Gets the information about the specified job."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef patch(\n            self, job_id, job_patch_parameter, job_patch_options=None, custom_headers=None, raw=False, **operation_config):\n        \"\"\"Updates the properties of the specified job.\n\n        This replaces only the job properties specified in the request. For\n        example, if the job has constraints, and a request does not specify the\n        constraints element, then the job keeps the existing constraints.\n\n        :param job_id: The ID of the job whose properties you want to update.\n        :type job_id: str\n        :param job_patch_parameter: The parameters for the request.\n        :type job_patch_parameter: ~azure.batch.models.JobPatchParameter\n        :param job_patch_options: Additional parameters for the operation\n        :type job_patch_options: ~azure.batch.models.JobPatchOptions\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: None or ClientRawResponse if raw=true\n        :rtype: None or ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`BatchErrorException<azure.batch.models.BatchErrorException>`\n        \"\"\"\n        timeout = None\n        if job_patch_options is not None:\n            timeout = job_patch_options.timeout\n        client_request_id = None\n        if job_patch_options is not None:\n            client_request_id = job_patch_options.client_request_id\n        return_client_request_id = None\n        if job_patch_options is not None:\n            return_client_request_id = job_patch_options.return_client_request_id\n        ocp_date = None\n        if job_patch_options is not None:\n            ocp_date = job_patch_options.ocp_date\n        if_match = None\n        if job_patch_options is not None:\n            if_match = job_patch_options.if_match\n        if_none_match = None\n        if job_patch_options is not None:\n            if_none_match = job_patch_options.if_none_match\n        if_modified_since = None\n        if job_patch_options is not None:\n            if_modified_since = job_patch_options.if_modified_since\n        if_unmodified_since = None\n        if job_patch_options is not None:\n            if_unmodified_since = job_patch_options.if_unmodified_since\n\n        # Construct URL\n        url = self.patch.metadata['url']\n        path_format_arguments = {\n            'batchUrl': self._serialize.url(\"self.config.batch_url\", self.config.batch_url, 'str', skip_quote=True),\n            'jobId': self._serialize.url(\"job_id\", job_id, 'str')\n        }\n        url = self._client.format_url(url, **path_format_arguments)\n\n        # Construct parameters\n        query_parameters = {}\n        query_parameters['api-version'] = self._serialize.query(\"self.api_version\", self.api_version, 'str')\n        if timeout is not None:\n            query_parameters['timeout'] = self._serialize.query(\"timeout\", timeout, 'int')\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Content-Type'] = 'application/json; odata=minimalmetadata; charset=utf-8'\n        if self.config.generate_client_request_id:\n            header_parameters['client-request-id'] = str(uuid.uuid1())\n        if custom_headers:\n            header_parameters.update(custom_headers)\n        if self.config.accept_language is not None:\n            header_parameters['accept-language'] = self._serialize.header(\"self.config.accept_language\", self.config.accept_language, 'str')\n        if client_request_id is not None:\n            header_parameters['client-request-id'] = self._serialize.header(\"client_request_id\", client_request_id, 'str')\n        if return_client_request_id is not None:\n            header_parameters['return-client-request-id'] = self._serialize.header(\"return_client_request_id\", return_client_request_id, 'bool')\n        if ocp_date is not None:\n            header_parameters['ocp-date'] = self._serialize.header(\"ocp_date\", ocp_date, 'rfc-1123')\n        if if_match is not None:\n            header_parameters['If-Match'] = self._serialize.header(\"if_match\", if_match, 'str')\n        if if_none_match is not None:\n            header_parameters['If-None-Match'] = self._serialize.header(\"if_none_match\", if_none_match, 'str')\n        if if_modified_since is not None:\n            header_parameters['If-Modified-Since'] = self._serialize.header(\"if_modified_since\", if_modified_since, 'rfc-1123')\n        if if_unmodified_since is not None:\n            header_parameters['If-Unmodified-Since'] = self._serialize.header(\"if_unmodified_since\", if_unmodified_since, 'rfc-1123')\n\n        # Construct body\n        body_content = self._serialize.body(job_patch_parameter, 'JobPatchParameter')\n\n        # Construct and send request\n        request = self._client.patch(url, query_parameters, header_parameters, body_content)\n        response = self._client.send(request, stream=False, **operation_config)\n\n        if response.status_code not in [200]:\n            raise models.BatchErrorException(self._deserialize, response)\n\n        if raw:\n            client_raw_response = ClientRawResponse(None, response)\n            client_raw_response.add_headers({\n                'client-request-id': 'str',\n                'request-id': 'str',\n                'ETag': 'str',\n                'Last-Modified': 'rfc-1123',\n                'DataServiceId': 'str',\n            })\n            return client_raw_response", "response": "Updates the properties of the specified job."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef add(\n            self, job, job_add_options=None, custom_headers=None, raw=False, **operation_config):\n        \"\"\"Adds a job to the specified account.\n\n        The Batch service supports two ways to control the work done as part of\n        a job. In the first approach, the user specifies a Job Manager task.\n        The Batch service launches this task when it is ready to start the job.\n        The Job Manager task controls all other tasks that run under this job,\n        by using the Task APIs. In the second approach, the user directly\n        controls the execution of tasks under an active job, by using the Task\n        APIs. Also note: when naming jobs, avoid including sensitive\n        information such as user names or secret project names. This\n        information may appear in telemetry logs accessible to Microsoft\n        Support engineers.\n\n        :param job: The job to be added.\n        :type job: ~azure.batch.models.JobAddParameter\n        :param job_add_options: Additional parameters for the operation\n        :type job_add_options: ~azure.batch.models.JobAddOptions\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: None or ClientRawResponse if raw=true\n        :rtype: None or ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`BatchErrorException<azure.batch.models.BatchErrorException>`\n        \"\"\"\n        timeout = None\n        if job_add_options is not None:\n            timeout = job_add_options.timeout\n        client_request_id = None\n        if job_add_options is not None:\n            client_request_id = job_add_options.client_request_id\n        return_client_request_id = None\n        if job_add_options is not None:\n            return_client_request_id = job_add_options.return_client_request_id\n        ocp_date = None\n        if job_add_options is not None:\n            ocp_date = job_add_options.ocp_date\n\n        # Construct URL\n        url = self.add.metadata['url']\n        path_format_arguments = {\n            'batchUrl': self._serialize.url(\"self.config.batch_url\", self.config.batch_url, 'str', skip_quote=True)\n        }\n        url = self._client.format_url(url, **path_format_arguments)\n\n        # Construct parameters\n        query_parameters = {}\n        query_parameters['api-version'] = self._serialize.query(\"self.api_version\", self.api_version, 'str')\n        if timeout is not None:\n            query_parameters['timeout'] = self._serialize.query(\"timeout\", timeout, 'int')\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Content-Type'] = 'application/json; odata=minimalmetadata; charset=utf-8'\n        if self.config.generate_client_request_id:\n            header_parameters['client-request-id'] = str(uuid.uuid1())\n        if custom_headers:\n            header_parameters.update(custom_headers)\n        if self.config.accept_language is not None:\n            header_parameters['accept-language'] = self._serialize.header(\"self.config.accept_language\", self.config.accept_language, 'str')\n        if client_request_id is not None:\n            header_parameters['client-request-id'] = self._serialize.header(\"client_request_id\", client_request_id, 'str')\n        if return_client_request_id is not None:\n            header_parameters['return-client-request-id'] = self._serialize.header(\"return_client_request_id\", return_client_request_id, 'bool')\n        if ocp_date is not None:\n            header_parameters['ocp-date'] = self._serialize.header(\"ocp_date\", ocp_date, 'rfc-1123')\n\n        # Construct body\n        body_content = self._serialize.body(job, 'JobAddParameter')\n\n        # Construct and send request\n        request = self._client.post(url, query_parameters, header_parameters, body_content)\n        response = self._client.send(request, stream=False, **operation_config)\n\n        if response.status_code not in [201]:\n            raise models.BatchErrorException(self._deserialize, response)\n\n        if raw:\n            client_raw_response = ClientRawResponse(None, response)\n            client_raw_response.add_headers({\n                'client-request-id': 'str',\n                'request-id': 'str',\n                'ETag': 'str',\n                'Last-Modified': 'rfc-1123',\n                'DataServiceId': 'str',\n            })\n            return client_raw_response", "response": "Adds a job to the specified account."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nlist all of the jobs in the specified account.", "response": "def list(\n            self, job_list_options=None, custom_headers=None, raw=False, **operation_config):\n        \"\"\"Lists all of the jobs in the specified account.\n\n        :param job_list_options: Additional parameters for the operation\n        :type job_list_options: ~azure.batch.models.JobListOptions\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: An iterator like instance of CloudJob\n        :rtype:\n         ~azure.batch.models.CloudJobPaged[~azure.batch.models.CloudJob]\n        :raises:\n         :class:`BatchErrorException<azure.batch.models.BatchErrorException>`\n        \"\"\"\n        filter = None\n        if job_list_options is not None:\n            filter = job_list_options.filter\n        select = None\n        if job_list_options is not None:\n            select = job_list_options.select\n        expand = None\n        if job_list_options is not None:\n            expand = job_list_options.expand\n        max_results = None\n        if job_list_options is not None:\n            max_results = job_list_options.max_results\n        timeout = None\n        if job_list_options is not None:\n            timeout = job_list_options.timeout\n        client_request_id = None\n        if job_list_options is not None:\n            client_request_id = job_list_options.client_request_id\n        return_client_request_id = None\n        if job_list_options is not None:\n            return_client_request_id = job_list_options.return_client_request_id\n        ocp_date = None\n        if job_list_options is not None:\n            ocp_date = job_list_options.ocp_date\n\n        def internal_paging(next_link=None, raw=False):\n\n            if not next_link:\n                # Construct URL\n                url = self.list.metadata['url']\n                path_format_arguments = {\n                    'batchUrl': self._serialize.url(\"self.config.batch_url\", self.config.batch_url, 'str', skip_quote=True)\n                }\n                url = self._client.format_url(url, **path_format_arguments)\n\n                # Construct parameters\n                query_parameters = {}\n                query_parameters['api-version'] = self._serialize.query(\"self.api_version\", self.api_version, 'str')\n                if filter is not None:\n                    query_parameters['$filter'] = self._serialize.query(\"filter\", filter, 'str')\n                if select is not None:\n                    query_parameters['$select'] = self._serialize.query(\"select\", select, 'str')\n                if expand is not None:\n                    query_parameters['$expand'] = self._serialize.query(\"expand\", expand, 'str')\n                if max_results is not None:\n                    query_parameters['maxresults'] = self._serialize.query(\"max_results\", max_results, 'int', maximum=1000, minimum=1)\n                if timeout is not None:\n                    query_parameters['timeout'] = self._serialize.query(\"timeout\", timeout, 'int')\n\n            else:\n                url = next_link\n                query_parameters = {}\n\n            # Construct headers\n            header_parameters = {}\n            header_parameters['Accept'] = 'application/json'\n            if self.config.generate_client_request_id:\n                header_parameters['client-request-id'] = str(uuid.uuid1())\n            if custom_headers:\n                header_parameters.update(custom_headers)\n            if self.config.accept_language is not None:\n                header_parameters['accept-language'] = self._serialize.header(\"self.config.accept_language\", self.config.accept_language, 'str')\n            if client_request_id is not None:\n                header_parameters['client-request-id'] = self._serialize.header(\"client_request_id\", client_request_id, 'str')\n            if return_client_request_id is not None:\n                header_parameters['return-client-request-id'] = self._serialize.header(\"return_client_request_id\", return_client_request_id, 'bool')\n            if ocp_date is not None:\n                header_parameters['ocp-date'] = self._serialize.header(\"ocp_date\", ocp_date, 'rfc-1123')\n\n            # Construct and send request\n            request = self._client.get(url, query_parameters, header_parameters)\n            response = self._client.send(request, stream=False, **operation_config)\n\n            if response.status_code not in [200]:\n                raise models.BatchErrorException(self._deserialize, response)\n\n            return response\n\n        # Deserialize response\n        deserialized = models.CloudJobPaged(internal_paging, self._deserialize.dependencies)\n\n        if raw:\n            header_dict = {}\n            client_raw_response = models.CloudJobPaged(internal_paging, self._deserialize.dependencies, header_dict)\n            return client_raw_response\n\n        return deserialized"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef models(cls, api_version=DEFAULT_API_VERSION):\n        if api_version == '2015-06-01':\n            from .v2015_06_01 import models\n            return models\n        elif api_version == '2015-07-01':\n            from .v2015_07_01 import models\n            return models\n        elif api_version == '2018-01-01-preview':\n            from .v2018_01_01_preview import models\n            return models\n        elif api_version == '2018-07-01-preview':\n            from .v2018_07_01_preview import models\n            return models\n        elif api_version == '2018-09-01-preview':\n            from .v2018_09_01_preview import models\n            return models\n        raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))", "response": "Return a new instance of the base class for all the models in the specified API version."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nreturn an instance of the Classic Administrators API Version.", "response": "def classic_administrators(self):\n        \"\"\"Instance depends on the API version:\n\n           * 2015-06-01: :class:`ClassicAdministratorsOperations<azure.mgmt.authorization.v2015_06_01.operations.ClassicAdministratorsOperations>`\n        \"\"\"\n        api_version = self._get_api_version('classic_administrators')\n        if api_version == '2015-06-01':\n            from .v2015_06_01.operations import ClassicAdministratorsOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef deny_assignments(self):\n        api_version = self._get_api_version('deny_assignments')\n        if api_version == '2018-07-01-preview':\n            from .v2018_07_01_preview.operations import DenyAssignmentsOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))", "response": "Get an instance of DenyAssignmentsOperations."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\ngets the permissions for the current user.", "response": "def permissions(self):\n        \"\"\"Instance depends on the API version:\n\n           * 2015-07-01: :class:`PermissionsOperations<azure.mgmt.authorization.v2015_07_01.operations.PermissionsOperations>`\n           * 2018-01-01-preview: :class:`PermissionsOperations<azure.mgmt.authorization.v2018_01_01_preview.operations.PermissionsOperations>`\n        \"\"\"\n        api_version = self._get_api_version('permissions')\n        if api_version == '2015-07-01':\n            from .v2015_07_01.operations import PermissionsOperations as OperationClass\n        elif api_version == '2018-01-01-preview':\n            from .v2018_01_01_preview.operations import PermissionsOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nreturns an instance of the ProviderOperationsMetadataOperations API version.", "response": "def provider_operations_metadata(self):\n        \"\"\"Instance depends on the API version:\n\n           * 2015-07-01: :class:`ProviderOperationsMetadataOperations<azure.mgmt.authorization.v2015_07_01.operations.ProviderOperationsMetadataOperations>`\n           * 2018-01-01-preview: :class:`ProviderOperationsMetadataOperations<azure.mgmt.authorization.v2018_01_01_preview.operations.ProviderOperationsMetadataOperations>`\n        \"\"\"\n        api_version = self._get_api_version('provider_operations_metadata')\n        if api_version == '2015-07-01':\n            from .v2015_07_01.operations import ProviderOperationsMetadataOperations as OperationClass\n        elif api_version == '2018-01-01-preview':\n            from .v2018_01_01_preview.operations import ProviderOperationsMetadataOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef role_assignments(self):\n        api_version = self._get_api_version('role_assignments')\n        if api_version == '2015-07-01':\n            from .v2015_07_01.operations import RoleAssignmentsOperations as OperationClass\n        elif api_version == '2018-01-01-preview':\n            from .v2018_01_01_preview.operations import RoleAssignmentsOperations as OperationClass\n        elif api_version == '2018-09-01-preview':\n            from .v2018_09_01_preview.operations import RoleAssignmentsOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))", "response": "Return an instance of the RoleAssignments API version."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nreturn an instance of the RoleDefinitions API version.", "response": "def role_definitions(self):\n        \"\"\"Instance depends on the API version:\n\n           * 2015-07-01: :class:`RoleDefinitionsOperations<azure.mgmt.authorization.v2015_07_01.operations.RoleDefinitionsOperations>`\n           * 2018-01-01-preview: :class:`RoleDefinitionsOperations<azure.mgmt.authorization.v2018_01_01_preview.operations.RoleDefinitionsOperations>`\n        \"\"\"\n        api_version = self._get_api_version('role_definitions')\n        if api_version == '2015-07-01':\n            from .v2015_07_01.operations import RoleDefinitionsOperations as OperationClass\n        elif api_version == '2018-01-01-preview':\n            from .v2018_01_01_preview.operations import RoleDefinitionsOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\ncreate an xml fragment from the specified data.", "response": "def _data_to_xml(data):\n    '''Creates an xml fragment from the specified data.\n        data:\n            Array of tuples, where first: xml element name\n                                    second:\n                                        xml element text\n                                    third:\n                                        conversion function\n    '''\n    xml = ''\n    for element in data:\n        name = element[0]\n        val = element[1]\n        if len(element) > 2:\n            converter = element[2]\n        else:\n            converter = None\n\n        if val is not None:\n            if converter is not None:\n                text = _str(converter(_str(val)))\n            else:\n                text = _str(val)\n\n            xml += ''.join(['<', name, '>', text, '</', name, '>'])\n    return xml"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nparses the HTTPResponse s body and fill all the data into a class of return_type.", "response": "def parse_response(response, return_type):\n        '''\n        Parse the HTTPResponse's body and fill all the data into a class of\n        return_type.\n        '''\n        doc = minidom.parseString(response.body)\n        return_obj = return_type()\n        xml_name = return_type._xml_name if hasattr(return_type, '_xml_name') else return_type.__name__ \n        for node in _MinidomXmlToObject.get_child_nodes(doc, xml_name):\n            _MinidomXmlToObject._fill_data_to_return_object(node, return_obj)\n\n        return return_obj"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nparse the HTTPResponse s body and fill all the data into a class of return_type.", "response": "def parse_service_resources_response(response, return_type):\n        '''\n        Parse the HTTPResponse's body and fill all the data into a class of\n        return_type.\n        '''\n        doc = minidom.parseString(response.body)\n        return_obj = _list_of(return_type)\n        for node in _MinidomXmlToObject.get_children_from_path(doc, \"ServiceResources\", \"ServiceResource\"):\n            local_obj = return_type()\n            _MinidomXmlToObject._fill_data_to_return_object(node, local_obj)\n            return_obj.append(local_obj)\n\n        return return_obj"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nget properties from entry xml", "response": "def get_entry_properties_from_node(entry, include_id, id_prefix_to_skip=None, use_title_as_id=False):\n        ''' get properties from entry xml '''\n        properties = {}\n\n        etag = entry.getAttributeNS(METADATA_NS, 'etag')\n        if etag:\n            properties['etag'] = etag\n        for updated in _MinidomXmlToObject.get_child_nodes(entry, 'updated'):\n            properties['updated'] = updated.firstChild.nodeValue\n        for name in _MinidomXmlToObject.get_children_from_path(entry, 'author', 'name'):\n            if name.firstChild is not None:\n                properties['author'] = name.firstChild.nodeValue\n\n        if include_id:\n            if use_title_as_id:\n                for title in _MinidomXmlToObject.get_child_nodes(entry, 'title'):\n                    properties['name'] = title.firstChild.nodeValue\n            else:\n                # TODO: check if this is used\n                for id in _MinidomXmlToObject.get_child_nodes(entry, 'id'):\n                    properties['name'] = _get_readable_id(\n                        id.firstChild.nodeValue, id_prefix_to_skip)\n\n        return properties"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\ndescend through a hierarchy of nodes returning the list of children at the inner most level. Only returns children who share a common parent and not cousins.", "response": "def get_children_from_path(node, *path):\n        '''descends through a hierarchy of nodes returning the list of children\n        at the inner most level.  Only returns children who share a common parent,\n        not cousins.'''\n        cur = node\n        for index, child in enumerate(path):\n            if isinstance(child, _strtype):\n                next = _MinidomXmlToObject.get_child_nodes(cur, child)\n            else:\n                next = _MinidomXmlToObject._get_child_nodesNS(cur, *child)\n            if index == len(path) - 1:\n                return next\n            elif not next:\n                break\n\n            cur = next[0]\n        return []"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef _parse_response_body_from_xml_node(node, return_type):\n        '''\n        parse the xml and fill all the data into a class of return_type\n        '''\n        return_obj = return_type()\n        _MinidomXmlToObject._fill_data_to_return_object(node, return_obj)\n\n        return return_obj", "response": "parse the xml and fill all the data into a class of return_type"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nconverts an xml fragment into a list of scalar types.", "response": "def _fill_scalar_list_of(xmldoc, element_type, parent_xml_element_name,\n                             xml_element_name):\n        '''Converts an xml fragment into a list of scalar types.  The parent xml\n        element contains a flat list of xml elements which are converted into the\n        specified scalar type and added to the list.\n        Example:\n        xmldoc=\n    <Endpoints>\n        <Endpoint>http://{storage-service-name}.blob.core.windows.net/</Endpoint>\n        <Endpoint>http://{storage-service-name}.queue.core.windows.net/</Endpoint>\n        <Endpoint>http://{storage-service-name}.table.core.windows.net/</Endpoint>\n    </Endpoints>\n        element_type=str\n        parent_xml_element_name='Endpoints'\n        xml_element_name='Endpoint'\n        '''\n        xmlelements = _MinidomXmlToObject.get_child_nodes(xmldoc, parent_xml_element_name)\n        if xmlelements:\n            xmlelements = _MinidomXmlToObject.get_child_nodes(xmlelements[0], xml_element_name)\n            return [_MinidomXmlToObject._get_node_value(xmlelement, element_type) \\\n                for xmlelement in xmlelements]"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nconvert a child of the current dom element to the specified type.", "response": "def _fill_instance_child(xmldoc, element_name, return_type):\n        '''Converts a child of the current dom element to the specified type.\n        '''\n        xmlelements = _MinidomXmlToObject.get_child_nodes(\n            xmldoc, _get_serialization_name(element_name))\n\n        if not xmlelements:\n            return None\n\n        return_obj = return_type()\n        _MinidomXmlToObject._fill_data_to_return_object(xmlelements[0], return_obj)\n\n        return return_obj"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef doc_from_xml(document_element_name, inner_xml):\n        '''Wraps the specified xml in an xml root element with default azure\n        namespaces'''\n        xml = ''.join(['<', document_element_name,\n                      ' xmlns:i=\"http://www.w3.org/2001/XMLSchema-instance\"',\n                      ' xmlns=\"http://schemas.microsoft.com/windowsazure\">'])\n        xml += inner_xml\n        xml += ''.join(['</', document_element_name, '>'])\n        return xml", "response": "Wraps the specified xml in an xml root element with default azure\n        namespaces"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef doc_from_xml(document_element_name, inner_xml,\n                     xmlns='http://schemas.microsoft.com/windowsazure'):\n        '''Wraps the specified xml in an xml root element with default azure\n        namespaces'''\n        xml = ''.join(['<', document_element_name,\n                      ' xmlns=\"{0}\">'.format(xmlns)])\n        xml += inner_xml\n        xml += ''.join(['</', document_element_name, '>'])\n        return xml", "response": "Wraps the specified xml in an xml root element with default azure\n                      namespaces"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nconverts xml response to service bus namespace", "response": "def xml_to_namespace(xmlstr):\n        '''Converts xml response to service bus namespace\n\n        The xml format for namespace:\n<entry>\n<id>uuid:00000000-0000-0000-0000-000000000000;id=0000000</id>\n<title type=\"text\">myunittests</title>\n<updated>2012-08-22T16:48:10Z</updated>\n<content type=\"application/xml\">\n    <NamespaceDescription\n        xmlns=\"http://schemas.microsoft.com/netservices/2010/10/servicebus/connect\"\n        xmlns:i=\"http://www.w3.org/2001/XMLSchema-instance\">\n    <Name>myunittests</Name>\n    <Region>West US</Region>\n    <DefaultKey>0000000000000000000000000000000000000000000=</DefaultKey>\n    <Status>Active</Status>\n    <CreatedAt>2012-08-22T16:48:10.217Z</CreatedAt>\n    <AcsManagementEndpoint>https://myunittests-sb.accesscontrol.windows.net/</AcsManagementEndpoint>\n    <ServiceBusEndpoint>https://myunittests.servicebus.windows.net/</ServiceBusEndpoint>\n    <ConnectionString>Endpoint=sb://myunittests.servicebus.windows.net/;SharedSecretIssuer=owner;SharedSecretValue=0000000000000000000000000000000000000000000=</ConnectionString>\n    <SubscriptionId>00000000000000000000000000000000</SubscriptionId>\n    <Enabled>true</Enabled>\n    </NamespaceDescription>\n</content>\n</entry>\n        '''\n        xmldoc = minidom.parseString(xmlstr)\n        namespace = ServiceBusNamespace()\n\n        mappings = (\n            ('Name', 'name', None),\n            ('Region', 'region', None),\n            ('DefaultKey', 'default_key', None),\n            ('Status', 'status', None),\n            ('CreatedAt', 'created_at', None),\n            ('AcsManagementEndpoint', 'acs_management_endpoint', None),\n            ('ServiceBusEndpoint', 'servicebus_endpoint', None),\n            ('ConnectionString', 'connection_string', None),\n            ('SubscriptionId', 'subscription_id', None),\n            ('Enabled', 'enabled', _parse_bool),\n        )\n\n        for desc in _MinidomXmlToObject.get_children_from_path(\n            xmldoc,\n            'entry',\n            'content',\n            'NamespaceDescription'):\n            for xml_name, field_name, conversion_func in mappings:\n                node_value = _MinidomXmlToObject.get_first_child_node_value(desc, xml_name)\n                if node_value is not None:\n                    if conversion_func is not None:\n                        node_value = conversion_func(node_value)\n                    setattr(namespace, field_name, node_value)\n\n        return namespace"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef xml_to_region(xmlstr):\n        '''Converts xml response to service bus region\n\n        The xml format for region:\n<entry>\n<id>uuid:157c311f-081f-4b4a-a0ba-a8f990ffd2a3;id=1756759</id>\n<title type=\"text\"></title>\n<updated>2013-04-10T18:25:29Z</updated>\n<content type=\"application/xml\">\n    <RegionCodeDescription\n        xmlns=\"http://schemas.microsoft.com/netservices/2010/10/servicebus/connect\"\n        xmlns:i=\"http://www.w3.org/2001/XMLSchema-instance\">\n    <Code>East Asia</Code>\n    <FullName>East Asia</FullName>\n    </RegionCodeDescription>\n</content>\n</entry>\n          '''\n        xmldoc = minidom.parseString(xmlstr)\n        region = ServiceBusRegion()\n\n        for desc in _MinidomXmlToObject.get_children_from_path(xmldoc, 'entry', 'content',\n                                            'RegionCodeDescription'):\n            node_value = _MinidomXmlToObject.get_first_child_node_value(desc, 'Code')\n            if node_value is not None:\n                region.code = node_value\n            node_value = _MinidomXmlToObject.get_first_child_node_value(desc, 'FullName')\n            if node_value is not None:\n                region.fullname = node_value\n\n        return region", "response": "Converts xml response to service bus region"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nconverts xml response to service bus namespace availability", "response": "def xml_to_namespace_availability(xmlstr):\n        '''Converts xml response to service bus namespace availability\n\n        The xml format:\n<?xml version=\"1.0\" encoding=\"utf-8\"?>\n<entry xmlns=\"http://www.w3.org/2005/Atom\">\n    <id>uuid:9fc7c652-1856-47ab-8d74-cd31502ea8e6;id=3683292</id>\n    <title type=\"text\"></title>\n    <updated>2013-04-16T03:03:37Z</updated>\n    <content type=\"application/xml\">\n        <NamespaceAvailability\n            xmlns=\"http://schemas.microsoft.com/netservices/2010/10/servicebus/connect\"\n            xmlns:i=\"http://www.w3.org/2001/XMLSchema-instance\">\n            <Result>false</Result>\n        </NamespaceAvailability>\n    </content>\n</entry>\n        '''\n        xmldoc = minidom.parseString(xmlstr)\n        availability = AvailabilityResponse()\n\n        for desc in _MinidomXmlToObject.get_children_from_path(xmldoc, 'entry', 'content',\n                                            'NamespaceAvailability'):\n            node_value = _MinidomXmlToObject.get_first_child_node_value(desc, 'Result')\n            if node_value is not None:\n                availability.result = _parse_bool(node_value)\n\n        return availability"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nconvert odata type to be completed", "response": "def odata_converter(data, str_type):\n        ''' Convert odata type\n        http://www.odata.org/documentation/odata-version-2-0/overview#AbstractTypeSystem\n        To be completed\n        '''\n        if not str_type:\n            return _str(data)\n        if str_type in [\"Edm.Single\", \"Edm.Double\"]:\n            return float(data)\n        elif \"Edm.Int\" in str_type:\n            return int(data)\n        else:\n            return _str(data)"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nconverts xml response to service bus metrics objects", "response": "def xml_to_metrics(xmlstr, object_type):\n        '''Converts xml response to service bus metrics objects\n\n        The xml format for MetricProperties\n<entry>\n    <id>https://sbgm.windows.net/Metrics(\\'listeners.active\\')</id>\n    <title/>\n    <updated>2014-10-09T11:56:50Z</updated>\n    <author>\n        <name/>\n    </author>\n    <content type=\"application/xml\">\n        <m:properties>\n            <d:Name>listeners.active</d:Name>\n            <d:PrimaryAggregation>Average</d:PrimaryAggregation>\n            <d:Unit>Count</d:Unit>\n            <d:DisplayName>Active listeners</d:DisplayName>\n        </m:properties>\n    </content>\n</entry>\n\n        The xml format for MetricValues\n    <entry>\n        <id>https://sbgm.windows.net/MetricValues(datetime\\'2014-10-02T00:00:00Z\\')</id>\n        <title/>\n        <updated>2014-10-09T18:38:28Z</updated>\n        <author>\n            <name/>\n        </author>\n        <content type=\"application/xml\">\n            <m:properties>\n                <d:Timestamp m:type=\"Edm.DateTime\">2014-10-02T00:00:00Z</d:Timestamp>\n                <d:Min m:type=\"Edm.Int64\">-118</d:Min>\n                <d:Max m:type=\"Edm.Int64\">15</d:Max>\n                <d:Average m:type=\"Edm.Single\">-78.44444</d:Average>\n                <d:Total m:type=\"Edm.Int64\">0</d:Total>\n            </m:properties>\n        </content>\n    </entry>\n        '''\n\n        xmldoc = minidom.parseString(xmlstr)\n        return_obj = object_type()\n\n        members = dict(vars(return_obj))\n\n        # Only one entry here\n        for xml_entry in _MinidomXmlToObject.get_children_from_path(xmldoc,\n                                                 'entry'):\n            for node in _MinidomXmlToObject.get_children_from_path(xml_entry,\n                                                'content',\n                                                'properties'):\n                for name in members:\n                    xml_name = _get_serialization_name(name)\n                    children = _MinidomXmlToObject.get_child_nodes(node, xml_name)\n                    if not children:\n                        continue\n                    child = children[0]\n                    node_type = child.getAttributeNS(\"http://schemas.microsoft.com/ado/2007/08/dataservices/metadata\", 'type')\n                    node_value = _ServiceBusManagementXmlSerializer.odata_converter(child.firstChild.nodeValue, node_type)\n                    setattr(return_obj, name, node_value)\n            for name, value in _MinidomXmlToObject.get_entry_properties_from_node(\n                xml_entry,\n                include_id=True,\n                use_title_as_id=False).items():\n                if name in members:\n                    continue  # Do not override if already members\n                setattr(return_obj, name, value)\n        return return_obj"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef create_cloud_service_to_xml(label, description, geo_region):\n        '''\n        <CloudService xmlns:i=\"http://www.w3.org/2001/XMLSchema-instance\" xmlns=\"http://schemas.microsoft.com/windowsazure\">\n          <Label>MyApp3</Label>\n          <Description>My Cloud Service for app3</Description>\n          <GeoRegion>South Central US</GeoRegion>\n        </CloudService>\n        '''\n        body = '<CloudService xmlns:i=\"http://www.w3.org/2001/XMLSchema-instance\" xmlns=\"http://schemas.microsoft.com/windowsazure\">'\n        body += ''.join(['<Label>', label, '</Label>'])\n        body += ''.join(['<Description>', description, '</Description>'])\n        body += ''.join(['<GeoRegion>', geo_region, '</GeoRegion>'])\n        body += '</CloudService>'\n\n        return body", "response": "Create a Cloud Service to XML."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\ncreate a resource collection from the jobs in the system.", "response": "def create_job_collection_to_xml(plan):\n        '''\n        <Resource xmlns:i=\"http://www.w3.org/2001/XMLSchema-instance\" xmlns=\"http://schemas.microsoft.com/windowsazure\">\n        <IntrinsicSettings>\n            <Plan>Standard</Plan>\n            <Quota>\n                <MaxJobCount>10</MaxJobCount>\n                <MaxRecurrence>\n                    <Frequency>Second</Frequency>\n                    <Interval>1</Interval>\n                </MaxRecurrence>\n            </Quota>\n        </IntrinsicSettings>\n        </Resource>\n        '''\n\n        if plan not in [\"Free\", \"Standard\"]:\n            raise ValueError(\"Plan: Invalid option must be 'Standard' or 'Free'\")\n\n        body = '<Resource xmlns:i=\"http://www.w3.org/2001/XMLSchema-instance\" xmlns=\"http://schemas.microsoft.com/windowsazure\"><IntrinsicSettings>'\n        body += ''.join(['<plan>', plan, '</plan>'])\n        body += '</IntrinsicSettings></Resource>'\n\n        return body"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef management_locks(self):\n        api_version = self._get_api_version('management_locks')\n        if api_version == '2015-01-01':\n            from .v2015_01_01.operations import ManagementLocksOperations as OperationClass\n        elif api_version == '2016-09-01':\n            from .v2016_09_01.operations import ManagementLocksOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))", "response": "Returns an instance of the ManagementLocks API Version."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef replace_content(\n            self, resource_group_name, automation_account_name, runbook_name, runbook_content, custom_headers=None, raw=False, callback=None, polling=True, **operation_config):\n        \"\"\"Replaces the runbook draft content.\n\n        :param resource_group_name: Name of an Azure Resource group.\n        :type resource_group_name: str\n        :param automation_account_name: The name of the automation account.\n        :type automation_account_name: str\n        :param runbook_name: The runbook name.\n        :type runbook_name: str\n        :param runbook_content: The\u00a0runbook\u00a0draft\u00a0content.\n        :type runbook_content: Generator\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: The poller return type is ClientRawResponse, the\n         direct response alongside the deserialized response\n        :param polling: True for ARMPolling, False for no polling, or a\n         polling object for personal polling strategy\n        :return: An instance of LROPoller that returns object or\n         ClientRawResponse<object> if raw==True\n        :rtype: ~msrestazure.azure_operation.AzureOperationPoller[Generator]\n         or\n         ~msrestazure.azure_operation.AzureOperationPoller[~msrest.pipeline.ClientRawResponse[Generator]]\n        :raises:\n         :class:`ErrorResponseException<azure.mgmt.automation.models.ErrorResponseException>`\n        \"\"\"\n        raw_result = self._replace_content_initial(\n            resource_group_name=resource_group_name,\n            automation_account_name=automation_account_name,\n            runbook_name=runbook_name,\n            runbook_content=runbook_content,\n            custom_headers=custom_headers,\n            raw=True,\n            **operation_config\n        )\n\n        def get_long_running_output(response):\n            header_dict = {\n                'location': 'str',\n            }\n            deserialized = self._deserialize('object', response)\n\n            if raw:\n                client_raw_response = ClientRawResponse(deserialized, response)\n                client_raw_response.add_headers(header_dict)\n                return client_raw_response\n\n            return deserialized\n\n        lro_delay = operation_config.get(\n            'long_running_operation_timeout',\n            self.config.long_running_operation_timeout)\n        if polling is True: polling_method = ARMPolling(lro_delay, **operation_config)\n        elif polling is False: polling_method = NoPolling()\n        else: polling_method = polling\n        return LROPoller(self._client, raw_result, get_long_running_output, polling_method)", "response": "Replaces the runbook draft content with the runbook content."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nget domain name recommendations based on keywords.", "response": "def list_recommendations(\n            self, keywords=None, max_domain_recommendations=None, custom_headers=None, raw=False, **operation_config):\n        \"\"\"Get domain name recommendations based on keywords.\n\n        Get domain name recommendations based on keywords.\n\n        :param keywords: Keywords to be used for generating domain\n         recommendations.\n        :type keywords: str\n        :param max_domain_recommendations: Maximum number of recommendations.\n        :type max_domain_recommendations: int\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: An iterator like instance of NameIdentifier\n        :rtype:\n         ~azure.mgmt.web.models.NameIdentifierPaged[~azure.mgmt.web.models.NameIdentifier]\n        :raises:\n         :class:`DefaultErrorResponseException<azure.mgmt.web.models.DefaultErrorResponseException>`\n        \"\"\"\n        parameters = models.DomainRecommendationSearchParameters(keywords=keywords, max_domain_recommendations=max_domain_recommendations)\n\n        def internal_paging(next_link=None, raw=False):\n\n            if not next_link:\n                # Construct URL\n                url = self.list_recommendations.metadata['url']\n                path_format_arguments = {\n                    'subscriptionId': self._serialize.url(\"self.config.subscription_id\", self.config.subscription_id, 'str')\n                }\n                url = self._client.format_url(url, **path_format_arguments)\n\n                # Construct parameters\n                query_parameters = {}\n                query_parameters['api-version'] = self._serialize.query(\"self.api_version\", self.api_version, 'str')\n\n            else:\n                url = next_link\n                query_parameters = {}\n\n            # Construct headers\n            header_parameters = {}\n            header_parameters['Accept'] = 'application/json'\n            header_parameters['Content-Type'] = 'application/json; charset=utf-8'\n            if self.config.generate_client_request_id:\n                header_parameters['x-ms-client-request-id'] = str(uuid.uuid1())\n            if custom_headers:\n                header_parameters.update(custom_headers)\n            if self.config.accept_language is not None:\n                header_parameters['accept-language'] = self._serialize.header(\"self.config.accept_language\", self.config.accept_language, 'str')\n\n            # Construct body\n            body_content = self._serialize.body(parameters, 'DomainRecommendationSearchParameters')\n\n            # Construct and send request\n            request = self._client.post(url, query_parameters, header_parameters, body_content)\n            response = self._client.send(request, stream=False, **operation_config)\n\n            if response.status_code not in [200]:\n                raise models.DefaultErrorResponseException(self._deserialize, response)\n\n            return response\n\n        # Deserialize response\n        deserialized = models.NameIdentifierPaged(internal_paging, self._deserialize.dependencies)\n\n        if raw:\n            header_dict = {}\n            client_raw_response = models.NameIdentifierPaged(internal_paging, self._deserialize.dependencies, header_dict)\n            return client_raw_response\n\n        return deserialized"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef update(\n            self, kb_id, update_kb, custom_headers=None, raw=False, **operation_config):\n        \"\"\"Asynchronous operation to modify a knowledgebase.\n\n        :param kb_id: Knowledgebase id.\n        :type kb_id: str\n        :param update_kb: Post body of the request.\n        :type update_kb:\n         ~azure.cognitiveservices.knowledge.qnamaker.models.UpdateKbOperationDTO\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: Operation or ClientRawResponse if raw=true\n        :rtype: ~azure.cognitiveservices.knowledge.qnamaker.models.Operation\n         or ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`ErrorResponseException<azure.cognitiveservices.knowledge.qnamaker.models.ErrorResponseException>`\n        \"\"\"\n        # Construct URL\n        url = self.update.metadata['url']\n        path_format_arguments = {\n            'Endpoint': self._serialize.url(\"self.config.endpoint\", self.config.endpoint, 'str', skip_quote=True),\n            'kbId': self._serialize.url(\"kb_id\", kb_id, 'str')\n        }\n        url = self._client.format_url(url, **path_format_arguments)\n\n        # Construct parameters\n        query_parameters = {}\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Accept'] = 'application/json'\n        header_parameters['Content-Type'] = 'application/json; charset=utf-8'\n        if custom_headers:\n            header_parameters.update(custom_headers)\n\n        # Construct body\n        body_content = self._serialize.body(update_kb, 'UpdateKbOperationDTO')\n\n        # Construct and send request\n        request = self._client.patch(url, query_parameters, header_parameters, body_content)\n        response = self._client.send(request, stream=False, **operation_config)\n\n        if response.status_code not in [202]:\n            raise models.ErrorResponseException(self._deserialize, response)\n\n        deserialized = None\n        header_dict = {}\n\n        if response.status_code == 202:\n            deserialized = self._deserialize('Operation', response)\n            header_dict = {\n                'Location': 'str',\n            }\n\n        if raw:\n            client_raw_response = ClientRawResponse(deserialized, response)\n            client_raw_response.add_headers(header_dict)\n            return client_raw_response\n\n        return deserialized", "response": "Asynchronous operation to modify a specific Knowledgebase."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef get(\n            self, app_id, metric_id, timespan=None, interval=None, aggregation=None, segment=None, top=None, orderby=None, filter=None, custom_headers=None, raw=False, **operation_config):\n        \"\"\"Retrieve metric data.\n\n        Gets metric values for a single metric.\n\n        :param app_id: ID of the application. This is Application ID from the\n         API Access settings blade in the Azure portal.\n        :type app_id: str\n        :param metric_id: ID of the metric. This is either a standard AI\n         metric, or an application-specific custom metric. Possible values\n         include: 'requests/count', 'requests/duration', 'requests/failed',\n         'users/count', 'users/authenticated', 'pageViews/count',\n         'pageViews/duration', 'client/processingDuration',\n         'client/receiveDuration', 'client/networkDuration',\n         'client/sendDuration', 'client/totalDuration', 'dependencies/count',\n         'dependencies/failed', 'dependencies/duration', 'exceptions/count',\n         'exceptions/browser', 'exceptions/server', 'sessions/count',\n         'performanceCounters/requestExecutionTime',\n         'performanceCounters/requestsPerSecond',\n         'performanceCounters/requestsInQueue',\n         'performanceCounters/memoryAvailableBytes',\n         'performanceCounters/exceptionsPerSecond',\n         'performanceCounters/processCpuPercentage',\n         'performanceCounters/processIOBytesPerSecond',\n         'performanceCounters/processPrivateBytes',\n         'performanceCounters/processorCpuPercentage',\n         'availabilityResults/availabilityPercentage',\n         'availabilityResults/duration', 'billing/telemetryCount',\n         'customEvents/count'\n        :type metric_id: str or ~azure.applicationinsights.models.MetricId\n        :param timespan: The timespan over which to retrieve metric values.\n         This is an ISO8601 time period value. If timespan is omitted, a\n         default time range of `PT12H` (\"last 12 hours\") is used. The actual\n         timespan that is queried may be adjusted by the server based. In all\n         cases, the actual time span used for the query is included in the\n         response.\n        :type timespan: str\n        :param interval: The time interval to use when retrieving metric\n         values. This is an ISO8601 duration. If interval is omitted, the\n         metric value is aggregated across the entire timespan. If interval is\n         supplied, the server may adjust the interval to a more appropriate\n         size based on the timespan used for the query. In all cases, the\n         actual interval used for the query is included in the response.\n        :type interval: timedelta\n        :param aggregation: The aggregation to use when computing the metric\n         values. To retrieve more than one aggregation at a time, separate them\n         with a comma. If no aggregation is specified, then the default\n         aggregation for the metric is used.\n        :type aggregation: list[str or\n         ~azure.applicationinsights.models.MetricsAggregation]\n        :param segment: The name of the dimension to segment the metric values\n         by. This dimension must be applicable to the metric you are\n         retrieving. To segment by more than one dimension at a time, separate\n         them with a comma (,). In this case, the metric data will be segmented\n         in the order the dimensions are listed in the parameter.\n        :type segment: list[str or\n         ~azure.applicationinsights.models.MetricsSegment]\n        :param top: The number of segments to return.  This value is only\n         valid when segment is specified.\n        :type top: int\n        :param orderby: The aggregation function and direction to sort the\n         segments by.  This value is only valid when segment is specified.\n        :type orderby: str\n        :param filter: An expression used to filter the results.  This value\n         should be a valid OData filter expression where the keys of each\n         clause should be applicable dimensions for the metric you are\n         retrieving.\n        :type filter: str\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: MetricsResult or ClientRawResponse if raw=true\n        :rtype: ~azure.applicationinsights.models.MetricsResult or\n         ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`ErrorResponseException<azure.applicationinsights.models.ErrorResponseException>`\n        \"\"\"\n        # Construct URL\n        url = self.get.metadata['url']\n        path_format_arguments = {\n            'appId': self._serialize.url(\"app_id\", app_id, 'str'),\n            'metricId': self._serialize.url(\"metric_id\", metric_id, 'str')\n        }\n        url = self._client.format_url(url, **path_format_arguments)\n\n        # Construct parameters\n        query_parameters = {}\n        if timespan is not None:\n            query_parameters['timespan'] = self._serialize.query(\"timespan\", timespan, 'str')\n        if interval is not None:\n            query_parameters['interval'] = self._serialize.query(\"interval\", interval, 'duration')\n        if aggregation is not None:\n            query_parameters['aggregation'] = self._serialize.query(\"aggregation\", aggregation, '[MetricsAggregation]', div=',', min_items=1)\n        if segment is not None:\n            query_parameters['segment'] = self._serialize.query(\"segment\", segment, '[str]', div=',', min_items=1)\n        if top is not None:\n            query_parameters['top'] = self._serialize.query(\"top\", top, 'int')\n        if orderby is not None:\n            query_parameters['orderby'] = self._serialize.query(\"orderby\", orderby, 'str')\n        if filter is not None:\n            query_parameters['filter'] = self._serialize.query(\"filter\", filter, 'str')\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Accept'] = 'application/json'\n        if custom_headers:\n            header_parameters.update(custom_headers)\n\n        # Construct and send request\n        request = self._client.get(url, query_parameters, header_parameters)\n        response = self._client.send(request, stream=False, **operation_config)\n\n        if response.status_code not in [200]:\n            raise models.ErrorResponseException(self._deserialize, response)\n\n        deserialized = None\n\n        if response.status_code == 200:\n            deserialized = self._deserialize('MetricsResult', response)\n\n        if raw:\n            client_raw_response = ClientRawResponse(deserialized, response)\n            return client_raw_response\n\n        return deserialized", "response": "Retrieves the values for a single metric in the specified application."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nretrieve metric values for multiple metrics.", "response": "def get_multiple(\n            self, app_id, body, custom_headers=None, raw=False, **operation_config):\n        \"\"\"Retrieve metric data.\n\n        Gets metric values for multiple metrics.\n\n        :param app_id: ID of the application. This is Application ID from the\n         API Access settings blade in the Azure portal.\n        :type app_id: str\n        :param body: The batched metrics query.\n        :type body:\n         list[~azure.applicationinsights.models.MetricsPostBodySchema]\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: list or ClientRawResponse if raw=true\n        :rtype: list[~azure.applicationinsights.models.MetricsResultsItem] or\n         ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`ErrorResponseException<azure.applicationinsights.models.ErrorResponseException>`\n        \"\"\"\n        # Construct URL\n        url = self.get_multiple.metadata['url']\n        path_format_arguments = {\n            'appId': self._serialize.url(\"app_id\", app_id, 'str')\n        }\n        url = self._client.format_url(url, **path_format_arguments)\n\n        # Construct parameters\n        query_parameters = {}\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Accept'] = 'application/json'\n        header_parameters['Content-Type'] = 'application/json; charset=utf-8'\n        if custom_headers:\n            header_parameters.update(custom_headers)\n\n        # Construct body\n        body_content = self._serialize.body(body, '[MetricsPostBodySchema]')\n\n        # Construct and send request\n        request = self._client.post(url, query_parameters, header_parameters, body_content)\n        response = self._client.send(request, stream=False, **operation_config)\n\n        if response.status_code not in [200]:\n            raise models.ErrorResponseException(self._deserialize, response)\n\n        deserialized = None\n\n        if response.status_code == 200:\n            deserialized = self._deserialize('[MetricsResultsItem]', response)\n\n        if raw:\n            client_raw_response = ClientRawResponse(deserialized, response)\n            return client_raw_response\n\n        return deserialized"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nregisters a renewable entity for automatic lock renewal.", "response": "def register(self, renewable, timeout=300):\n        \"\"\"Register a renewable entity for automatic lock renewal.\n\n        :param renewable: A locked entity that needs to be renewed.\n        :type renewable: ~azure.servicebus.aio.async_message.Message or\n         ~azure.servicebus.aio.async_receive_handler.SessionReceiver\n        :param timeout: A time in seconds that the lock should be maintained for.\n         Default value is 300 (5 minutes).\n        :type timeout: int\n        \"\"\"\n        starttime = renewable_start_time(renewable)\n        renew_future = asyncio.ensure_future(self._auto_lock_renew(renewable, starttime, timeout), loop=self.loop)\n        self._futures.append(renew_future)"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nreturns an iterator over the object IDs of the groups of which the user is a member.", "response": "def get_member_groups(\n            self, object_id, security_enabled_only, additional_properties=None, custom_headers=None, raw=False, **operation_config):\n        \"\"\"Gets a collection that contains the object IDs of the groups of which\n        the user is a member.\n\n        :param object_id: The object ID of the user for which to get group\n         membership.\n        :type object_id: str\n        :param security_enabled_only: If true, only membership in\n         security-enabled groups should be checked. Otherwise, membership in\n         all groups should be checked.\n        :type security_enabled_only: bool\n        :param additional_properties: Unmatched properties from the message\n         are deserialized this collection\n        :type additional_properties: dict[str, object]\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: An iterator like instance of str\n        :rtype: ~azure.graphrbac.models.StrPaged[str]\n        :raises:\n         :class:`GraphErrorException<azure.graphrbac.models.GraphErrorException>`\n        \"\"\"\n        parameters = models.UserGetMemberGroupsParameters(additional_properties=additional_properties, security_enabled_only=security_enabled_only)\n\n        def internal_paging(next_link=None, raw=False):\n\n            if not next_link:\n                # Construct URL\n                url = self.get_member_groups.metadata['url']\n                path_format_arguments = {\n                    'objectId': self._serialize.url(\"object_id\", object_id, 'str'),\n                    'tenantID': self._serialize.url(\"self.config.tenant_id\", self.config.tenant_id, 'str')\n                }\n                url = self._client.format_url(url, **path_format_arguments)\n\n                # Construct parameters\n                query_parameters = {}\n                query_parameters['api-version'] = self._serialize.query(\"self.api_version\", self.api_version, 'str')\n\n            else:\n                url = next_link\n                query_parameters = {}\n\n            # Construct headers\n            header_parameters = {}\n            header_parameters['Accept'] = 'application/json'\n            header_parameters['Content-Type'] = 'application/json; charset=utf-8'\n            if self.config.generate_client_request_id:\n                header_parameters['x-ms-client-request-id'] = str(uuid.uuid1())\n            if custom_headers:\n                header_parameters.update(custom_headers)\n            if self.config.accept_language is not None:\n                header_parameters['accept-language'] = self._serialize.header(\"self.config.accept_language\", self.config.accept_language, 'str')\n\n            # Construct body\n            body_content = self._serialize.body(parameters, 'UserGetMemberGroupsParameters')\n\n            # Construct and send request\n            request = self._client.post(url, query_parameters, header_parameters, body_content)\n            response = self._client.send(request, stream=False, **operation_config)\n\n            if response.status_code not in [200]:\n                raise models.GraphErrorException(self._deserialize, response)\n\n            return response\n\n        # Deserialize response\n        deserialized = models.StrPaged(internal_paging, self._deserialize.dependencies)\n\n        if raw:\n            header_dict = {}\n            client_raw_response = models.StrPaged(internal_paging, self._deserialize.dependencies, header_dict)\n            return client_raw_response\n\n        return deserialized"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef build_package_from_pr_number(gh_token, sdk_id, pr_number, output_folder, *, with_comment=False):\n\n    con = Github(gh_token)\n    repo = con.get_repo(sdk_id)\n    sdk_pr = repo.get_pull(pr_number)\n    # \"get_files\" of Github only download the first 300 files. Might not be enough.\n    package_names = {f.filename.split('/')[0] for f in sdk_pr.get_files() if f.filename.startswith(\"azure\")}\n    absolute_output_folder = Path(output_folder).resolve()\n\n    with tempfile.TemporaryDirectory() as temp_dir, \\\n            manage_git_folder(gh_token, Path(temp_dir) / Path(\"sdk\"), sdk_id, pr_number=pr_number) as sdk_folder:\n\n        for package_name in package_names:\n            _LOGGER.debug(\"Build {}\".format(package_name))\n            execute_simple_command(\n                [\"python\", \"./build_package.py\", \"--dest\", str(absolute_output_folder), package_name],\n                cwd=sdk_folder\n            )\n            _LOGGER.debug(\"Build finished: {}\".format(package_name))\n\n    if with_comment:\n        files = [f.name for f in absolute_output_folder.iterdir()]\n        comment_message = None\n        dashboard = DashboardCommentableObject(sdk_pr, \"(message created by the CI based on PR content)\")\n        try:\n            installation_message = build_installation_message(sdk_pr)\n            download_message = build_download_message(sdk_pr, files)\n            comment_message = installation_message + \"\\n\\n\" + download_message\n            dashboard.create_comment(comment_message)\n        except Exception:\n            _LOGGER.critical(\"Unable to do PR comment:\\n%s\", comment_message)", "response": "Build a package from the given PR number."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nimporting data into Redis cache.", "response": "def import_data(\n            self, resource_group_name, name, files, format=None, custom_headers=None, raw=False, polling=True, **operation_config):\n        \"\"\"Import data into Redis cache.\n\n        :param resource_group_name: The name of the resource group.\n        :type resource_group_name: str\n        :param name: The name of the Redis cache.\n        :type name: str\n        :param files: files to import.\n        :type files: list[str]\n        :param format: File format.\n        :type format: str\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: The poller return type is ClientRawResponse, the\n         direct response alongside the deserialized response\n        :param polling: True for ARMPolling, False for no polling, or a\n         polling object for personal polling strategy\n        :return: An instance of LROPoller that returns None or\n         ClientRawResponse<None> if raw==True\n        :rtype: ~msrestazure.azure_operation.AzureOperationPoller[None] or\n         ~msrestazure.azure_operation.AzureOperationPoller[~msrest.pipeline.ClientRawResponse[None]]\n        :raises: :class:`CloudError<msrestazure.azure_exceptions.CloudError>`\n        \"\"\"\n        raw_result = self._import_data_initial(\n            resource_group_name=resource_group_name,\n            name=name,\n            files=files,\n            format=format,\n            custom_headers=custom_headers,\n            raw=True,\n            **operation_config\n        )\n\n        def get_long_running_output(response):\n            if raw:\n                client_raw_response = ClientRawResponse(None, response)\n                return client_raw_response\n\n        lro_delay = operation_config.get(\n            'long_running_operation_timeout',\n            self.config.long_running_operation_timeout)\n        if polling is True: polling_method = ARMPolling(lro_delay, **operation_config)\n        elif polling is False: polling_method = NoPolling()\n        else: polling_method = polling\n        return LROPoller(self._client, raw_result, get_long_running_output, polling_method)"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef create_object_id(collection, vault, name, version):\n        collection = _validate_string_argument(collection, 'collection')\n        vault = _validate_string_argument(vault, 'vault')\n        name = _validate_string_argument(name, 'name')\n        version = _validate_string_argument(version, 'version', True)\n        _parse_uri_argument(vault)  # check that vault is a valid URI but don't change it\n        return KeyVaultIdentifier(collection=collection, vault=vault, name=name, version=version)", "response": "Create a KeyVaultId object from the given parameters."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nparse a key vault identifier.", "response": "def parse_object_id(collection, id):\n        \"\"\"\n        :param collection: The resource collection type.\n        :type collection: str\n        :param id: The resource uri.\n        :type id: str\n        :rtype: KeyVaultId\n        \"\"\"\n        collection = _validate_string_argument(collection, 'collection')\n        return KeyVaultIdentifier(uri=id, collection=collection)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\ncreate a new KeyId object.", "response": "def create_key_id(vault, name, version=None):\n        \"\"\"\n        :param vault: The vault uri.\n        :type vault: str\n        :param name: The key name.\n        :type name: str\n        :param version: The key version.\n        :type version: str\n        :rtype: KeyVaultId\n        \"\"\"\n        return KeyId(vault=vault, name=name, version=version)"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\ncreating a new KeyVaultId object.", "response": "def create_secret_id(vault, name, version=None):\n        \"\"\"\n        :param vault: The vault uri.\n        :type vault: str\n        :param name: The secret name.\n        :type name: str\n        :param version: The secret version.\n        :type version: str\n        :rtype: KeyVaultId\n        \"\"\"\n        return SecretId(vault=vault, name=name, version=version)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\ncreating a new certificate id.", "response": "def create_certificate_id(vault, name, version=None):\n        \"\"\"\n        :param vault: The vault uri.\n        :type vault: str\n        :param name: The certificate name.\n        :type name: str\n        :param version: The certificate version.\n        :type version: str\n        :rtype: KeyVaultId\n        \"\"\"\n        return CertificateId(vault=vault, name=name, version=version)"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nformats the KeyVaultIdentifier into a identifier uri based on the specified format string.", "response": "def _format(self, fmt=None):\n        \"\"\"\n        Formats the KeyVaultIdentifier into a identifier uri based of the specified format string\n        :param fmt: The format string for the identifier uri \n        :return: The formatted key vault object identifier uri \n        \"\"\"\n        # if no fmt was specified use the _id_format from the current object\n        fmt = fmt or self._id_format\n        segments = []\n\n        # split the formatting string into segments\n        for fmt_seg in fmt.split('/'):\n            # if the segment is a substitution element\n            if fmt_seg.startswith('{') and fmt_seg.endswith('}'):\n\n                # get the attribute name from the segment element\n                fmt_seg = fmt_seg[1:-1]\n                fmt_prop = fmt_seg.rstrip('?')\n                # get the value of the attribute from the current object\n                seg_val = getattr(self, fmt_prop)\n\n                # if the attribute is specified add the value to the formatted segments\n                if seg_val:\n                    segments.append(seg_val.strip('/').strip())\n                # if the attribute is not specified and the substitution is not optional raise an error\n                else:\n                    if not fmt_seg.endswith('?'):\n                        raise ValueError('invalid id: No value specified for the required segment \"{}\"'.format(fmt_prop))\n\n            # if the segment is a literal element simply add it to the formatted segments\n            else:\n                segments.append(fmt_seg)\n\n        # join all the formatted segments together\n        return '/'.join(segments)"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef _parse(self, uri, validation_args):\n        def format_error():\n            return ValueError('invalid id: The specified uri \"{}\", does to match the specified format \"{}\"'.format(uri, self._id_format))\n\n        uri = _validate_string_argument(uri, 'uri')\n        parsed_uri = _parse_uri_argument(uri)\n\n        # split all the id segments from the uri path using and insert the host as the first segment\n        id_segs = list(filter(None, parsed_uri.path.split('/')))\n        id_segs.insert(0, '{}://{}'.format(parsed_uri.scheme, parsed_uri.hostname))\n\n        # split the format segments from the classes format string\n        fmt_segs = list(filter(None, self._id_format.split('/')))\n\n        for ix in range(len(fmt_segs)):\n            # get the format segment and the id segment\n            fmt_seg = fmt_segs[ix]\n            id_seg = id_segs.pop(0) if len(id_segs) > 0 else ''\n\n            # if the segment is a substitution element\n            if fmt_seg.startswith('{') and fmt_seg.endswith('}'):\n                prop = fmt_seg[1:-1]\n                prop_strip = prop.rstrip('?')\n                # if the segment is not present in the specified uri and is not optional raise an error\n                if not id_seg and not prop.endswith('?'):\n                    raise format_error()\n                # if the segment is in the segments to validate and doesn't match the expected vault raise an error\n                if prop_strip in validation_args and validation_args[prop_strip] and validation_args[prop_strip] != id_seg:\n                    if id_seg and not prop.endswith('?'):\n                        raise ValueError('invalid id: The {} \"{}\" does not match the expected \"{}\"'.format(prop, id_seg, validation_args[prop_strip]))\n                # set the attribute to the value parsed from the uri\n                self.__dict__[prop_strip] = id_seg\n            # otherwise the segment is a literal element\n            else:\n                # if the value parsed from the uri doesn't match the literal value from the format string raise an error\n                if not fmt_seg == id_seg:\n                    raise format_error()\n\n        # if there are still segments left in the uri which were not accounted for in the format string raise an error\n        if len(id_segs) > 0:\n            raise format_error()", "response": "Parses the specified uri and sets the parsed format arguments as \n        attributes on the current id object."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\npublish the runbook draft.", "response": "def publish(\n            self, resource_group_name, automation_account_name, runbook_name, custom_headers=None, raw=False, polling=True, **operation_config):\n        \"\"\"Publish runbook draft.\n\n        :param resource_group_name: Name of an Azure Resource group.\n        :type resource_group_name: str\n        :param automation_account_name: The name of the automation account.\n        :type automation_account_name: str\n        :param runbook_name: The parameters supplied to the publish runbook\n         operation.\n        :type runbook_name: str\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: The poller return type is ClientRawResponse, the\n         direct response alongside the deserialized response\n        :param polling: True for ARMPolling, False for no polling, or a\n         polling object for personal polling strategy\n        :return: An instance of LROPoller that returns None or\n         ClientRawResponse<None> if raw==True\n        :rtype: ~msrestazure.azure_operation.AzureOperationPoller[None] or\n         ~msrestazure.azure_operation.AzureOperationPoller[~msrest.pipeline.ClientRawResponse[None]]\n        :raises:\n         :class:`ErrorResponseException<azure.mgmt.automation.models.ErrorResponseException>`\n        \"\"\"\n        raw_result = self._publish_initial(\n            resource_group_name=resource_group_name,\n            automation_account_name=automation_account_name,\n            runbook_name=runbook_name,\n            custom_headers=custom_headers,\n            raw=True,\n            **operation_config\n        )\n\n        def get_long_running_output(response):\n            if raw:\n                client_raw_response = ClientRawResponse(None, response)\n                client_raw_response.add_headers({\n                    'location': 'str',\n                })\n                return client_raw_response\n\n        lro_delay = operation_config.get(\n            'long_running_operation_timeout',\n            self.config.long_running_operation_timeout)\n        if polling is True: polling_method = ARMPolling(lro_delay, **operation_config)\n        elif polling is False: polling_method = NoPolling()\n        else: polling_method = polling\n        return LROPoller(self._client, raw_result, get_long_running_output, polling_method)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\nasync def renew_lock(self):\n        if hasattr(self._receiver, 'locked_until'):\n            raise TypeError(\"Session messages cannot be renewed. Please renew the Session lock instead.\")\n        self._is_live('renew')\n        expiry = await self._receiver._renew_locks(self.lock_token)  # pylint: disable=protected-access\n        self._expiry = datetime.datetime.fromtimestamp(expiry[b'expirations'][0]/1000.0)", "response": "Renews the message lock."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\nasync def dead_letter(self, description=None):\n        self._is_live('reject')\n        try:\n            reject = functools.partial(self.message.reject, condition=DEADLETTERNAME, description=description)\n            await self._loop.run_in_executor(None, reject)\n        except Exception as e:\n            raise MessageSettleFailed(\"reject\", e)", "response": "Move the message to the Dead Letter queue."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nabandons the message. This message will be returned to the queue to be reprocessed. :raises: ~azure.servicebus.common.errors.MessageAlreadySettled if the message has been settled. :raises: ~azure.servicebus.common.errors.MessageLockExpired if message lock has already expired. :raises: ~azure.servicebus.common.errors.SessionLockExpired if session lock has already expired. :raises: ~azure.servicebus.common.errors.MessageSettleFailed if message settle operation fails.", "response": "async def abandon(self):\n        \"\"\"Abandon the message.\n\n        This message will be returned to the queue to be reprocessed.\n\n        :raises: ~azure.servicebus.common.errors.MessageAlreadySettled if the message has been settled.\n        :raises: ~azure.servicebus.common.errors.MessageLockExpired if message lock has already expired.\n        :raises: ~azure.servicebus.common.errors.SessionLockExpired if session lock has already expired.\n        :raises: ~azure.servicebus.common.errors.MessageSettleFailed if message settle operation fails.\n        \"\"\"\n        self._is_live('abandon')\n        try:\n            modify = functools.partial(self.message.modify, True, False)\n            await self._loop.run_in_executor(None, modify)\n        except Exception as e:\n            raise MessageSettleFailed(\"abandon\", e)"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\ncompletes the message. This removes the message from the queue. :raises: ~azure.servicebus.common.errors.MessageAlreadySettled if the message has been settled. :raises: ~azure.servicebus.common.errors.MessageLockExpired if message lock has already expired. :raises: ~azure.servicebus.common.errors.SessionLockExpired if session lock has already expired. :raises: ~azure.servicebus.common.errors.MessageSettleFailed if message settle operation fails.", "response": "async def complete(self):\n        \"\"\"Complete the message.\n\n        This removes the message from the queue.\n\n        :raises: ~azure.servicebus.common.errors.MessageAlreadySettled if the message has been settled.\n        :raises: ~azure.servicebus.common.errors.MessageLockExpired if message lock has already expired.\n        :raises: ~azure.servicebus.common.errors.SessionLockExpired if session lock has already expired.\n        :raises: ~azure.servicebus.common.errors.MessageSettleFailed if message settle operation fails.\n        \"\"\"\n        self._is_live('complete')\n        await self._receiver._settle_deferred('completed', [self.lock_token])  # pylint: disable=protected-access\n        self._settled = True"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef parse_input(input_parameter):\n    split_package_name = input_parameter.split('#')\n    package_name = split_package_name[0]\n    module_name = package_name.replace(\"-\", \".\")\n    if len(split_package_name) >= 2:\n        module_name = \".\".join([module_name, split_package_name[1]])\n    return package_name, module_name", "response": "From a syntax like package_name. submodule build a package name and complete module name."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nreplaces alterations data. :param word_alterations: Collection of word alterations. :type word_alterations: list[~azure.cognitiveservices.knowledge.qnamaker.models.AlterationsDTO] :param dict custom_headers: headers that will be added to the request :param bool raw: returns the direct response alongside the deserialized response :param operation_config: :ref:`Operation configuration overrides<msrest:optionsforoperations>`. :return: None or ClientRawResponse if raw=true :rtype: None or ~msrest.pipeline.ClientRawResponse :raises: :class:`ErrorResponseException<azure.cognitiveservices.knowledge.qnamaker.models.ErrorResponseException>`", "response": "def replace(\n            self, word_alterations, custom_headers=None, raw=False, **operation_config):\n        \"\"\"Replace alterations data.\n\n        :param word_alterations: Collection of word alterations.\n        :type word_alterations:\n         list[~azure.cognitiveservices.knowledge.qnamaker.models.AlterationsDTO]\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: None or ClientRawResponse if raw=true\n        :rtype: None or ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`ErrorResponseException<azure.cognitiveservices.knowledge.qnamaker.models.ErrorResponseException>`\n        \"\"\"\n        word_alterations1 = models.WordAlterationsDTO(word_alterations=word_alterations)\n\n        # Construct URL\n        url = self.replace.metadata['url']\n        path_format_arguments = {\n            'Endpoint': self._serialize.url(\"self.config.endpoint\", self.config.endpoint, 'str', skip_quote=True)\n        }\n        url = self._client.format_url(url, **path_format_arguments)\n\n        # Construct parameters\n        query_parameters = {}\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Content-Type'] = 'application/json; charset=utf-8'\n        if custom_headers:\n            header_parameters.update(custom_headers)\n\n        # Construct body\n        body_content = self._serialize.body(word_alterations1, 'WordAlterationsDTO')\n\n        # Construct and send request\n        request = self._client.put(url, query_parameters, header_parameters, body_content)\n        response = self._client.send(request, stream=False, **operation_config)\n\n        if response.status_code not in [204]:\n            raise models.ErrorResponseException(self._deserialize, response)\n\n        if raw:\n            client_raw_response = ClientRawResponse(None, response)\n            return client_raw_response"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nadd a value to a new version of the specified secret resource.", "response": "def add_value(\n            self, secret_resource_name, secret_value_resource_name, name, value=None, custom_headers=None, raw=False, **operation_config):\n        \"\"\"Adds the specified value as a new version of the specified secret\n        resource.\n\n        Creates a new value of the specified secret resource. The name of the\n        value is typically the version identifier. Once created the value\n        cannot be changed.\n\n        :param secret_resource_name: The name of the secret resource.\n        :type secret_resource_name: str\n        :param secret_value_resource_name: The name of the secret resource\n         value which is typically the version identifier for the value.\n        :type secret_value_resource_name: str\n        :param name: Version identifier of the secret value.\n        :type name: str\n        :param value: The actual value of the secret.\n        :type value: str\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: SecretValueResourceDescription or ClientRawResponse if\n         raw=true\n        :rtype: ~azure.servicefabric.models.SecretValueResourceDescription or\n         ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`FabricErrorException<azure.servicefabric.models.FabricErrorException>`\n        \"\"\"\n        secret_value_resource_description = models.SecretValueResourceDescription(name=name, value=value)\n\n        # Construct URL\n        url = self.add_value.metadata['url']\n        path_format_arguments = {\n            'secretResourceName': self._serialize.url(\"secret_resource_name\", secret_resource_name, 'str', skip_quote=True),\n            'secretValueResourceName': self._serialize.url(\"secret_value_resource_name\", secret_value_resource_name, 'str', skip_quote=True)\n        }\n        url = self._client.format_url(url, **path_format_arguments)\n\n        # Construct parameters\n        query_parameters = {}\n        query_parameters['api-version'] = self._serialize.query(\"self.api_version\", self.api_version, 'str')\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Accept'] = 'application/json'\n        header_parameters['Content-Type'] = 'application/json; charset=utf-8'\n        if custom_headers:\n            header_parameters.update(custom_headers)\n\n        # Construct body\n        body_content = self._serialize.body(secret_value_resource_description, 'SecretValueResourceDescription')\n\n        # Construct and send request\n        request = self._client.put(url, query_parameters, header_parameters, body_content)\n        response = self._client.send(request, stream=False, **operation_config)\n\n        if response.status_code not in [200, 201, 202]:\n            raise models.FabricErrorException(self._deserialize, response)\n\n        deserialized = None\n\n        if response.status_code == 200:\n            deserialized = self._deserialize('SecretValueResourceDescription', response)\n        if response.status_code == 201:\n            deserialized = self._deserialize('SecretValueResourceDescription', response)\n\n        if raw:\n            client_raw_response = ClientRawResponse(deserialized, response)\n            return client_raw_response\n\n        return deserialized"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nreturning system properties for the specified storage account.", "response": "def get_storage_account_properties(self, service_name):\n        '''\n        Returns system properties for the specified storage account.\n\n        service_name:\n            Name of the storage service account.\n        '''\n        _validate_not_none('service_name', service_name)\n        return self._perform_get(self._get_storage_service_path(service_name),\n                                 StorageService)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nreturning the primary and secondary access keys for the specified storage account.", "response": "def get_storage_account_keys(self, service_name):\n        '''\n        Returns the primary and secondary access keys for the specified\n        storage account.\n\n        service_name:\n            Name of the storage service account.\n        '''\n        _validate_not_none('service_name', service_name)\n        return self._perform_get(\n            self._get_storage_service_path(service_name) + '/keys',\n            StorageService)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef regenerate_storage_account_keys(self, service_name, key_type):\n        '''\n        Regenerates the primary or secondary access key for the specified\n        storage account.\n\n        service_name:\n            Name of the storage service account.\n        key_type:\n            Specifies which key to regenerate. Valid values are:\n            Primary, Secondary\n        '''\n        _validate_not_none('service_name', service_name)\n        _validate_not_none('key_type', key_type)\n        return self._perform_post(\n            self._get_storage_service_path(\n                service_name) + '/keys?action=regenerate',\n            _XmlSerializer.regenerate_keys_to_xml(\n                key_type),\n            StorageService)", "response": "Regenerates the primary or secondary access key for the specified storage account."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef create_storage_account(self, service_name, description, label,\n                               affinity_group=None, location=None,\n                               geo_replication_enabled=None,\n                               extended_properties=None,\n                               account_type='Standard_GRS'):\n        '''\n        Creates a new storage account in Windows Azure.\n\n        service_name:\n            A name for the storage account that is unique within Windows Azure.\n            Storage account names must be between 3 and 24 characters in length\n            and use numbers and lower-case letters only.\n        description:\n            A description for the storage account. The description may be up\n            to 1024 characters in length.\n        label:\n            A name for the storage account. The name may be up to 100\n            characters in length. The name can be used to identify the storage\n            account for your tracking purposes.\n        affinity_group:\n            The name of an existing affinity group in the specified\n            subscription. You can specify either a location or affinity_group,\n            but not both.\n        location:\n            The location where the storage account is created. You can specify\n            either a location or affinity_group, but not both.\n        geo_replication_enabled:\n            Deprecated. Replaced by the account_type parameter.\n        extended_properties:\n            Dictionary containing name/value pairs of storage account\n            properties. You can have a maximum of 50 extended property\n            name/value pairs. The maximum length of the Name element is 64\n            characters, only alphanumeric characters and underscores are valid\n            in the Name, and the name must start with a letter. The value has\n            a maximum length of 255 characters.\n        account_type:\n            Specifies whether the account supports locally-redundant storage,\n            geo-redundant storage, zone-redundant storage, or read access\n            geo-redundant storage.\n            Possible values are:\n                Standard_LRS, Standard_ZRS, Standard_GRS, Standard_RAGRS\n        '''\n        _validate_not_none('service_name', service_name)\n        _validate_not_none('description', description)\n        _validate_not_none('label', label)\n        if affinity_group is None and location is None:\n            raise ValueError(\n                'location or affinity_group must be specified')\n        if affinity_group is not None and location is not None:\n            raise ValueError(\n                'Only one of location or affinity_group needs to be specified')\n        if geo_replication_enabled == False:\n            account_type = 'Standard_LRS'\n        return self._perform_post(\n            self._get_storage_service_path(),\n            _XmlSerializer.create_storage_service_input_to_xml(\n                service_name,\n                description,\n                label,\n                affinity_group,\n                location,\n                account_type,\n                extended_properties),\n            as_async=True)", "response": "Creates a new storage account in Windows Azure."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef update_storage_account(self, service_name, description=None,\n                               label=None, geo_replication_enabled=None,\n                               extended_properties=None,\n                               account_type='Standard_GRS'):\n        '''\n        Updates the label, the description, and enables or disables the\n        geo-replication status for a storage account in Windows Azure.\n\n        service_name:\n            Name of the storage service account.\n        description:\n            A description for the storage account. The description may be up\n            to 1024 characters in length.\n        label:\n            A name for the storage account. The name may be up to 100\n            characters in length. The name can be used to identify the storage\n            account for your tracking purposes.\n        geo_replication_enabled:\n            Deprecated. Replaced by the account_type parameter.\n        extended_properties:\n            Dictionary containing name/value pairs of storage account\n            properties. You can have a maximum of 50 extended property\n            name/value pairs. The maximum length of the Name element is 64\n            characters, only alphanumeric characters and underscores are valid\n            in the Name, and the name must start with a letter. The value has\n            a maximum length of 255 characters.\n        account_type:\n            Specifies whether the account supports locally-redundant storage,\n            geo-redundant storage, zone-redundant storage, or read access\n            geo-redundant storage.\n            Possible values are:\n                Standard_LRS, Standard_ZRS, Standard_GRS, Standard_RAGRS\n        '''\n        _validate_not_none('service_name', service_name)\n        if geo_replication_enabled == False:\n            account_type = 'Standard_LRS'\n        return self._perform_put(\n            self._get_storage_service_path(service_name),\n            _XmlSerializer.update_storage_service_input_to_xml(\n                description,\n                label,\n                account_type,\n                extended_properties))", "response": "Updates the label description and geo - replication status for a storage account in Windows Azure."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef delete_storage_account(self, service_name):\n        '''\n        Deletes the specified storage account from Windows Azure.\n\n        service_name:\n            Name of the storage service account.\n        '''\n        _validate_not_none('service_name', service_name)\n        return self._perform_delete(\n            self._get_storage_service_path(service_name),\n            as_async=True)", "response": "Deletes the specified storage account from Windows Azure."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef check_storage_account_name_availability(self, service_name):\n        '''\n        Checks to see if the specified storage account name is available, or\n        if it has already been taken.\n\n        service_name:\n            Name of the storage service account.\n        '''\n        _validate_not_none('service_name', service_name)\n        return self._perform_get(\n            self._get_storage_service_path() +\n            '/operations/isavailable/' +\n            _str(service_name) + '',\n            AvailabilityResponse)", "response": "Checks to see if the specified storage account name is available."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nretrieving system properties for the specified hosted service.", "response": "def get_hosted_service_properties(self, service_name, embed_detail=False):\n        '''\n        Retrieves system properties for the specified hosted service. These\n        properties include the service name and service type; the name of the\n        affinity group to which the service belongs, or its location if it is\n        not part of an affinity group; and optionally, information on the\n        service's deployments.\n\n        service_name:\n            Name of the hosted service.\n        embed_detail:\n            When True, the management service returns properties for all\n            deployments of the service, as well as for the service itself.\n        '''\n        _validate_not_none('service_name', service_name)\n        _validate_not_none('embed_detail', embed_detail)\n        return self._perform_get(\n            self._get_hosted_service_path(service_name) +\n            '?embed-detail=' +\n            _str(embed_detail).lower(),\n            HostedService)"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\ncreates a new hosted service in Windows Azure.", "response": "def create_hosted_service(self, service_name, label, description=None,\n                              location=None, affinity_group=None,\n                              extended_properties=None):\n        '''\n        Creates a new hosted service in Windows Azure.\n\n        service_name:\n            A name for the hosted service that is unique within Windows Azure.\n            This name is the DNS prefix name and can be used to access the\n            hosted service.\n        label:\n            A name for the hosted service. The name can be up to 100 characters\n            in length. The name can be used to identify the storage account for\n            your tracking purposes.\n        description:\n            A description for the hosted service. The description can be up to\n            1024 characters in length.\n        location:\n            The location where the hosted service will be created. You can\n            specify either a location or affinity_group, but not both.\n        affinity_group:\n            The name of an existing affinity group associated with this\n            subscription. This name is a GUID and can be retrieved by examining\n            the name element of the response body returned by\n            list_affinity_groups. You can specify either a location or\n            affinity_group, but not both.\n        extended_properties:\n            Dictionary containing name/value pairs of storage account\n            properties. You can have a maximum of 50 extended property\n            name/value pairs. The maximum length of the Name element is 64\n            characters, only alphanumeric characters and underscores are valid\n            in the Name, and the name must start with a letter. The value has\n            a maximum length of 255 characters.\n        '''\n        _validate_not_none('service_name', service_name)\n        _validate_not_none('label', label)\n        if affinity_group is None and location is None:\n            raise ValueError(\n                'location or affinity_group must be specified')\n        if affinity_group is not None and location is not None:\n            raise ValueError(\n                'Only one of location or affinity_group needs to be specified')\n        return self._perform_post(self._get_hosted_service_path(),\n                                  _XmlSerializer.create_hosted_service_to_xml(\n                                      service_name,\n                                      label,\n                                      description,\n                                      location,\n                                      affinity_group,\n                                      extended_properties),\n                                  as_async=True)"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nupdating the label and description for a hosted service in Windows Azure.", "response": "def update_hosted_service(self, service_name, label=None, description=None,\n                              extended_properties=None):\n        '''\n        Updates the label and/or the description for a hosted service in\n        Windows Azure.\n\n        service_name:\n            Name of the hosted service.\n        label:\n            A name for the hosted service. The name may be up to 100 characters\n            in length. You must specify a value for either Label or\n            Description, or for both. It is recommended that the label be\n            unique within the subscription. The name can be used\n            identify the hosted service for your tracking purposes.\n        description:\n            A description for the hosted service. The description may be up to\n            1024 characters in length. You must specify a value for either\n            Label or Description, or for both.\n        extended_properties:\n            Dictionary containing name/value pairs of storage account\n            properties. You can have a maximum of 50 extended property\n            name/value pairs. The maximum length of the Name element is 64\n            characters, only alphanumeric characters and underscores are valid\n            in the Name, and the name must start with a letter. The value has\n            a maximum length of 255 characters.\n        '''\n        _validate_not_none('service_name', service_name)\n        return self._perform_put(self._get_hosted_service_path(service_name),\n                                 _XmlSerializer.update_hosted_service_to_xml(\n                                     label,\n                                     description,\n                                     extended_properties))"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\ndelete the specified hosted service from Windows Azure.", "response": "def delete_hosted_service(self, service_name, complete=False):\n        '''\n        Deletes the specified hosted service from Windows Azure.\n\n        service_name:\n            Name of the hosted service.\n        complete:\n            True if all OS/data disks and the source blobs for the disks should\n            also be deleted from storage.\n        '''\n\n        _validate_not_none('service_name', service_name)\n\n        path = self._get_hosted_service_path(service_name)\n\n        if complete == True:\n            path = path +'?comp=media'\n\n        return self._perform_delete(path, as_async=True)"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nreturn the configuration information status system properties and system properties for a deployment. Valid values are staging production.", "response": "def get_deployment_by_slot(self, service_name, deployment_slot):\n        '''\n        Returns configuration information, status, and system properties for\n        a deployment.\n\n        service_name:\n            Name of the hosted service.\n        deployment_slot:\n            The environment to which the hosted service is deployed. Valid\n            values are: staging, production\n        '''\n        _validate_not_none('service_name', service_name)\n        _validate_not_none('deployment_slot', deployment_slot)\n        return self._perform_get(\n            self._get_deployment_path_using_slot(\n                service_name, deployment_slot),\n            Deployment)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef get_deployment_by_name(self, service_name, deployment_name):\n        '''\n        Returns configuration information, status, and system properties for a\n        deployment.\n\n        service_name:\n            Name of the hosted service.\n        deployment_name:\n            The name of the deployment.\n        '''\n        _validate_not_none('service_name', service_name)\n        _validate_not_none('deployment_name', deployment_name)\n        return self._perform_get(\n            self._get_deployment_path_using_name(\n                service_name, deployment_name),\n            Deployment)", "response": "Returns the deployment with the given name."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\ncreating a new deployment for the specified service.", "response": "def create_deployment(self, service_name, deployment_slot, name,\n                          package_url, label, configuration,\n                          start_deployment=False,\n                          treat_warnings_as_error=False,\n                          extended_properties=None):\n        '''\n        Uploads a new service package and creates a new deployment on staging\n        or production.\n\n        service_name:\n            Name of the hosted service.\n        deployment_slot:\n            The environment to which the hosted service is deployed. Valid\n            values are: staging, production\n        name:\n            The name for the deployment. The deployment name must be unique\n            among other deployments for the hosted service.\n        package_url:\n            A URL that refers to the location of the service package in the\n            Blob service. The service package can be located either in a\n            storage account beneath the same subscription or a Shared Access\n            Signature (SAS) URI from any storage account.\n        label:\n            A name for the hosted service. The name can be up to 100 characters\n            in length. It is recommended that the label be unique within the\n            subscription. The name can be used to identify the hosted service\n            for your tracking purposes.\n        configuration:\n            The base-64 encoded service configuration file for the deployment.\n        start_deployment:\n            Indicates whether to start the deployment immediately after it is\n            created. If false, the service model is still deployed to the\n            virtual machines but the code is not run immediately. Instead, the\n            service is Suspended until you call Update Deployment Status and\n            set the status to Running, at which time the service will be\n            started. A deployed service still incurs charges, even if it is\n            suspended.\n        treat_warnings_as_error:\n            Indicates whether to treat package validation warnings as errors.\n            If set to true, the Created Deployment operation fails if there\n            are validation warnings on the service package.\n        extended_properties:\n            Dictionary containing name/value pairs of storage account\n            properties. You can have a maximum of 50 extended property\n            name/value pairs. The maximum length of the Name element is 64\n            characters, only alphanumeric characters and underscores are valid\n            in the Name, and the name must start with a letter. The value has\n            a maximum length of 255 characters.\n        '''\n        _validate_not_none('service_name', service_name)\n        _validate_not_none('deployment_slot', deployment_slot)\n        _validate_not_none('name', name)\n        _validate_not_none('package_url', package_url)\n        _validate_not_none('label', label)\n        _validate_not_none('configuration', configuration)\n        return self._perform_post(\n            self._get_deployment_path_using_slot(\n                service_name, deployment_slot),\n            _XmlSerializer.create_deployment_to_xml(\n                name,\n                package_url,\n                label,\n                configuration,\n                start_deployment,\n                treat_warnings_as_error,\n                extended_properties),\n            as_async=True)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\ndelete the specified deployment.", "response": "def delete_deployment(self, service_name, deployment_name,delete_vhd=False):\n        '''\n        Deletes the specified deployment.\n\n        service_name:\n            Name of the hosted service.\n        deployment_name:\n            The name of the deployment.\n        '''\n        _validate_not_none('service_name', service_name)\n        _validate_not_none('deployment_name', deployment_name)\n        path= self._get_deployment_path_using_name(service_name, deployment_name)\n        if delete_vhd:\n            path += '?comp=media'\n        return self._perform_delete(\n                path,\n            as_async=True)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef swap_deployment(self, service_name, production, source_deployment):\n        '''\n        Initiates a virtual IP swap between the staging and production\n        deployment environments for a service. If the service is currently\n        running in the staging environment, it will be swapped to the\n        production environment. If it is running in the production\n        environment, it will be swapped to staging.\n\n        service_name:\n            Name of the hosted service.\n        production:\n            The name of the production deployment.\n        source_deployment:\n            The name of the source deployment.\n        '''\n        _validate_not_none('service_name', service_name)\n        _validate_not_none('production', production)\n        _validate_not_none('source_deployment', source_deployment)\n        return self._perform_post(self._get_hosted_service_path(service_name),\n                                  _XmlSerializer.swap_deployment_to_xml(\n                                      production, source_deployment),\n                                  as_async=True)", "response": "This method performs a virtual IP swap between the staging and production deployments for a service."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef change_deployment_configuration(self, service_name, deployment_name,\n                                        configuration,\n                                        treat_warnings_as_error=False,\n                                        mode='Auto', extended_properties=None):\n        '''\n        Initiates a change to the deployment configuration.\n\n        service_name:\n            Name of the hosted service.\n        deployment_name:\n            The name of the deployment.\n        configuration:\n            The base-64 encoded service configuration file for the deployment.\n        treat_warnings_as_error:\n            Indicates whether to treat package validation warnings as errors.\n            If set to true, the Created Deployment operation fails if there\n            are validation warnings on the service package.\n        mode:\n            If set to Manual, WalkUpgradeDomain must be called to apply the\n            update. If set to Auto, the Windows Azure platform will\n            automatically apply the update To each upgrade domain for the\n            service. Possible values are: Auto, Manual\n        extended_properties:\n            Dictionary containing name/value pairs of storage account\n            properties. You can have a maximum of 50 extended property\n            name/value pairs. The maximum length of the Name element is 64\n            characters, only alphanumeric characters and underscores are valid\n            in the Name, and the name must start with a letter. The value has\n            a maximum length of 255 characters.\n        '''\n        _validate_not_none('service_name', service_name)\n        _validate_not_none('deployment_name', deployment_name)\n        _validate_not_none('configuration', configuration)\n        return self._perform_post(\n            self._get_deployment_path_using_name(\n                service_name, deployment_name) + '/?comp=config',\n            _XmlSerializer.change_deployment_to_xml(\n                configuration,\n                treat_warnings_as_error,\n                mode,\n                extended_properties),\n            as_async=True)", "response": "Changes the configuration of the specified service to the specified deployment."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nupdate the status of a hosted service in a deployment.", "response": "def update_deployment_status(self, service_name, deployment_name, status):\n        '''\n        Initiates a change in deployment status.\n\n        service_name:\n            Name of the hosted service.\n        deployment_name:\n            The name of the deployment.\n        status:\n            The change to initiate to the deployment status. Possible values\n            include:\n                Running, Suspended\n        '''\n        _validate_not_none('service_name', service_name)\n        _validate_not_none('deployment_name', deployment_name)\n        _validate_not_none('status', status)\n        return self._perform_post(\n            self._get_deployment_path_using_name(\n                service_name, deployment_name) + '/?comp=status',\n            _XmlSerializer.update_deployment_status_to_xml(\n                status),\n            as_async=True)"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nwalks the upgrade domain of a hosted service.", "response": "def walk_upgrade_domain(self, service_name, deployment_name,\n                            upgrade_domain):\n        '''\n        Specifies the next upgrade domain to be walked during manual in-place\n        upgrade or configuration change.\n\n        service_name:\n            Name of the hosted service.\n        deployment_name:\n            The name of the deployment.\n        upgrade_domain:\n            An integer value that identifies the upgrade domain to walk.\n            Upgrade domains are identified with a zero-based index: the first\n            upgrade domain has an ID of 0, the second has an ID of 1, and so on.\n        '''\n        _validate_not_none('service_name', service_name)\n        _validate_not_none('deployment_name', deployment_name)\n        _validate_not_none('upgrade_domain', upgrade_domain)\n        return self._perform_post(\n            self._get_deployment_path_using_name(\n                service_name, deployment_name) + '/?comp=walkupgradedomain',\n            _XmlSerializer.walk_upgrade_domain_to_xml(\n                upgrade_domain),\n            as_async=True)"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\ncancel an in progress configuration change (update) or upgrade and returns the deployment to its state before the upgrade or configuration change was started. service_name: Name of the hosted service. deployment_name: The name of the deployment. mode: Specifies whether the rollback should proceed automatically. auto - The rollback proceeds without further user input. manual - You must call the Walk Upgrade Domain operation to apply the rollback to each upgrade domain. force: Specifies whether the rollback should proceed even when it will cause local data to be lost from some role instances. True if the rollback should proceed; otherwise false if the rollback should fail.", "response": "def rollback_update_or_upgrade(self, service_name, deployment_name, mode,\n                                   force):\n        '''\n        Cancels an in progress configuration change (update) or upgrade and\n        returns the deployment to its state before the upgrade or\n        configuration change was started.\n\n        service_name:\n            Name of the hosted service.\n        deployment_name:\n            The name of the deployment.\n        mode:\n            Specifies whether the rollback should proceed automatically.\n                auto - The rollback proceeds without further user input.\n                manual - You must call the Walk Upgrade Domain operation to\n                         apply the rollback to each upgrade domain.\n        force:\n            Specifies whether the rollback should proceed even when it will\n            cause local data to be lost from some role instances. True if the\n            rollback should proceed; otherwise false if the rollback should\n            fail.\n        '''\n        _validate_not_none('service_name', service_name)\n        _validate_not_none('deployment_name', deployment_name)\n        _validate_not_none('mode', mode)\n        _validate_not_none('force', force)\n        return self._perform_post(\n            self._get_deployment_path_using_name(\n                service_name, deployment_name) + '/?comp=rollback',\n            _XmlSerializer.rollback_upgrade_to_xml(\n                mode, force),\n            as_async=True)"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nrequests a reboot of a role instance that is running in a deployment. service_name: Name of the hosted service. deployment_name: The name of the deployment. role_instance_name: The name of the role instance.", "response": "def reboot_role_instance(self, service_name, deployment_name,\n                             role_instance_name):\n        '''\n        Requests a reboot of a role instance that is running in a deployment.\n\n        service_name:\n            Name of the hosted service.\n        deployment_name:\n            The name of the deployment.\n        role_instance_name:\n            The name of the role instance.\n        '''\n        _validate_not_none('service_name', service_name)\n        _validate_not_none('deployment_name', deployment_name)\n        _validate_not_none('role_instance_name', role_instance_name)\n        return self._perform_post(\n            self._get_deployment_path_using_name(\n                service_name, deployment_name) + \\\n                    '/roleinstances/' + _str(role_instance_name) + \\\n                    '?comp=reboot',\n            '',\n            as_async=True)"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\ndeletes the specified list of role instances.", "response": "def delete_role_instances(self, service_name, deployment_name,\n                             role_instance_names):\n        '''\n        Reinstalls the operating system on instances of web roles or worker\n        roles and initializes the storage resources that are used by them. If\n        you do not want to initialize storage resources, you can use\n        reimage_role_instance.\n\n        service_name:\n            Name of the hosted service.\n        deployment_name:\n            The name of the deployment.\n        role_instance_names:\n            List of role instance names.\n        '''\n        _validate_not_none('service_name', service_name)\n        _validate_not_none('deployment_name', deployment_name)\n        _validate_not_none('role_instance_names', role_instance_names)\n        return self._perform_post(\n            self._get_deployment_path_using_name(\n                service_name, deployment_name) + '/roleinstances/?comp=delete',\n            _XmlSerializer.role_instances_to_xml(role_instance_names),\n            as_async=True)"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef check_hosted_service_name_availability(self, service_name):\n        '''\n        Checks to see if the specified hosted service name is available, or if\n        it has already been taken.\n\n        service_name:\n            Name of the hosted service.\n        '''\n        _validate_not_none('service_name', service_name)\n        return self._perform_get(\n            '/' + self.subscription_id +\n            '/services/hostedservices/operations/isavailable/' +\n            _str(service_name) + '',\n            AvailabilityResponse)", "response": "Checks to see if the specified hosted service name is available or not."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef list_service_certificates(self, service_name):\n        '''\n        Lists all of the service certificates associated with the specified\n        hosted service.\n\n        service_name:\n            Name of the hosted service.\n        '''\n        _validate_not_none('service_name', service_name)\n        return self._perform_get(\n            '/' + self.subscription_id + '/services/hostedservices/' +\n            _str(service_name) + '/certificates',\n            Certificates)", "response": "Lists all of the service certificates associated with the specified service."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef get_service_certificate(self, service_name, thumbalgorithm, thumbprint):\n        '''\n        Returns the public data for the specified X.509 certificate associated\n        with a hosted service.\n\n        service_name:\n            Name of the hosted service.\n        thumbalgorithm:\n            The algorithm for the certificate's thumbprint.\n        thumbprint:\n            The hexadecimal representation of the thumbprint.\n        '''\n        _validate_not_none('service_name', service_name)\n        _validate_not_none('thumbalgorithm', thumbalgorithm)\n        _validate_not_none('thumbprint', thumbprint)\n        return self._perform_get(\n            '/' + self.subscription_id + '/services/hostedservices/' +\n            _str(service_name) + '/certificates/' +\n            _str(thumbalgorithm) + '-' + _str(thumbprint) + '',\n            Certificate)", "response": "Get the public data for the specified X.509 certificate associated with a hosted service."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef add_service_certificate(self, service_name, data, certificate_format,\n                                password=None):\n        '''\n        Adds a certificate to a hosted service.\n\n        service_name:\n            Name of the hosted service.\n        data:\n            The base-64 encoded form of the pfx/cer file.\n        certificate_format:\n            The service certificate format.\n        password:\n            The certificate password. Default to None when using cer format.\n        '''\n        _validate_not_none('service_name', service_name)\n        _validate_not_none('data', data)\n        _validate_not_none('certificate_format', certificate_format)\n        _validate_not_none('password', password)\n\n        return self._perform_post(\n            '/' + self.subscription_id + '/services/hostedservices/' +\n            _str(service_name) + '/certificates',\n            _XmlSerializer.certificate_file_to_xml(\n                data, certificate_format, password),\n            as_async=True)", "response": "Adds a certificate to a hosted service."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\ndelete a service certificate from the certificate store.", "response": "def delete_service_certificate(self, service_name, thumbalgorithm,\n                                   thumbprint):\n        '''\n        Deletes a service certificate from the certificate store of a hosted\n        service.\n\n        service_name:\n            Name of the hosted service.\n        thumbalgorithm:\n            The algorithm for the certificate's thumbprint.\n        thumbprint:\n            The hexadecimal representation of the thumbprint.\n        '''\n        _validate_not_none('service_name', service_name)\n        _validate_not_none('thumbalgorithm', thumbalgorithm)\n        _validate_not_none('thumbprint', thumbprint)\n        return self._perform_delete(\n            '/' + self.subscription_id + '/services/hostedservices/' +\n            _str(service_name) + '/certificates/' +\n            _str(thumbalgorithm) + '-' + _str(thumbprint),\n            as_async=True)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\ngetting the management certificate with the specified thumbprint.", "response": "def get_management_certificate(self, thumbprint):\n        '''\n        The Get Management Certificate operation retrieves information about\n        the management certificate with the specified thumbprint. Management\n        certificates, which are also known as subscription certificates,\n        authenticate clients attempting to connect to resources associated\n        with your Windows Azure subscription.\n\n        thumbprint:\n            The thumbprint value of the certificate.\n        '''\n        _validate_not_none('thumbprint', thumbprint)\n        return self._perform_get(\n            '/' + self.subscription_id + '/certificates/' + _str(thumbprint),\n            SubscriptionCertificate)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef delete_management_certificate(self, thumbprint):\n        '''\n        The Delete Management Certificate operation deletes a certificate from\n        the list of management certificates. Management certificates, which\n        are also known as subscription certificates, authenticate clients\n        attempting to connect to resources associated with your Windows Azure\n        subscription.\n\n        thumbprint:\n            The thumb print that uniquely identifies the management\n            certificate.\n        '''\n        _validate_not_none('thumbprint', thumbprint)\n        return self._perform_delete(\n            '/' + self.subscription_id + '/certificates/' + _str(thumbprint))", "response": "This operation deletes a management certificate from the resource store."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nreturn the system properties associated with the specified affinity group.", "response": "def get_affinity_group_properties(self, affinity_group_name):\n        '''\n        Returns the system properties associated with the specified affinity\n        group.\n\n        affinity_group_name:\n            The name of the affinity group.\n        '''\n        _validate_not_none('affinity_group_name', affinity_group_name)\n        return self._perform_get(\n            '/' + self.subscription_id + '/affinitygroups/' +\n            _str(affinity_group_name) + '',\n            AffinityGroup)"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef create_affinity_group(self, name, label, location, description=None):\n        '''\n        Creates a new affinity group for the specified subscription.\n\n        name:\n            A name for the affinity group that is unique to the subscription.\n        label:\n            A name for the affinity group. The name can be up to 100 characters\n            in length.\n        location:\n            The data center location where the affinity group will be created.\n            To list available locations, use the list_location function.\n        description:\n            A description for the affinity group. The description can be up to\n            1024 characters in length.\n        '''\n        _validate_not_none('name', name)\n        _validate_not_none('label', label)\n        _validate_not_none('location', location)\n        return self._perform_post(\n            '/' + self.subscription_id + '/affinitygroups',\n            _XmlSerializer.create_affinity_group_to_xml(name,\n                                                        label,\n                                                        description,\n                                                        location))", "response": "Creates a new affinity group for the specified subscription."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef update_affinity_group(self, affinity_group_name, label,\n                              description=None):\n        '''\n        Updates the label and/or the description for an affinity group for the\n        specified subscription.\n\n        affinity_group_name:\n            The name of the affinity group.\n        label:\n            A name for the affinity group. The name can be up to 100 characters\n            in length.\n        description:\n            A description for the affinity group. The description can be up to\n            1024 characters in length.\n        '''\n        _validate_not_none('affinity_group_name', affinity_group_name)\n        _validate_not_none('label', label)\n        return self._perform_put(\n            '/' + self.subscription_id + '/affinitygroups/' +\n            _str(affinity_group_name),\n            _XmlSerializer.update_affinity_group_to_xml(label, description))", "response": "Updates the label and or description of an affinity group for the specified subscription."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef delete_affinity_group(self, affinity_group_name):\n        '''\n        Deletes an affinity group in the specified subscription.\n\n        affinity_group_name:\n            The name of the affinity group.\n        '''\n        _validate_not_none('affinity_group_name', affinity_group_name)\n        return self._perform_delete('/' + self.subscription_id + \\\n                                    '/affinitygroups/' + \\\n                                    _str(affinity_group_name))", "response": "Deletes an affinity group in the specified subscription."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef list_subscription_operations(self, start_time=None, end_time=None, object_id_filter=None,\n                                     operation_result_filter=None, continuation_token=None):\n        '''\n        List subscription operations.\n\n        start_time: Required. An ISO8601 date.\n        end_time: Required. An ISO8601 date.\n        object_id_filter: Optional. Returns subscription operations only for the specified object type and object ID\n        operation_result_filter: Optional. Returns subscription operations only for the specified result status, either Succeeded, Failed, or InProgress.\n        continuation_token: Optional.\n        More information at:\n        https://msdn.microsoft.com/en-us/library/azure/gg715318.aspx\n        '''\n        start_time = ('StartTime=' + start_time) if start_time else ''\n        end_time = ('EndTime=' + end_time) if end_time else ''\n        object_id_filter = ('ObjectIdFilter=' + object_id_filter) if object_id_filter else ''\n        operation_result_filter = ('OperationResultFilter=' + operation_result_filter) if operation_result_filter else ''\n        continuation_token = ('ContinuationToken=' + continuation_token) if continuation_token else ''\n\n        parameters = ('&'.join(v for v in (start_time, end_time, object_id_filter, operation_result_filter, continuation_token) if v))\n        parameters = '?' + parameters if parameters else ''\n\n        return self._perform_get(self._get_list_subscription_operations_path() + parameters,\n                                 SubscriptionOperationCollection)", "response": "Returns a list of all subscription operations for the specified object type and object ID."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef create_reserved_ip_address(self, name, label=None, location=None):\n        '''\n        Reserves an IPv4 address for the specified subscription.\n\n        name:\n            Required. Specifies the name for the reserved IP address.\n        label:\n            Optional. Specifies a label for the reserved IP address. The label\n            can be up to 100 characters long and can be used for your tracking\n            purposes.\n        location:\n            Required. Specifies the location of the reserved IP address. This\n            should be the same location that is assigned to the cloud service\n            containing the deployment that will use the reserved IP address.\n            To see the available locations, you can use list_locations.\n        '''\n        _validate_not_none('name', name)\n        return self._perform_post(\n            self._get_reserved_ip_path(),\n            _XmlSerializer.create_reserved_ip_to_xml(name, label, location),\n            as_async=True)", "response": "Creates a reserved IP address for the specified subscription."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\ndeleting a reserved IP address from the specified subscription.", "response": "def delete_reserved_ip_address(self, name):\n        '''\n        Deletes a reserved IP address from the specified subscription.\n\n        name:\n            Required. Name of the reserved IP address.\n        '''\n        _validate_not_none('name', name)\n        return self._perform_delete(self._get_reserved_ip_path(name),\n                                    as_async=True)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nassociate an existing reserved IP address with a deployment.", "response": "def associate_reserved_ip_address(\n        self, name, service_name, deployment_name, virtual_ip_name=None\n    ):\n        '''\n        Associate an existing reservedIP to a deployment.\n\n        name:\n            Required. Name of the reserved IP address.\n\n        service_name:\n            Required. Name of the hosted service.\n\n        deployment_name:\n            Required. Name of the deployment.\n\n        virtual_ip_name:\n            Optional. Name of the VirtualIP in case of multi Vip tenant.\n            If this value is not specified default virtualIP is used\n            for this operation.\n        '''\n        _validate_not_none('name', name)\n        _validate_not_none('service_name', service_name)\n        _validate_not_none('deployment_name', deployment_name)\n        return self._perform_post(\n            self._get_reserved_ip_path_for_association(name),\n            _XmlSerializer.associate_reserved_ip_to_xml(\n                service_name, deployment_name, virtual_ip_name\n            ),\n            as_async=True,\n            x_ms_version='2015-02-01'\n        )"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\ndisassociates an existing reserved IP from the given service and deployment.", "response": "def disassociate_reserved_ip_address(\n        self, name, service_name, deployment_name, virtual_ip_name=None\n    ):\n        '''\n        Disassociate an existing reservedIP from the given deployment.\n\n        name:\n            Required. Name of the reserved IP address.\n\n        service_name:\n            Required. Name of the hosted service.\n\n        deployment_name:\n            Required. Name of the deployment.\n\n        virtual_ip_name:\n            Optional. Name of the VirtualIP in case of multi Vip tenant.\n            If this value is not specified default virtualIP is used\n            for this operation.\n        '''\n        _validate_not_none('name', name)\n        _validate_not_none('service_name', service_name)\n        _validate_not_none('deployment_name', deployment_name)\n        return self._perform_post(\n            self._get_reserved_ip_path_for_disassociation(name),\n            _XmlSerializer.associate_reserved_ip_to_xml(\n                service_name, deployment_name, virtual_ip_name\n            ),\n            as_async=True,\n            x_ms_version='2015-02-01'\n        )"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nretrieve information about the specified reserved IP address.", "response": "def get_reserved_ip_address(self, name):\n        '''\n        Retrieves information about the specified reserved IP address.\n\n        name:\n            Required. Name of the reserved IP address.\n        '''\n        _validate_not_none('name', name)\n        return self._perform_get(self._get_reserved_ip_path(name), ReservedIP)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nretrieve the specified virtual machine role.", "response": "def get_role(self, service_name, deployment_name, role_name):\n        '''\n        Retrieves the specified virtual machine.\n\n        service_name:\n            The name of the service.\n        deployment_name:\n            The name of the deployment.\n        role_name:\n            The name of the role.\n        '''\n        _validate_not_none('service_name', service_name)\n        _validate_not_none('deployment_name', deployment_name)\n        _validate_not_none('role_name', role_name)\n        return self._perform_get(\n            self._get_role_path(service_name, deployment_name, role_name),\n            PersistentVMRole)"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef create_virtual_machine_deployment(self, service_name, deployment_name,\n                                          deployment_slot, label, role_name,\n                                          system_config, os_virtual_hard_disk,\n                                          network_config=None,\n                                          availability_set_name=None,\n                                          data_virtual_hard_disks=None,\n                                          role_size=None,\n                                          role_type='PersistentVMRole',\n                                          virtual_network_name=None,\n                                          resource_extension_references=None,\n                                          provision_guest_agent=None,\n                                          vm_image_name=None,\n                                          media_location=None,\n                                          dns_servers=None,\n                                          reserved_ip_name=None):\n        '''\n        Provisions a virtual machine based on the supplied configuration.\n\n        service_name:\n            Name of the hosted service.\n        deployment_name:\n            The name for the deployment. The deployment name must be unique\n            among other deployments for the hosted service.\n        deployment_slot:\n            The environment to which the hosted service is deployed. Valid\n            values are: staging, production\n        label:\n            Specifies an identifier for the deployment. The label can be up to\n            100 characters long. The label can be used for tracking purposes.\n        role_name:\n            The name of the role.\n        system_config:\n            Contains the metadata required to provision a virtual machine from\n            a Windows or Linux OS image.  Use an instance of\n            WindowsConfigurationSet or LinuxConfigurationSet.\n        os_virtual_hard_disk:\n            Contains the parameters Windows Azure uses to create the operating\n            system disk for the virtual machine. If you are creating a Virtual\n            Machine by using a VM Image, this parameter is not used.\n        network_config:\n            Encapsulates the metadata required to create the virtual network\n            configuration for a virtual machine. If you do not include a\n            network configuration set you will not be able to access the VM\n            through VIPs over the internet. If your virtual machine belongs to\n            a virtual network you can not specify which subnet address space\n            it resides under. Use an instance of ConfigurationSet.\n        availability_set_name:\n            Specifies the name of an availability set to which to add the\n            virtual machine. This value controls the virtual machine\n            allocation in the Windows Azure environment. Virtual machines\n            specified in the same availability set are allocated to different\n            nodes to maximize availability.\n        data_virtual_hard_disks:\n            Contains the parameters Windows Azure uses to create a data disk\n            for a virtual machine.\n        role_size:\n            The size of the virtual machine to allocate. The default value is\n            Small. Possible values are: ExtraSmall,Small,Medium,Large,\n            ExtraLarge,A5,A6,A7,A8,A9,Basic_A0,Basic_A1,Basic_A2,Basic_A3,\n            Basic_A4,Standard_D1,Standard_D2,Standard_D3,Standard_D4,\n            Standard_D11,Standard_D12,Standard_D13,Standard_D14,Standard_G1,\n            Standard_G2,Sandard_G3,Standard_G4,Standard_G5. The specified\n            value must be compatible with the disk selected in the \n            OSVirtualHardDisk values.\n        role_type:\n            The type of the role for the virtual machine. The only supported\n            value is PersistentVMRole.\n        virtual_network_name:\n            Specifies the name of an existing virtual network to which the\n            deployment will belong.\n        resource_extension_references:\n            Optional. Contains a collection of resource extensions that are to\n            be installed on the Virtual Machine. This element is used if\n            provision_guest_agent is set to True. Use an iterable of instances\n            of ResourceExtensionReference.\n        provision_guest_agent:\n            Optional. Indicates whether the VM Agent is installed on the\n            Virtual Machine. To run a resource extension in a Virtual Machine,\n            this service must be installed.\n        vm_image_name:\n            Optional. Specifies the name of the VM Image that is to be used to\n            create the Virtual Machine. If this is specified, the\n            system_config and network_config parameters are not used.\n        media_location:\n            Optional. Required if the Virtual Machine is being created from a\n            published VM Image. Specifies the location of the VHD file that is\n            created when VMImageName specifies a published VM Image.\n        dns_servers:\n            Optional. List of DNS servers (use DnsServer class) to associate\n            with the Virtual Machine.\n        reserved_ip_name:\n            Optional. Specifies the name of a reserved IP address that is to be\n            assigned to the deployment. You must run create_reserved_ip_address\n            before you can assign the address to the deployment using this\n            element.\n        '''\n        _validate_not_none('service_name', service_name)\n        _validate_not_none('deployment_name', deployment_name)\n        _validate_not_none('deployment_slot', deployment_slot)\n        _validate_not_none('label', label)\n        _validate_not_none('role_name', role_name)\n        return self._perform_post(\n            self._get_deployment_path_using_name(service_name),\n            _XmlSerializer.virtual_machine_deployment_to_xml(\n                deployment_name,\n                deployment_slot,\n                label,\n                role_name,\n                system_config,\n                os_virtual_hard_disk,\n                role_type,\n                network_config,\n                availability_set_name,\n                data_virtual_hard_disks,\n                role_size,\n                virtual_network_name,\n                resource_extension_references,\n                provision_guest_agent,\n                vm_image_name,\n                media_location,\n                dns_servers,\n                reserved_ip_name),\n            as_async=True)", "response": "Creates a virtual machine deployment for the specified service and deployment."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nadd a virtual machine to an existing deployment.", "response": "def add_role(self, service_name, deployment_name, role_name, system_config,\n                 os_virtual_hard_disk, network_config=None,\n                 availability_set_name=None, data_virtual_hard_disks=None,\n                 role_size=None, role_type='PersistentVMRole',\n                 resource_extension_references=None,\n                 provision_guest_agent=None, vm_image_name=None,\n                 media_location=None):\n        '''\n        Adds a virtual machine to an existing deployment.\n\n        service_name:\n            The name of the service.\n        deployment_name:\n            The name of the deployment.\n        role_name:\n            The name of the role.\n        system_config:\n            Contains the metadata required to provision a virtual machine from\n            a Windows or Linux OS image.  Use an instance of\n            WindowsConfigurationSet or LinuxConfigurationSet.\n        os_virtual_hard_disk:\n            Contains the parameters Windows Azure uses to create the operating\n            system disk for the virtual machine. If you are creating a Virtual\n            Machine by using a VM Image, this parameter is not used.\n        network_config:\n            Encapsulates the metadata required to create the virtual network\n            configuration for a virtual machine. If you do not include a\n            network configuration set you will not be able to access the VM\n            through VIPs over the internet. If your virtual machine belongs to\n            a virtual network you can not specify which subnet address space\n            it resides under.\n        availability_set_name:\n            Specifies the name of an availability set to which to add the\n            virtual machine. This value controls the virtual machine allocation\n            in the Windows Azure environment. Virtual machines specified in the\n            same availability set are allocated to different nodes to maximize\n            availability.\n        data_virtual_hard_disks:\n            Contains the parameters Windows Azure uses to create a data disk\n            for a virtual machine.\n        role_size:\n            The size of the virtual machine to allocate. The default value is\n            Small. Possible values are: ExtraSmall, Small, Medium, Large,\n            ExtraLarge. The specified value must be compatible with the disk\n            selected in the OSVirtualHardDisk values.\n        role_type:\n            The type of the role for the virtual machine. The only supported\n            value is PersistentVMRole.\n        resource_extension_references:\n            Optional. Contains a collection of resource extensions that are to\n            be installed on the Virtual Machine. This element is used if\n            provision_guest_agent is set to True.\n        provision_guest_agent:\n            Optional. Indicates whether the VM Agent is installed on the\n            Virtual Machine. To run a resource extension in a Virtual Machine,\n            this service must be installed.\n        vm_image_name:\n            Optional. Specifies the name of the VM Image that is to be used to\n            create the Virtual Machine. If this is specified, the\n            system_config and network_config parameters are not used.\n        media_location:\n            Optional. Required if the Virtual Machine is being created from a\n            published VM Image. Specifies the location of the VHD file that is\n            created when VMImageName specifies a published VM Image.\n        '''\n        _validate_not_none('service_name', service_name)\n        _validate_not_none('deployment_name', deployment_name)\n        _validate_not_none('role_name', role_name)\n        return self._perform_post(\n            self._get_role_path(service_name, deployment_name),\n            _XmlSerializer.add_role_to_xml(\n                role_name,\n                system_config,\n                os_virtual_hard_disk,\n                role_type,\n                network_config,\n                availability_set_name,\n                data_virtual_hard_disks,\n                role_size,\n                resource_extension_references,\n                provision_guest_agent,\n                vm_image_name,\n                media_location),\n            as_async=True)"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef update_role(self, service_name, deployment_name, role_name,\n                    os_virtual_hard_disk=None, network_config=None,\n                    availability_set_name=None, data_virtual_hard_disks=None,\n                    role_size=None, role_type='PersistentVMRole',\n                    resource_extension_references=None,\n                    provision_guest_agent=None):\n        '''\n        Updates the specified virtual machine.\n\n        service_name:\n            The name of the service.\n        deployment_name:\n            The name of the deployment.\n        role_name:\n            The name of the role.\n        os_virtual_hard_disk:\n            Contains the parameters Windows Azure uses to create the operating\n            system disk for the virtual machine.\n        network_config:\n            Encapsulates the metadata required to create the virtual network\n            configuration for a virtual machine. If you do not include a\n            network configuration set you will not be able to access the VM\n            through VIPs over the internet. If your virtual machine belongs to\n            a virtual network you can not specify which subnet address space\n            it resides under.\n        availability_set_name:\n            Specifies the name of an availability set to which to add the\n            virtual machine. This value controls the virtual machine allocation\n            in the Windows Azure environment. Virtual machines specified in the\n            same availability set are allocated to different nodes to maximize\n            availability.\n        data_virtual_hard_disks:\n            Contains the parameters Windows Azure uses to create a data disk\n            for a virtual machine.\n        role_size:\n            The size of the virtual machine to allocate. The default value is\n            Small. Possible values are: ExtraSmall, Small, Medium, Large,\n            ExtraLarge. The specified value must be compatible with the disk\n            selected in the OSVirtualHardDisk values.\n        role_type:\n            The type of the role for the virtual machine. The only supported\n            value is PersistentVMRole.\n        resource_extension_references:\n            Optional. Contains a collection of resource extensions that are to\n            be installed on the Virtual Machine. This element is used if\n            provision_guest_agent is set to True.\n        provision_guest_agent:\n            Optional. Indicates whether the VM Agent is installed on the\n            Virtual Machine. To run a resource extension in a Virtual Machine,\n            this service must be installed.\n        '''\n        _validate_not_none('service_name', service_name)\n        _validate_not_none('deployment_name', deployment_name)\n        _validate_not_none('role_name', role_name)\n        return self._perform_put(\n            self._get_role_path(service_name, deployment_name, role_name),\n            _XmlSerializer.update_role_to_xml(\n                role_name,\n                os_virtual_hard_disk,\n                role_type,\n                network_config,\n                availability_set_name,\n                data_virtual_hard_disks,\n                role_size,\n                resource_extension_references,\n                provision_guest_agent),\n            as_async=True)", "response": "Updates the role of a virtual machine."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\ndelete the specified virtual machine.", "response": "def delete_role(self, service_name, deployment_name, role_name, complete = False):\n        '''\n        Deletes the specified virtual machine.\n\n        service_name:\n            The name of the service.\n        deployment_name:\n            The name of the deployment.\n        role_name:\n            The name of the role.\n        complete:\n            True if all OS/data disks and the source blobs for the disks should\n            also be deleted from storage.\n        '''\n        _validate_not_none('service_name', service_name)\n        _validate_not_none('deployment_name', deployment_name)\n        _validate_not_none('role_name', role_name)\n\n        path = self._get_role_path(service_name, deployment_name, role_name)\n        \n        if complete == True:\n            path = path +'?comp=media'\n\n        return self._perform_delete(path,\n                                    as_async=True)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef start_role(self, service_name, deployment_name, role_name):\n        '''\n        Starts the specified virtual machine.\n\n        service_name:\n            The name of the service.\n        deployment_name:\n            The name of the deployment.\n        role_name:\n            The name of the role.\n        '''\n        _validate_not_none('service_name', service_name)\n        _validate_not_none('deployment_name', deployment_name)\n        _validate_not_none('role_name', role_name)\n        return self._perform_post(\n            self._get_role_instance_operations_path(\n                service_name, deployment_name, role_name),\n            _XmlSerializer.start_role_operation_to_xml(),\n            as_async=True)", "response": "Starts the specified virtual machine."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nstarting the specified virtual machines.", "response": "def start_roles(self, service_name, deployment_name, role_names):\n        '''\n        Starts the specified virtual machines.\n\n        service_name:\n            The name of the service.\n        deployment_name:\n            The name of the deployment.\n        role_names:\n            The names of the roles, as an enumerable of strings.\n        '''\n        _validate_not_none('service_name', service_name)\n        _validate_not_none('deployment_name', deployment_name)\n        _validate_not_none('role_names', role_names)\n        return self._perform_post(\n            self._get_roles_operations_path(service_name, deployment_name),\n            _XmlSerializer.start_roles_operation_to_xml(role_names),\n            as_async=True)"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nrestart the specified virtual machine.", "response": "def restart_role(self, service_name, deployment_name, role_name):\n        '''\n        Restarts the specified virtual machine.\n\n        service_name:\n            The name of the service.\n        deployment_name:\n            The name of the deployment.\n        role_name:\n            The name of the role.\n        '''\n        _validate_not_none('service_name', service_name)\n        _validate_not_none('deployment_name', deployment_name)\n        _validate_not_none('role_name', role_name)\n        return self._perform_post(\n            self._get_role_instance_operations_path(\n                service_name, deployment_name, role_name),\n            _XmlSerializer.restart_role_operation_to_xml(\n            ),\n            as_async=True)"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nshuts down the specified role.", "response": "def shutdown_role(self, service_name, deployment_name, role_name,\n                      post_shutdown_action='Stopped'):\n        '''\n        Shuts down the specified virtual machine.\n\n        service_name:\n            The name of the service.\n        deployment_name:\n            The name of the deployment.\n        role_name:\n            The name of the role.\n        post_shutdown_action:\n            Specifies how the Virtual Machine should be shut down. Values are:\n                Stopped\n                    Shuts down the Virtual Machine but retains the compute\n                    resources. You will continue to be billed for the resources\n                    that the stopped machine uses.\n                StoppedDeallocated\n                    Shuts down the Virtual Machine and releases the compute\n                    resources. You are not billed for the compute resources that\n                    this Virtual Machine uses. If a static Virtual Network IP\n                    address is assigned to the Virtual Machine, it is reserved.\n        '''\n        _validate_not_none('service_name', service_name)\n        _validate_not_none('deployment_name', deployment_name)\n        _validate_not_none('role_name', role_name)\n        _validate_not_none('post_shutdown_action', post_shutdown_action)\n        return self._perform_post(\n            self._get_role_instance_operations_path(\n                service_name, deployment_name, role_name),\n            _XmlSerializer.shutdown_role_operation_to_xml(post_shutdown_action),\n            as_async=True)"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nshutting down the specified roles.", "response": "def shutdown_roles(self, service_name, deployment_name, role_names,\n                       post_shutdown_action='Stopped'):\n        '''\n        Shuts down the specified virtual machines.\n\n        service_name:\n            The name of the service.\n        deployment_name:\n            The name of the deployment.\n        role_names:\n            The names of the roles, as an enumerable of strings.\n        post_shutdown_action:\n            Specifies how the Virtual Machine should be shut down. Values are:\n                Stopped\n                    Shuts down the Virtual Machine but retains the compute\n                    resources. You will continue to be billed for the resources\n                    that the stopped machine uses.\n                StoppedDeallocated\n                    Shuts down the Virtual Machine and releases the compute\n                    resources. You are not billed for the compute resources that\n                    this Virtual Machine uses. If a static Virtual Network IP\n                    address is assigned to the Virtual Machine, it is reserved.\n        '''\n        _validate_not_none('service_name', service_name)\n        _validate_not_none('deployment_name', deployment_name)\n        _validate_not_none('role_names', role_names)\n        _validate_not_none('post_shutdown_action', post_shutdown_action)\n        return self._perform_post(\n            self._get_roles_operations_path(service_name, deployment_name),\n            _XmlSerializer.shutdown_roles_operation_to_xml(\n                role_names, post_shutdown_action),\n            as_async=True)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nadd a DNS server definition to an existing deployment.", "response": "def add_dns_server(self, service_name, deployment_name, dns_server_name, address):\n        '''\n        Adds a DNS server definition to an existing deployment.\n\n        service_name:\n            The name of the service.\n        deployment_name:\n            The name of the deployment.\n        dns_server_name:\n            Specifies the name of the DNS server.\n        address:\n            Specifies the IP address of the DNS server.\n        '''\n        _validate_not_none('service_name', service_name)\n        _validate_not_none('deployment_name', deployment_name)\n        _validate_not_none('dns_server_name', dns_server_name)\n        _validate_not_none('address', address)\n        return self._perform_post(\n            self._get_dns_server_path(service_name, deployment_name),\n            _XmlSerializer.dns_server_to_xml(dns_server_name, address),\n            as_async=True)"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef update_dns_server(self, service_name, deployment_name, dns_server_name, address):\n        '''\n        Updates the ip address of a DNS server.\n\n        service_name:\n            The name of the service.\n        deployment_name:\n            The name of the deployment.\n        dns_server_name:\n            Specifies the name of the DNS server.\n        address:\n            Specifies the IP address of the DNS server.\n        '''\n        _validate_not_none('service_name', service_name)\n        _validate_not_none('deployment_name', deployment_name)\n        _validate_not_none('dns_server_name', dns_server_name)\n        _validate_not_none('address', address)\n        return self._perform_put(\n            self._get_dns_server_path(service_name,\n                                      deployment_name,\n                                      dns_server_name),\n            _XmlSerializer.dns_server_to_xml(dns_server_name, address),\n            as_async=True)", "response": "Updates the IP address of a DNS server."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef delete_dns_server(self, service_name, deployment_name, dns_server_name):\n        '''\n        Deletes a DNS server from a deployment.\n\n        service_name:\n            The name of the service.\n        deployment_name:\n            The name of the deployment.\n        dns_server_name:\n            Name of the DNS server that you want to delete.\n        '''\n        _validate_not_none('service_name', service_name)\n        _validate_not_none('deployment_name', deployment_name)\n        _validate_not_none('dns_server_name', dns_server_name)\n        return self._perform_delete(\n            self._get_dns_server_path(service_name,\n                                      deployment_name,\n                                      dns_server_name),\n            as_async=True)", "response": "Deletes a DNS server from a service in a deployment."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nlist the versions of a resource extension that are available to add to a Virtual Machine.", "response": "def list_resource_extension_versions(self, publisher_name, extension_name):\n        '''\n        Lists the versions of a resource extension that are available to add\n        to a Virtual Machine.\n\n        publisher_name:\n            Name of the resource extension publisher.\n        extension_name:\n            Name of the resource extension.\n        '''\n        return self._perform_get(self._get_resource_extension_versions_path(\n                                    publisher_name, extension_name),\n                                 ResourceExtensions)"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nreplicates a VM image to multiple target locations. This operation is only for publishers. You have to be registered as image publisher with Microsoft Azure to be able to call this. vm_image_name: Specifies the name of the VM Image that is to be used for replication regions: Specified a list of regions to replicate the image to Note: The regions in the request body are not additive. If a VM Image has already been replicated to Regions A, B, and C, and a request is made to replicate to Regions A and D, the VM Image will remain in Region A, will be replicated in Region D, and will be unreplicated from Regions B and C offer: Specifies the publisher defined name of the offer. The allowed characters are uppercase or lowercase letters, digit, hypen(-), period (.).The maximum allowed length is 64 characters. sku: Specifies the publisher defined name of the Sku. The allowed characters are uppercase or lowercase letters, digit, hypen(-), period (.). The maximum allowed length is 64 characters. version: Specifies the publisher defined version of the image. The allowed characters are digit and period. Format: <MajorVersion>.<MinorVersion>.<Patch> Example: '1.0.0' or '1.1.0' The 3 version number to follow standard of most of the RPs. See http://semver.org", "response": "def replicate_vm_image(self, vm_image_name, regions, offer, sku, version):\n        '''\n        Replicate a VM image to multiple target locations. This operation\n        is only for publishers. You have to be registered as image publisher\n        with Microsoft Azure to be able to call this.\n\n        vm_image_name:\n            Specifies the name of the VM Image that is to be used for\n            replication\n        regions:\n            Specified a list of regions to replicate the image to\n            Note: The regions in the request body are not additive. If a VM\n            Image has already been replicated to Regions A, B, and C, and\n            a request is made to replicate to Regions A and D, the VM\n            Image will remain in Region A, will be replicated in Region D,\n            and will be unreplicated from Regions B and C\n        offer:\n            Specifies the publisher defined name of the offer. The allowed\n            characters are uppercase or lowercase letters, digit,\n            hypen(-), period (.).The maximum allowed length is 64 characters.\n        sku:\n            Specifies the publisher defined name of the Sku. The allowed\n            characters are uppercase or lowercase letters, digit,\n            hypen(-), period (.). The maximum allowed length is 64 characters.\n        version:\n            Specifies the publisher defined version of the image.\n            The allowed characters are digit and period.\n            Format: <MajorVersion>.<MinorVersion>.<Patch>\n            Example: '1.0.0' or '1.1.0' The 3 version number to\n            follow standard of most of the RPs. See http://semver.org\n        '''\n        _validate_not_none('vm_image_name', vm_image_name)\n        _validate_not_none('regions', regions)\n        _validate_not_none('offer', offer)\n        _validate_not_none('sku', sku)\n        _validate_not_none('version', version)\n\n        return self._perform_put(\n            self._get_replication_path_using_vm_image_name(vm_image_name),\n            _XmlSerializer.replicate_image_to_xml(\n                regions,\n                offer,\n                sku,\n                version\n            ),\n            as_async=True,\n            x_ms_version='2015-04-01'\n        )"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef unreplicate_vm_image(self, vm_image_name):\n        '''\n        Unreplicate a VM image from all regions This operation\n        is only for publishers. You have to be registered as image publisher\n        with Microsoft Azure to be able to call this\n\n        vm_image_name:\n            Specifies the name of the VM Image that is to be used for\n            unreplication. The VM Image Name should be the user VM Image,\n            not the published name of the VM Image.\n\n        '''\n        _validate_not_none('vm_image_name', vm_image_name)\n\n        return self._perform_put(\n            self._get_unreplication_path_using_vm_image_name(vm_image_name),\n            None,\n            as_async=True,\n            x_ms_version='2015-04-01'\n        )", "response": "Unreplicate a VM Image from all regions"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef share_vm_image(self, vm_image_name, permission):\n        '''\n        Share an already replicated OS image. This operation is only for\n        publishers. You have to be registered as image publisher with Windows\n        Azure to be able to call this.\n\n        vm_image_name:\n            The name of the virtual machine image to share\n        permission:\n            The sharing permission: public, msdn, or private\n        '''\n        _validate_not_none('vm_image_name', vm_image_name)\n        _validate_not_none('permission', permission)\n\n        path = self._get_sharing_path_using_vm_image_name(vm_image_name)\n        query = '&permission=' + permission\n        path = path + '?' + query.lstrip('&')\n\n        return self._perform_put(\n            path, None, as_async=True, x_ms_version='2015-04-01'\n        )", "response": "Share an already replicated OS image with the specified virtual machine image name and permission."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\ncreate a copy of the operating system virtual hard disk and creates a copy of the data VHDs that are associated with the Virtual Machine and registers the copies as VM Images.", "response": "def capture_vm_image(self, service_name, deployment_name, role_name, options):\n        '''\n        Creates a copy of the operating system virtual hard disk (VHD) and all\n        of the data VHDs that are associated with the Virtual Machine, saves\n        the VHD copies in the same storage location as the original VHDs, and\n        registers the copies as a VM Image in the image repository that is\n        associated with the specified subscription.\n\n        service_name:\n            The name of the service.\n        deployment_name:\n            The name of the deployment.\n        role_name:\n            The name of the role.\n        options:\n            An instance of CaptureRoleAsVMImage class.\n        options.os_state:\n            Required. Specifies the state of the operating system in the image.\n            Possible values are: Generalized, Specialized \n            A Virtual Machine that is fully configured and running contains a\n            Specialized operating system. A Virtual Machine on which the\n            Sysprep command has been run with the generalize option contains a\n            Generalized operating system. If you capture an image from a\n            generalized Virtual Machine, the machine is deleted after the image\n            is captured. It is recommended that all Virtual Machines are shut\n            down before capturing an image.\n        options.vm_image_name:\n            Required. Specifies the name of the VM Image.\n        options.vm_image_label:\n            Required. Specifies the label of the VM Image.\n        options.description:\n            Optional. Specifies the description of the VM Image.\n        options.language:\n            Optional. Specifies the language of the VM Image.\n        options.image_family:\n            Optional. Specifies a value that can be used to group VM Images.\n        options.recommended_vm_size:\n            Optional. Specifies the size to use for the Virtual Machine that\n            is created from the VM Image.\n        '''\n        _validate_not_none('service_name', service_name)\n        _validate_not_none('deployment_name', deployment_name)\n        _validate_not_none('role_name', role_name)\n        _validate_not_none('options', options)\n        _validate_not_none('options.os_state', options.os_state)\n        _validate_not_none('options.vm_image_name', options.vm_image_name)\n        _validate_not_none('options.vm_image_label', options.vm_image_label)\n        return self._perform_post(\n            self._get_capture_vm_image_path(service_name, deployment_name, role_name),\n            _XmlSerializer.capture_vm_image_to_xml(options),\n            as_async=True)"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef create_vm_image(self, vm_image):\n        '''\n        Creates a VM Image in the image repository that is associated with the\n        specified subscription using a specified set of virtual hard disks.\n\n        vm_image:\n            An instance of VMImage class.\n        vm_image.name: Required. Specifies the name of the image.\n        vm_image.label: Required. Specifies an identifier for the image.\n        vm_image.description: Optional. Specifies the description of the image.\n        vm_image.os_disk_configuration:\n            Required. Specifies configuration information for the operating \n            system disk that is associated with the image.\n        vm_image.os_disk_configuration.host_caching:\n            Optional. Specifies the caching behavior of the operating system disk.\n            Possible values are: None, ReadOnly, ReadWrite \n        vm_image.os_disk_configuration.os_state:\n            Required. Specifies the state of the operating system in the image.\n            Possible values are: Generalized, Specialized\n            A Virtual Machine that is fully configured and running contains a\n            Specialized operating system. A Virtual Machine on which the\n            Sysprep command has been run with the generalize option contains a\n            Generalized operating system.\n        vm_image.os_disk_configuration.os:\n            Required. Specifies the operating system type of the image.\n        vm_image.os_disk_configuration.media_link:\n            Required. Specifies the location of the blob in Windows Azure\n            storage. The blob location belongs to a storage account in the\n            subscription specified by the <subscription-id> value in the\n            operation call.\n        vm_image.data_disk_configurations:\n            Optional. Specifies configuration information for the data disks\n            that are associated with the image. A VM Image might not have data\n            disks associated with it.\n        vm_image.data_disk_configurations[].host_caching:\n            Optional. Specifies the caching behavior of the data disk.\n            Possible values are: None, ReadOnly, ReadWrite \n        vm_image.data_disk_configurations[].lun:\n            Optional if the lun for the disk is 0. Specifies the Logical Unit\n            Number (LUN) for the data disk.\n        vm_image.data_disk_configurations[].media_link:\n            Required. Specifies the location of the blob in Windows Azure\n            storage. The blob location belongs to a storage account in the\n            subscription specified by the <subscription-id> value in the\n            operation call.\n        vm_image.data_disk_configurations[].logical_size_in_gb:\n            Required. Specifies the size, in GB, of the data disk.\n        vm_image.language: Optional. Specifies the language of the image.\n        vm_image.image_family:\n            Optional. Specifies a value that can be used to group VM Images.\n        vm_image.recommended_vm_size:\n            Optional. Specifies the size to use for the Virtual Machine that\n            is created from the VM Image.\n        vm_image.eula:\n            Optional. Specifies the End User License Agreement that is\n            associated with the image. The value for this element is a string,\n            but it is recommended that the value be a URL that points to a EULA.\n        vm_image.icon_uri:\n            Optional. Specifies the URI to the icon that is displayed for the\n            image in the Management Portal.\n        vm_image.small_icon_uri:\n            Optional. Specifies the URI to the small icon that is displayed for\n            the image in the Management Portal.\n        vm_image.privacy_uri:\n            Optional. Specifies the URI that points to a document that contains\n            the privacy policy related to the image.\n        vm_image.published_date:\n            Optional. Specifies the date when the image was added to the image\n            repository.\n        vm_image.show_in_gui:\n            Optional. Indicates whether the VM Images should be listed in the\n            portal.\n        '''\n        _validate_not_none('vm_image', vm_image)\n        _validate_not_none('vm_image.name', vm_image.name)\n        _validate_not_none('vm_image.label', vm_image.label)\n        _validate_not_none('vm_image.os_disk_configuration.os_state',\n                           vm_image.os_disk_configuration.os_state)\n        _validate_not_none('vm_image.os_disk_configuration.os',\n                           vm_image.os_disk_configuration.os)\n        _validate_not_none('vm_image.os_disk_configuration.media_link',\n                           vm_image.os_disk_configuration.media_link)\n        return self._perform_post(\n            self._get_vm_image_path(),\n            _XmlSerializer.create_vm_image_to_xml(vm_image),\n            as_async=True)", "response": "Creates a VM Image in the image repository that is associated with the specified subscription using a specified set of virtual hard disks."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\ndelete the VM Image from the image repository that is associated with the subscription.", "response": "def delete_vm_image(self, vm_image_name, delete_vhd=False):\n        '''\n        Deletes the specified VM Image from the image repository that is\n        associated with the specified subscription.\n\n        vm_image_name:\n            The name of the image.\n        delete_vhd:\n            Deletes the underlying vhd blob in Azure storage.\n        '''\n        _validate_not_none('vm_image_name', vm_image_name)\n        path = self._get_vm_image_path(vm_image_name)\n        if delete_vhd:\n            path += '?comp=media'\n        return self._perform_delete(path, as_async=True)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nreturn a list of the VM Images associated with the specified subscription.", "response": "def list_vm_images(self, location=None, publisher=None, category=None):\n        '''\n        Retrieves a list of the VM Images from the image repository that is\n        associated with the specified subscription.\n        '''\n        path = self._get_vm_image_path()\n        query = ''\n        if location:\n            query += '&location=' + location\n        if publisher:\n            query += '&publisher=' + publisher\n        if category:\n            query += '&category=' + category\n        if query:\n            path = path + '?' + query.lstrip('&')\n        return self._perform_get(path, VMImages)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef update_vm_image(self, vm_image_name, vm_image):\n        '''\n        Updates a VM Image in the image repository that is associated with the\n        specified subscription.\n\n        vm_image_name:\n            Name of image to update.\n        vm_image:\n            An instance of VMImage class.\n        vm_image.label: Optional. Specifies an identifier for the image.\n        vm_image.os_disk_configuration:\n            Required. Specifies configuration information for the operating \n            system disk that is associated with the image.\n        vm_image.os_disk_configuration.host_caching:\n            Optional. Specifies the caching behavior of the operating system disk.\n            Possible values are: None, ReadOnly, ReadWrite \n        vm_image.data_disk_configurations:\n            Optional. Specifies configuration information for the data disks\n            that are associated with the image. A VM Image might not have data\n            disks associated with it.\n        vm_image.data_disk_configurations[].name:\n            Required. Specifies the name of the data disk.\n        vm_image.data_disk_configurations[].host_caching:\n            Optional. Specifies the caching behavior of the data disk.\n            Possible values are: None, ReadOnly, ReadWrite \n        vm_image.data_disk_configurations[].lun:\n            Optional if the lun for the disk is 0. Specifies the Logical Unit\n            Number (LUN) for the data disk.\n        vm_image.description: Optional. Specifies the description of the image.\n        vm_image.language: Optional. Specifies the language of the image.\n        vm_image.image_family:\n            Optional. Specifies a value that can be used to group VM Images.\n        vm_image.recommended_vm_size:\n            Optional. Specifies the size to use for the Virtual Machine that\n            is created from the VM Image.\n        vm_image.eula:\n            Optional. Specifies the End User License Agreement that is\n            associated with the image. The value for this element is a string,\n            but it is recommended that the value be a URL that points to a EULA.\n        vm_image.icon_uri:\n            Optional. Specifies the URI to the icon that is displayed for the\n            image in the Management Portal.\n        vm_image.small_icon_uri:\n            Optional. Specifies the URI to the small icon that is displayed for\n            the image in the Management Portal.\n        vm_image.privacy_uri:\n            Optional. Specifies the URI that points to a document that contains\n            the privacy policy related to the image.\n        vm_image.published_date:\n            Optional. Specifies the date when the image was added to the image\n            repository.\n        vm_image.show_in_gui:\n            Optional. Indicates whether the VM Images should be listed in the\n            portal.\n        '''\n        _validate_not_none('vm_image_name', vm_image_name)\n        _validate_not_none('vm_image', vm_image)\n        return self._perform_put(self._get_vm_image_path(vm_image_name),\n                                 _XmlSerializer.update_vm_image_to_xml(vm_image),\n                                 as_async=True)", "response": "Updates a VM Image in the image repository that is associated with the VM Image name and subscription."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nadd an OS image to the image repository.", "response": "def add_os_image(self, label, media_link, name, os):\n        '''\n        Adds an OS image that is currently stored in a storage account in your\n        subscription to the image repository.\n\n        label:\n            Specifies the friendly name of the image.\n        media_link:\n            Specifies the location of the blob in Windows Azure blob store\n            where the media for the image is located. The blob location must\n            belong to a storage account in the subscription specified by the\n            <subscription-id> value in the operation call. Example:\n            http://example.blob.core.windows.net/disks/mydisk.vhd\n        name:\n            Specifies a name for the OS image that Windows Azure uses to\n            identify the image when creating one or more virtual machines.\n        os:\n            The operating system type of the OS image. Possible values are:\n            Linux, Windows\n        '''\n        _validate_not_none('label', label)\n        _validate_not_none('media_link', media_link)\n        _validate_not_none('name', name)\n        _validate_not_none('os', os)\n        return self._perform_post(self._get_image_path(),\n                                  _XmlSerializer.os_image_to_xml(\n                                      label, media_link, name, os),\n                                  as_async=True)"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef update_os_image(self, image_name, label, media_link, name, os):\n        '''\n        Updates an OS image that in your image repository.\n\n        image_name:\n            The name of the image to update.\n        label:\n            Specifies the friendly name of the image to be updated. You cannot\n            use this operation to update images provided by the Windows Azure\n            platform.\n        media_link:\n            Specifies the location of the blob in Windows Azure blob store\n            where the media for the image is located. The blob location must\n            belong to a storage account in the subscription specified by the\n            <subscription-id> value in the operation call. Example:\n            http://example.blob.core.windows.net/disks/mydisk.vhd\n        name:\n            Specifies a name for the OS image that Windows Azure uses to\n            identify the image when creating one or more VM Roles.\n        os:\n            The operating system type of the OS image. Possible values are:\n            Linux, Windows\n        '''\n        _validate_not_none('image_name', image_name)\n        _validate_not_none('label', label)\n        _validate_not_none('media_link', media_link)\n        _validate_not_none('name', name)\n        _validate_not_none('os', os)\n        return self._perform_put(self._get_image_path(image_name),\n                                 _XmlSerializer.os_image_to_xml(\n                                     label, media_link, name, os),\n                                 as_async=True)", "response": "Updates an OS image in the image repository."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nupdates the metadata elements of an OS Image from a given OS Image reference.", "response": "def update_os_image_from_image_reference(self, image_name, os_image):\n        '''\n        Updates metadata elements from a given OS image reference.\n\n        image_name:\n            The name of the image to update.\n        os_image:\n            An instance of OSImage class.\n        os_image.label: Optional. Specifies an identifier for the image.\n        os_image.description: Optional. Specifies the description of the image.\n        os_image.language: Optional. Specifies the language of the image.\n        os_image.image_family:\n            Optional. Specifies a value that can be used to group VM Images.\n        os_image.recommended_vm_size:\n            Optional. Specifies the size to use for the Virtual Machine that\n            is created from the VM Image.\n        os_image.eula:\n            Optional. Specifies the End User License Agreement that is\n            associated with the image. The value for this element is a string,\n            but it is recommended that the value be a URL that points to a EULA.\n        os_image.icon_uri:\n            Optional. Specifies the URI to the icon that is displayed for the\n            image in the Management Portal.\n        os_image.small_icon_uri:\n            Optional. Specifies the URI to the small icon that is displayed for\n            the image in the Management Portal.\n        os_image.privacy_uri:\n            Optional. Specifies the URI that points to a document that contains\n            the privacy policy related to the image.\n        os_image.published_date:\n            Optional. Specifies the date when the image was added to the image\n            repository.\n        os.image.media_link:\n            Required: Specifies the location of the blob in Windows Azure\n            blob store where the media for the image is located. The blob\n            location must belong to a storage account in the subscription\n            specified by the <subscription-id> value in the operation call.\n            Example:\n            http://example.blob.core.windows.net/disks/mydisk.vhd\n        os_image.name:\n            Specifies a name for the OS image that Windows Azure uses to\n            identify the image when creating one or more VM Roles.\n        os_image.os:\n            The operating system type of the OS image. Possible values are:\n            Linux, Windows\n        '''\n        _validate_not_none('image_name', image_name)\n        _validate_not_none('os_image', os_image)\n        return self._perform_put(self._get_image_path(image_name),\n            _XmlSerializer.update_os_image_to_xml(os_image), as_async=True\n        )"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef delete_os_image(self, image_name, delete_vhd=False):\n        '''\n        Deletes the specified OS image from your image repository.\n\n        image_name:\n            The name of the image.\n        delete_vhd:\n            Deletes the underlying vhd blob in Azure storage.\n        '''\n        _validate_not_none('image_name', image_name)\n        path = self._get_image_path(image_name)\n        if delete_vhd:\n            path += '?comp=media'\n        return self._perform_delete(path, as_async=True)", "response": "Deletes the specified OS image from the image repository."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nretrieves the specified data disk from a virtual machine.", "response": "def get_data_disk(self, service_name, deployment_name, role_name, lun):\n        '''\n        Retrieves the specified data disk from a virtual machine.\n\n        service_name:\n            The name of the service.\n        deployment_name:\n            The name of the deployment.\n        role_name:\n            The name of the role.\n        lun:\n            The Logical Unit Number (LUN) for the disk.\n        '''\n        _validate_not_none('service_name', service_name)\n        _validate_not_none('deployment_name', deployment_name)\n        _validate_not_none('role_name', role_name)\n        _validate_not_none('lun', lun)\n        return self._perform_get(\n            self._get_data_disk_path(\n                service_name, deployment_name, role_name, lun),\n            DataVirtualHardDisk)"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nadding a data disk to a virtual machine.", "response": "def add_data_disk(self, service_name, deployment_name, role_name, lun,\n                      host_caching=None, media_link=None, disk_label=None,\n                      disk_name=None, logical_disk_size_in_gb=None,\n                      source_media_link=None):\n        '''\n        Adds a data disk to a virtual machine.\n\n        service_name:\n            The name of the service.\n        deployment_name:\n            The name of the deployment.\n        role_name:\n            The name of the role.\n        lun:\n            Specifies the Logical Unit Number (LUN) for the disk. The LUN\n            specifies the slot in which the data drive appears when mounted\n            for usage by the virtual machine. Valid LUN values are 0 through 15.\n        host_caching:\n            Specifies the platform caching behavior of data disk blob for\n            read/write efficiency. The default vault is ReadOnly. Possible\n            values are: None, ReadOnly, ReadWrite\n        media_link:\n            Specifies the location of the blob in Windows Azure blob store\n            where the media for the disk is located. The blob location must\n            belong to the storage account in the subscription specified by the\n            <subscription-id> value in the operation call. Example:\n            http://example.blob.core.windows.net/disks/mydisk.vhd\n        disk_label:\n            Specifies the description of the data disk. When you attach a disk,\n            either by directly referencing a media using the MediaLink element\n            or specifying the target disk size, you can use the DiskLabel\n            element to customize the name property of the target data disk.\n        disk_name:\n            Specifies the name of the disk. Windows Azure uses the specified\n            disk to create the data disk for the machine and populates this\n            field with the disk name.\n        logical_disk_size_in_gb:\n            Specifies the size, in GB, of an empty disk to be attached to the\n            role. The disk can be created as part of disk attach or create VM\n            role call by specifying the value for this property. Windows Azure\n            creates the empty disk based on size preference and attaches the\n            newly created disk to the Role.\n        source_media_link:\n            Specifies the location of a blob in account storage which is\n            mounted as a data disk when the virtual machine is created.\n        '''\n        _validate_not_none('service_name', service_name)\n        _validate_not_none('deployment_name', deployment_name)\n        _validate_not_none('role_name', role_name)\n        _validate_not_none('lun', lun)\n        return self._perform_post(\n            self._get_data_disk_path(service_name, deployment_name, role_name),\n            _XmlSerializer.data_virtual_hard_disk_to_xml(\n                host_caching,\n                disk_label,\n                disk_name,\n                lun,\n                logical_disk_size_in_gb,\n                media_link,\n                source_media_link),\n            as_async=True)"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef update_data_disk(self, service_name, deployment_name, role_name, lun,\n                         host_caching=None, media_link=None, updated_lun=None,\n                         disk_label=None, disk_name=None,\n                         logical_disk_size_in_gb=None):\n        '''\n        Updates the specified data disk attached to the specified virtual\n        machine.\n\n        service_name:\n            The name of the service.\n        deployment_name:\n            The name of the deployment.\n        role_name:\n            The name of the role.\n        lun:\n            Specifies the Logical Unit Number (LUN) for the disk. The LUN\n            specifies the slot in which the data drive appears when mounted\n            for usage by the virtual machine. Valid LUN values are 0 through\n            15.\n        host_caching:\n            Specifies the platform caching behavior of data disk blob for\n            read/write efficiency. The default vault is ReadOnly. Possible\n            values are: None, ReadOnly, ReadWrite\n        media_link:\n            Specifies the location of the blob in Windows Azure blob store\n            where the media for the disk is located. The blob location must\n            belong to the storage account in the subscription specified by\n            the <subscription-id> value in the operation call. Example:\n            http://example.blob.core.windows.net/disks/mydisk.vhd\n        updated_lun:\n            Specifies the Logical Unit Number (LUN) for the disk. The LUN\n            specifies the slot in which the data drive appears when mounted\n            for usage by the virtual machine. Valid LUN values are 0 through 15.\n        disk_label:\n            Specifies the description of the data disk. When you attach a disk,\n            either by directly referencing a media using the MediaLink element\n            or specifying the target disk size, you can use the DiskLabel\n            element to customize the name property of the target data disk.\n        disk_name:\n            Specifies the name of the disk. Windows Azure uses the specified\n            disk to create the data disk for the machine and populates this\n            field with the disk name.\n        logical_disk_size_in_gb:\n            Specifies the size, in GB, of an empty disk to be attached to the\n            role. The disk can be created as part of disk attach or create VM\n            role call by specifying the value for this property. Windows Azure\n            creates the empty disk based on size preference and attaches the\n            newly created disk to the Role.\n        '''\n        _validate_not_none('service_name', service_name)\n        _validate_not_none('deployment_name', deployment_name)\n        _validate_not_none('role_name', role_name)\n        _validate_not_none('lun', lun)\n        return self._perform_put(\n            self._get_data_disk_path(\n                service_name, deployment_name, role_name, lun),\n            _XmlSerializer.data_virtual_hard_disk_to_xml(\n                host_caching,\n                disk_label,\n                disk_name,\n                updated_lun,\n                logical_disk_size_in_gb,\n                media_link,\n                None),\n            as_async=True)", "response": "Updates the specified data disk in the specified virtual machine."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef delete_data_disk(self, service_name, deployment_name, role_name, lun, delete_vhd=False):\n        '''\n        Removes the specified data disk from a virtual machine.\n\n        service_name:\n            The name of the service.\n        deployment_name:\n            The name of the deployment.\n        role_name:\n            The name of the role.\n        lun:\n            The Logical Unit Number (LUN) for the disk.\n        delete_vhd:\n            Deletes the underlying vhd blob in Azure storage.\n        '''\n        _validate_not_none('service_name', service_name)\n        _validate_not_none('deployment_name', deployment_name)\n        _validate_not_none('role_name', role_name)\n        _validate_not_none('lun', lun)\n        path = self._get_data_disk_path(service_name, deployment_name, role_name, lun)\n        if delete_vhd:\n            path += '?comp=media'\n        return self._perform_delete(path, as_async=True)", "response": "Deletes the specified data disk from a virtual machine."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nadds a disk to the user image repository.", "response": "def add_disk(self, has_operating_system, label, media_link, name, os):\n        '''\n        Adds a disk to the user image repository. The disk can be an OS disk\n        or a data disk.\n\n        has_operating_system:\n            Deprecated.\n        label:\n            Specifies the description of the disk.\n        media_link:\n            Specifies the location of the blob in Windows Azure blob store\n            where the media for the disk is located. The blob location must\n            belong to the storage account in the current subscription specified\n            by the <subscription-id> value in the operation call. Example:\n            http://example.blob.core.windows.net/disks/mydisk.vhd\n        name:\n            Specifies a name for the disk. Windows Azure uses the name to\n            identify the disk when creating virtual machines from the disk.\n        os:\n            The OS type of the disk. Possible values are: Linux, Windows\n        '''\n        _validate_not_none('label', label)\n        _validate_not_none('media_link', media_link)\n        _validate_not_none('name', name)\n        _validate_not_none('os', os)\n        return self._perform_post(self._get_disk_path(),\n                                  _XmlSerializer.disk_to_xml(\n                                      label,\n                                      media_link,\n                                      name,\n                                      os))"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef update_disk(self, disk_name, has_operating_system=None, label=None, media_link=None,\n                    name=None, os=None):\n        '''\n        Updates an existing disk in your image repository.\n\n        disk_name:\n            The name of the disk to update.\n        has_operating_system:\n            Deprecated.\n        label:\n            Specifies the description of the disk.\n        media_link:\n            Deprecated.\n        name:\n            Deprecated.\n        os:\n            Deprecated.\n        '''\n        _validate_not_none('disk_name', disk_name)\n        _validate_not_none('label', label)\n        return self._perform_put(self._get_disk_path(disk_name),\n                                 _XmlSerializer.disk_to_xml(\n                                     label,\n                                     None,\n                                     None,\n                                     None))", "response": "Update an existing disk in your image repository."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\ndeletes the specified data or operating system disk from your image repository.", "response": "def delete_disk(self, disk_name, delete_vhd=False):\n        '''\n        Deletes the specified data or operating system disk from your image\n        repository.\n\n        disk_name:\n            The name of the disk to delete.\n        delete_vhd:\n            Deletes the underlying vhd blob in Azure storage.\n        '''\n        _validate_not_none('disk_name', disk_name)\n        path = self._get_disk_path(disk_name)\n        if delete_vhd:\n            path += '?comp=media'\n        return self._perform_delete(path)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef get_all(\n            self, target_resource=None, target_resource_group=None, target_resource_type=None, monitor_service=None, monitor_condition=None, severity=None, smart_group_state=None, time_range=None, page_count=None, sort_by=None, sort_order=None, custom_headers=None, raw=False, **operation_config):\n        \"\"\"Get all smartGroups within the subscription.\n\n        List all the smartGroups within the specified subscription. .\n\n        :param target_resource: Filter by target resource( which is full ARM\n         ID) Default value is select all.\n        :type target_resource: str\n        :param target_resource_group: Filter by target resource group name.\n         Default value is select all.\n        :type target_resource_group: str\n        :param target_resource_type: Filter by target resource type. Default\n         value is select all.\n        :type target_resource_type: str\n        :param monitor_service: Filter by monitor service which is the source\n         of the alert instance. Default value is select all. Possible values\n         include: 'Application Insights', 'ActivityLog Administrative',\n         'ActivityLog Security', 'ActivityLog Recommendation', 'ActivityLog\n         Policy', 'ActivityLog Autoscale', 'Log Analytics', 'Nagios',\n         'Platform', 'SCOM', 'ServiceHealth', 'SmartDetector', 'VM Insights',\n         'Zabbix'\n        :type monitor_service: str or\n         ~azure.mgmt.alertsmanagement.models.MonitorService\n        :param monitor_condition: Filter by monitor condition which is the\n         state of the  monitor(alertRule) at monitor service. Default value is\n         to select all. Possible values include: 'Fired', 'Resolved'\n        :type monitor_condition: str or\n         ~azure.mgmt.alertsmanagement.models.MonitorCondition\n        :param severity: Filter by severity.  Defaut value is select all.\n         Possible values include: 'Sev0', 'Sev1', 'Sev2', 'Sev3', 'Sev4'\n        :type severity: str or ~azure.mgmt.alertsmanagement.models.Severity\n        :param smart_group_state: Filter by state of the smart group. Default\n         value is to select all. Possible values include: 'New',\n         'Acknowledged', 'Closed'\n        :type smart_group_state: str or\n         ~azure.mgmt.alertsmanagement.models.AlertState\n        :param time_range: Filter by time range by below listed values.\n         Default value is 1 day. Possible values include: '1h', '1d', '7d',\n         '30d'\n        :type time_range: str or ~azure.mgmt.alertsmanagement.models.TimeRange\n        :param page_count: Determines number of alerts returned per page in\n         response. Permissible value is between 1 to 250. When the\n         \"includeContent\"  filter is selected, maximum value allowed is 25.\n         Default value is 25.\n        :type page_count: int\n        :param sort_by: Sort the query results by input field  Default value\n         is sort by 'lastModifiedDateTime'. Possible values include:\n         'alertsCount', 'state', 'severity', 'startDateTime',\n         'lastModifiedDateTime'\n        :type sort_by: str or\n         ~azure.mgmt.alertsmanagement.models.SmartGroupsSortByFields\n        :param sort_order: Sort the query results order in either ascending or\n         descending.  Default value is 'desc' for time fields and 'asc' for\n         others. Possible values include: 'asc', 'desc'\n        :type sort_order: str\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: SmartGroupsList or ClientRawResponse if raw=true\n        :rtype: ~azure.mgmt.alertsmanagement.models.SmartGroupsList or\n         ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`ErrorResponseException<azure.mgmt.alertsmanagement.models.ErrorResponseException>`\n        \"\"\"\n        # Construct URL\n        url = self.get_all.metadata['url']\n        path_format_arguments = {\n            'subscriptionId': self._serialize.url(\"self.config.subscription_id\", self.config.subscription_id, 'str')\n        }\n        url = self._client.format_url(url, **path_format_arguments)\n\n        # Construct parameters\n        query_parameters = {}\n        if target_resource is not None:\n            query_parameters['targetResource'] = self._serialize.query(\"target_resource\", target_resource, 'str')\n        if target_resource_group is not None:\n            query_parameters['targetResourceGroup'] = self._serialize.query(\"target_resource_group\", target_resource_group, 'str')\n        if target_resource_type is not None:\n            query_parameters['targetResourceType'] = self._serialize.query(\"target_resource_type\", target_resource_type, 'str')\n        if monitor_service is not None:\n            query_parameters['monitorService'] = self._serialize.query(\"monitor_service\", monitor_service, 'str')\n        if monitor_condition is not None:\n            query_parameters['monitorCondition'] = self._serialize.query(\"monitor_condition\", monitor_condition, 'str')\n        if severity is not None:\n            query_parameters['severity'] = self._serialize.query(\"severity\", severity, 'str')\n        if smart_group_state is not None:\n            query_parameters['smartGroupState'] = self._serialize.query(\"smart_group_state\", smart_group_state, 'str')\n        if time_range is not None:\n            query_parameters['timeRange'] = self._serialize.query(\"time_range\", time_range, 'str')\n        if page_count is not None:\n            query_parameters['pageCount'] = self._serialize.query(\"page_count\", page_count, 'int')\n        if sort_by is not None:\n            query_parameters['sortBy'] = self._serialize.query(\"sort_by\", sort_by, 'str')\n        if sort_order is not None:\n            query_parameters['sortOrder'] = self._serialize.query(\"sort_order\", sort_order, 'str')\n        query_parameters['api-version'] = self._serialize.query(\"self.api_version\", self.api_version, 'str')\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Accept'] = 'application/json'\n        if self.config.generate_client_request_id:\n            header_parameters['x-ms-client-request-id'] = str(uuid.uuid1())\n        if custom_headers:\n            header_parameters.update(custom_headers)\n        if self.config.accept_language is not None:\n            header_parameters['accept-language'] = self._serialize.header(\"self.config.accept_language\", self.config.accept_language, 'str')\n\n        # Construct and send request\n        request = self._client.get(url, query_parameters, header_parameters)\n        response = self._client.send(request, stream=False, **operation_config)\n\n        if response.status_code not in [200]:\n            raise models.ErrorResponseException(self._deserialize, response)\n\n        deserialized = None\n\n        if response.status_code == 200:\n            deserialized = self._deserialize('SmartGroupsList', response)\n\n        if raw:\n            client_raw_response = ClientRawResponse(deserialized, response)\n            return client_raw_response\n\n        return deserialized", "response": "Get all smart groups within the subscription."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nsummarizing the policy states for the resources under the management group.", "response": "def summarize_for_management_group(\n            self, management_group_name, query_options=None, custom_headers=None, raw=False, **operation_config):\n        \"\"\"Summarizes policy states for the resources under the management group.\n\n        :param management_group_name: Management group name.\n        :type management_group_name: str\n        :param query_options: Additional parameters for the operation\n        :type query_options: ~azure.mgmt.policyinsights.models.QueryOptions\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: SummarizeResults or ClientRawResponse if raw=true\n        :rtype: ~azure.mgmt.policyinsights.models.SummarizeResults or\n         ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`QueryFailureException<azure.mgmt.policyinsights.models.QueryFailureException>`\n        \"\"\"\n        top = None\n        if query_options is not None:\n            top = query_options.top\n        from_parameter = None\n        if query_options is not None:\n            from_parameter = query_options.from_property\n        to = None\n        if query_options is not None:\n            to = query_options.to\n        filter = None\n        if query_options is not None:\n            filter = query_options.filter\n\n        # Construct URL\n        url = self.summarize_for_management_group.metadata['url']\n        path_format_arguments = {\n            'policyStatesSummaryResource': self._serialize.url(\"self.policy_states_summary_resource\", self.policy_states_summary_resource, 'str'),\n            'managementGroupsNamespace': self._serialize.url(\"self.management_groups_namespace\", self.management_groups_namespace, 'str'),\n            'managementGroupName': self._serialize.url(\"management_group_name\", management_group_name, 'str')\n        }\n        url = self._client.format_url(url, **path_format_arguments)\n\n        # Construct parameters\n        query_parameters = {}\n        query_parameters['api-version'] = self._serialize.query(\"self.api_version\", self.api_version, 'str')\n        if top is not None:\n            query_parameters['$top'] = self._serialize.query(\"top\", top, 'int', minimum=0)\n        if from_parameter is not None:\n            query_parameters['$from'] = self._serialize.query(\"from_parameter\", from_parameter, 'iso-8601')\n        if to is not None:\n            query_parameters['$to'] = self._serialize.query(\"to\", to, 'iso-8601')\n        if filter is not None:\n            query_parameters['$filter'] = self._serialize.query(\"filter\", filter, 'str')\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Accept'] = 'application/json'\n        if self.config.generate_client_request_id:\n            header_parameters['x-ms-client-request-id'] = str(uuid.uuid1())\n        if custom_headers:\n            header_parameters.update(custom_headers)\n        if self.config.accept_language is not None:\n            header_parameters['accept-language'] = self._serialize.header(\"self.config.accept_language\", self.config.accept_language, 'str')\n\n        # Construct and send request\n        request = self._client.post(url, query_parameters, header_parameters)\n        response = self._client.send(request, stream=False, **operation_config)\n\n        if response.status_code not in [200]:\n            raise models.QueryFailureException(self._deserialize, response)\n\n        deserialized = None\n\n        if response.status_code == 200:\n            deserialized = self._deserialize('SummarizeResults', response)\n\n        if raw:\n            client_raw_response = ClientRawResponse(deserialized, response)\n            return client_raw_response\n\n        return deserialized"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nopens receiver connection and authenticate session.", "response": "def open(self):\n        \"\"\"Open receiver connection and authenticate session.\n\n        If the receiver is already open, this operation will do nothing.\n        This method will be called automatically when one starts to iterate\n        messages in the receiver, so there should be no need to call it directly.\n        A receiver opened with this method must be explicitly closed.\n        It is recommended to open a handler within a context manager as\n        opposed to calling the method directly.\n\n        .. note:: This operation is not thread-safe.\n\n        \"\"\"\n        if self.running:\n            return\n        self.running = True\n        try:\n            self._handler.open(connection=self.connection)\n            self.message_iter = self._handler.receive_messages_iter()\n            while not self._handler.auth_complete():\n                time.sleep(0.05)\n            self._build_receiver()\n            while not self._handler.client_ready():\n                time.sleep(0.05)\n        except Exception as e:  # pylint: disable=broad-except\n            try:\n                self._handle_exception(e)\n            except:\n                self.running = False\n                raise"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef fetch_next(self, max_batch_size=None, timeout=None):\n        self._can_run()\n        wrapped_batch = []\n        max_batch_size = max_batch_size or self._handler._prefetch  # pylint: disable=protected-access\n        try:\n            timeout_ms = 1000 * timeout if timeout else 0\n            batch = self._handler.receive_message_batch(\n                max_batch_size=max_batch_size,\n                timeout=timeout_ms)\n            for received in batch:\n                message = self._build_message(received)\n                wrapped_batch.append(message)\n        except Exception as e:  # pylint: disable=broad-except\n            self._handle_exception(e)\n        return wrapped_batch", "response": "Receive a batch of messages at once."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nrenewing the session lock.", "response": "def renew_lock(self):\n        \"\"\"Renew the session lock.\n\n        This operation must be performed periodically in order to retain a lock on the\n        session to continue message processing.\n        Once the lock is lost the connection will be closed. This operation can\n        also be performed as a threaded background task by registering the session\n        with an `azure.servicebus.AutoLockRenew` instance.\n\n        Example:\n            .. literalinclude:: ../examples/test_examples.py\n                :start-after: [START renew_lock]\n                :end-before: [END renew_lock]\n                :language: python\n                :dedent: 4\n                :caption: Renew the session lock before it expires\n\n        \"\"\"\n        self._can_run()\n        expiry = self._mgmt_request_response(\n            REQUEST_RESPONSE_RENEW_SESSION_LOCK_OPERATION,\n            {'session-id': self.session_id},\n            mgmt_handlers.default)\n        self.locked_until = datetime.datetime.fromtimestamp(expiry[b'expiration']/1000.0)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nbrowse messages currently pending in the queue.", "response": "def peek(self, count=1, start_from=None):\n        \"\"\"Browse messages currently pending in the queue.\n\n        Peeked messages are not removed from queue, nor are they locked. They cannot be completed,\n        deferred or dead-lettered.\n        This operation will only peek pending messages in the current session.\n\n        :param count: The maximum number of messages to try and peek. The default\n         value is 1.\n        :type count: int\n        :param start_from: A message sequence number from which to start browsing messages.\n        :type start_from: int\n        :rtype: list[~azure.servicebus.common.message.PeekMessage]\n\n        Example:\n            .. literalinclude:: ../examples/test_examples.py\n                :start-after: [START peek_messages]\n                :end-before: [END peek_messages]\n                :language: python\n                :dedent: 4\n                :caption: Look at pending messages in the queue\n\n        \"\"\"\n        if not start_from:\n            start_from = self.last_received or 1\n        if int(count) < 1:\n            raise ValueError(\"count must be 1 or greater.\")\n        if int(start_from) < 1:\n            raise ValueError(\"start_from must be 1 or greater.\")\n\n        self._can_run()\n        message = {\n            'from-sequence-number': types.AMQPLong(start_from),\n            'message-count': count,\n            'session-id': self.session_id}\n        return self._mgmt_request_response(\n            REQUEST_RESPONSE_PEEK_OPERATION,\n            message,\n            mgmt_handlers.peek_op)"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nlist the pending messages in the queue that have been updated since the provided timestamp.", "response": "def list_sessions(self, updated_since=None, max_results=100, skip=0):\n        \"\"\"List session IDs.\n\n        List the Session IDs with pending messages in the queue where the state of the session\n        has been updated since the timestamp provided. If no timestamp is provided, all will be returned.\n        If the state of a Session has never been set, it will not be returned regardless of whether\n        there are messages pending.\n\n        :param updated_since: The UTC datetime from which to return updated pending Session IDs.\n        :type updated_since: datetime.datetime\n        :param max_results: The maximum number of Session IDs to return. Default value is 100.\n        :type max_results: int\n        :param skip: The page value to jump to. Default value is 0.\n        :type skip: int\n        :rtype: list[str]\n\n        Example:\n            .. literalinclude:: ../examples/test_examples.py\n                :start-after: [START list_sessions]\n                :end-before: [END list_sessions]\n                :language: python\n                :dedent: 4\n                :caption: List the ids of sessions with pending messages\n\n        \"\"\"\n        if int(max_results) < 1:\n            raise ValueError(\"max_results must be 1 or greater.\")\n\n        self._can_run()\n        message = {\n            'last-updated-time': updated_since or datetime.datetime.utcfromtimestamp(0),\n            'skip': skip,\n            'top': max_results,\n        }\n        return self._mgmt_request_response(\n            REQUEST_RESPONSE_GET_MESSAGE_SESSIONS_OPERATION,\n            message,\n            mgmt_handlers.list_sessions_op)"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef create_or_update(\n            self, resource_group_name, vm_scale_set_name, parameters, custom_headers=None, raw=False, polling=True, **operation_config):\n        \"\"\"Create or update a VM scale set.\n\n        :param resource_group_name: The name of the resource group.\n        :type resource_group_name: str\n        :param vm_scale_set_name: The name of the VM scale set to create or\n         update.\n        :type vm_scale_set_name: str\n        :param parameters: The scale set object.\n        :type parameters:\n         ~azure.mgmt.compute.v2019_03_01.models.VirtualMachineScaleSet\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: The poller return type is ClientRawResponse, the\n         direct response alongside the deserialized response\n        :param polling: True for ARMPolling, False for no polling, or a\n         polling object for personal polling strategy\n        :return: An instance of LROPoller that returns VirtualMachineScaleSet\n         or ClientRawResponse<VirtualMachineScaleSet> if raw==True\n        :rtype:\n         ~msrestazure.azure_operation.AzureOperationPoller[~azure.mgmt.compute.v2019_03_01.models.VirtualMachineScaleSet]\n         or\n         ~msrestazure.azure_operation.AzureOperationPoller[~msrest.pipeline.ClientRawResponse[~azure.mgmt.compute.v2019_03_01.models.VirtualMachineScaleSet]]\n        :raises: :class:`CloudError<msrestazure.azure_exceptions.CloudError>`\n        \"\"\"\n        raw_result = self._create_or_update_initial(\n            resource_group_name=resource_group_name,\n            vm_scale_set_name=vm_scale_set_name,\n            parameters=parameters,\n            custom_headers=custom_headers,\n            raw=True,\n            **operation_config\n        )\n\n        def get_long_running_output(response):\n            deserialized = self._deserialize('VirtualMachineScaleSet', response)\n\n            if raw:\n                client_raw_response = ClientRawResponse(deserialized, response)\n                return client_raw_response\n\n            return deserialized\n\n        lro_delay = operation_config.get(\n            'long_running_operation_timeout',\n            self.config.long_running_operation_timeout)\n        if polling is True: polling_method = ARMPolling(lro_delay, **operation_config)\n        elif polling is False: polling_method = NoPolling()\n        else: polling_method = polling\n        return LROPoller(self._client, raw_result, get_long_running_output, polling_method)", "response": "Creates or updates a VM scale set."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef convert_to_single_placement_group(\n            self, resource_group_name, vm_scale_set_name, active_placement_group_id=None, custom_headers=None, raw=False, **operation_config):\n        \"\"\"Converts SinglePlacementGroup property to false for a existing virtual\n        machine scale set.\n\n        :param resource_group_name: The name of the resource group.\n        :type resource_group_name: str\n        :param vm_scale_set_name: The name of the virtual machine scale set to\n         create or update.\n        :type vm_scale_set_name: str\n        :param active_placement_group_id: Id of the placement group in which\n         you want future virtual machine instances to be placed. To query\n         placement group Id, please use Virtual Machine Scale Set VMs - Get\n         API. If not provided, the platform will choose one with maximum number\n         of virtual machine instances.\n        :type active_placement_group_id: str\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: None or ClientRawResponse if raw=true\n        :rtype: None or ~msrest.pipeline.ClientRawResponse\n        :raises: :class:`CloudError<msrestazure.azure_exceptions.CloudError>`\n        \"\"\"\n        parameters = models.VMScaleSetConvertToSinglePlacementGroupInput(active_placement_group_id=active_placement_group_id)\n\n        # Construct URL\n        url = self.convert_to_single_placement_group.metadata['url']\n        path_format_arguments = {\n            'resourceGroupName': self._serialize.url(\"resource_group_name\", resource_group_name, 'str'),\n            'vmScaleSetName': self._serialize.url(\"vm_scale_set_name\", vm_scale_set_name, 'str'),\n            'subscriptionId': self._serialize.url(\"self.config.subscription_id\", self.config.subscription_id, 'str')\n        }\n        url = self._client.format_url(url, **path_format_arguments)\n\n        # Construct parameters\n        query_parameters = {}\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Content-Type'] = 'application/json; charset=utf-8'\n        if self.config.generate_client_request_id:\n            header_parameters['x-ms-client-request-id'] = str(uuid.uuid1())\n        if custom_headers:\n            header_parameters.update(custom_headers)\n        if self.config.accept_language is not None:\n            header_parameters['accept-language'] = self._serialize.header(\"self.config.accept_language\", self.config.accept_language, 'str')\n\n        # Construct body\n        body_content = self._serialize.body(parameters, 'VMScaleSetConvertToSinglePlacementGroupInput')\n\n        # Construct and send request\n        request = self._client.post(url, query_parameters, header_parameters, body_content)\n        response = self._client.send(request, stream=False, **operation_config)\n\n        if response.status_code not in [200]:\n            exp = CloudError(response)\n            exp.request_id = response.headers.get('x-ms-request-id')\n            raise exp\n\n        if raw:\n            client_raw_response = ClientRawResponse(None, response)\n            return client_raw_response", "response": "Converts a single placement group to a single placement group."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nfinding all Autorest generated code in that module prefix.", "response": "def find_autorest_generated_folder(module_prefix=\"azure\"):\n    \"\"\"Find all Autorest generated code in that module prefix.\n    This actually looks for a \"models\" package only (not file). We could be smarter if necessary.\n    \"\"\"\n    _LOGGER.info(f\"Looking for Autorest generated package in {module_prefix}\")\n\n    # Manually skip some namespaces for now\n    if module_prefix in [\"azure.cli\", \"azure.storage\", \"azure.servicemanagement\", \"azure.servicebus\"]:\n        _LOGGER.info(f\"Skip {module_prefix}\")\n        return []\n\n    result = []\n    try:\n        _LOGGER.debug(f\"Try {module_prefix}\")\n        model_module = importlib.import_module(\".models\", module_prefix)\n        # If not exception, we MIGHT have found it, but cannot be a file.\n        # Keep continue to try to break it, file module have no __path__\n        model_module.__path__\n        _LOGGER.info(f\"Found {module_prefix}\")\n        result.append(module_prefix)\n    except (ModuleNotFoundError, AttributeError):\n        # No model, might dig deeper\n        prefix_module = importlib.import_module(module_prefix)\n        for _, sub_package, ispkg in pkgutil.iter_modules(prefix_module.__path__, module_prefix+\".\"):\n            if ispkg:\n                result += find_autorest_generated_folder(sub_package)\n    return result"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef get_sub_module_part(package_name, module_name):\n    sub_module_from_package = package_name.replace(\"-\", \".\")\n    if not module_name.startswith(sub_module_from_package):\n        _LOGGER.warning(f\"Submodule {module_name} does not start with package name {package_name}\")\n        return\n    return module_name[len(sub_module_from_package)+1:]", "response": "Assuming module name is azure - mgmt - compute. v2018 -08 - 01."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nreturn a new instance of the base class for the specified API version.", "response": "def models(cls, api_version=DEFAULT_API_VERSION):\n        \"\"\"Module depends on the API version:\n\n           * 2016-02-01: :mod:`v2016_02_01.models<azure.mgmt.resource.resources.v2016_02_01.models>`\n           * 2016-09-01: :mod:`v2016_09_01.models<azure.mgmt.resource.resources.v2016_09_01.models>`\n           * 2017-05-10: :mod:`v2017_05_10.models<azure.mgmt.resource.resources.v2017_05_10.models>`\n           * 2018-02-01: :mod:`v2018_02_01.models<azure.mgmt.resource.resources.v2018_02_01.models>`\n           * 2018-05-01: :mod:`v2018_05_01.models<azure.mgmt.resource.resources.v2018_05_01.models>`\n        \"\"\"\n        if api_version == '2016-02-01':\n            from .v2016_02_01 import models\n            return models\n        elif api_version == '2016-09-01':\n            from .v2016_09_01 import models\n            return models\n        elif api_version == '2017-05-10':\n            from .v2017_05_10 import models\n            return models\n        elif api_version == '2018-02-01':\n            from .v2018_02_01 import models\n            return models\n        elif api_version == '2018-05-01':\n            from .v2018_05_01 import models\n            return models\n        raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nreturns an instance of the Resource Groups API Version Operation class.", "response": "def resource_groups(self):\n        \"\"\"Instance depends on the API version:\n\n           * 2016-02-01: :class:`ResourceGroupsOperations<azure.mgmt.resource.resources.v2016_02_01.operations.ResourceGroupsOperations>`\n           * 2016-09-01: :class:`ResourceGroupsOperations<azure.mgmt.resource.resources.v2016_09_01.operations.ResourceGroupsOperations>`\n           * 2017-05-10: :class:`ResourceGroupsOperations<azure.mgmt.resource.resources.v2017_05_10.operations.ResourceGroupsOperations>`\n           * 2018-02-01: :class:`ResourceGroupsOperations<azure.mgmt.resource.resources.v2018_02_01.operations.ResourceGroupsOperations>`\n           * 2018-05-01: :class:`ResourceGroupsOperations<azure.mgmt.resource.resources.v2018_05_01.operations.ResourceGroupsOperations>`\n        \"\"\"\n        api_version = self._get_api_version('resource_groups')\n        if api_version == '2016-02-01':\n            from .v2016_02_01.operations import ResourceGroupsOperations as OperationClass\n        elif api_version == '2016-09-01':\n            from .v2016_09_01.operations import ResourceGroupsOperations as OperationClass\n        elif api_version == '2017-05-10':\n            from .v2017_05_10.operations import ResourceGroupsOperations as OperationClass\n        elif api_version == '2018-02-01':\n            from .v2018_02_01.operations import ResourceGroupsOperations as OperationClass\n        elif api_version == '2018-05-01':\n            from .v2018_05_01.operations import ResourceGroupsOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nreturn a new instance of the base class for the specified API version.", "response": "def models(cls, api_version=DEFAULT_API_VERSION):\n        \"\"\"Module depends on the API version:\n\n           * 2015-10-01-preview: :mod:`v2015_10_01_preview.models<azure.mgmt.resource.policy.v2015_10_01_preview.models>`\n           * 2016-04-01: :mod:`v2016_04_01.models<azure.mgmt.resource.policy.v2016_04_01.models>`\n           * 2016-12-01: :mod:`v2016_12_01.models<azure.mgmt.resource.policy.v2016_12_01.models>`\n           * 2017-06-01-preview: :mod:`v2017_06_01_preview.models<azure.mgmt.resource.policy.v2017_06_01_preview.models>`\n           * 2018-03-01: :mod:`v2018_03_01.models<azure.mgmt.resource.policy.v2018_03_01.models>`\n           * 2018-05-01: :mod:`v2018_05_01.models<azure.mgmt.resource.policy.v2018_05_01.models>`\n        \"\"\"\n        if api_version == '2015-10-01-preview':\n            from .v2015_10_01_preview import models\n            return models\n        elif api_version == '2016-04-01':\n            from .v2016_04_01 import models\n            return models\n        elif api_version == '2016-12-01':\n            from .v2016_12_01 import models\n            return models\n        elif api_version == '2017-06-01-preview':\n            from .v2017_06_01_preview import models\n            return models\n        elif api_version == '2018-03-01':\n            from .v2018_03_01 import models\n            return models\n        elif api_version == '2018-05-01':\n            from .v2018_05_01 import models\n            return models\n        raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nreturns an instance of the PolicyAssignments API client.", "response": "def policy_assignments(self):\n        \"\"\"Instance depends on the API version:\n\n           * 2015-10-01-preview: :class:`PolicyAssignmentsOperations<azure.mgmt.resource.policy.v2015_10_01_preview.operations.PolicyAssignmentsOperations>`\n           * 2016-04-01: :class:`PolicyAssignmentsOperations<azure.mgmt.resource.policy.v2016_04_01.operations.PolicyAssignmentsOperations>`\n           * 2016-12-01: :class:`PolicyAssignmentsOperations<azure.mgmt.resource.policy.v2016_12_01.operations.PolicyAssignmentsOperations>`\n           * 2017-06-01-preview: :class:`PolicyAssignmentsOperations<azure.mgmt.resource.policy.v2017_06_01_preview.operations.PolicyAssignmentsOperations>`\n           * 2018-03-01: :class:`PolicyAssignmentsOperations<azure.mgmt.resource.policy.v2018_03_01.operations.PolicyAssignmentsOperations>`\n           * 2018-05-01: :class:`PolicyAssignmentsOperations<azure.mgmt.resource.policy.v2018_05_01.operations.PolicyAssignmentsOperations>`\n        \"\"\"\n        api_version = self._get_api_version('policy_assignments')\n        if api_version == '2015-10-01-preview':\n            from .v2015_10_01_preview.operations import PolicyAssignmentsOperations as OperationClass\n        elif api_version == '2016-04-01':\n            from .v2016_04_01.operations import PolicyAssignmentsOperations as OperationClass\n        elif api_version == '2016-12-01':\n            from .v2016_12_01.operations import PolicyAssignmentsOperations as OperationClass\n        elif api_version == '2017-06-01-preview':\n            from .v2017_06_01_preview.operations import PolicyAssignmentsOperations as OperationClass\n        elif api_version == '2018-03-01':\n            from .v2018_03_01.operations import PolicyAssignmentsOperations as OperationClass\n        elif api_version == '2018-05-01':\n            from .v2018_05_01.operations import PolicyAssignmentsOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef policy_set_definitions(self):\n        api_version = self._get_api_version('policy_set_definitions')\n        if api_version == '2017-06-01-preview':\n            from .v2017_06_01_preview.operations import PolicySetDefinitionsOperations as OperationClass\n        elif api_version == '2018-03-01':\n            from .v2018_03_01.operations import PolicySetDefinitionsOperations as OperationClass\n        elif api_version == '2018-05-01':\n            from .v2018_05_01.operations import PolicySetDefinitionsOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))", "response": "Return an instance of the PolicySetDefinitions API version."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\ndetect profanity and match against custom and shared blacklists. Detects profanity in more than 100 languages and match against custom and shared blacklists. :param text_content_type: The content type. Possible values include: 'text/plain', 'text/html', 'text/xml', 'text/markdown' :type text_content_type: str :param text_content: Content to screen. :type text_content: Generator :param language: Language of the text. :type language: str :param autocorrect: Autocorrect text. :type autocorrect: bool :param pii: Detect personal identifiable information. :type pii: bool :param list_id: The list Id. :type list_id: str :param classify: Classify input. :type classify: bool :param dict custom_headers: headers that will be added to the request :param bool raw: returns the direct response alongside the deserialized response :param callback: When specified, will be called with each chunk of data that is streamed. The callback should take two arguments, the bytes of the current chunk of data and the response object. If the data is uploading, response will be None. :type callback: Callable[Bytes, response=None] :param operation_config: :ref:`Operation configuration overrides<msrest:optionsforoperations>`. :return: Screen or ClientRawResponse if raw=true :rtype: ~azure.cognitiveservices.vision.contentmoderator.models.Screen or ~msrest.pipeline.ClientRawResponse :raises: :class:`APIErrorException<azure.cognitiveservices.vision.contentmoderator.models.APIErrorException>`", "response": "def screen_text(\n            self, text_content_type, text_content, language=None, autocorrect=False, pii=False, list_id=None, classify=False, custom_headers=None, raw=False, callback=None, **operation_config):\n        \"\"\"Detect profanity and match against custom and shared blacklists.\n\n        Detects profanity in more than 100 languages and match against custom\n        and shared blacklists.\n\n        :param text_content_type: The content type. Possible values include:\n         'text/plain', 'text/html', 'text/xml', 'text/markdown'\n        :type text_content_type: str\n        :param text_content: Content to screen.\n        :type text_content: Generator\n        :param language: Language of the text.\n        :type language: str\n        :param autocorrect: Autocorrect text.\n        :type autocorrect: bool\n        :param pii: Detect personal identifiable information.\n        :type pii: bool\n        :param list_id: The list Id.\n        :type list_id: str\n        :param classify: Classify input.\n        :type classify: bool\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param callback: When specified, will be called with each chunk of\n         data that is streamed. The callback should take two arguments, the\n         bytes of the current chunk of data and the response object. If the\n         data is uploading, response will be None.\n        :type callback: Callable[Bytes, response=None]\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: Screen or ClientRawResponse if raw=true\n        :rtype: ~azure.cognitiveservices.vision.contentmoderator.models.Screen\n         or ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`APIErrorException<azure.cognitiveservices.vision.contentmoderator.models.APIErrorException>`\n        \"\"\"\n        # Construct URL\n        url = self.screen_text.metadata['url']\n        path_format_arguments = {\n            'Endpoint': self._serialize.url(\"self.config.endpoint\", self.config.endpoint, 'str', skip_quote=True)\n        }\n        url = self._client.format_url(url, **path_format_arguments)\n\n        # Construct parameters\n        query_parameters = {}\n        if language is not None:\n            query_parameters['language'] = self._serialize.query(\"language\", language, 'str')\n        if autocorrect is not None:\n            query_parameters['autocorrect'] = self._serialize.query(\"autocorrect\", autocorrect, 'bool')\n        if pii is not None:\n            query_parameters['PII'] = self._serialize.query(\"pii\", pii, 'bool')\n        if list_id is not None:\n            query_parameters['listId'] = self._serialize.query(\"list_id\", list_id, 'str')\n        if classify is not None:\n            query_parameters['classify'] = self._serialize.query(\"classify\", classify, 'bool')\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Accept'] = 'application/json'\n        header_parameters['Content-Type'] = 'text/plain'\n        if custom_headers:\n            header_parameters.update(custom_headers)\n        header_parameters['Content-Type'] = self._serialize.header(\"text_content_type\", text_content_type, 'str')\n\n        # Construct body\n        body_content = self._client.stream_upload(text_content, callback)\n\n        # Construct and send request\n        request = self._client.post(url, query_parameters, header_parameters, body_content)\n        response = self._client.send(request, stream=False, **operation_config)\n\n        if response.status_code not in [200]:\n            raise models.APIErrorException(self._deserialize, response)\n\n        deserialized = None\n\n        if response.status_code == 200:\n            deserialized = self._deserialize('Screen', response)\n\n        if raw:\n            client_raw_response = ClientRawResponse(deserialized, response)\n            return client_raw_response\n\n        return deserialized"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\ncreating a new key in Azure Key Vault and stores it in Azure Key Vault.", "response": "def create_key(\n            self, vault_base_url, key_name, kty, key_size=None, key_ops=None, key_attributes=None, tags=None, curve=None, custom_headers=None, raw=False, **operation_config):\n        \"\"\"Creates a new key, stores it, then returns key parameters and\n        attributes to the client.\n\n        The create key operation can be used to create any key type in Azure\n        Key Vault. If the named key already exists, Azure Key Vault creates a\n        new version of the key. It requires the keys/create permission.\n\n        :param vault_base_url: The vault name, for example\n         https://myvault.vault.azure.net.\n        :type vault_base_url: str\n        :param key_name: The name for the new key. The system will generate\n         the version name for the new key.\n        :type key_name: str\n        :param kty: The type of key to create. For valid values, see\n         JsonWebKeyType. Possible values include: 'EC', 'EC-HSM', 'RSA',\n         'RSA-HSM', 'oct'\n        :type kty: str or ~azure.keyvault.v2016_10_01.models.JsonWebKeyType\n        :param key_size: The key size in bits. For example: 2048, 3072, or\n         4096 for RSA.\n        :type key_size: int\n        :param key_ops:\n        :type key_ops: list[str or\n         ~azure.keyvault.v2016_10_01.models.JsonWebKeyOperation]\n        :param key_attributes:\n        :type key_attributes: ~azure.keyvault.v2016_10_01.models.KeyAttributes\n        :param tags: Application specific metadata in the form of key-value\n         pairs.\n        :type tags: dict[str, str]\n        :param curve: Elliptic curve name. For valid values, see\n         JsonWebKeyCurveName. Possible values include: 'P-256', 'P-384',\n         'P-521', 'SECP256K1'\n        :type curve: str or\n         ~azure.keyvault.v2016_10_01.models.JsonWebKeyCurveName\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: KeyBundle or ClientRawResponse if raw=true\n        :rtype: ~azure.keyvault.v2016_10_01.models.KeyBundle or\n         ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`KeyVaultErrorException<azure.keyvault.v2016_10_01.models.KeyVaultErrorException>`\n        \"\"\"\n        parameters = models.KeyCreateParameters(kty=kty, key_size=key_size, key_ops=key_ops, key_attributes=key_attributes, tags=tags, curve=curve)\n\n        # Construct URL\n        url = self.create_key.metadata['url']\n        path_format_arguments = {\n            'vaultBaseUrl': self._serialize.url(\"vault_base_url\", vault_base_url, 'str', skip_quote=True),\n            'key-name': self._serialize.url(\"key_name\", key_name, 'str', pattern=r'^[0-9a-zA-Z-]+$')\n        }\n        url = self._client.format_url(url, **path_format_arguments)\n\n        # Construct parameters\n        query_parameters = {}\n        query_parameters['api-version'] = self._serialize.query(\"self.api_version\", self.api_version, 'str')\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Content-Type'] = 'application/json; charset=utf-8'\n        if self.config.generate_client_request_id:\n            header_parameters['x-ms-client-request-id'] = str(uuid.uuid1())\n        if custom_headers:\n            header_parameters.update(custom_headers)\n        if self.config.accept_language is not None:\n            header_parameters['accept-language'] = self._serialize.header(\"self.config.accept_language\", self.config.accept_language, 'str')\n\n        # Construct body\n        body_content = self._serialize.body(parameters, 'KeyCreateParameters')\n\n        # Construct and send request\n        request = self._client.post(url, query_parameters)\n        response = self._client.send(\n            request, header_parameters, body_content, stream=False, **operation_config)\n\n        if response.status_code not in [200]:\n            raise models.KeyVaultErrorException(self._deserialize, response)\n\n        deserialized = None\n\n        if response.status_code == 200:\n            deserialized = self._deserialize('KeyBundle', response)\n\n        if raw:\n            client_raw_response = ClientRawResponse(deserialized, response)\n            return client_raw_response\n\n        return deserialized"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef import_key(\n            self, vault_base_url, key_name, key, hsm=None, key_attributes=None, tags=None, custom_headers=None, raw=False, **operation_config):\n        \"\"\"Imports an externally created key, stores it, and returns key\n        parameters and attributes to the client.\n\n        The import key operation may be used to import any key type into an\n        Azure Key Vault. If the named key already exists, Azure Key Vault\n        creates a new version of the key. This operation requires the\n        keys/import permission.\n\n        :param vault_base_url: The vault name, for example\n         https://myvault.vault.azure.net.\n        :type vault_base_url: str\n        :param key_name: Name for the imported key.\n        :type key_name: str\n        :param key: The Json web key\n        :type key: ~azure.keyvault.v2016_10_01.models.JsonWebKey\n        :param hsm: Whether to import as a hardware key (HSM) or software key.\n        :type hsm: bool\n        :param key_attributes: The key management attributes.\n        :type key_attributes: ~azure.keyvault.v2016_10_01.models.KeyAttributes\n        :param tags: Application specific metadata in the form of key-value\n         pairs.\n        :type tags: dict[str, str]\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: KeyBundle or ClientRawResponse if raw=true\n        :rtype: ~azure.keyvault.v2016_10_01.models.KeyBundle or\n         ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`KeyVaultErrorException<azure.keyvault.v2016_10_01.models.KeyVaultErrorException>`\n        \"\"\"\n        parameters = models.KeyImportParameters(hsm=hsm, key=key, key_attributes=key_attributes, tags=tags)\n\n        # Construct URL\n        url = self.import_key.metadata['url']\n        path_format_arguments = {\n            'vaultBaseUrl': self._serialize.url(\"vault_base_url\", vault_base_url, 'str', skip_quote=True),\n            'key-name': self._serialize.url(\"key_name\", key_name, 'str', pattern=r'^[0-9a-zA-Z-]+$')\n        }\n        url = self._client.format_url(url, **path_format_arguments)\n\n        # Construct parameters\n        query_parameters = {}\n        query_parameters['api-version'] = self._serialize.query(\"self.api_version\", self.api_version, 'str')\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Content-Type'] = 'application/json; charset=utf-8'\n        if self.config.generate_client_request_id:\n            header_parameters['x-ms-client-request-id'] = str(uuid.uuid1())\n        if custom_headers:\n            header_parameters.update(custom_headers)\n        if self.config.accept_language is not None:\n            header_parameters['accept-language'] = self._serialize.header(\"self.config.accept_language\", self.config.accept_language, 'str')\n\n        # Construct body\n        body_content = self._serialize.body(parameters, 'KeyImportParameters')\n\n        # Construct and send request\n        request = self._client.put(url, query_parameters)\n        response = self._client.send(\n            request, header_parameters, body_content, stream=False, **operation_config)\n\n        if response.status_code not in [200]:\n            raise models.KeyVaultErrorException(self._deserialize, response)\n\n        deserialized = None\n\n        if response.status_code == 200:\n            deserialized = self._deserialize('KeyBundle', response)\n\n        if raw:\n            client_raw_response = ClientRawResponse(deserialized, response)\n            return client_raw_response\n\n        return deserialized", "response": "Imports an externally created key into the Key Vault and returns a KeyBundle or ClientRawResponse object."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nsetting a secret in a specified key vault.", "response": "def set_secret(\n            self, vault_base_url, secret_name, value, tags=None, content_type=None, secret_attributes=None, custom_headers=None, raw=False, **operation_config):\n        \"\"\"Sets a secret in a specified key vault.\n\n        The SET operation adds a secret to the Azure Key Vault. If the named\n        secret already exists, Azure Key Vault creates a new version of that\n        secret. This operation requires the secrets/set permission.\n\n        :param vault_base_url: The vault name, for example\n         https://myvault.vault.azure.net.\n        :type vault_base_url: str\n        :param secret_name: The name of the secret.\n        :type secret_name: str\n        :param value: The value of the secret.\n        :type value: str\n        :param tags: Application specific metadata in the form of key-value\n         pairs.\n        :type tags: dict[str, str]\n        :param content_type: Type of the secret value such as a password.\n        :type content_type: str\n        :param secret_attributes: The secret management attributes.\n        :type secret_attributes:\n         ~azure.keyvault.v2016_10_01.models.SecretAttributes\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: SecretBundle or ClientRawResponse if raw=true\n        :rtype: ~azure.keyvault.v2016_10_01.models.SecretBundle or\n         ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`KeyVaultErrorException<azure.keyvault.v2016_10_01.models.KeyVaultErrorException>`\n        \"\"\"\n        parameters = models.SecretSetParameters(value=value, tags=tags, content_type=content_type, secret_attributes=secret_attributes)\n\n        # Construct URL\n        url = self.set_secret.metadata['url']\n        path_format_arguments = {\n            'vaultBaseUrl': self._serialize.url(\"vault_base_url\", vault_base_url, 'str', skip_quote=True),\n            'secret-name': self._serialize.url(\"secret_name\", secret_name, 'str', pattern=r'^[0-9a-zA-Z-]+$')\n        }\n        url = self._client.format_url(url, **path_format_arguments)\n\n        # Construct parameters\n        query_parameters = {}\n        query_parameters['api-version'] = self._serialize.query(\"self.api_version\", self.api_version, 'str')\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Content-Type'] = 'application/json; charset=utf-8'\n        if self.config.generate_client_request_id:\n            header_parameters['x-ms-client-request-id'] = str(uuid.uuid1())\n        if custom_headers:\n            header_parameters.update(custom_headers)\n        if self.config.accept_language is not None:\n            header_parameters['accept-language'] = self._serialize.header(\"self.config.accept_language\", self.config.accept_language, 'str')\n\n        # Construct body\n        body_content = self._serialize.body(parameters, 'SecretSetParameters')\n\n        # Construct and send request\n        request = self._client.put(url, query_parameters)\n        response = self._client.send(\n            request, header_parameters, body_content, stream=False, **operation_config)\n\n        if response.status_code not in [200]:\n            raise models.KeyVaultErrorException(self._deserialize, response)\n\n        deserialized = None\n\n        if response.status_code == 200:\n            deserialized = self._deserialize('SecretBundle', response)\n\n        if raw:\n            client_raw_response = ClientRawResponse(deserialized, response)\n            return client_raw_response\n\n        return deserialized"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nset the certificate issuer for the specified vault.", "response": "def set_certificate_issuer(\n            self, vault_base_url, issuer_name, provider, credentials=None, organization_details=None, attributes=None, custom_headers=None, raw=False, **operation_config):\n        \"\"\"Sets the specified certificate issuer.\n\n        The SetCertificateIssuer operation adds or updates the specified\n        certificate issuer. This operation requires the certificates/setissuers\n        permission.\n\n        :param vault_base_url: The vault name, for example\n         https://myvault.vault.azure.net.\n        :type vault_base_url: str\n        :param issuer_name: The name of the issuer.\n        :type issuer_name: str\n        :param provider: The issuer provider.\n        :type provider: str\n        :param credentials: The credentials to be used for the issuer.\n        :type credentials:\n         ~azure.keyvault.v2016_10_01.models.IssuerCredentials\n        :param organization_details: Details of the organization as provided\n         to the issuer.\n        :type organization_details:\n         ~azure.keyvault.v2016_10_01.models.OrganizationDetails\n        :param attributes: Attributes of the issuer object.\n        :type attributes: ~azure.keyvault.v2016_10_01.models.IssuerAttributes\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: IssuerBundle or ClientRawResponse if raw=true\n        :rtype: ~azure.keyvault.v2016_10_01.models.IssuerBundle or\n         ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`KeyVaultErrorException<azure.keyvault.v2016_10_01.models.KeyVaultErrorException>`\n        \"\"\"\n        parameter = models.CertificateIssuerSetParameters(provider=provider, credentials=credentials, organization_details=organization_details, attributes=attributes)\n\n        # Construct URL\n        url = self.set_certificate_issuer.metadata['url']\n        path_format_arguments = {\n            'vaultBaseUrl': self._serialize.url(\"vault_base_url\", vault_base_url, 'str', skip_quote=True),\n            'issuer-name': self._serialize.url(\"issuer_name\", issuer_name, 'str')\n        }\n        url = self._client.format_url(url, **path_format_arguments)\n\n        # Construct parameters\n        query_parameters = {}\n        query_parameters['api-version'] = self._serialize.query(\"self.api_version\", self.api_version, 'str')\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Content-Type'] = 'application/json; charset=utf-8'\n        if self.config.generate_client_request_id:\n            header_parameters['x-ms-client-request-id'] = str(uuid.uuid1())\n        if custom_headers:\n            header_parameters.update(custom_headers)\n        if self.config.accept_language is not None:\n            header_parameters['accept-language'] = self._serialize.header(\"self.config.accept_language\", self.config.accept_language, 'str')\n\n        # Construct body\n        body_content = self._serialize.body(parameter, 'CertificateIssuerSetParameters')\n\n        # Construct and send request\n        request = self._client.put(url, query_parameters)\n        response = self._client.send(\n            request, header_parameters, body_content, stream=False, **operation_config)\n\n        if response.status_code not in [200]:\n            raise models.KeyVaultErrorException(self._deserialize, response)\n\n        deserialized = None\n\n        if response.status_code == 200:\n            deserialized = self._deserialize('IssuerBundle', response)\n\n        if raw:\n            client_raw_response = ClientRawResponse(deserialized, response)\n            return client_raw_response\n\n        return deserialized"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\ncreating or updates a storage account.", "response": "def set_storage_account(\n            self, vault_base_url, storage_account_name, resource_id, active_key_name, auto_regenerate_key, regeneration_period=None, storage_account_attributes=None, tags=None, custom_headers=None, raw=False, **operation_config):\n        \"\"\"Creates or updates a new storage account. This operation requires the\n        storage/set permission.\n\n        :param vault_base_url: The vault name, for example\n         https://myvault.vault.azure.net.\n        :type vault_base_url: str\n        :param storage_account_name: The name of the storage account.\n        :type storage_account_name: str\n        :param resource_id: Storage account resource id.\n        :type resource_id: str\n        :param active_key_name: Current active storage account key name.\n        :type active_key_name: str\n        :param auto_regenerate_key: whether keyvault should manage the storage\n         account for the user.\n        :type auto_regenerate_key: bool\n        :param regeneration_period: The key regeneration time duration\n         specified in ISO-8601 format.\n        :type regeneration_period: str\n        :param storage_account_attributes: The attributes of the storage\n         account.\n        :type storage_account_attributes:\n         ~azure.keyvault.v2016_10_01.models.StorageAccountAttributes\n        :param tags: Application specific metadata in the form of key-value\n         pairs.\n        :type tags: dict[str, str]\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: StorageBundle or ClientRawResponse if raw=true\n        :rtype: ~azure.keyvault.v2016_10_01.models.StorageBundle or\n         ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`KeyVaultErrorException<azure.keyvault.v2016_10_01.models.KeyVaultErrorException>`\n        \"\"\"\n        parameters = models.StorageAccountCreateParameters(resource_id=resource_id, active_key_name=active_key_name, auto_regenerate_key=auto_regenerate_key, regeneration_period=regeneration_period, storage_account_attributes=storage_account_attributes, tags=tags)\n\n        # Construct URL\n        url = self.set_storage_account.metadata['url']\n        path_format_arguments = {\n            'vaultBaseUrl': self._serialize.url(\"vault_base_url\", vault_base_url, 'str', skip_quote=True),\n            'storage-account-name': self._serialize.url(\"storage_account_name\", storage_account_name, 'str', pattern=r'^[0-9a-zA-Z]+$')\n        }\n        url = self._client.format_url(url, **path_format_arguments)\n\n        # Construct parameters\n        query_parameters = {}\n        query_parameters['api-version'] = self._serialize.query(\"self.api_version\", self.api_version, 'str')\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Content-Type'] = 'application/json; charset=utf-8'\n        if self.config.generate_client_request_id:\n            header_parameters['x-ms-client-request-id'] = str(uuid.uuid1())\n        if custom_headers:\n            header_parameters.update(custom_headers)\n        if self.config.accept_language is not None:\n            header_parameters['accept-language'] = self._serialize.header(\"self.config.accept_language\", self.config.accept_language, 'str')\n\n        # Construct body\n        body_content = self._serialize.body(parameters, 'StorageAccountCreateParameters')\n\n        # Construct and send request\n        request = self._client.put(url, query_parameters)\n        response = self._client.send(\n            request, header_parameters, body_content, stream=False, **operation_config)\n\n        if response.status_code not in [200]:\n            raise models.KeyVaultErrorException(self._deserialize, response)\n\n        deserialized = None\n\n        if response.status_code == 200:\n            deserialized = self._deserialize('StorageBundle', response)\n\n        if raw:\n            client_raw_response = ClientRawResponse(deserialized, response)\n            return client_raw_response\n\n        return deserialized"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef models(cls, api_version=DEFAULT_API_VERSION):\n        if api_version == '2015-06-15':\n            from .v2015_06_15 import models\n            return models\n        elif api_version == '2016-01-01':\n            from .v2016_01_01 import models\n            return models\n        elif api_version == '2016-12-01':\n            from .v2016_12_01 import models\n            return models\n        elif api_version == '2017-06-01':\n            from .v2017_06_01 import models\n            return models\n        elif api_version == '2017-10-01':\n            from .v2017_10_01 import models\n            return models\n        elif api_version == '2018-02-01':\n            from .v2018_02_01 import models\n            return models\n        elif api_version == '2018-03-01-preview':\n            from .v2018_03_01_preview import models\n            return models\n        elif api_version == '2018-07-01':\n            from .v2018_07_01 import models\n            return models\n        raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))", "response": "Return a new instance of the appropriate class based on the API version."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef blob_containers(self):\n        api_version = self._get_api_version('blob_containers')\n        if api_version == '2018-02-01':\n            from .v2018_02_01.operations import BlobContainersOperations as OperationClass\n        elif api_version == '2018-03-01-preview':\n            from .v2018_03_01_preview.operations import BlobContainersOperations as OperationClass\n        elif api_version == '2018-07-01':\n            from .v2018_07_01.operations import BlobContainersOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))", "response": "Return an instance of the BlobContainers API version."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nreturn an instance of the Blob Services API version.", "response": "def blob_services(self):\n        \"\"\"Instance depends on the API version:\n\n           * 2018-07-01: :class:`BlobServicesOperations<azure.mgmt.storage.v2018_07_01.operations.BlobServicesOperations>`\n        \"\"\"\n        api_version = self._get_api_version('blob_services')\n        if api_version == '2018-07-01':\n            from .v2018_07_01.operations import BlobServicesOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef management_policies(self):\n        api_version = self._get_api_version('management_policies')\n        if api_version == '2018-07-01':\n            from .v2018_07_01.operations import ManagementPoliciesOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))", "response": "Return an instance of the ManagementPoliciesOperations API Version 2. 0. 0 ManagementPoliciesOperations class."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nreturns an instance of the Skus API version.", "response": "def skus(self):\n        \"\"\"Instance depends on the API version:\n\n           * 2017-06-01: :class:`SkusOperations<azure.mgmt.storage.v2017_06_01.operations.SkusOperations>`\n           * 2017-10-01: :class:`SkusOperations<azure.mgmt.storage.v2017_10_01.operations.SkusOperations>`\n           * 2018-02-01: :class:`SkusOperations<azure.mgmt.storage.v2018_02_01.operations.SkusOperations>`\n           * 2018-03-01-preview: :class:`SkusOperations<azure.mgmt.storage.v2018_03_01_preview.operations.SkusOperations>`\n           * 2018-07-01: :class:`SkusOperations<azure.mgmt.storage.v2018_07_01.operations.SkusOperations>`\n        \"\"\"\n        api_version = self._get_api_version('skus')\n        if api_version == '2017-06-01':\n            from .v2017_06_01.operations import SkusOperations as OperationClass\n        elif api_version == '2017-10-01':\n            from .v2017_10_01.operations import SkusOperations as OperationClass\n        elif api_version == '2018-02-01':\n            from .v2018_02_01.operations import SkusOperations as OperationClass\n        elif api_version == '2018-03-01-preview':\n            from .v2018_03_01_preview.operations import SkusOperations as OperationClass\n        elif api_version == '2018-07-01':\n            from .v2018_07_01.operations import SkusOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nreturn a list of storage accounts related to the current API version.", "response": "def storage_accounts(self):\n        \"\"\"Instance depends on the API version:\n\n           * 2015-06-15: :class:`StorageAccountsOperations<azure.mgmt.storage.v2015_06_15.operations.StorageAccountsOperations>`\n           * 2016-01-01: :class:`StorageAccountsOperations<azure.mgmt.storage.v2016_01_01.operations.StorageAccountsOperations>`\n           * 2016-12-01: :class:`StorageAccountsOperations<azure.mgmt.storage.v2016_12_01.operations.StorageAccountsOperations>`\n           * 2017-06-01: :class:`StorageAccountsOperations<azure.mgmt.storage.v2017_06_01.operations.StorageAccountsOperations>`\n           * 2017-10-01: :class:`StorageAccountsOperations<azure.mgmt.storage.v2017_10_01.operations.StorageAccountsOperations>`\n           * 2018-02-01: :class:`StorageAccountsOperations<azure.mgmt.storage.v2018_02_01.operations.StorageAccountsOperations>`\n           * 2018-03-01-preview: :class:`StorageAccountsOperations<azure.mgmt.storage.v2018_03_01_preview.operations.StorageAccountsOperations>`\n           * 2018-07-01: :class:`StorageAccountsOperations<azure.mgmt.storage.v2018_07_01.operations.StorageAccountsOperations>`\n        \"\"\"\n        api_version = self._get_api_version('storage_accounts')\n        if api_version == '2015-06-15':\n            from .v2015_06_15.operations import StorageAccountsOperations as OperationClass\n        elif api_version == '2016-01-01':\n            from .v2016_01_01.operations import StorageAccountsOperations as OperationClass\n        elif api_version == '2016-12-01':\n            from .v2016_12_01.operations import StorageAccountsOperations as OperationClass\n        elif api_version == '2017-06-01':\n            from .v2017_06_01.operations import StorageAccountsOperations as OperationClass\n        elif api_version == '2017-10-01':\n            from .v2017_10_01.operations import StorageAccountsOperations as OperationClass\n        elif api_version == '2018-02-01':\n            from .v2018_02_01.operations import StorageAccountsOperations as OperationClass\n        elif api_version == '2018-03-01-preview':\n            from .v2018_03_01_preview.operations import StorageAccountsOperations as OperationClass\n        elif api_version == '2018-07-01':\n            from .v2018_07_01.operations import StorageAccountsOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef usage(self):\n        api_version = self._get_api_version('usage')\n        if api_version == '2015-06-15':\n            from .v2015_06_15.operations import UsageOperations as OperationClass\n        elif api_version == '2016-01-01':\n            from .v2016_01_01.operations import UsageOperations as OperationClass\n        elif api_version == '2016-12-01':\n            from .v2016_12_01.operations import UsageOperations as OperationClass\n        elif api_version == '2017-06-01':\n            from .v2017_06_01.operations import UsageOperations as OperationClass\n        elif api_version == '2017-10-01':\n            from .v2017_10_01.operations import UsageOperations as OperationClass\n        elif api_version == '2018-02-01':\n            from .v2018_02_01.operations import UsageOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))", "response": "Return an instance of the UsageOperations API version."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nreturning an instance of the Usages API version.", "response": "def usages(self):\n        \"\"\"Instance depends on the API version:\n\n           * 2018-03-01-preview: :class:`UsagesOperations<azure.mgmt.storage.v2018_03_01_preview.operations.UsagesOperations>`\n           * 2018-07-01: :class:`UsagesOperations<azure.mgmt.storage.v2018_07_01.operations.UsagesOperations>`\n        \"\"\"\n        api_version = self._get_api_version('usages')\n        if api_version == '2018-03-01-preview':\n            from .v2018_03_01_preview.operations import UsagesOperations as OperationClass\n        elif api_version == '2018-07-01':\n            from .v2018_07_01.operations import UsagesOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef models(cls, api_version=DEFAULT_API_VERSION):\n        if api_version == '2016-06-01':\n            from .v2016_06_01 import models\n            return models\n        raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))", "response": "Returns a base class for the current API version."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\ncreate a Service Bus client from a connection string.", "response": "def from_connection_string(cls, conn_str, *, loop=None, **kwargs):\n        \"\"\"Create a Service Bus client from a connection string.\n\n        :param conn_str: The connection string.\n        :type conn_str: str\n\n        Example:\n            .. literalinclude:: ../examples/async_examples/test_examples_async.py\n                :start-after: [START create_async_servicebus_client_connstr]\n                :end-before: [END create_async_servicebus_client_connstr]\n                :language: python\n                :dedent: 4\n                :caption: Create a ServiceBusClient via a connection string.\n\n        \"\"\"\n        address, policy, key, _ = parse_conn_str(conn_str)\n        parsed_namespace = urlparse(address)\n        namespace, _, base = parsed_namespace.hostname.partition('.')\n        return cls(\n            service_namespace=namespace,\n            shared_access_key_name=policy,\n            shared_access_key_value=key,\n            host_base='.' + base,\n            loop=loop,\n            **kwargs)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef get_subscription(self, topic_name, subscription_name):\n        try:\n            subscription = self.mgmt_client.get_subscription(topic_name, subscription_name)\n        except requests.exceptions.ConnectionError as e:\n            raise ServiceBusConnectionError(\"Namespace: {} not found\".format(self.service_namespace), e)\n        except AzureServiceBusResourceNotFound:\n            raise ServiceBusResourceNotFound(\"Specificed subscription does not exist.\")\n        return SubscriptionClient.from_entity(\n            self._get_host(), topic_name, subscription,\n            shared_access_key_name=self.shared_access_key_name,\n            shared_access_key_value=self.shared_access_key_value,\n            loop=self.loop,\n            debug=self.debug)", "response": "Get an async client for a subscription entity."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\ngetting an async client for all subscription entities in the topic.", "response": "def list_subscriptions(self, topic_name):\n        \"\"\"Get an async client for all subscription entities in the topic.\n\n        :param topic_name: The topic to list subscriptions for.\n        :type topic_name: str\n        :rtype: list[~azure.servicebus.aio.async_client.SubscriptionClient]\n        :raises: ~azure.servicebus.common.errors.ServiceBusConnectionError if the namespace is not found.\n        :raises: ~azure.servicebus.common.errors.ServiceBusResourceNotFound if the topic is not found.\n        \"\"\"\n        try:\n            subs = self.mgmt_client.list_subscriptions(topic_name)\n        except requests.exceptions.ConnectionError as e:\n            raise ServiceBusConnectionError(\"Namespace: {} not found\".format(self.service_namespace), e)\n        except AzureServiceBusResourceNotFound:\n            raise ServiceBusResourceNotFound(\"Specificed topic does not exist.\")\n        sub_clients = []\n        for sub in subs:\n            sub_clients.append(SubscriptionClient.from_entity(\n                self._get_host(), topic_name, sub,\n                shared_access_key_name=self.shared_access_key_name,\n                shared_access_key_value=self.shared_access_key_value,\n                loop=self.loop,\n                debug=self.debug))\n        return sub_clients"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nsending one or more messages to the current entity.", "response": "async def send(self, messages, message_timeout=0, session=None, **kwargs):\n        \"\"\"Send one or more messages to the current entity.\n\n        This operation will open a single-use connection, send the supplied messages, and close\n        connection. If the entity requires sessions, a session ID must be either\n        provided here, or set on each outgoing message.\n\n        :param messages: One or more messages to be sent.\n        :type messages: ~azure.servicebus.aio.async_message.Message or\n         list[~azure.servicebus.aio.async_message.Message]\n        :param message_timeout: The period in seconds during which the Message must be\n         sent. If the send is not completed in this time it will return a failure result.\n        :type message_timeout: int\n        :param session: An optional session ID. If supplied this session ID will be\n         applied to every outgoing message sent with this Sender.\n         If an individual message already has a session ID, that will be\n         used instead. If no session ID is supplied here, nor set on an outgoing\n         message, a ValueError will be raised if the entity is sessionful.\n        :type session: str or ~uuid.Guid\n        :raises: ~azure.servicebus.common.errors.MessageSendFailed\n        :returns: A list of the send results of all the messages. Each\n         send result is a tuple with two values. The first is a boolean, indicating `True`\n         if the message sent, or `False` if it failed. The second is an error if the message\n         failed, otherwise it will be `None`.\n        :rtype: list[tuple[bool, ~azure.servicebus.common.errors.MessageSendFailed]]\n\n        Example:\n            .. literalinclude:: ../examples/async_examples/test_examples_async.py\n                :start-after: [START queue_client_send]\n                :end-before: [END queue_client_send]\n                :language: python\n                :dedent: 4\n                :caption: Send a single message.\n\n            .. literalinclude:: ../examples/async_examples/test_examples_async.py\n                :start-after: [START queue_client_send_multiple]\n                :end-before: [END queue_client_send_multiple]\n                :language: python\n                :dedent: 4\n                :caption: Send multiple messages.\n\n        \"\"\"\n        async with self.get_sender(message_timeout=message_timeout, session=session, **kwargs) as sender:\n            if isinstance(messages, Message):\n                sender.queue_message(messages)\n            else:\n                try:\n                    messages = list(messages)\n                except TypeError:\n                    raise TypeError(\n                        \"Value of messages must be a 'Message' object or a synchronous iterable of 'Message' objects.\")\n\n                for m in messages:\n                    if not isinstance(m, Message):\n                        raise TypeError(\"Item in iterator is not of type 'Message'.\")\n                    sender.queue_message(m)\n\n            return await sender.send_pending_messages()"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nreturns a new instance of the class Azure. servicebus. aio. async_send_handler. Sender class.", "response": "def get_sender(self, message_timeout=0, session=None, **kwargs):\n        \"\"\"Get a Sender for the Service Bus endpoint.\n\n        A Sender represents a single open connection within which multiple send operations can be made.\n\n        :param message_timeout: The period in seconds during which messages sent with\n         this Sender must be sent. If the send is not completed in this time it will fail.\n        :type message_timeout: int\n        :param session: An optional session ID. If supplied this session ID will be\n         applied to every outgoing message sent with this Sender.\n         If an individual message already has a session ID, that will be\n         used instead. If no session ID is supplied here, nor set on an outgoing\n         message, a ValueError will be raised if the entity is sessionful.\n        :type session: str or ~uuid.Guid\n        :returns: A Sender instance with an unopened connection.\n        :rtype: ~azure.servicebus.aio.async_send_handler.Sender\n\n        Example:\n            .. literalinclude:: ../examples/async_examples/test_examples_async.py\n                :start-after: [START open_close_sender_context]\n                :end-before: [END open_close_sender_context]\n                :language: python\n                :dedent: 4\n                :caption: Send multiple messages with a Sender.\n\n        \"\"\"\n        handler_id = str(uuid.uuid4())\n        if self.entity and self.requires_session:\n            return SessionSender(\n                handler_id,\n                self.entity_uri,\n                self.auth_config,\n                session=session,\n                loop=self.loop,\n                debug=self.debug,\n                msg_timeout=message_timeout,\n                **kwargs)\n        return Sender(\n            handler_id,\n            self.entity_uri,\n            self.auth_config,\n            session=session,\n            loop=self.loop,\n            debug=self.debug,\n            msg_timeout=message_timeout,\n            **kwargs)"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\ngets a Receiver for the Service Bus endpoint.", "response": "def get_receiver(self, session=None, prefetch=0, mode=ReceiveSettleMode.PeekLock, idle_timeout=0, **kwargs):\n        \"\"\"Get a Receiver for the Service Bus endpoint.\n\n        A Receiver represents a single open connection with which multiple receive operations can be made.\n\n        :param session: A specific session from which to receive. This must be specified for a\n         sessionful entity, otherwise it must be None. In order to receive the next available\n         session, set this to NEXT_AVAILABLE.\n        :type session: str or ~azure.servicebus.common.constants.NEXT_AVAILABLE\n        :param prefetch: The maximum number of messages to cache with each request to the service.\n         The default value is 0, meaning messages will be received from the service and processed\n         one at a time. Increasing this value will improve message throughput performance but increase\n         the chance that messages will expire while they are cached if they're not processed fast enough.\n        :type prefetch: int\n        :param mode: The mode with which messages will be retrieved from the entity. The two options\n         are PeekLock and ReceiveAndDelete. Messages received with PeekLock must be settled within a given\n         lock period before they will be removed from the queue. Messages received with ReceiveAndDelete\n         will be immediately removed from the queue, and cannot be subsequently rejected or re-received if\n         the client fails to process the message. The default mode is PeekLock.\n        :type mode: ~azure.servicebus.common.constants.ReceiveSettleMode\n        :param idle_timeout: The timeout in seconds between received messages after which the receiver will\n         automatically shutdown. The default value is 0, meaning no timeout.\n        :type idle_timeout: int\n        :returns: A Receiver instance with an unopened connection.\n        :rtype: ~azure.servicebus.aio.async_receive_handler.Receiver\n\n        Example:\n            .. literalinclude:: ../examples/async_examples/test_examples_async.py\n                :start-after: [START open_close_receiver_context]\n                :end-before: [END open_close_receiver_context]\n                :language: python\n                :dedent: 4\n                :caption: Receive messages with a Receiver.\n\n        \"\"\"\n        if self.entity and not self.requires_session and session:\n            raise ValueError(\"A session cannot be used with a non-sessionful entitiy.\")\n        if self.entity and self.requires_session and not session:\n            raise ValueError(\"This entity requires a session.\")\n        if int(prefetch) < 0 or int(prefetch) > 50000:\n            raise ValueError(\"Prefetch must be an integer between 0 and 50000 inclusive.\")\n\n        prefetch += 1\n        handler_id = str(uuid.uuid4())\n        if session:\n            return SessionReceiver(\n                handler_id,\n                self.entity_uri,\n                self.auth_config,\n                session=session,\n                loop=self.loop,\n                debug=self.debug,\n                timeout=int(idle_timeout * 1000),\n                prefetch=prefetch,\n                mode=mode,\n                **kwargs)\n        return Receiver(\n            handler_id,\n            self.entity_uri,\n            self.auth_config,\n            loop=self.loop,\n            debug=self.debug,\n            timeout=int(idle_timeout * 1000),\n            prefetch=prefetch,\n            mode=mode,\n            **kwargs)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef get_deadletter_receiver(\n            self, transfer_deadletter=False, prefetch=0,\n            mode=ReceiveSettleMode.PeekLock, idle_timeout=0, **kwargs):\n        \"\"\"Get a Receiver for the deadletter endpoint of the entity.\n\n        A Receiver represents a single open connection with which multiple receive operations can be made.\n\n        :param transfer_deadletter: Whether to connect to the transfer deadletter queue, or the standard\n         deadletter queue. Default is False, using the standard deadletter endpoint.\n        :type transfer_deadletter: bool\n        :param prefetch: The maximum number of messages to cache with each request to the service.\n         The default value is 0, meaning messages will be received from the service and processed\n         one at a time. Increasing this value will improve message throughput performance but increase\n         the change that messages will expire while they are cached if they're not processed fast enough.\n        :type prefetch: int\n        :param mode: The mode with which messages will be retrieved from the entity. The two options\n         are PeekLock and ReceiveAndDelete. Messages received with PeekLock must be settled within a given\n         lock period before they will be removed from the queue. Messages received with ReceiveAndDelete\n         will be immediately removed from the queue, and cannot be subsequently rejected or re-received if\n         the client fails to process the message. The default mode is PeekLock.\n        :type mode: ~azure.servicebus.common.constants.ReceiveSettleMode\n        :param idle_timeout: The timeout in seconds between received messages after which the receiver will\n         automatically shutdown. The default value is 0, meaning no timeout.\n        :type idle_timeout: int\n        :returns: A Receiver instance with an unopened Connection.\n        :rtype: ~azure.servicebus.aio.async_receive_handler.Receiver\n\n        Example:\n            .. literalinclude:: ../examples/async_examples/test_examples_async.py\n                :start-after: [START receiver_deadletter_messages]\n                :end-before: [END receiver_deadletter_messages]\n                :language: python\n                :dedent: 4\n                :caption: Receive dead-lettered messages.\n\n        \"\"\"\n        if int(prefetch) < 0 or int(prefetch) > 50000:\n            raise ValueError(\"Prefetch must be an integer between 0 and 50000 inclusive.\")\n\n        prefetch += 1\n        handler_id = str(uuid.uuid4())\n        if transfer_deadletter:\n            entity_uri = self.mgmt_client.format_transfer_dead_letter_queue_name(self.entity_uri)\n        else:\n            entity_uri = self.mgmt_client.format_dead_letter_queue_name(self.entity_uri)\n        return Receiver(\n            handler_id,\n            entity_uri,\n            self.auth_config,\n            loop=self.loop,\n            debug=self.debug,\n            timeout=int(idle_timeout * 1000),\n            prefetch=prefetch,\n            mode=mode,\n            **kwargs)", "response": "Get a Receiver for the dead - letter endpoint of the entity."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nextract request id from response header.", "response": "def parse_response_for_async_op(response):\n    ''' Extracts request id from response header. '''\n\n    if response is None:\n        return None\n\n    result = AsynchronousOperationResult()\n    if response.headers:\n        for name, value in response.headers:\n            if name.lower() == 'x-ms-request-id':\n                result.request_id = value\n\n    return result"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef with_filter(self, filter):\n        '''Returns a new service which will process requests with the\n        specified filter.  Filtering operations can include logging, automatic\n        retrying, etc...  The filter is a lambda which receives the HTTPRequest\n        and another lambda.  The filter can perform any pre-processing on the\n        request, pass it off to the next lambda, and then perform any\n        post-processing on the response.'''\n        res = type(self)(self.subscription_id, self.cert_file, self.host,\n                         self.request_session, self._httpclient.timeout)\n        old_filter = self._filter\n\n        def new_filter(request):\n            return filter(request, old_filter)\n\n        res._filter = new_filter\n        return res", "response": "Returns a new service which will process requests with the\n        specified filter."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nsetting the proxy server host and port for the HTTP CONNECT Tunnelling.", "response": "def set_proxy(self, host, port, user=None, password=None):\n        '''\n        Sets the proxy server host and port for the HTTP CONNECT Tunnelling.\n\n        host:\n            Address of the proxy. Ex: '192.168.0.100'\n        port:\n            Port of the proxy. Ex: 6000\n        user:\n            User for proxy authorization.\n        password:\n            Password for proxy authorization.\n        '''\n        self._httpclient.set_proxy(host, port, user, password)"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef perform_get(self, path, x_ms_version=None):\n        '''\n        Performs a GET request and returns the response.\n\n        path:\n            Path to the resource.\n            Ex: '/<subscription-id>/services/hostedservices/<service-name>'\n        x_ms_version:\n            If specified, this is used for the x-ms-version header.\n            Otherwise, self.x_ms_version is used.\n        '''\n        request = HTTPRequest()\n        request.method = 'GET'\n        request.host = self.host\n        request.path = path\n        request.path, request.query = self._httpclient._update_request_uri_query(request)\n        request.headers = self._update_management_header(request, x_ms_version)\n        response = self._perform_request(request)\n\n        return response", "response": "Perform a GET request and returns the response."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef perform_put(self, path, body, x_ms_version=None):\n        '''\n        Performs a PUT request and returns the response.\n\n        path:\n            Path to the resource.\n            Ex: '/<subscription-id>/services/hostedservices/<service-name>'\n        body:\n            Body for the PUT request.\n        x_ms_version:\n            If specified, this is used for the x-ms-version header.\n            Otherwise, self.x_ms_version is used.\n        '''\n        request = HTTPRequest()\n        request.method = 'PUT'\n        request.host = self.host\n        request.path = path\n        request.body = _get_request_body(body)\n        request.path, request.query = self._httpclient._update_request_uri_query(request)\n        request.headers = self._update_management_header(request, x_ms_version)\n        response = self._perform_request(request)\n\n        return response", "response": "Perform a PUT request and returns the response."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef wait_for_operation_status(self,\n        request_id, wait_for_status='Succeeded', timeout=30, sleep_interval=5,\n        progress_callback=wait_for_operation_status_progress_default_callback,\n        success_callback=wait_for_operation_status_success_default_callback,\n        failure_callback=wait_for_operation_status_failure_default_callback):\n        '''\n        Waits for an asynchronous operation to complete.\n\n        This calls get_operation_status in a loop and returns when the expected\n        status is reached. The result of get_operation_status is returned. By\n        default, an exception is raised on timeout or error status.\n\n        request_id:\n            The request ID for the request you wish to track.\n        wait_for_status:\n            Status to wait for. Default is 'Succeeded'.\n        timeout:\n            Total timeout in seconds. Default is 30s.\n        sleep_interval:\n            Sleep time in seconds for each iteration. Default is 5s.\n        progress_callback:\n            Callback for each iteration.\n            Default prints '.'.\n            Set it to None for no progress notification.\n        success_callback:\n            Callback on success. Default prints newline.\n            Set it to None for no success notification.\n        failure_callback:\n            Callback on failure. Default prints newline+error details then\n            raises exception.\n            Set it to None for no failure notification.\n        '''\n        loops = timeout // sleep_interval + 1\n        start_time = time.time()\n        for _ in range(int(loops)):\n            result = self.get_operation_status(request_id)\n            elapsed = time.time() - start_time\n            if result.status == wait_for_status:\n                if success_callback is not None:\n                    success_callback(elapsed)\n                return result\n            elif result.error:\n                if failure_callback is not None:\n                    ex = AzureAsyncOperationHttpError(_ERROR_ASYNC_OP_FAILURE, result.status, result)\n                    failure_callback(elapsed, ex)\n                return result\n            else:\n                if progress_callback is not None:\n                    progress_callback(elapsed)\n                time.sleep(sleep_interval)\n\n        if failure_callback is not None:\n            ex = AzureAsyncOperationHttpError(_ERROR_ASYNC_OP_TIMEOUT, result.status, result)\n            failure_callback(elapsed, ex)\n        return result", "response": "This method waits for an asynchronous operation to complete."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef get_operation_status(self, request_id):\n        '''\n        Returns the status of the specified operation. After calling an\n        asynchronous operation, you can call Get Operation Status to determine\n        whether the operation has succeeded, failed, or is still in progress.\n\n        request_id:\n            The request ID for the request you wish to track.\n        '''\n        _validate_not_none('request_id', request_id)\n        return self._perform_get(\n            '/' + self.subscription_id + '/operations/' + _str(request_id),\n            Operation)", "response": "Returns the status of the specified operation."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nadding additional headers for management.", "response": "def _update_management_header(self, request, x_ms_version):\n        ''' Add additional headers for management. '''\n\n        if request.method in ['PUT', 'POST', 'MERGE', 'DELETE']:\n            request.headers.append(('Content-Length', str(len(request.body))))\n\n        # append additional headers base on the service\n        request.headers.append(('x-ms-version', x_ms_version or self.x_ms_version))\n\n        # if it is not GET or HEAD request, must set content-type.\n        if not request.method in ['GET', 'HEAD']:\n            for name, _ in request.headers:\n                if 'content-type' == name.lower():\n                    break\n            else:\n                request.headers.append(\n                    ('Content-Type',\n                     self.content_type))\n\n        return request.headers"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nassuming on Travis to create a package", "response": "def travis_build_package():\n    \"\"\"Assumed called on Travis, to prepare a package to be deployed\n\n    This method prints on stdout for Travis.\n    Return is obj to pass to sys.exit() directly\n    \"\"\"\n\n    travis_tag = os.environ.get('TRAVIS_TAG')\n    if not travis_tag:\n        print(\"TRAVIS_TAG environment variable is not present\")\n        return \"TRAVIS_TAG environment variable is not present\"\n\n    try:\n        name, version = travis_tag.split(\"_\")\n    except ValueError:\n        print(\"TRAVIS_TAG is not '<package_name>_<version>' (tag is: {})\".format(travis_tag))\n        return \"TRAVIS_TAG is not '<package_name>_<version>' (tag is: {})\".format(travis_tag)\n\n    try:\n        version = Version(version)\n    except InvalidVersion:\n        print(\"Version must be a valid PEP440 version (version is: {})\".format(version))\n        return \"Version must be a valid PEP440 version (version is: {})\".format(version)\n\n    if name.lower() in OMITTED_RELEASE_PACKAGES:\n        print(\"The input package {} has been disabled for release from Travis.CI.\".format(name))\n        return\n\n    abs_dist_path = Path(os.environ['TRAVIS_BUILD_DIR'], 'dist')\n    create_package(name, str(abs_dist_path))\n\n    print(\"Produced:\\n{}\".format(list(abs_dist_path.glob('*'))))\n\n    pattern = \"*{}*\".format(version)\n    packages = list(abs_dist_path.glob(pattern))\n    if not packages:\n        return \"Package version does not match tag {}, abort\".format(version)\n    pypi_server = os.environ.get(\"PYPI_SERVER\", \"default PyPI server\")\n    print(\"Package created as expected and will be pushed to {}\".format(pypi_server))"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nlists certificates in a specified key vault. The GetCertificates operation returns the set of certificates resources in the specified key vault. This operation requires the certificates/list permission. :param vault_base_url: The vault name, for example https://myvault.vault.azure.net. :type vault_base_url: str :param maxresults: Maximum number of results to return in a page. If not specified the service will return up to 25 results. :type maxresults: int :param include_pending: Specifies whether to include certificates which are not completely provisioned. :type include_pending: bool :param dict custom_headers: headers that will be added to the request :param bool raw: returns the direct response alongside the deserialized response :param operation_config: :ref:`Operation configuration overrides<msrest:optionsforoperations>`. :return: An iterator like instance of CertificateItem :rtype: ~azure.keyvault.v7_0.models.CertificateItemPaged[~azure.keyvault.v7_0.models.CertificateItem] :raises: :class:`KeyVaultErrorException<azure.keyvault.v7_0.models.KeyVaultErrorException>`", "response": "def get_certificates(\n            self, vault_base_url, maxresults=None, include_pending=None, custom_headers=None, raw=False, **operation_config):\n        \"\"\"List certificates in a specified key vault.\n\n        The GetCertificates operation returns the set of certificates resources\n        in the specified key vault. This operation requires the\n        certificates/list permission.\n\n        :param vault_base_url: The vault name, for example\n         https://myvault.vault.azure.net.\n        :type vault_base_url: str\n        :param maxresults: Maximum number of results to return in a page. If\n         not specified the service will return up to 25 results.\n        :type maxresults: int\n        :param include_pending: Specifies whether to include certificates\n         which are not completely provisioned.\n        :type include_pending: bool\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: An iterator like instance of CertificateItem\n        :rtype:\n         ~azure.keyvault.v7_0.models.CertificateItemPaged[~azure.keyvault.v7_0.models.CertificateItem]\n        :raises:\n         :class:`KeyVaultErrorException<azure.keyvault.v7_0.models.KeyVaultErrorException>`\n        \"\"\"\n        def internal_paging(next_link=None, raw=False):\n\n            if not next_link:\n                # Construct URL\n                url = self.get_certificates.metadata['url']\n                path_format_arguments = {\n                    'vaultBaseUrl': self._serialize.url(\"vault_base_url\", vault_base_url, 'str', skip_quote=True)\n                }\n                url = self._client.format_url(url, **path_format_arguments)\n\n                # Construct parameters\n                query_parameters = {}\n                if maxresults is not None:\n                    query_parameters['maxresults'] = self._serialize.query(\"maxresults\", maxresults, 'int', maximum=25, minimum=1)\n                if include_pending is not None:\n                    query_parameters['includePending'] = self._serialize.query(\"include_pending\", include_pending, 'bool')\n                query_parameters['api-version'] = self._serialize.query(\"self.api_version\", self.api_version, 'str')\n\n            else:\n                url = next_link\n                query_parameters = {}\n\n            # Construct headers\n            header_parameters = {}\n            header_parameters['Content-Type'] = 'application/json; charset=utf-8'\n            if self.config.generate_client_request_id:\n                header_parameters['x-ms-client-request-id'] = str(uuid.uuid1())\n            if custom_headers:\n                header_parameters.update(custom_headers)\n            if self.config.accept_language is not None:\n                header_parameters['accept-language'] = self._serialize.header(\"self.config.accept_language\", self.config.accept_language, 'str')\n\n            # Construct and send request\n            request = self._client.get(url, query_parameters)\n            response = self._client.send(\n                request, header_parameters, stream=False, **operation_config)\n\n            if response.status_code not in [200]:\n                raise models.KeyVaultErrorException(self._deserialize, response)\n\n            return response\n\n        # Deserialize response\n        deserialized = models.CertificateItemPaged(internal_paging, self._deserialize.dependencies)\n\n        if raw:\n            header_dict = {}\n            client_raw_response = models.CertificateItemPaged(internal_paging, self._deserialize.dependencies, header_dict)\n            return client_raw_response\n\n        return deserialized"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef get_regions(self):\n        '''\n        Get list of available service bus regions.\n        '''\n        response = self._perform_get(\n            self._get_path('services/serviceBus/Regions/', None),\n            None)\n\n        return _MinidomXmlToObject.convert_response_to_feeds(\n            response,\n            _ServiceBusManagementXmlSerializer.xml_to_region)", "response": "Get list of available service bus regions."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef list_namespaces(self):\n        '''\n        List the service bus namespaces defined on the account.\n        '''\n        response = self._perform_get(\n            self._get_path('services/serviceBus/Namespaces/', None),\n            None)\n\n        return _MinidomXmlToObject.convert_response_to_feeds(\n            response,\n            _ServiceBusManagementXmlSerializer.xml_to_namespace)", "response": "List the service bus namespaces defined on the account."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\ngetting details about a specific namespace.", "response": "def get_namespace(self, name):\n        '''\n        Get details about a specific namespace.\n\n        name:\n            Name of the service bus namespace.\n        '''\n        response = self._perform_get(\n            self._get_path('services/serviceBus/Namespaces', name),\n            None)\n\n        return _ServiceBusManagementXmlSerializer.xml_to_namespace(\n            response.body)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\ncreating a new service bus namespace.", "response": "def create_namespace(self, name, region):\n        '''\n        Create a new service bus namespace.\n\n        name:\n            Name of the service bus namespace to create.\n        region:\n            Region to create the namespace in.\n        '''\n        _validate_not_none('name', name)\n\n        return self._perform_put(\n            self._get_path('services/serviceBus/Namespaces', name),\n            _ServiceBusManagementXmlSerializer.namespace_to_xml(region))"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef delete_namespace(self, name):\n        '''\n        Delete a service bus namespace.\n\n        name:\n            Name of the service bus namespace to delete.\n        '''\n        _validate_not_none('name', name)\n\n        return self._perform_delete(\n            self._get_path('services/serviceBus/Namespaces', name),\n            None)", "response": "Delete a service bus namespace."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef check_namespace_availability(self, name):\n        '''\n        Checks to see if the specified service bus namespace is available, or\n        if it has already been taken.\n\n        name:\n            Name of the service bus namespace to validate.\n        '''\n        _validate_not_none('name', name)\n\n        response = self._perform_get(\n            self._get_path('services/serviceBus/CheckNamespaceAvailability',\n                           None) + '/?namespace=' + _str(name), None)\n\n        return _ServiceBusManagementXmlSerializer.xml_to_namespace_availability(\n            response.body)", "response": "Checks to see if the specified service bus namespace is available or not."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef list_queues(self, name):\n        '''\n        Enumerates the queues in the service namespace.\n\n        name:\n            Name of the service bus namespace.\n        '''\n        _validate_not_none('name', name)\n\n        response = self._perform_get(\n            self._get_list_queues_path(name),\n            None)\n\n        return _MinidomXmlToObject.convert_response_to_feeds(\n            response,\n            partial(\n                _MinidomXmlToObject.convert_xml_to_azure_object,\n                azure_type=QueueDescription\n            )\n        )", "response": "Enumerates the queues in the service namespace."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef list_topics(self, name):\n        '''\n        Retrieves the topics in the service namespace.\n\n        name:\n            Name of the service bus namespace.\n        '''\n        response = self._perform_get(\n            self._get_list_topics_path(name),\n            None)\n\n        return _MinidomXmlToObject.convert_response_to_feeds(\n            response,\n            partial(\n                _MinidomXmlToObject.convert_xml_to_azure_object,\n                azure_type=TopicDescription\n            )\n        )", "response": "Retrieves the topics in the service namespace."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef list_notification_hubs(self, name):\n        '''\n        Retrieves the notification hubs in the service namespace.\n\n        name:\n            Name of the service bus namespace.\n        '''\n        response = self._perform_get(\n            self._get_list_notification_hubs_path(name),\n            None)\n\n        return _MinidomXmlToObject.convert_response_to_feeds(\n            response,\n            partial(\n                _MinidomXmlToObject.convert_xml_to_azure_object,\n                azure_type=NotificationHubDescription\n            )\n        )", "response": "Returns the list of notification hubs in the service namespace."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef list_relays(self, name):\n        '''\n        Retrieves the relays in the service namespace.\n\n        name:\n            Name of the service bus namespace.\n        '''\n        response = self._perform_get(\n            self._get_list_relays_path(name),\n            None)\n\n        return _MinidomXmlToObject.convert_response_to_feeds(\n            response,\n            partial(\n                _MinidomXmlToObject.convert_xml_to_azure_object,\n                azure_type=RelayDescription\n            )\n        )", "response": "Retrieves the relays in the service namespace."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nretrieving the list of supported metrics for this namespace and queue_name.", "response": "def get_supported_metrics_queue(self, name, queue_name):\n        '''\n        Retrieves the list of supported metrics for this namespace and queue\n\n        name:\n            Name of the service bus namespace.\n        queue_name:\n            Name of the service bus queue in this namespace.\n        '''\n        response = self._perform_get(\n            self._get_get_supported_metrics_queue_path(name, queue_name),\n            None)\n\n        return _MinidomXmlToObject.convert_response_to_feeds(\n            response,\n            partial(\n                _ServiceBusManagementXmlSerializer.xml_to_metrics,\n                object_type=MetricProperties\n            )\n        )"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef get_supported_metrics_topic(self, name, topic_name):\n        '''\n        Retrieves the list of supported metrics for this namespace and topic\n\n        name:\n            Name of the service bus namespace.\n        topic_name:\n            Name of the service bus queue in this namespace.\n        '''\n        response = self._perform_get(\n            self._get_get_supported_metrics_topic_path(name, topic_name),\n            None)\n\n        return _MinidomXmlToObject.convert_response_to_feeds(\n            response,\n            partial(\n                _ServiceBusManagementXmlSerializer.xml_to_metrics,\n                object_type=MetricProperties\n            )\n        )", "response": "Retrieves the list of supported metrics for this namespace and topic_name."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef get_supported_metrics_notification_hub(self, name, hub_name):\n        '''\n        Retrieves the list of supported metrics for this namespace and topic\n\n        name:\n            Name of the service bus namespace.\n        hub_name:\n            Name of the service bus notification hub in this namespace.\n        '''\n        response = self._perform_get(\n            self._get_get_supported_metrics_hub_path(name, hub_name),\n            None)\n\n        return _MinidomXmlToObject.convert_response_to_feeds(\n            response,\n            partial(\n                _ServiceBusManagementXmlSerializer.xml_to_metrics,\n                object_type=MetricProperties\n            )\n        )", "response": "Retrieves the list of supported metrics for this namespace and topic."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef get_supported_metrics_relay(self, name, relay_name):\n        '''\n        Retrieves the list of supported metrics for this namespace and relay\n\n        name:\n            Name of the service bus namespace.\n        relay_name:\n            Name of the service bus relay in this namespace.\n        '''\n        response = self._perform_get(\n            self._get_get_supported_metrics_relay_path(name, relay_name),\n            None)\n\n        return _MinidomXmlToObject.convert_response_to_feeds(\n            response,\n            partial(\n                _ServiceBusManagementXmlSerializer.xml_to_metrics,\n                object_type=MetricProperties\n            )\n        )", "response": "Retrieves the list of supported metrics for this namespace and relay."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef get_metrics_data_queue(self, name, queue_name, metric, rollup, filter_expresssion):\n        '''\n        Retrieves the list of supported metrics for this namespace and queue\n\n        name:\n            Name of the service bus namespace.\n        queue_name:\n            Name of the service bus queue in this namespace.\n        metric:\n            name of a supported metric\n        rollup:\n            name of a supported rollup\n        filter_expression:\n            filter, for instance \"$filter=Timestamp gt datetime'2014-10-01T00:00:00Z'\"\n        '''\n        response = self._perform_get(\n            self._get_get_metrics_data_queue_path(name, queue_name, metric, rollup, filter_expresssion),\n            None)\n\n        return _MinidomXmlToObject.convert_response_to_feeds(\n            response,\n            partial(\n                _ServiceBusManagementXmlSerializer.xml_to_metrics,\n                object_type=MetricValues\n            )\n        )", "response": "Retrieves the list of supported metrics for a service bus queue."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef get_metrics_data_topic(self, name, topic_name, metric, rollup, filter_expresssion):\n        '''\n        Retrieves the list of supported metrics for this namespace and topic\n\n        name:\n            Name of the service bus namespace.\n        topic_name:\n            Name of the service bus queue in this namespace.\n        metric:\n            name of a supported metric\n        rollup:\n            name of a supported rollup\n        filter_expression:\n            filter, for instance \"$filter=Timestamp gt datetime'2014-10-01T00:00:00Z'\"\n        '''\n        response = self._perform_get(\n            self._get_get_metrics_data_topic_path(name, topic_name, metric, rollup, filter_expresssion),\n            None)\n\n        return _MinidomXmlToObject.convert_response_to_feeds(\n            response,\n            partial(\n                _ServiceBusManagementXmlSerializer.xml_to_metrics,\n                object_type=MetricValues\n            )\n        )", "response": "Retrieves the list of supported metrics for a given topic."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nretrieves the list of supported metrics for a service bus notification hub.", "response": "def get_metrics_data_notification_hub(self, name, hub_name, metric, rollup, filter_expresssion):\n        '''\n        Retrieves the list of supported metrics for this namespace and topic\n\n        name:\n            Name of the service bus namespace.\n        hub_name:\n            Name of the service bus notification hub in this namespace.\n        metric:\n            name of a supported metric\n        rollup:\n            name of a supported rollup\n        filter_expression:\n            filter, for instance \"$filter=Timestamp gt datetime'2014-10-01T00:00:00Z'\"\n        '''\n        response = self._perform_get(\n            self._get_get_metrics_data_hub_path(name, hub_name, metric, rollup, filter_expresssion),\n            None)\n\n        return _MinidomXmlToObject.convert_response_to_feeds(\n            response,\n            partial(\n                _ServiceBusManagementXmlSerializer.xml_to_metrics,\n                object_type=MetricValues\n            )\n        )"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef get_metrics_data_relay(self, name, relay_name, metric, rollup, filter_expresssion):\n        '''\n        Retrieves the list of supported metrics for this namespace and relay\n\n        name:\n            Name of the service bus namespace.\n        relay_name:\n            Name of the service bus relay in this namespace.\n        metric:\n            name of a supported metric\n        rollup:\n            name of a supported rollup\n        filter_expression:\n            filter, for instance \"$filter=Timestamp gt datetime'2014-10-01T00:00:00Z'\"\n        '''\n        response = self._perform_get(\n            self._get_get_metrics_data_relay_path(name, relay_name, metric, rollup, filter_expresssion),\n            None)\n\n        return _MinidomXmlToObject.convert_response_to_feeds(\n            response,\n            partial(\n                _ServiceBusManagementXmlSerializer.xml_to_metrics,\n                object_type=MetricValues\n            )\n        )", "response": "Retrieves the list of supported metrics for this service bus relay and returns the feed list."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef get_metrics_rollups_queue(self, name, queue_name, metric):\n        '''\n        This operation gets rollup data for Service Bus metrics queue.\n        Rollup data includes the time granularity for the telemetry aggregation as well as\n        the retention settings for each time granularity.\n\n        name:\n            Name of the service bus namespace.\n        queue_name:\n            Name of the service bus queue in this namespace.\n        metric:\n            name of a supported metric\n        '''\n        response = self._perform_get(\n            self._get_get_metrics_rollup_queue_path(name, queue_name, metric),\n            None)\n\n        return _MinidomXmlToObject.convert_response_to_feeds(\n            response,\n            partial(\n                _ServiceBusManagementXmlSerializer.xml_to_metrics,\n                object_type=MetricRollups\n            )\n        )", "response": "This operation gets the rollup data for a Service Bus metrics queue."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef get_metrics_rollups_topic(self, name, topic_name, metric):\n        '''\n        This operation gets rollup data for Service Bus metrics topic.\n        Rollup data includes the time granularity for the telemetry aggregation as well as\n        the retention settings for each time granularity.\n\n        name:\n            Name of the service bus namespace.\n        topic_name:\n            Name of the service bus queue in this namespace.\n        metric:\n            name of a supported metric\n        '''\n        response = self._perform_get(\n            self._get_get_metrics_rollup_topic_path(name, topic_name, metric),\n            None)\n\n        return _MinidomXmlToObject.convert_response_to_feeds(\n            response,\n            partial(\n                _ServiceBusManagementXmlSerializer.xml_to_metrics,\n                object_type=MetricRollups\n            )\n        )", "response": "This operation gets the rollup data for a Service Bus metrics topic."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef get_metrics_rollups_relay(self, name, relay_name, metric):\n        '''\n        This operation gets rollup data for Service Bus metrics relay.\n        Rollup data includes the time granularity for the telemetry aggregation as well as\n        the retention settings for each time granularity.\n\n        name:\n            Name of the service bus namespace.\n        relay_name:\n            Name of the service bus relay in this namespace.\n        metric:\n            name of a supported metric\n        '''\n        response = self._perform_get(\n            self._get_get_metrics_rollup_relay_path(name, relay_name, metric),\n            None)\n\n        return _MinidomXmlToObject.convert_response_to_feeds(\n            response,\n            partial(\n                _ServiceBusManagementXmlSerializer.xml_to_metrics,\n                object_type=MetricRollups\n            )\n        )", "response": "This operation gets the rollup data for a Service Bus metrics relay."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\ncreates a virtual environment in a directory.", "response": "def create(env_dir, system_site_packages=False, clear=False,\n                    symlinks=False, with_pip=False, prompt=None):\n    \"\"\"Create a virtual environment in a directory.\"\"\"\n    builder = ExtendedEnvBuilder(system_site_packages=system_site_packages,\n                                 clear=clear, symlinks=symlinks, with_pip=with_pip,\n                                 prompt=prompt)\n    builder.create(env_dir)\n    return builder.context"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef create_venv_with_package(packages):\n    with tempfile.TemporaryDirectory() as tempdir:\n        myenv = create(tempdir, with_pip=True)\n        pip_call = [\n            myenv.env_exe,\n            \"-m\",\n            \"pip\",\n            \"install\",\n        ]\n        subprocess.check_call(pip_call + ['-U', 'pip'])\n        if packages:\n            subprocess.check_call(pip_call + packages)\n        yield myenv", "response": "Create a new virtualenv with the given packages."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef generate_thumbnail_in_stream(\n            self, width, height, image, smart_cropping=False, custom_headers=None, raw=False, callback=None, **operation_config):\n        \"\"\"This operation generates a thumbnail image with the user-specified\n        width and height. By default, the service analyzes the image,\n        identifies the region of interest (ROI), and generates smart cropping\n        coordinates based on the ROI. Smart cropping helps when you specify an\n        aspect ratio that differs from that of the input image.\n        A successful response contains the thumbnail image binary. If the\n        request failed, the response contains an error code and a message to\n        help determine what went wrong.\n        Upon failure, the error code and an error message are returned. The\n        error code could be one of InvalidImageUrl, InvalidImageFormat,\n        InvalidImageSize, InvalidThumbnailSize, NotSupportedImage,\n        FailedToProcess, Timeout, or InternalServerError.\n\n        :param width: Width of the thumbnail, in pixels. It must be between 1\n         and 1024. Recommended minimum of 50.\n        :type width: int\n        :param height: Height of the thumbnail, in pixels. It must be between\n         1 and 1024. Recommended minimum of 50.\n        :type height: int\n        :param image: An image stream.\n        :type image: Generator\n        :param smart_cropping: Boolean flag for enabling smart cropping.\n        :type smart_cropping: bool\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param callback: When specified, will be called with each chunk of\n         data that is streamed. The callback should take two arguments, the\n         bytes of the current chunk of data and the response object. If the\n         data is uploading, response will be None.\n        :type callback: Callable[Bytes, response=None]\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: object or ClientRawResponse if raw=true\n        :rtype: Generator or ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`HttpOperationError<msrest.exceptions.HttpOperationError>`\n        \"\"\"\n        # Construct URL\n        url = self.generate_thumbnail_in_stream.metadata['url']\n        path_format_arguments = {\n            'Endpoint': self._serialize.url(\"self.config.endpoint\", self.config.endpoint, 'str', skip_quote=True)\n        }\n        url = self._client.format_url(url, **path_format_arguments)\n\n        # Construct parameters\n        query_parameters = {}\n        query_parameters['width'] = self._serialize.query(\"width\", width, 'int', maximum=1024, minimum=1)\n        query_parameters['height'] = self._serialize.query(\"height\", height, 'int', maximum=1024, minimum=1)\n        if smart_cropping is not None:\n            query_parameters['smartCropping'] = self._serialize.query(\"smart_cropping\", smart_cropping, 'bool')\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Accept'] = 'application/octet-stream'\n        header_parameters['Content-Type'] = 'application/octet-stream'\n        if custom_headers:\n            header_parameters.update(custom_headers)\n\n        # Construct body\n        body_content = self._client.stream_upload(image, callback)\n\n        # Construct and send request\n        request = self._client.post(url, query_parameters, header_parameters, body_content)\n        response = self._client.send(request, stream=True, **operation_config)\n\n        if response.status_code not in [200]:\n            raise HttpOperationError(self._deserialize, response)\n\n        deserialized = None\n\n        if response.status_code == 200:\n            deserialized = self._client.stream_download(response, callback)\n\n        if raw:\n            client_raw_response = ClientRawResponse(deserialized, response)\n            return client_raw_response\n\n        return deserialized", "response": "This method generates a thumbnail image in stream."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\ncreates a new Azure SQL Database server.", "response": "def create_server(self, admin_login, admin_password, location):\n        '''\n        Create a new Azure SQL Database server.\n\n        admin_login:\n            The administrator login name for the new server.\n        admin_password:\n            The administrator login password for the new server.\n        location:\n            The region to deploy the new server.\n        '''\n        _validate_not_none('admin_login', admin_login)\n        _validate_not_none('admin_password', admin_password)\n        _validate_not_none('location', location)\n        response = self.perform_post(\n            self._get_servers_path(),\n            _SqlManagementXmlSerializer.create_server_to_xml(\n                admin_login,\n                admin_password,\n                location\n            )\n        )\n\n        return _SqlManagementXmlSerializer.xml_to_create_server_response(\n            response.body)"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nset the administrator password for a server.", "response": "def set_server_admin_password(self, server_name, admin_password):\n        '''\n        Reset the administrator password for a server.\n\n        server_name:\n            Name of the server to change the password.\n        admin_password:\n            The new administrator password for the server.\n        '''\n        _validate_not_none('server_name', server_name)\n        _validate_not_none('admin_password', admin_password)\n        return self._perform_post(\n            self._get_servers_path(server_name) + '?op=ResetPassword',\n            _SqlManagementXmlSerializer.set_server_admin_password_to_xml(\n                admin_password\n            )\n        )"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\ndeletes an Azure SQL Database server.", "response": "def delete_server(self, server_name):\n        '''\n        Deletes an Azure SQL Database server (including all its databases).\n\n        server_name:\n            Name of the server you want to delete.\n        '''\n        _validate_not_none('server_name', server_name)\n        return self._perform_delete(\n            self._get_servers_path(server_name))"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\ngets the quotas for an Azure SQL Database Server.", "response": "def list_quotas(self, server_name):\n        '''\n        Gets quotas for an Azure SQL Database Server.\n\n        server_name:\n            Name of the server.\n        '''\n        _validate_not_none('server_name', server_name)\n        response = self._perform_get(self._get_quotas_path(server_name),\n                                     None)\n        return _MinidomXmlToObject.parse_service_resources_response(\n            response, ServerQuota)"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef get_server_event_logs(self, server_name, start_date,\n                              interval_size_in_minutes, event_types=''):\n        '''\n        Gets the event logs for an Azure SQL Database Server.\n\n        server_name:\n            Name of the server to retrieve the event logs from.\n        start_date:\n            The starting date and time of the events to retrieve in UTC format,\n            for example '2011-09-28 16:05:00'.\n        interval_size_in_minutes:\n            Size of the event logs to retrieve (in minutes).\n            Valid values are: 5, 60, or 1440.\n        event_types:\n            The event type of the log entries you want to retrieve.\n            Valid values are: \n                - connection_successful\n                - connection_failed\n                - connection_terminated\n                - deadlock\n                - throttling\n                - throttling_long_transaction\n            To return all event types pass in an empty string.\n        '''\n        _validate_not_none('server_name', server_name)\n        _validate_not_none('start_date', start_date)\n        _validate_not_none('interval_size_in_minutes', interval_size_in_minutes)\n        _validate_not_none('event_types', event_types)\n        path = self._get_server_event_logs_path(server_name) + \\\n               '?startDate={0}&intervalSizeInMinutes={1}&eventTypes={2}'.format(\n            start_date, interval_size_in_minutes, event_types)\n        response = self._perform_get(path, None)\n        return _MinidomXmlToObject.parse_service_resources_response(\n            response, EventLog)", "response": "Retrieves the event logs for an Azure SQL Database Server."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\ncreates an Azure SQL Database server firewall rule.", "response": "def create_firewall_rule(self, server_name, name, start_ip_address,\n                             end_ip_address):\n        '''\n        Creates an Azure SQL Database server firewall rule.\n\n        server_name:\n            Name of the server to set the firewall rule on. \n        name:\n            The name of the new firewall rule.\n        start_ip_address:\n            The lowest IP address in the range of the server-level firewall\n            setting. IP addresses equal to or greater than this can attempt to\n            connect to the server. The lowest possible IP address is 0.0.0.0.\n        end_ip_address:\n            The highest IP address in the range of the server-level firewall\n            setting. IP addresses equal to or less than this can attempt to\n            connect to the server. The highest possible IP address is\n            255.255.255.255.\n        '''\n        _validate_not_none('server_name', server_name)\n        _validate_not_none('name', name)\n        _validate_not_none('start_ip_address', start_ip_address)\n        _validate_not_none('end_ip_address', end_ip_address)\n        return self._perform_post(\n            self._get_firewall_rules_path(server_name),\n            _SqlManagementXmlSerializer.create_firewall_rule_to_xml(\n                name, start_ip_address, end_ip_address\n            )\n        )"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nupdating a firewall rule for an Azure SQL Database server.", "response": "def update_firewall_rule(self, server_name, name, start_ip_address,\n                             end_ip_address):\n        '''\n        Update a firewall rule for an Azure SQL Database server.\n\n        server_name:\n            Name of the server to set the firewall rule on. \n        name:\n            The name of the firewall rule to update.\n        start_ip_address:\n            The lowest IP address in the range of the server-level firewall\n            setting. IP addresses equal to or greater than this can attempt to\n            connect to the server. The lowest possible IP address is 0.0.0.0.\n        end_ip_address:\n            The highest IP address in the range of the server-level firewall\n            setting. IP addresses equal to or less than this can attempt to\n            connect to the server. The highest possible IP address is\n            255.255.255.255.\n        '''\n        _validate_not_none('server_name', server_name)\n        _validate_not_none('name', name)\n        _validate_not_none('start_ip_address', start_ip_address)\n        _validate_not_none('end_ip_address', end_ip_address)\n        return self._perform_put(\n            self._get_firewall_rules_path(server_name, name),\n            _SqlManagementXmlSerializer.update_firewall_rule_to_xml(\n                name, start_ip_address, end_ip_address\n            )\n        )"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\ndelete an Azure SQL Database server firewall rule.", "response": "def delete_firewall_rule(self, server_name, name):\n        '''\n        Deletes an Azure SQL Database server firewall rule.\n\n        server_name:\n            Name of the server with the firewall rule you want to delete.\n        name:\n            Name of the firewall rule you want to delete.\n        '''\n        _validate_not_none('server_name', server_name)\n        _validate_not_none('name', name)\n        return self._perform_delete(\n            self._get_firewall_rules_path(server_name, name))"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef list_firewall_rules(self, server_name):\n        '''\n        Retrieves the set of firewall rules for an Azure SQL Database Server.\n\n        server_name:\n            Name of the server.\n        '''\n        _validate_not_none('server_name', server_name)\n        response = self._perform_get(self._get_firewall_rules_path(server_name),\n                                     None)\n        return _MinidomXmlToObject.parse_service_resources_response(\n            response, FirewallRule)", "response": "Retrieves the set of firewall rules for an Azure SQL Database Server."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nreturns a list of all service level objectives for an Azure SQL Database server.", "response": "def list_service_level_objectives(self, server_name):\n        '''\n        Gets the service level objectives for an Azure SQL Database server.\n\n        server_name:\n            Name of the server.\n        '''\n        _validate_not_none('server_name', server_name)\n        response = self._perform_get(\n            self._get_service_objectives_path(server_name), None)\n        return _MinidomXmlToObject.parse_service_resources_response(\n            response, ServiceObjective)"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef create_database(self, server_name, name, service_objective_id,\n                        edition=None, collation_name=None,\n                        max_size_bytes=None):\n        '''\n        Creates a new Azure SQL Database.\n\n        server_name:\n            Name of the server to contain the new database.\n        name:\n            Required. The name for the new database. See Naming Requirements\n            in Azure SQL Database General Guidelines and Limitations and\n            Database Identifiers for more information.\n        service_objective_id:\n            Required. The GUID corresponding to the performance level for\n            Edition. See List Service Level Objectives for current values.\n        edition:\n            Optional. The Service Tier (Edition) for the new database. If\n            omitted, the default is Web. Valid values are Web, Business,\n            Basic, Standard, and Premium. See Azure SQL Database Service Tiers\n            (Editions) and Web and Business Edition Sunset FAQ for more\n            information.\n        collation_name:\n            Optional. The database collation. This can be any collation\n            supported by SQL. If omitted, the default collation is used. See\n            SQL Server Collation Support in Azure SQL Database General\n            Guidelines and Limitations for more information.\n        max_size_bytes:\n            Optional. Sets the maximum size, in bytes, for the database. This\n            value must be within the range of allowed values for Edition. If\n            omitted, the default value for the edition is used. See Azure SQL\n            Database Service Tiers (Editions) for current maximum databases\n            sizes. Convert MB or GB values to bytes.\n            1 MB = 1048576 bytes. 1 GB = 1073741824 bytes.\n        '''\n        _validate_not_none('server_name', server_name)\n        _validate_not_none('name', name)\n        _validate_not_none('service_objective_id', service_objective_id)\n        return self._perform_post(\n            self._get_databases_path(server_name),\n            _SqlManagementXmlSerializer.create_database_to_xml(\n                name, service_objective_id, edition, collation_name,\n                max_size_bytes\n            )\n        )", "response": "Creates a new Azure SQL Database."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef update_database(self, server_name, name, new_database_name=None,\n                        service_objective_id=None, edition=None,\n                        max_size_bytes=None):\n        '''\n        Updates existing database details.\n\n        server_name:\n            Name of the server to contain the new database.\n        name:\n            Required. The name for the new database. See Naming Requirements\n            in Azure SQL Database General Guidelines and Limitations and\n            Database Identifiers for more information.\n        new_database_name:\n            Optional. The new name for the new database.\n        service_objective_id:\n            Optional. The new service level to apply to the database. For more\n            information about service levels, see Azure SQL Database Service\n            Tiers and Performance Levels. Use List Service Level Objectives to\n            get the correct ID for the desired service objective.\n        edition:\n            Optional. The new edition for the new database.\n        max_size_bytes:\n            Optional. The new size of the database in bytes. For information on\n            available sizes for each edition, see Azure SQL Database Service\n            Tiers (Editions).\n        '''\n        _validate_not_none('server_name', server_name)\n        _validate_not_none('name', name)\n        return self._perform_put(\n            self._get_databases_path(server_name, name),\n            _SqlManagementXmlSerializer.update_database_to_xml(\n                new_database_name, service_objective_id, edition,\n                max_size_bytes\n            )\n        )", "response": "Updates the database details for the specified server and name."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\ndeleting an Azure SQL Database.", "response": "def delete_database(self, server_name, name):\n        '''\n        Deletes an Azure SQL Database.\n\n        server_name:\n            Name of the server where the database is located.\n        name:\n            Name of the database to delete.\n        '''\n        return self._perform_delete(self._get_databases_path(server_name, name))"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nlisting the SQL databases defined on the specified server name.", "response": "def list_databases(self, name):\n        '''\n        List the SQL databases defined on the specified server name\n        '''\n        response = self._perform_get(self._get_list_databases_path(name),\n                                     None)\n        return _MinidomXmlToObject.parse_service_resources_response(\n            response, Database)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef list_agreements(\n            self, name, include_privacy=None, for_transfer=None, custom_headers=None, raw=False, **operation_config):\n        \"\"\"Gets all legal agreements that user needs to accept before purchasing a\n        domain.\n\n        Gets all legal agreements that user needs to accept before purchasing a\n        domain.\n\n        :param name: Name of the top-level domain.\n        :type name: str\n        :param include_privacy: If <code>true</code>, then the list of\n         agreements will include agreements for domain privacy as well;\n         otherwise, <code>false</code>.\n        :type include_privacy: bool\n        :param for_transfer: If <code>true</code>, then the list of agreements\n         will include agreements for domain transfer as well; otherwise,\n         <code>false</code>.\n        :type for_transfer: bool\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: An iterator like instance of TldLegalAgreement\n        :rtype:\n         ~azure.mgmt.web.models.TldLegalAgreementPaged[~azure.mgmt.web.models.TldLegalAgreement]\n        :raises:\n         :class:`DefaultErrorResponseException<azure.mgmt.web.models.DefaultErrorResponseException>`\n        \"\"\"\n        agreement_option = models.TopLevelDomainAgreementOption(include_privacy=include_privacy, for_transfer=for_transfer)\n\n        def internal_paging(next_link=None, raw=False):\n\n            if not next_link:\n                # Construct URL\n                url = self.list_agreements.metadata['url']\n                path_format_arguments = {\n                    'name': self._serialize.url(\"name\", name, 'str'),\n                    'subscriptionId': self._serialize.url(\"self.config.subscription_id\", self.config.subscription_id, 'str')\n                }\n                url = self._client.format_url(url, **path_format_arguments)\n\n                # Construct parameters\n                query_parameters = {}\n                query_parameters['api-version'] = self._serialize.query(\"self.api_version\", self.api_version, 'str')\n\n            else:\n                url = next_link\n                query_parameters = {}\n\n            # Construct headers\n            header_parameters = {}\n            header_parameters['Accept'] = 'application/json'\n            header_parameters['Content-Type'] = 'application/json; charset=utf-8'\n            if self.config.generate_client_request_id:\n                header_parameters['x-ms-client-request-id'] = str(uuid.uuid1())\n            if custom_headers:\n                header_parameters.update(custom_headers)\n            if self.config.accept_language is not None:\n                header_parameters['accept-language'] = self._serialize.header(\"self.config.accept_language\", self.config.accept_language, 'str')\n\n            # Construct body\n            body_content = self._serialize.body(agreement_option, 'TopLevelDomainAgreementOption')\n\n            # Construct and send request\n            request = self._client.post(url, query_parameters, header_parameters, body_content)\n            response = self._client.send(request, stream=False, **operation_config)\n\n            if response.status_code not in [200]:\n                raise models.DefaultErrorResponseException(self._deserialize, response)\n\n            return response\n\n        # Deserialize response\n        deserialized = models.TldLegalAgreementPaged(internal_paging, self._deserialize.dependencies)\n\n        if raw:\n            header_dict = {}\n            client_raw_response = models.TldLegalAgreementPaged(internal_paging, self._deserialize.dependencies, header_dict)\n            return client_raw_response\n\n        return deserialized", "response": "Returns an iterator over all legal agreements that user needs to accept before purchasing the top - level domain."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nopen the handler connection and authenticate session.", "response": "async def open(self):\n        \"\"\"Open handler connection and authenticate session.\n\n        If the handler is already open, this operation will do nothing.\n        A handler opened with this method must be explicitly closed.\n        It is recommended to open a handler within a context manager as\n        opposed to calling the method directly.\n\n        .. note:: This operation is not thread-safe.\n\n        Example:\n            .. literalinclude:: ../examples/async_examples/test_examples_async.py\n                :start-after: [START open_close_sender_directly]\n                :end-before: [END open_close_sender_directly]\n                :language: python\n                :dedent: 4\n                :caption: Explicitly open and close a Sender.\n\n        \"\"\"\n        if self.running:\n            return\n        self.running = True\n        try:\n            await self._handler.open_async(connection=self.connection)\n            while not await self._handler.client_ready_async():\n                await asyncio.sleep(0.05)\n        except Exception as e:  # pylint: disable=broad-except\n            try:\n                await self._handle_exception(e)\n            except:\n                self.running = False\n                raise"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\nasync def close(self, exception=None):\n        self.running = False\n        if self.error:\n            return\n        if isinstance(exception, ServiceBusError):\n            self.error = exception\n        elif exception:\n            self.error = ServiceBusError(str(exception))\n        else:\n            self.error = ServiceBusError(\"This message handler is now closed.\")\n        await self._handler.close_async()", "response": "Closes the connection to the specified context manager."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\nasync def open(self):\n        if self.running:\n            return\n        self.running = True\n        try:\n            await self._handler.open_async(connection=self.connection)\n            self.message_iter = self._handler.receive_messages_iter_async()\n            while not await self._handler.auth_complete_async():\n                await asyncio.sleep(0.05)\n            await self._build_receiver()\n            while not await self._handler.client_ready_async():\n                await asyncio.sleep(0.05)\n        except Exception as e:  # pylint: disable=broad-except\n            try:\n                await self._handle_exception(e)\n            except:\n                self.running = False\n                raise", "response": "Open receiver connection and authenticate session."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nclose the receiver connection.", "response": "async def close(self, exception=None):\n        \"\"\"Close down the receiver connection.\n\n        If the receiver has already closed, this operation will do nothing. An optional\n        exception can be passed in to indicate that the handler was shutdown due to error.\n        It is recommended to open a handler within a context manager as\n        opposed to calling the method directly.\n        The receiver will be implicitly closed on completion of the message iterator,\n        however this method will need to be called explicitly if the message iterator is not run\n        to completion.\n\n        .. note:: This operation is not thread-safe.\n\n        :param exception: An optional exception if the handler is closing\n         due to an error.\n        :type exception: Exception\n\n        Example:\n            .. literalinclude:: ../examples/async_examples/test_examples_async.py\n                :start-after: [START open_close_receiver_directly]\n                :end-before: [END open_close_receiver_directly]\n                :language: python\n                :dedent: 4\n                :caption: Iterate then explicitly close a Receiver.\n\n        \"\"\"\n        if not self.running:\n            return\n        self.running = False\n        self.receiver_shutdown = True\n        self._used.set()\n        await super(Receiver, self).close(exception=exception)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\nasync def get_session_state(self):\n        await self._can_run()\n        response = await self._mgmt_request_response(\n            REQUEST_RESPONSE_GET_SESSION_STATE_OPERATION,\n            {'session-id': self.session_id},\n            mgmt_handlers.default)\n        session_state = response.get(b'session-state')\n        if isinstance(session_state, six.binary_type):\n            session_state = session_state.decode('UTF-8')\n        return session_state", "response": "Get the state of a session."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\nasync def set_session_state(self, state):\n        await self._can_run()\n        state = state.encode(self.encoding) if isinstance(state, six.text_type) else state\n        return await self._mgmt_request_response(\n            REQUEST_RESPONSE_SET_SESSION_STATE_OPERATION,\n            {'session-id': self.session_id, 'session-state': bytearray(state)},\n            mgmt_handlers.default)", "response": "Set the state of a session."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nreceive messages that have been deferred from the current session.", "response": "async def receive_deferred_messages(self, sequence_numbers, mode=ReceiveSettleMode.PeekLock):\n        \"\"\"Receive messages that have previously been deferred.\n\n        This operation can only receive deferred messages from the current session.\n        When receiving deferred messages from a partitioned entity, all of the supplied\n        sequence numbers must be messages from the same partition.\n\n        :param sequence_numbers: A list of the sequence numbers of messages that have been\n         deferred.\n        :type sequence_numbers: list[int]\n        :param mode: The receive mode, default value is PeekLock.\n        :type mode: ~azure.servicebus.common.constants.ReceiveSettleMode\n        :rtype: list[~azure.servicebus.aio.async_message.DeferredMessage]\n\n        Example:\n            .. literalinclude:: ../examples/async_examples/test_examples_async.py\n                :start-after: [START receiver_defer_session_messages]\n                :end-before: [END receiver_defer_session_messages]\n                :language: python\n                :dedent: 8\n                :caption: Defer messages, then retrieve them by sequence number.\n\n        \"\"\"\n        if not sequence_numbers:\n            raise ValueError(\"At least one sequence number must be specified.\")\n        await self._can_run()\n        try:\n            receive_mode = mode.value.value\n        except AttributeError:\n            receive_mode = int(mode)\n        message = {\n            'sequence-numbers': types.AMQPArray([types.AMQPLong(s) for s in sequence_numbers]),\n            'receiver-settle-mode': types.AMQPuInt(receive_mode),\n            'session-id': self.session_id\n        }\n        handler = functools.partial(mgmt_handlers.deferred_message_op, mode=receive_mode, message_type=DeferredMessage)\n        messages = await self._mgmt_request_response(\n            REQUEST_RESPONSE_RECEIVE_BY_SEQUENCE_NUMBER,\n            message,\n            handler)\n        for m in messages:\n            m._receiver = self  # pylint: disable=protected-access\n        return messages"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef merge(\n            self, reservation_order_id, sources=None, custom_headers=None, raw=False, polling=True, **operation_config):\n        \"\"\"Merges two `Reservation`s.\n\n        Merge the specified `Reservation`s into a new `Reservation`. The two\n        `Reservation`s being merged must have same properties.\n\n        :param reservation_order_id: Order Id of the reservation\n        :type reservation_order_id: str\n        :param sources: Format of the resource id should be\n         /providers/Microsoft.Capacity/reservationOrders/{reservationOrderId}/reservations/{reservationId}\n        :type sources: list[str]\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: The poller return type is ClientRawResponse, the\n         direct response alongside the deserialized response\n        :param polling: True for ARMPolling, False for no polling, or a\n         polling object for personal polling strategy\n        :return: An instance of LROPoller that returns list or\n         ClientRawResponse<list> if raw==True\n        :rtype:\n         ~msrestazure.azure_operation.AzureOperationPoller[list[~azure.mgmt.reservations.models.ReservationResponse]]\n         or\n         ~msrestazure.azure_operation.AzureOperationPoller[~msrest.pipeline.ClientRawResponse[list[~azure.mgmt.reservations.models.ReservationResponse]]]\n        :raises:\n         :class:`ErrorException<azure.mgmt.reservations.models.ErrorException>`\n        \"\"\"\n        raw_result = self._merge_initial(\n            reservation_order_id=reservation_order_id,\n            sources=sources,\n            custom_headers=custom_headers,\n            raw=True,\n            **operation_config\n        )\n\n        def get_long_running_output(response):\n            deserialized = self._deserialize('[ReservationResponse]', response)\n\n            if raw:\n                client_raw_response = ClientRawResponse(deserialized, response)\n                return client_raw_response\n\n            return deserialized\n\n        lro_delay = operation_config.get(\n            'long_running_operation_timeout',\n            self.config.long_running_operation_timeout)\n        if polling is True: polling_method = ARMPolling(lro_delay, **operation_config)\n        elif polling is False: polling_method = NoPolling()\n        else: polling_method = polling\n        return LROPoller(self._client, raw_result, get_long_running_output, polling_method)", "response": "Merges two reservation s into a new reservation."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nverify that the challenge is a Bearer challenge and returns the key = value pairs.", "response": "def _validate_challenge(self, challenge):\n        \"\"\" Verifies that the challenge is a Bearer challenge and returns the key=value pairs. \"\"\"\n        bearer_string = 'Bearer '\n        if not challenge:\n            raise ValueError('Challenge cannot be empty')\n\n        challenge = challenge.strip()\n        if not challenge.startswith(bearer_string):\n            raise ValueError('Challenge is not Bearer')\n\n        return challenge[len(bearer_string):]"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef purge(\n            self, resource_group_name, workspace_name, table, filters, custom_headers=None, raw=False, polling=True, **operation_config):\n        \"\"\"Purges data in an Log Analytics workspace by a set of user-defined\n        filters.\n\n        :param resource_group_name: The name of the resource group to get. The\n         name is case insensitive.\n        :type resource_group_name: str\n        :param workspace_name: Log Analytics workspace name\n        :type workspace_name: str\n        :param table: Table from which to purge data.\n        :type table: str\n        :param filters: The set of columns and filters (queries) to run over\n         them to purge the resulting data.\n        :type filters:\n         list[~azure.mgmt.loganalytics.models.WorkspacePurgeBodyFilters]\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: The poller return type is ClientRawResponse, the\n         direct response alongside the deserialized response\n        :param polling: True for ARMPolling, False for no polling, or a\n         polling object for personal polling strategy\n        :return: An instance of LROPoller that returns object or\n         ClientRawResponse<object> if raw==True\n        :rtype: ~msrestazure.azure_operation.AzureOperationPoller[object] or\n         ~msrestazure.azure_operation.AzureOperationPoller[~msrest.pipeline.ClientRawResponse[object]]\n        :raises: :class:`CloudError<msrestazure.azure_exceptions.CloudError>`\n        \"\"\"\n        raw_result = self._purge_initial(\n            resource_group_name=resource_group_name,\n            workspace_name=workspace_name,\n            table=table,\n            filters=filters,\n            custom_headers=custom_headers,\n            raw=True,\n            **operation_config\n        )\n\n        def get_long_running_output(response):\n            deserialized = self._deserialize('object', response)\n\n            if raw:\n                client_raw_response = ClientRawResponse(deserialized, response)\n                return client_raw_response\n\n            return deserialized\n\n        lro_delay = operation_config.get(\n            'long_running_operation_timeout',\n            self.config.long_running_operation_timeout)\n        if polling is True: polling_method = ARMPolling(lro_delay, **operation_config)\n        elif polling is False: polling_method = NoPolling()\n        else: polling_method = polling\n        return LROPoller(self._client, raw_result, get_long_running_output, polling_method)", "response": "Purges data in an Log Analytics workspace by a set of user - defined filters."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nhandle connection and service errors.", "response": "def _error_handler(error):\n    \"\"\"Handle connection and service errors.\n\n    Called internally when an event has failed to send so we\n    can parse the error to determine whether we should attempt\n    to retry sending the event again.\n    Returns the action to take according to error type.\n\n    :param error: The error received in the send attempt.\n    :type error: Exception\n    :rtype: ~uamqp.errors.ErrorAction\n    \"\"\"\n    if error.condition == b'com.microsoft:server-busy':\n        return errors.ErrorAction(retry=True, backoff=4)\n    if error.condition == b'com.microsoft:timeout':\n        return errors.ErrorAction(retry=True, backoff=2)\n    if error.condition == b'com.microsoft:operation-cancelled':\n        return errors.ErrorAction(retry=True)\n    if error.condition == b\"com.microsoft:container-close\":\n        return errors.ErrorAction(retry=True, backoff=4)\n    if error.condition in _NO_RETRY_ERRORS:\n        return errors.ErrorAction(retry=False)\n    return errors.ErrorAction(retry=True)"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nreturns a new service which will process requests with the specified filter.", "response": "def with_filter(self, filter_func):\n        '''\n        Returns a new service which will process requests with the specified\n        filter.  Filtering operations can include logging, automatic retrying,\n        etc...  The filter is a lambda which receives the HTTPRequest and\n        another lambda.  The filter can perform any pre-processing on the\n        request, pass it off to the next lambda, and then perform any\n        post-processing on the response.\n        '''\n        res = ServiceBusService(\n            service_namespace=self.service_namespace,\n            authentication=self.authentication)\n\n        old_filter = self._filter\n\n        def new_filter(request):\n            return filter_func(request, old_filter)\n\n        res._filter = new_filter  # pylint: disable=protected-access\n        return res"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\ncreate a new queue.", "response": "def create_queue(self, queue_name, queue=None, fail_on_exist=False):\n        '''\n        Creates a new queue. Once created, this queue's resource manifest is\n        immutable.\n\n        queue_name:\n            Name of the queue to create.\n        queue:\n            Queue object to create.\n        fail_on_exist:\n            Specify whether to throw an exception when the queue exists.\n        '''\n        _validate_not_none('queue_name', queue_name)\n        request = HTTPRequest()\n        request.method = 'PUT'\n        request.host = self._get_host()\n        request.path = '/' + _str(queue_name) + ''\n        request.body = _get_request_body(_convert_queue_to_xml(queue))\n        request.path, request.query = self._httpclient._update_request_uri_query(request)  # pylint: disable=protected-access\n        request.headers = self._update_service_bus_header(request)\n        if not fail_on_exist:\n            try:\n                self._perform_request(request)\n                return True\n            except AzureHttpError as ex:\n                _dont_fail_on_exist(ex)\n                return False\n        else:\n            self._perform_request(request)\n            return True"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef delete_queue(self, queue_name, fail_not_exist=False):\n        '''\n        Deletes an existing queue. This operation will also remove all\n        associated state including messages in the queue.\n\n        queue_name:\n            Name of the queue to delete.\n        fail_not_exist:\n            Specify whether to throw an exception if the queue doesn't exist.\n        '''\n        _validate_not_none('queue_name', queue_name)\n        request = HTTPRequest()\n        request.method = 'DELETE'\n        request.host = self._get_host()\n        request.path = '/' + _str(queue_name) + ''\n        request.path, request.query = self._httpclient._update_request_uri_query(request)  # pylint: disable=protected-access\n        request.headers = self._update_service_bus_header(request)\n        if not fail_not_exist:\n            try:\n                self._perform_request(request)\n                return True\n            except AzureHttpError as ex:\n                _dont_fail_not_exist(ex)\n                return False\n        else:\n            self._perform_request(request)\n            return True", "response": "Deletes an existing queue."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef get_queue(self, queue_name):\n        '''\n        Retrieves an existing queue.\n\n        queue_name:\n            Name of the queue.\n        '''\n        _validate_not_none('queue_name', queue_name)\n        request = HTTPRequest()\n        request.method = 'GET'\n        request.host = self._get_host()\n        request.path = '/' + _str(queue_name) + ''\n        request.path, request.query = self._httpclient._update_request_uri_query(request)  # pylint: disable=protected-access\n        request.headers = self._update_service_bus_header(request)\n        response = self._perform_request(request)\n\n        return _convert_response_to_queue(response)", "response": "Retrieves an existing queue."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nenumerates the queues in the service namespace.", "response": "def list_queues(self):\n        '''\n        Enumerates the queues in the service namespace.\n        '''\n        request = HTTPRequest()\n        request.method = 'GET'\n        request.host = self._get_host()\n        request.path = '/$Resources/Queues'\n        request.path, request.query = self._httpclient._update_request_uri_query(request)  # pylint: disable=protected-access\n        request.headers = self._update_service_bus_header(request)\n        response = self._perform_request(request)\n\n        return _ETreeXmlToObject.convert_response_to_feeds(\n            response, _convert_etree_element_to_queue)"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\ncreate a new topic.", "response": "def create_topic(self, topic_name, topic=None, fail_on_exist=False):\n        '''\n        Creates a new topic. Once created, this topic resource manifest is\n        immutable.\n\n        topic_name:\n            Name of the topic to create.\n        topic:\n            Topic object to create.\n        fail_on_exist:\n            Specify whether to throw an exception when the topic exists.\n        '''\n        _validate_not_none('topic_name', topic_name)\n        request = HTTPRequest()\n        request.method = 'PUT'\n        request.host = self._get_host()\n        request.path = '/' + _str(topic_name) + ''\n        request.body = _get_request_body(_convert_topic_to_xml(topic))\n        request.path, request.query = self._httpclient._update_request_uri_query(request)  # pylint: disable=protected-access\n        request.headers = self._update_service_bus_header(request)\n        if not fail_on_exist:\n            try:\n                self._perform_request(request)\n                return True\n            except AzureHttpError as ex:\n                _dont_fail_on_exist(ex)\n                return False\n        else:\n            self._perform_request(request)\n            return True"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef get_topic(self, topic_name):\n        '''\n        Retrieves the description for the specified topic.\n\n        topic_name:\n            Name of the topic.\n        '''\n        _validate_not_none('topic_name', topic_name)\n        request = HTTPRequest()\n        request.method = 'GET'\n        request.host = self._get_host()\n        request.path = '/' + _str(topic_name) + ''\n        request.path, request.query = self._httpclient._update_request_uri_query(request)  # pylint: disable=protected-access\n        request.headers = self._update_service_bus_header(request)\n        response = self._perform_request(request)\n\n        return _convert_response_to_topic(response)", "response": "Retrieves the description for the specified topic."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef create_rule(self, topic_name, subscription_name, rule_name, rule=None,\n                    fail_on_exist=False):\n        '''\n        Creates a new rule. Once created, this rule's resource manifest is\n        immutable.\n\n        topic_name:\n            Name of the topic.\n        subscription_name:\n            Name of the subscription.\n        rule_name:\n            Name of the rule.\n        fail_on_exist:\n            Specify whether to throw an exception when the rule exists.\n        '''\n        _validate_not_none('topic_name', topic_name)\n        _validate_not_none('subscription_name', subscription_name)\n        _validate_not_none('rule_name', rule_name)\n        request = HTTPRequest()\n        request.method = 'PUT'\n        request.host = self._get_host()\n        request.path = '/' + _str(topic_name) + '/subscriptions/' + \\\n            _str(subscription_name) + \\\n            '/rules/' + _str(rule_name) + ''\n        request.body = _get_request_body(_convert_rule_to_xml(rule))\n        request.path, request.query = self._httpclient._update_request_uri_query(request)  # pylint: disable=protected-access\n        request.headers = self._update_service_bus_header(request)\n        if not fail_on_exist:\n            try:\n                self._perform_request(request)\n                return True\n            except AzureHttpError as ex:\n                _dont_fail_on_exist(ex)\n                return False\n        else:\n            self._perform_request(request)\n            return True", "response": "Creates a new rule in the resource manifest."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nretrieve the description for the specified rule.", "response": "def get_rule(self, topic_name, subscription_name, rule_name):\n        '''\n        Retrieves the description for the specified rule.\n\n        topic_name:\n            Name of the topic.\n        subscription_name:\n            Name of the subscription.\n        rule_name:\n            Name of the rule.\n        '''\n        _validate_not_none('topic_name', topic_name)\n        _validate_not_none('subscription_name', subscription_name)\n        _validate_not_none('rule_name', rule_name)\n        request = HTTPRequest()\n        request.method = 'GET'\n        request.host = self._get_host()\n        request.path = '/' + _str(topic_name) + '/subscriptions/' + \\\n            _str(subscription_name) + \\\n            '/rules/' + _str(rule_name) + ''\n        request.path, request.query = self._httpclient._update_request_uri_query(request)  # pylint: disable=protected-access\n        request.headers = self._update_service_bus_header(request)\n        response = self._perform_request(request)\n\n        return _convert_response_to_rule(response)"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nreturning the list of rules that exist under the specified subscription.", "response": "def list_rules(self, topic_name, subscription_name):\n        '''\n        Retrieves the rules that exist under the specified subscription.\n\n        topic_name:\n            Name of the topic.\n        subscription_name:\n            Name of the subscription.\n        '''\n        _validate_not_none('topic_name', topic_name)\n        _validate_not_none('subscription_name', subscription_name)\n        request = HTTPRequest()\n        request.method = 'GET'\n        request.host = self._get_host()\n        request.path = '/' + \\\n            _str(topic_name) + '/subscriptions/' + \\\n            _str(subscription_name) + '/rules/'\n        request.path, request.query = self._httpclient._update_request_uri_query(request)  # pylint: disable=protected-access\n        request.headers = self._update_service_bus_header(request)\n        response = self._perform_request(request)\n\n        return _ETreeXmlToObject.convert_response_to_feeds(\n            response, _convert_etree_element_to_rule)"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef create_subscription(self, topic_name, subscription_name,\n                            subscription=None, fail_on_exist=False):\n        '''\n        Creates a new subscription. Once created, this subscription resource\n        manifest is immutable.\n\n        topic_name:\n            Name of the topic.\n        subscription_name:\n            Name of the subscription.\n        fail_on_exist:\n            Specify whether throw exception when subscription exists.\n        '''\n        _validate_not_none('topic_name', topic_name)\n        _validate_not_none('subscription_name', subscription_name)\n        request = HTTPRequest()\n        request.method = 'PUT'\n        request.host = self._get_host()\n        request.path = '/' + \\\n            _str(topic_name) + '/subscriptions/' + _str(subscription_name) + ''\n        request.body = _get_request_body(\n            _convert_subscription_to_xml(subscription))\n        request.path, request.query = self._httpclient._update_request_uri_query(request)  # pylint: disable=protected-access\n        request.headers = self._update_service_bus_header(request)\n        if not fail_on_exist:\n            try:\n                self._perform_request(request)\n                return True\n            except AzureHttpError as ex:\n                _dont_fail_on_exist(ex)\n                return False\n        else:\n            self._perform_request(request)\n            return True", "response": "Creates a new subscription."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef get_subscription(self, topic_name, subscription_name):\n        '''\n        Gets an existing subscription.\n\n        topic_name:\n            Name of the topic.\n        subscription_name:\n            Name of the subscription.\n        '''\n        _validate_not_none('topic_name', topic_name)\n        _validate_not_none('subscription_name', subscription_name)\n        request = HTTPRequest()\n        request.method = 'GET'\n        request.host = self._get_host()\n        request.path = '/' + \\\n            _str(topic_name) + '/subscriptions/' + _str(subscription_name) + ''\n        request.path, request.query = self._httpclient._update_request_uri_query(request)  # pylint: disable=protected-access\n        request.headers = self._update_service_bus_header(request)\n        response = self._perform_request(request)\n\n        return _convert_response_to_subscription(response)", "response": "Gets an existing subscription."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef list_subscriptions(self, topic_name):\n        '''\n        Retrieves the subscriptions in the specified topic.\n\n        topic_name:\n            Name of the topic.\n        '''\n        _validate_not_none('topic_name', topic_name)\n        request = HTTPRequest()\n        request.method = 'GET'\n        request.host = self._get_host()\n        request.path = '/' + _str(topic_name) + '/subscriptions/'\n        request.path, request.query = self._httpclient._update_request_uri_query(request)  # pylint: disable=protected-access\n        request.headers = self._update_service_bus_header(request)\n        response = self._perform_request(request)\n\n        return _ETreeXmlToObject.convert_response_to_feeds(\n            response, _convert_etree_element_to_subscription)", "response": "Returns the list of subscriptions in the specified topic."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef send_topic_message(self, topic_name, message=None):\n        '''\n        Enqueues a message into the specified topic. The limit to the number\n        of messages which may be present in the topic is governed by the\n        message size in MaxTopicSizeInBytes. If this message causes the topic\n        to exceed its quota, a quota exceeded error is returned and the\n        message will be rejected.\n\n        topic_name:\n            Name of the topic.\n        message:\n            Message object containing message body and properties.\n        '''\n        _validate_not_none('topic_name', topic_name)\n        _validate_not_none('message', message)\n        request = HTTPRequest()\n        request.method = 'POST'\n        request.host = self._get_host()\n        request.path = '/' + _str(topic_name) + '/messages'\n        request.headers = message.add_headers(request)\n        request.body = _get_request_body(message.body)\n        request.path, request.query = self._httpclient._update_request_uri_query(request)  # pylint: disable=protected-access\n        request.headers = self._update_service_bus_header(request)\n        self._perform_request(request)", "response": "Sends a message to the specified topic."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nunlock a given message for processing by other receivers on a given topic.", "response": "def unlock_subscription_message(self, topic_name, subscription_name,\n                                    sequence_number, lock_token):\n        '''\n        Unlock a message for processing by other receivers on a given\n        subscription. This operation deletes the lock object, causing the\n        message to be unlocked. A message must have first been locked by a\n        receiver before this operation is called.\n\n        topic_name:\n            Name of the topic.\n        subscription_name:\n            Name of the subscription.\n        sequence_number:\n            The sequence number of the message to be unlocked as returned in\n            BrokerProperties['SequenceNumber'] by the Peek Message operation.\n        lock_token:\n            The ID of the lock as returned by the Peek Message operation in\n            BrokerProperties['LockToken']\n        '''\n        _validate_not_none('topic_name', topic_name)\n        _validate_not_none('subscription_name', subscription_name)\n        _validate_not_none('sequence_number', sequence_number)\n        _validate_not_none('lock_token', lock_token)\n        request = HTTPRequest()\n        request.method = 'PUT'\n        request.host = self._get_host()\n        request.path = '/' + _str(topic_name) + \\\n                       '/subscriptions/' + str(subscription_name) + \\\n                       '/messages/' + _str(sequence_number) + \\\n                       '/' + _str(lock_token) + ''\n        request.path, request.query = self._httpclient._update_request_uri_query(request)  # pylint: disable=protected-access\n        request.headers = self._update_service_bus_header(request)\n        self._perform_request(request)"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef send_queue_message_batch(self, queue_name, messages=None):\n        '''\n        Sends a batch of messages into the specified queue. The limit to the number of\n        messages which may be present in the topic is governed by the message\n        size the MaxTopicSizeInMegaBytes. If this message will cause the queue\n        to exceed its quota, a quota exceeded error is returned and the\n        message will be rejected.\n\n        queue_name:\n            Name of the queue.\n        messages:\n            List of message objects containing message body and properties.\n        '''\n        _validate_not_none('queue_name', queue_name)\n        _validate_not_none('messages', messages)\n        request = HTTPRequest()\n        request.method = 'POST'\n        request.host = self._get_host()\n        request.path = '/' + _str(queue_name) + '/messages'\n        request.headers.append(('Content-Type', 'application/vnd.microsoft.servicebus.json'))\n        request.body = _get_request_body(json.dumps([m.as_batch_body() for m in messages]))\n        request.path, request.query = self._httpclient._update_request_uri_query(request)  # pylint: disable=protected-access\n        request.headers = self._update_service_bus_header(request)\n        self._perform_request(request)", "response": "Sends a batch of messages into the specified queue."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef peek_lock_queue_message(self, queue_name, timeout='60'):\n        '''\n        Automically retrieves and locks a message from a queue for processing.\n        The message is guaranteed not to be delivered to other receivers (on\n        the same subscription only) during the lock duration period specified\n        in the queue description. Once the lock expires, the message will be\n        available to other receivers. In order to complete processing of the\n        message, the receiver should issue a delete command with the lock ID\n        received from this operation. To abandon processing of the message and\n        unlock it for other receivers, an Unlock Message command should be\n        issued, or the lock duration period can expire.\n\n        queue_name:\n            Name of the queue.\n        timeout:\n            Optional. The timeout parameter is expressed in seconds.\n        '''\n        _validate_not_none('queue_name', queue_name)\n        request = HTTPRequest()\n        request.method = 'POST'\n        request.host = self._get_host()\n        request.path = '/' + _str(queue_name) + '/messages/head'\n        request.query = [('timeout', _int_or_none(timeout))]\n        request.path, request.query = self._httpclient._update_request_uri_query(request)  # pylint: disable=protected-access\n        request.headers = self._update_service_bus_header(request)\n        response = self._perform_request(request)\n\n        return _create_message(response, self)", "response": "Retrieves and locks a message from a queue."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef unlock_queue_message(self, queue_name, sequence_number, lock_token):\n        '''\n        Unlocks a message for processing by other receivers on a given\n        queue. This operation deletes the lock object, causing the\n        message to be unlocked. A message must have first been locked by a\n        receiver before this operation is called.\n\n        queue_name:\n            Name of the queue.\n        sequence_number:\n            The sequence number of the message to be unlocked as returned in\n            BrokerProperties['SequenceNumber'] by the Peek Message operation.\n        lock_token:\n            The ID of the lock as returned by the Peek Message operation in\n            BrokerProperties['LockToken']\n        '''\n        _validate_not_none('queue_name', queue_name)\n        _validate_not_none('sequence_number', sequence_number)\n        _validate_not_none('lock_token', lock_token)\n        request = HTTPRequest()\n        request.method = 'PUT'\n        request.host = self._get_host()\n        request.path = '/' + _str(queue_name) + \\\n                       '/messages/' + _str(sequence_number) + \\\n                       '/' + _str(lock_token) + ''\n        request.path, request.query = self._httpclient._update_request_uri_query(request)  # pylint: disable=protected-access\n        request.headers = self._update_service_bus_header(request)\n        self._perform_request(request)", "response": "Unlocks a message for processing by other receivers on a given queue."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nreceive a message from a queue for processing.", "response": "def receive_queue_message(self, queue_name, peek_lock=True, timeout=60):\n        '''\n        Receive a message from a queue for processing.\n\n        queue_name:\n            Name of the queue.\n        peek_lock:\n            Optional. True to retrieve and lock the message. False to read and\n            delete the message. Default is True (lock).\n        timeout:\n            Optional. The timeout parameter is expressed in seconds.\n        '''\n        if peek_lock:\n            return self.peek_lock_queue_message(queue_name, timeout)\n        return self.read_delete_queue_message(queue_name, timeout)"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nreceive a message from a subscription for processing. topic_name: Name of the topic. subscription_name: Name of the subscription. peek_lock: Optional. True to retrieve and lock the message. False to read and delete the message. Default is True (lock). timeout: Optional. The timeout parameter is expressed in seconds.", "response": "def receive_subscription_message(self, topic_name, subscription_name,\n                                     peek_lock=True, timeout=60):\n        '''\n        Receive a message from a subscription for processing.\n\n        topic_name:\n            Name of the topic.\n        subscription_name:\n            Name of the subscription.\n        peek_lock:\n            Optional. True to retrieve and lock the message. False to read and\n            delete the message. Default is True (lock).\n        timeout:\n            Optional. The timeout parameter is expressed in seconds.\n        '''\n        if peek_lock:\n            return self.peek_lock_subscription_message(topic_name,\n                                                       subscription_name,\n                                                       timeout)\n        return self.read_delete_subscription_message(topic_name,\n                                                     subscription_name,\n                                                     timeout)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef create_event_hub(self, hub_name, hub=None, fail_on_exist=False):\n        '''\n        Creates a new Event Hub.\n\n        hub_name:\n            Name of event hub.\n        hub:\n            Optional. Event hub properties. Instance of EventHub class.\n        hub.message_retention_in_days:\n            Number of days to retain the events for this Event Hub.\n        hub.status:\n            Status of the Event Hub (enabled or disabled).\n        hub.user_metadata:\n            User metadata.\n        hub.partition_count:\n            Number of shards on the Event Hub.\n        fail_on_exist:\n            Specify whether to throw an exception when the event hub exists.\n        '''\n        _validate_not_none('hub_name', hub_name)\n        request = HTTPRequest()\n        request.method = 'PUT'\n        request.host = self._get_host()\n        request.path = '/' + _str(hub_name) + '?api-version=2014-01'\n        request.body = _get_request_body(_convert_event_hub_to_xml(hub))\n        request.path, request.query = self._httpclient._update_request_uri_query(request)  # pylint: disable=protected-access\n        request.headers = self._update_service_bus_header(request)\n        if not fail_on_exist:\n            try:\n                self._perform_request(request)\n                return True\n            except AzureHttpError as ex:\n                _dont_fail_on_exist(ex)\n                return False\n        else:\n            self._perform_request(request)\n            return True", "response": "Creates a new Event Hub."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nupdating an Event Hub with the specified name and optional Event Hub properties.", "response": "def update_event_hub(self, hub_name, hub=None):\n        '''\n        Updates an Event Hub.\n\n        hub_name:\n            Name of event hub.\n        hub:\n            Optional. Event hub properties. Instance of EventHub class.\n        hub.message_retention_in_days:\n            Number of days to retain the events for this Event Hub.\n        '''\n        _validate_not_none('hub_name', hub_name)\n        request = HTTPRequest()\n        request.method = 'PUT'\n        request.host = self._get_host()\n        request.path = '/' + _str(hub_name) + '?api-version=2014-01'\n        request.body = _get_request_body(_convert_event_hub_to_xml(hub))\n        request.path, request.query = self._httpclient._update_request_uri_query(request)  # pylint: disable=protected-access\n        request.headers.append(('If-Match', '*'))\n        request.headers = self._update_service_bus_header(request)\n        response = self._perform_request(request)\n\n        return _convert_response_to_event_hub(response)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nretrieving an existing event hub.", "response": "def get_event_hub(self, hub_name):\n        '''\n        Retrieves an existing event hub.\n\n        hub_name:\n            Name of the event hub.\n        '''\n        _validate_not_none('hub_name', hub_name)\n        request = HTTPRequest()\n        request.method = 'GET'\n        request.host = self._get_host()\n        request.path = '/' + _str(hub_name) + ''\n        request.path, request.query = self._httpclient._update_request_uri_query(request)  # pylint: disable=protected-access\n        request.headers = self._update_service_bus_header(request)\n        response = self._perform_request(request)\n\n        return _convert_response_to_event_hub(response)"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef send_event(self, hub_name, message, device_id=None,\n                   broker_properties=None):\n        '''\n        Sends a new message event to an Event Hub.\n        '''\n        _validate_not_none('hub_name', hub_name)\n        request = HTTPRequest()\n        request.method = 'POST'\n        request.host = self._get_host()\n        if device_id:\n            request.path = '/{0}/publishers/{1}/messages?api-version=2014-01'.format(hub_name, device_id)\n        else:\n            request.path = '/{0}/messages?api-version=2014-01'.format(hub_name)\n        if broker_properties:\n            request.headers.append(\n                ('BrokerProperties', str(broker_properties)))\n        request.body = _get_request_body(message)\n        request.path, request.query = self._httpclient._update_request_uri_query(request)  # pylint: disable=protected-access\n        request.headers = self._update_service_bus_header(request)\n        self._perform_request(request)", "response": "Sends a new message to an Event Hub."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nadds additional headers for Service Bus.", "response": "def _update_service_bus_header(self, request):\n        ''' Add additional headers for Service Bus. '''\n\n        if request.method in ['PUT', 'POST', 'MERGE', 'DELETE']:\n            request.headers.append(('Content-Length', str(len(request.body))))\n\n        # if it is not GET or HEAD request, must set content-type.\n        if not request.method in ['GET', 'HEAD']:\n            for name, _ in request.headers:\n                if name.lower() == 'content-type':\n                    break\n            else:\n                request.headers.append(\n                    ('Content-Type',\n                     'application/atom+xml;type=entry;charset=utf-8'))\n\n        # Adds authorization header for authentication.\n        self.authentication.sign_request(request, self._httpclient)\n\n        return request.headers"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _get_authorization(self, request, httpclient):\n        ''' return the signed string with token. '''\n        return 'WRAP access_token=\"' + \\\n                self._get_token(request.host, request.path, httpclient) + '\"'", "response": "return the signed string with token."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef _token_is_expired(self, token):  # pylint: disable=no-self-use\n        ''' Check if token expires or not. '''\n        time_pos_begin = token.find('ExpiresOn=') + len('ExpiresOn=')\n        time_pos_end = token.find('&', time_pos_begin)\n        token_expire_time = int(token[time_pos_begin:time_pos_end])\n        time_now = time.mktime(time.localtime())\n\n        # Adding 30 seconds so the token wouldn't be expired when we send the\n        # token to server.\n        return (token_expire_time - time_now) < 30", "response": "Check if the token is expired or not."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef _get_token(self, host, path, httpclient):\n        '''\n        Returns token for the request.\n\n        host:\n            the Service Bus service request.\n        path:\n            the Service Bus service request.\n        '''\n        wrap_scope = 'http://' + host + path + self.issuer + self.account_key\n\n        # Check whether has unexpired cache, return cached token if it is still\n        # usable.\n        if wrap_scope in _tokens:\n            token = _tokens[wrap_scope]\n            if not self._token_is_expired(token):\n                return token\n\n        # get token from accessconstrol server\n        request = HTTPRequest()\n        request.protocol_override = 'https'\n        request.host = host.replace('.servicebus.', '-sb.accesscontrol.')\n        request.method = 'POST'\n        request.path = '/WRAPv0.9'\n        request.body = ('wrap_name=' + url_quote(self.issuer) +\n                        '&wrap_password=' + url_quote(self.account_key) +\n                        '&wrap_scope=' +\n                        url_quote('http://' + host + path)).encode('utf-8')\n        request.headers.append(('Content-Length', str(len(request.body))))\n        resp = httpclient.perform_request(request)\n\n        token = resp.body.decode('utf-8-sig')\n        token = url_unquote(token[token.find('=') + 1:token.rfind('&')])\n        _tokens[wrap_scope] = token\n\n        return token", "response": "Get the token for the request."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\npulls the query string out of the URI and moves it into the request. path and request. query", "response": "def _update_request_uri_query(self, request):\n        '''pulls the query string out of the URI and moves it into\n        the query portion of the request object.  If there are already\n        query parameters on the request the parameters in the URI will\n        appear after the existing parameters'''\n\n        if '?' in request.path:\n            request.path, _, query_string = request.path.partition('?')\n            if query_string:\n                query_params = query_string.split('&')\n                for query in query_params:\n                    if '=' in query:\n                        name, _, value = query.partition('=')\n                        request.query.append((name, value))\n\n        request.path = url_quote(request.path, '/()$=\\',')\n\n        # add encoded queries to request.path.\n        if request.query:\n            request.path += '?'\n            for name, value in request.query:\n                if value is not None:\n                    request.path += name + '=' + url_quote(value, '/()$=\\',') + '&'\n            request.path = request.path[:-1]\n\n        return request.path, request.query"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nreset the Service Principal Profile of a managed cluster.", "response": "def reset_service_principal_profile(\n            self, resource_group_name, resource_name, client_id, secret=None, custom_headers=None, raw=False, polling=True, **operation_config):\n        \"\"\"Reset Service Principal Profile of a managed cluster.\n\n        Update the service principal Profile for a managed cluster.\n\n        :param resource_group_name: The name of the resource group.\n        :type resource_group_name: str\n        :param resource_name: The name of the managed cluster resource.\n        :type resource_name: str\n        :param client_id: The ID for the service principal.\n        :type client_id: str\n        :param secret: The secret password associated with the service\n         principal in plain text.\n        :type secret: str\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: The poller return type is ClientRawResponse, the\n         direct response alongside the deserialized response\n        :param polling: True for ARMPolling, False for no polling, or a\n         polling object for personal polling strategy\n        :return: An instance of LROPoller that returns None or\n         ClientRawResponse<None> if raw==True\n        :rtype: ~msrestazure.azure_operation.AzureOperationPoller[None] or\n         ~msrestazure.azure_operation.AzureOperationPoller[~msrest.pipeline.ClientRawResponse[None]]\n        :raises: :class:`CloudError<msrestazure.azure_exceptions.CloudError>`\n        \"\"\"\n        raw_result = self._reset_service_principal_profile_initial(\n            resource_group_name=resource_group_name,\n            resource_name=resource_name,\n            client_id=client_id,\n            secret=secret,\n            custom_headers=custom_headers,\n            raw=True,\n            **operation_config\n        )\n\n        def get_long_running_output(response):\n            if raw:\n                client_raw_response = ClientRawResponse(None, response)\n                return client_raw_response\n\n        lro_delay = operation_config.get(\n            'long_running_operation_timeout',\n            self.config.long_running_operation_timeout)\n        if polling is True: polling_method = ARMPolling(lro_delay, **operation_config)\n        elif polling is False: polling_method = NoPolling()\n        else: polling_method = polling\n        return LROPoller(self._client, raw_result, get_long_running_output, polling_method)"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef delete(self):\n        ''' Deletes itself if find queue name or topic name and subscription\n        name. '''\n        if self._queue_name:\n            self.service_bus_service.delete_queue_message(\n                self._queue_name,\n                self.broker_properties['SequenceNumber'],\n                self.broker_properties['LockToken'])\n        elif self._topic_name and self._subscription_name:\n            self.service_bus_service.delete_subscription_message(\n                self._topic_name,\n                self._subscription_name,\n                self.broker_properties['SequenceNumber'],\n                self.broker_properties['LockToken'])\n        else:\n            raise AzureServiceBusPeekLockError(_ERROR_MESSAGE_NOT_PEEK_LOCKED_ON_DELETE)", "response": "Deletes itself if find queue name topic name and subscription name."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nunlocks itself if find queue name topic name and subscription name.", "response": "def unlock(self):\n        ''' Unlocks itself if find queue name or topic name and subscription\n        name. '''\n        if self._queue_name:\n            self.service_bus_service.unlock_queue_message(\n                self._queue_name,\n                self.broker_properties['SequenceNumber'],\n                self.broker_properties['LockToken'])\n        elif self._topic_name and self._subscription_name:\n            self.service_bus_service.unlock_subscription_message(\n                self._topic_name,\n                self._subscription_name,\n                self.broker_properties['SequenceNumber'],\n                self.broker_properties['LockToken'])\n        else:\n            raise AzureServiceBusPeekLockError(_ERROR_MESSAGE_NOT_PEEK_LOCKED_ON_UNLOCK)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef renew_lock(self):\n        ''' Renew lock on itself if find queue name or topic name and subscription\n        name. '''\n        if self._queue_name:\n            self.service_bus_service.renew_lock_queue_message(\n                self._queue_name,\n                self.broker_properties['SequenceNumber'],\n                self.broker_properties['LockToken'])\n        elif self._topic_name and self._subscription_name:\n            self.service_bus_service.renew_lock_subscription_message(\n                self._topic_name,\n                self._subscription_name,\n                self.broker_properties['SequenceNumber'],\n                self.broker_properties['LockToken'])\n        else:\n            raise AzureServiceBusPeekLockError(_ERROR_MESSAGE_NOT_PEEK_LOCKED_ON_RENEW_LOCK)", "response": "Renews lock on itself if find queue name topic name and subscription name and lock token."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nadd addtional headers to request for message request.", "response": "def add_headers(self, request):\n        ''' add addtional headers to request for message request.'''\n\n        # Adds custom properties\n        if self.custom_properties:\n            for name, value in self.custom_properties.items():\n                request.headers.append((name, self._serialize_escaped_properties_value(value)))\n\n        # Adds content-type\n        request.headers.append(('Content-Type', self.type))\n\n        # Adds BrokerProperties\n        if self.broker_properties:\n            if hasattr(self.broker_properties, 'items'):\n                broker_properties = {name: self._serialize_basic_properties_value(value)\n                                     for name, value\n                                     in self.broker_properties.items()}\n                broker_properties = json.dumps(broker_properties)\n            else:\n                broker_properties = self.broker_properties\n            request.headers.append(\n                ('BrokerProperties', str(broker_properties)))\n\n        return request.headers"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef as_batch_body(self):\n        ''' return the current message as expected by batch body format'''\n        if sys.version_info >= (3,) and isinstance(self.body, bytes):\n            # It HAS to be string to be serialized in JSON\n            body = self.body.decode('utf-8')\n        else:\n            # Python 2.7 people handle this themself\n            body = self.body\n        result = {'Body': body}\n\n        # Adds custom properties\n        if self.custom_properties:\n            result['UserProperties'] = {name: self._serialize_basic_properties_value(value)\n                                        for name, value\n                                        in self.custom_properties.items()}\n\n        # Adds BrokerProperties\n        if self.broker_properties:\n            result['BrokerProperties'] = {name: self._serialize_basic_properties_value(value)\n                                          for name, value\n                                          in self.broker_properties.items()}\n\n        return result", "response": "return the current message as expected by batch body format"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\ngetting the health of a Service Fabric cluster.", "response": "def get_cluster_health(\n            self, nodes_health_state_filter=0, applications_health_state_filter=0, events_health_state_filter=0, exclude_health_statistics=False, include_system_application_health_statistics=False, timeout=60, custom_headers=None, raw=False, **operation_config):\n        \"\"\"Gets the health of a Service Fabric cluster.\n\n        Use EventsHealthStateFilter to filter the collection of health events\n        reported on the cluster based on the health state.\n        Similarly, use NodesHealthStateFilter and ApplicationsHealthStateFilter\n        to filter the collection of nodes and applications returned based on\n        their aggregated health state.\n\n        :param nodes_health_state_filter: Allows filtering of the node health\n         state objects returned in the result of cluster health query\n         based on their health state. The possible values for this parameter\n         include integer value of one of the\n         following health states. Only nodes that match the filter are\n         returned. All nodes are used to evaluate the aggregated health state.\n         If not specified, all entries are returned.\n         The state values are flag-based enumeration, so the value could be a\n         combination of these values obtained using bitwise 'OR' operator.\n         For example, if the provided value is 6 then health state of nodes\n         with HealthState value of OK (2) and Warning (4) are returned.\n         - Default - Default value. Matches any HealthState. The value is zero.\n         - None - Filter that doesn't match any HealthState value. Used in\n         order to return no results on a given collection of states. The value\n         is 1.\n         - Ok - Filter that matches input with HealthState value Ok. The value\n         is 2.\n         - Warning - Filter that matches input with HealthState value Warning.\n         The value is 4.\n         - Error - Filter that matches input with HealthState value Error. The\n         value is 8.\n         - All - Filter that matches input with any HealthState value. The\n         value is 65535.\n        :type nodes_health_state_filter: int\n        :param applications_health_state_filter: Allows filtering of the\n         application health state objects returned in the result of cluster\n         health\n         query based on their health state.\n         The possible values for this parameter include integer value obtained\n         from members or bitwise operations\n         on members of HealthStateFilter enumeration. Only applications that\n         match the filter are returned.\n         All applications are used to evaluate the aggregated health state. If\n         not specified, all entries are returned.\n         The state values are flag-based enumeration, so the value could be a\n         combination of these values obtained using bitwise 'OR' operator.\n         For example, if the provided value is 6 then health state of\n         applications with HealthState value of OK (2) and Warning (4) are\n         returned.\n         - Default - Default value. Matches any HealthState. The value is zero.\n         - None - Filter that doesn't match any HealthState value. Used in\n         order to return no results on a given collection of states. The value\n         is 1.\n         - Ok - Filter that matches input with HealthState value Ok. The value\n         is 2.\n         - Warning - Filter that matches input with HealthState value Warning.\n         The value is 4.\n         - Error - Filter that matches input with HealthState value Error. The\n         value is 8.\n         - All - Filter that matches input with any HealthState value. The\n         value is 65535.\n        :type applications_health_state_filter: int\n        :param events_health_state_filter: Allows filtering the collection of\n         HealthEvent objects returned based on health state.\n         The possible values for this parameter include integer value of one of\n         the following health states.\n         Only events that match the filter are returned. All events are used to\n         evaluate the aggregated health state.\n         If not specified, all entries are returned. The state values are\n         flag-based enumeration, so the value could be a combination of these\n         values, obtained using the bitwise 'OR' operator. For example, If the\n         provided value is 6 then all of the events with HealthState value of\n         OK (2) and Warning (4) are returned.\n         - Default - Default value. Matches any HealthState. The value is zero.\n         - None - Filter that doesn't match any HealthState value. Used in\n         order to return no results on a given collection of states. The value\n         is 1.\n         - Ok - Filter that matches input with HealthState value Ok. The value\n         is 2.\n         - Warning - Filter that matches input with HealthState value Warning.\n         The value is 4.\n         - Error - Filter that matches input with HealthState value Error. The\n         value is 8.\n         - All - Filter that matches input with any HealthState value. The\n         value is 65535.\n        :type events_health_state_filter: int\n        :param exclude_health_statistics: Indicates whether the health\n         statistics should be returned as part of the query result. False by\n         default.\n         The statistics show the number of children entities in health state\n         Ok, Warning, and Error.\n        :type exclude_health_statistics: bool\n        :param include_system_application_health_statistics: Indicates whether\n         the health statistics should include the fabric:/System application\n         health statistics. False by default.\n         If IncludeSystemApplicationHealthStatistics is set to true, the health\n         statistics include the entities that belong to the fabric:/System\n         application.\n         Otherwise, the query result includes health statistics only for user\n         applications.\n         The health statistics must be included in the query result for this\n         parameter to be applied.\n        :type include_system_application_health_statistics: bool\n        :param timeout: The server timeout for performing the operation in\n         seconds. This timeout specifies the time duration that the client is\n         willing to wait for the requested operation to complete. The default\n         value for this parameter is 60 seconds.\n        :type timeout: long\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: ClusterHealth or ClientRawResponse if raw=true\n        :rtype: ~azure.servicefabric.models.ClusterHealth or\n         ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`FabricErrorException<azure.servicefabric.models.FabricErrorException>`\n        \"\"\"\n        api_version = \"6.0\"\n\n        # Construct URL\n        url = self.get_cluster_health.metadata['url']\n\n        # Construct parameters\n        query_parameters = {}\n        query_parameters['api-version'] = self._serialize.query(\"api_version\", api_version, 'str')\n        if nodes_health_state_filter is not None:\n            query_parameters['NodesHealthStateFilter'] = self._serialize.query(\"nodes_health_state_filter\", nodes_health_state_filter, 'int')\n        if applications_health_state_filter is not None:\n            query_parameters['ApplicationsHealthStateFilter'] = self._serialize.query(\"applications_health_state_filter\", applications_health_state_filter, 'int')\n        if events_health_state_filter is not None:\n            query_parameters['EventsHealthStateFilter'] = self._serialize.query(\"events_health_state_filter\", events_health_state_filter, 'int')\n        if exclude_health_statistics is not None:\n            query_parameters['ExcludeHealthStatistics'] = self._serialize.query(\"exclude_health_statistics\", exclude_health_statistics, 'bool')\n        if include_system_application_health_statistics is not None:\n            query_parameters['IncludeSystemApplicationHealthStatistics'] = self._serialize.query(\"include_system_application_health_statistics\", include_system_application_health_statistics, 'bool')\n        if timeout is not None:\n            query_parameters['timeout'] = self._serialize.query(\"timeout\", timeout, 'long', maximum=4294967295, minimum=1)\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Accept'] = 'application/json'\n        if custom_headers:\n            header_parameters.update(custom_headers)\n\n        # Construct and send request\n        request = self._client.get(url, query_parameters, header_parameters)\n        response = self._client.send(request, stream=False, **operation_config)\n\n        if response.status_code not in [200]:\n            raise models.FabricErrorException(self._deserialize, response)\n\n        deserialized = None\n\n        if response.status_code == 200:\n            deserialized = self._deserialize('ClusterHealth', response)\n\n        if raw:\n            client_raw_response = ClientRawResponse(deserialized, response)\n            return client_raw_response\n\n        return deserialized"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nget the health of a Service Fabric cluster using the specified policy.", "response": "def get_cluster_health_using_policy(\n            self, nodes_health_state_filter=0, applications_health_state_filter=0, events_health_state_filter=0, exclude_health_statistics=False, include_system_application_health_statistics=False, timeout=60, application_health_policy_map=None, cluster_health_policy=None, custom_headers=None, raw=False, **operation_config):\n        \"\"\"Gets the health of a Service Fabric cluster using the specified policy.\n\n        Use EventsHealthStateFilter to filter the collection of health events\n        reported on the cluster based on the health state.\n        Similarly, use NodesHealthStateFilter and ApplicationsHealthStateFilter\n        to filter the collection of nodes and applications returned based on\n        their aggregated health state.\n        Use ClusterHealthPolicies to override the health policies used to\n        evaluate the health.\n\n        :param nodes_health_state_filter: Allows filtering of the node health\n         state objects returned in the result of cluster health query\n         based on their health state. The possible values for this parameter\n         include integer value of one of the\n         following health states. Only nodes that match the filter are\n         returned. All nodes are used to evaluate the aggregated health state.\n         If not specified, all entries are returned.\n         The state values are flag-based enumeration, so the value could be a\n         combination of these values obtained using bitwise 'OR' operator.\n         For example, if the provided value is 6 then health state of nodes\n         with HealthState value of OK (2) and Warning (4) are returned.\n         - Default - Default value. Matches any HealthState. The value is zero.\n         - None - Filter that doesn't match any HealthState value. Used in\n         order to return no results on a given collection of states. The value\n         is 1.\n         - Ok - Filter that matches input with HealthState value Ok. The value\n         is 2.\n         - Warning - Filter that matches input with HealthState value Warning.\n         The value is 4.\n         - Error - Filter that matches input with HealthState value Error. The\n         value is 8.\n         - All - Filter that matches input with any HealthState value. The\n         value is 65535.\n        :type nodes_health_state_filter: int\n        :param applications_health_state_filter: Allows filtering of the\n         application health state objects returned in the result of cluster\n         health\n         query based on their health state.\n         The possible values for this parameter include integer value obtained\n         from members or bitwise operations\n         on members of HealthStateFilter enumeration. Only applications that\n         match the filter are returned.\n         All applications are used to evaluate the aggregated health state. If\n         not specified, all entries are returned.\n         The state values are flag-based enumeration, so the value could be a\n         combination of these values obtained using bitwise 'OR' operator.\n         For example, if the provided value is 6 then health state of\n         applications with HealthState value of OK (2) and Warning (4) are\n         returned.\n         - Default - Default value. Matches any HealthState. The value is zero.\n         - None - Filter that doesn't match any HealthState value. Used in\n         order to return no results on a given collection of states. The value\n         is 1.\n         - Ok - Filter that matches input with HealthState value Ok. The value\n         is 2.\n         - Warning - Filter that matches input with HealthState value Warning.\n         The value is 4.\n         - Error - Filter that matches input with HealthState value Error. The\n         value is 8.\n         - All - Filter that matches input with any HealthState value. The\n         value is 65535.\n        :type applications_health_state_filter: int\n        :param events_health_state_filter: Allows filtering the collection of\n         HealthEvent objects returned based on health state.\n         The possible values for this parameter include integer value of one of\n         the following health states.\n         Only events that match the filter are returned. All events are used to\n         evaluate the aggregated health state.\n         If not specified, all entries are returned. The state values are\n         flag-based enumeration, so the value could be a combination of these\n         values, obtained using the bitwise 'OR' operator. For example, If the\n         provided value is 6 then all of the events with HealthState value of\n         OK (2) and Warning (4) are returned.\n         - Default - Default value. Matches any HealthState. The value is zero.\n         - None - Filter that doesn't match any HealthState value. Used in\n         order to return no results on a given collection of states. The value\n         is 1.\n         - Ok - Filter that matches input with HealthState value Ok. The value\n         is 2.\n         - Warning - Filter that matches input with HealthState value Warning.\n         The value is 4.\n         - Error - Filter that matches input with HealthState value Error. The\n         value is 8.\n         - All - Filter that matches input with any HealthState value. The\n         value is 65535.\n        :type events_health_state_filter: int\n        :param exclude_health_statistics: Indicates whether the health\n         statistics should be returned as part of the query result. False by\n         default.\n         The statistics show the number of children entities in health state\n         Ok, Warning, and Error.\n        :type exclude_health_statistics: bool\n        :param include_system_application_health_statistics: Indicates whether\n         the health statistics should include the fabric:/System application\n         health statistics. False by default.\n         If IncludeSystemApplicationHealthStatistics is set to true, the health\n         statistics include the entities that belong to the fabric:/System\n         application.\n         Otherwise, the query result includes health statistics only for user\n         applications.\n         The health statistics must be included in the query result for this\n         parameter to be applied.\n        :type include_system_application_health_statistics: bool\n        :param timeout: The server timeout for performing the operation in\n         seconds. This timeout specifies the time duration that the client is\n         willing to wait for the requested operation to complete. The default\n         value for this parameter is 60 seconds.\n        :type timeout: long\n        :param application_health_policy_map: Defines a map that contains\n         specific application health policies for different applications.\n         Each entry specifies as key the application name and as value an\n         ApplicationHealthPolicy used to evaluate the application health.\n         If an application is not specified in the map, the application health\n         evaluation uses the ApplicationHealthPolicy found in its application\n         manifest or the default application health policy (if no health policy\n         is defined in the manifest).\n         The map is empty by default.\n        :type application_health_policy_map:\n         list[~azure.servicefabric.models.ApplicationHealthPolicyMapItem]\n        :param cluster_health_policy: Defines a health policy used to evaluate\n         the health of the cluster or of a cluster node.\n        :type cluster_health_policy:\n         ~azure.servicefabric.models.ClusterHealthPolicy\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: ClusterHealth or ClientRawResponse if raw=true\n        :rtype: ~azure.servicefabric.models.ClusterHealth or\n         ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`FabricErrorException<azure.servicefabric.models.FabricErrorException>`\n        \"\"\"\n        cluster_health_policies = None\n        if application_health_policy_map is not None or cluster_health_policy is not None:\n            cluster_health_policies = models.ClusterHealthPolicies(application_health_policy_map=application_health_policy_map, cluster_health_policy=cluster_health_policy)\n\n        api_version = \"6.0\"\n\n        # Construct URL\n        url = self.get_cluster_health_using_policy.metadata['url']\n\n        # Construct parameters\n        query_parameters = {}\n        query_parameters['api-version'] = self._serialize.query(\"api_version\", api_version, 'str')\n        if nodes_health_state_filter is not None:\n            query_parameters['NodesHealthStateFilter'] = self._serialize.query(\"nodes_health_state_filter\", nodes_health_state_filter, 'int')\n        if applications_health_state_filter is not None:\n            query_parameters['ApplicationsHealthStateFilter'] = self._serialize.query(\"applications_health_state_filter\", applications_health_state_filter, 'int')\n        if events_health_state_filter is not None:\n            query_parameters['EventsHealthStateFilter'] = self._serialize.query(\"events_health_state_filter\", events_health_state_filter, 'int')\n        if exclude_health_statistics is not None:\n            query_parameters['ExcludeHealthStatistics'] = self._serialize.query(\"exclude_health_statistics\", exclude_health_statistics, 'bool')\n        if include_system_application_health_statistics is not None:\n            query_parameters['IncludeSystemApplicationHealthStatistics'] = self._serialize.query(\"include_system_application_health_statistics\", include_system_application_health_statistics, 'bool')\n        if timeout is not None:\n            query_parameters['timeout'] = self._serialize.query(\"timeout\", timeout, 'long', maximum=4294967295, minimum=1)\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Accept'] = 'application/json'\n        header_parameters['Content-Type'] = 'application/json; charset=utf-8'\n        if custom_headers:\n            header_parameters.update(custom_headers)\n\n        # Construct body\n        if cluster_health_policies is not None:\n            body_content = self._serialize.body(cluster_health_policies, 'ClusterHealthPolicies')\n        else:\n            body_content = None\n\n        # Construct and send request\n        request = self._client.post(url, query_parameters, header_parameters, body_content)\n        response = self._client.send(request, stream=False, **operation_config)\n\n        if response.status_code not in [200]:\n            raise models.FabricErrorException(self._deserialize, response)\n\n        deserialized = None\n\n        if response.status_code == 200:\n            deserialized = self._deserialize('ClusterHealth', response)\n\n        if raw:\n            client_raw_response = ClientRawResponse(deserialized, response)\n            return client_raw_response\n\n        return deserialized"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef unprovision_application_type(\n            self, application_type_name, application_type_version, timeout=60, async_parameter=None, custom_headers=None, raw=False, **operation_config):\n        \"\"\"Removes or unregisters a Service Fabric application type from the\n        cluster.\n\n        This operation can only be performed if all application instances of\n        the application type have been deleted. Once the application type is\n        unregistered, no new application instances can be created for this\n        particular application type.\n\n        :param application_type_name: The name of the application type.\n        :type application_type_name: str\n        :param application_type_version: The version of the application type\n         as defined in the application manifest.\n        :type application_type_version: str\n        :param timeout: The server timeout for performing the operation in\n         seconds. This timeout specifies the time duration that the client is\n         willing to wait for the requested operation to complete. The default\n         value for this parameter is 60 seconds.\n        :type timeout: long\n        :param async_parameter: The flag indicating whether or not unprovision\n         should occur asynchronously. When set to true, the unprovision\n         operation returns when the request is accepted by the system, and the\n         unprovision operation continues without any timeout limit. The default\n         value is false. However, we recommend setting it to true for large\n         application packages that were provisioned.\n        :type async_parameter: bool\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: None or ClientRawResponse if raw=true\n        :rtype: None or ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`FabricErrorException<azure.servicefabric.models.FabricErrorException>`\n        \"\"\"\n        unprovision_application_type_description_info = models.UnprovisionApplicationTypeDescriptionInfo(application_type_version=application_type_version, async_property=async_parameter)\n\n        api_version = \"6.0\"\n\n        # Construct URL\n        url = self.unprovision_application_type.metadata['url']\n        path_format_arguments = {\n            'applicationTypeName': self._serialize.url(\"application_type_name\", application_type_name, 'str')\n        }\n        url = self._client.format_url(url, **path_format_arguments)\n\n        # Construct parameters\n        query_parameters = {}\n        query_parameters['api-version'] = self._serialize.query(\"api_version\", api_version, 'str')\n        if timeout is not None:\n            query_parameters['timeout'] = self._serialize.query(\"timeout\", timeout, 'long', maximum=4294967295, minimum=1)\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Content-Type'] = 'application/json; charset=utf-8'\n        if custom_headers:\n            header_parameters.update(custom_headers)\n\n        # Construct body\n        body_content = self._serialize.body(unprovision_application_type_description_info, 'UnprovisionApplicationTypeDescriptionInfo')\n\n        # Construct and send request\n        request = self._client.post(url, query_parameters, header_parameters, body_content)\n        response = self._client.send(request, stream=False, **operation_config)\n\n        if response.status_code not in [200, 202]:\n            raise models.FabricErrorException(self._deserialize, response)\n\n        if raw:\n            client_raw_response = ClientRawResponse(None, response)\n            return client_raw_response", "response": "Unprovisions an application type from the service Fabric cluster."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\ngets a list of repair tasks matching the given filters. This API supports the Service Fabric platform; it is not meant to be used directly from your code. :param task_id_filter: The repair task ID prefix to be matched. :type task_id_filter: str :param state_filter: A bitwise-OR of the following values, specifying which task states should be included in the result list. - 1 - Created - 2 - Claimed - 4 - Preparing - 8 - Approved - 16 - Executing - 32 - Restoring - 64 - Completed :type state_filter: int :param executor_filter: The name of the repair executor whose claimed tasks should be included in the list. :type executor_filter: str :param dict custom_headers: headers that will be added to the request :param bool raw: returns the direct response alongside the deserialized response :param operation_config: :ref:`Operation configuration overrides<msrest:optionsforoperations>`. :return: list or ClientRawResponse if raw=true :rtype: list[~azure.servicefabric.models.RepairTask] or ~msrest.pipeline.ClientRawResponse :raises: :class:`FabricErrorException<azure.servicefabric.models.FabricErrorException>`", "response": "def get_repair_task_list(\n            self, task_id_filter=None, state_filter=None, executor_filter=None, custom_headers=None, raw=False, **operation_config):\n        \"\"\"Gets a list of repair tasks matching the given filters.\n\n        This API supports the Service Fabric platform; it is not meant to be\n        used directly from your code.\n\n        :param task_id_filter: The repair task ID prefix to be matched.\n        :type task_id_filter: str\n        :param state_filter: A bitwise-OR of the following values, specifying\n         which task states should be included in the result list.\n         - 1 - Created\n         - 2 - Claimed\n         - 4 - Preparing\n         - 8 - Approved\n         - 16 - Executing\n         - 32 - Restoring\n         - 64 - Completed\n        :type state_filter: int\n        :param executor_filter: The name of the repair executor whose claimed\n         tasks should be included in the list.\n        :type executor_filter: str\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: list or ClientRawResponse if raw=true\n        :rtype: list[~azure.servicefabric.models.RepairTask] or\n         ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`FabricErrorException<azure.servicefabric.models.FabricErrorException>`\n        \"\"\"\n        api_version = \"6.0\"\n\n        # Construct URL\n        url = self.get_repair_task_list.metadata['url']\n\n        # Construct parameters\n        query_parameters = {}\n        query_parameters['api-version'] = self._serialize.query(\"api_version\", api_version, 'str')\n        if task_id_filter is not None:\n            query_parameters['TaskIdFilter'] = self._serialize.query(\"task_id_filter\", task_id_filter, 'str')\n        if state_filter is not None:\n            query_parameters['StateFilter'] = self._serialize.query(\"state_filter\", state_filter, 'int')\n        if executor_filter is not None:\n            query_parameters['ExecutorFilter'] = self._serialize.query(\"executor_filter\", executor_filter, 'str')\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Accept'] = 'application/json'\n        if custom_headers:\n            header_parameters.update(custom_headers)\n\n        # Construct and send request\n        request = self._client.get(url, query_parameters, header_parameters)\n        response = self._client.send(request, stream=False, **operation_config)\n\n        if response.status_code not in [200]:\n            raise models.FabricErrorException(self._deserialize, response)\n\n        deserialized = None\n\n        if response.status_code == 200:\n            deserialized = self._deserialize('[RepairTask]', response)\n\n        if raw:\n            client_raw_response = ClientRawResponse(deserialized, response)\n            return client_raw_response\n\n        return deserialized"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nsubmit a property batch.", "response": "def submit_property_batch(\n            self, name_id, timeout=60, operations=None, custom_headers=None, raw=False, **operation_config):\n        \"\"\"Submits a property batch.\n\n        Submits a batch of property operations. Either all or none of the\n        operations will be committed.\n\n        :param name_id: The Service Fabric name, without the 'fabric:' URI\n         scheme.\n        :type name_id: str\n        :param timeout: The server timeout for performing the operation in\n         seconds. This timeout specifies the time duration that the client is\n         willing to wait for the requested operation to complete. The default\n         value for this parameter is 60 seconds.\n        :type timeout: long\n        :param operations: A list of the property batch operations to be\n         executed.\n        :type operations:\n         list[~azure.servicefabric.models.PropertyBatchOperation]\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: PropertyBatchInfo or ClientRawResponse if raw=true\n        :rtype: ~azure.servicefabric.models.PropertyBatchInfo or\n         ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`FabricErrorException<azure.servicefabric.models.FabricErrorException>`\n        \"\"\"\n        property_batch_description_list = models.PropertyBatchDescriptionList(operations=operations)\n\n        api_version = \"6.0\"\n\n        # Construct URL\n        url = self.submit_property_batch.metadata['url']\n        path_format_arguments = {\n            'nameId': self._serialize.url(\"name_id\", name_id, 'str', skip_quote=True)\n        }\n        url = self._client.format_url(url, **path_format_arguments)\n\n        # Construct parameters\n        query_parameters = {}\n        query_parameters['api-version'] = self._serialize.query(\"api_version\", api_version, 'str')\n        if timeout is not None:\n            query_parameters['timeout'] = self._serialize.query(\"timeout\", timeout, 'long', maximum=4294967295, minimum=1)\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Accept'] = 'application/json'\n        header_parameters['Content-Type'] = 'application/json; charset=utf-8'\n        if custom_headers:\n            header_parameters.update(custom_headers)\n\n        # Construct body\n        body_content = self._serialize.body(property_batch_description_list, 'PropertyBatchDescriptionList')\n\n        # Construct and send request\n        request = self._client.post(url, query_parameters, header_parameters, body_content)\n        response = self._client.send(request, stream=False, **operation_config)\n\n        if response.status_code not in [200, 409]:\n            raise models.FabricErrorException(self._deserialize, response)\n\n        deserialized = None\n\n        if response.status_code == 200:\n            deserialized = self._deserialize('SuccessfulPropertyBatchInfo', response)\n        if response.status_code == 409:\n            deserialized = self._deserialize('FailedPropertyBatchInfo', response)\n\n        if raw:\n            client_raw_response = ClientRawResponse(deserialized, response)\n            return client_raw_response\n\n        return deserialized"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef models(cls, api_version=DEFAULT_API_VERSION):\n        if api_version == '2015-12-01':\n            from .v2015_12_01 import models\n            return models\n        raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))", "response": "Returns the base models module."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nreturn an instance of the FeaturesOperations API version.", "response": "def features(self):\n        \"\"\"Instance depends on the API version:\n\n           * 2015-12-01: :class:`FeaturesOperations<azure.mgmt.resource.features.v2015_12_01.operations.FeaturesOperations>`\n        \"\"\"\n        api_version = self._get_api_version('features')\n        if api_version == '2015-12-01':\n            from .v2015_12_01.operations import FeaturesOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef _general_error_handler(http_error):\n    ''' Simple error handler for azure.'''\n    message = str(http_error)\n    if http_error.respbody is not None:\n        message += '\\n' + http_error.respbody.decode('utf-8-sig')\n    raise AzureHttpError(message, http_error.status)", "response": "Simple error handler for azure."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nstart capturing network packets for the site.", "response": "def start_web_site_network_trace_operation(\n            self, resource_group_name, name, duration_in_seconds=None, max_frame_length=None, sas_url=None, custom_headers=None, raw=False, polling=True, **operation_config):\n        \"\"\"Start capturing network packets for the site.\n\n        Start capturing network packets for the site.\n\n        :param resource_group_name: Name of the resource group to which the\n         resource belongs.\n        :type resource_group_name: str\n        :param name: The name of the web app.\n        :type name: str\n        :param duration_in_seconds: The duration to keep capturing in seconds.\n        :type duration_in_seconds: int\n        :param max_frame_length: The maximum frame length in bytes (Optional).\n        :type max_frame_length: int\n        :param sas_url: The Blob URL to store capture file.\n        :type sas_url: str\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: The poller return type is ClientRawResponse, the\n         direct response alongside the deserialized response\n        :param polling: True for ARMPolling, False for no polling, or a\n         polling object for personal polling strategy\n        :return: An instance of LROPoller that returns list or\n         ClientRawResponse<list> if raw==True\n        :rtype:\n         ~msrestazure.azure_operation.AzureOperationPoller[list[~azure.mgmt.web.models.NetworkTrace]]\n         or\n         ~msrestazure.azure_operation.AzureOperationPoller[~msrest.pipeline.ClientRawResponse[list[~azure.mgmt.web.models.NetworkTrace]]]\n        :raises:\n         :class:`DefaultErrorResponseException<azure.mgmt.web.models.DefaultErrorResponseException>`\n        \"\"\"\n        raw_result = self._start_web_site_network_trace_operation_initial(\n            resource_group_name=resource_group_name,\n            name=name,\n            duration_in_seconds=duration_in_seconds,\n            max_frame_length=max_frame_length,\n            sas_url=sas_url,\n            custom_headers=custom_headers,\n            raw=True,\n            **operation_config\n        )\n\n        def get_long_running_output(response):\n            deserialized = self._deserialize('[NetworkTrace]', response)\n\n            if raw:\n                client_raw_response = ClientRawResponse(deserialized, response)\n                return client_raw_response\n\n            return deserialized\n\n        lro_delay = operation_config.get(\n            'long_running_operation_timeout',\n            self.config.long_running_operation_timeout)\n        if polling is True: polling_method = ARMPolling(lro_delay, **operation_config)\n        elif polling is False: polling_method = NoPolling()\n        else: polling_method = polling\n        return LROPoller(self._client, raw_result, get_long_running_output, polling_method)"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\ngets the difference in configuration settings between two web app slots.", "response": "def list_slot_differences_slot(\n            self, resource_group_name, name, slot, target_slot, preserve_vnet, custom_headers=None, raw=False, **operation_config):\n        \"\"\"Get the difference in configuration settings between two web app slots.\n\n        Get the difference in configuration settings between two web app slots.\n\n        :param resource_group_name: Name of the resource group to which the\n         resource belongs.\n        :type resource_group_name: str\n        :param name: Name of the app.\n        :type name: str\n        :param slot: Name of the source slot. If a slot is not specified, the\n         production slot is used as the source slot.\n        :type slot: str\n        :param target_slot: Destination deployment slot during swap operation.\n        :type target_slot: str\n        :param preserve_vnet: <code>true</code> to preserve Virtual Network to\n         the slot during swap; otherwise, <code>false</code>.\n        :type preserve_vnet: bool\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: An iterator like instance of SlotDifference\n        :rtype:\n         ~azure.mgmt.web.models.SlotDifferencePaged[~azure.mgmt.web.models.SlotDifference]\n        :raises:\n         :class:`DefaultErrorResponseException<azure.mgmt.web.models.DefaultErrorResponseException>`\n        \"\"\"\n        slot_swap_entity = models.CsmSlotEntity(target_slot=target_slot, preserve_vnet=preserve_vnet)\n\n        def internal_paging(next_link=None, raw=False):\n\n            if not next_link:\n                # Construct URL\n                url = self.list_slot_differences_slot.metadata['url']\n                path_format_arguments = {\n                    'resourceGroupName': self._serialize.url(\"resource_group_name\", resource_group_name, 'str', max_length=90, min_length=1, pattern=r'^[-\\w\\._\\(\\)]+[^\\.]$'),\n                    'name': self._serialize.url(\"name\", name, 'str'),\n                    'slot': self._serialize.url(\"slot\", slot, 'str'),\n                    'subscriptionId': self._serialize.url(\"self.config.subscription_id\", self.config.subscription_id, 'str')\n                }\n                url = self._client.format_url(url, **path_format_arguments)\n\n                # Construct parameters\n                query_parameters = {}\n                query_parameters['api-version'] = self._serialize.query(\"self.api_version\", self.api_version, 'str')\n\n            else:\n                url = next_link\n                query_parameters = {}\n\n            # Construct headers\n            header_parameters = {}\n            header_parameters['Accept'] = 'application/json'\n            header_parameters['Content-Type'] = 'application/json; charset=utf-8'\n            if self.config.generate_client_request_id:\n                header_parameters['x-ms-client-request-id'] = str(uuid.uuid1())\n            if custom_headers:\n                header_parameters.update(custom_headers)\n            if self.config.accept_language is not None:\n                header_parameters['accept-language'] = self._serialize.header(\"self.config.accept_language\", self.config.accept_language, 'str')\n\n            # Construct body\n            body_content = self._serialize.body(slot_swap_entity, 'CsmSlotEntity')\n\n            # Construct and send request\n            request = self._client.post(url, query_parameters, header_parameters, body_content)\n            response = self._client.send(request, stream=False, **operation_config)\n\n            if response.status_code not in [200]:\n                raise models.DefaultErrorResponseException(self._deserialize, response)\n\n            return response\n\n        # Deserialize response\n        deserialized = models.SlotDifferencePaged(internal_paging, self._deserialize.dependencies)\n\n        if raw:\n            header_dict = {}\n            client_raw_response = models.SlotDifferencePaged(internal_paging, self._deserialize.dependencies, header_dict)\n            return client_raw_response\n\n        return deserialized"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nswaps two deployment slots of an app.", "response": "def swap_slot_slot(\n            self, resource_group_name, name, slot, target_slot, preserve_vnet, custom_headers=None, raw=False, polling=True, **operation_config):\n        \"\"\"Swaps two deployment slots of an app.\n\n        Swaps two deployment slots of an app.\n\n        :param resource_group_name: Name of the resource group to which the\n         resource belongs.\n        :type resource_group_name: str\n        :param name: Name of the app.\n        :type name: str\n        :param slot: Name of the source slot. If a slot is not specified, the\n         production slot is used as the source slot.\n        :type slot: str\n        :param target_slot: Destination deployment slot during swap operation.\n        :type target_slot: str\n        :param preserve_vnet: <code>true</code> to preserve Virtual Network to\n         the slot during swap; otherwise, <code>false</code>.\n        :type preserve_vnet: bool\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: The poller return type is ClientRawResponse, the\n         direct response alongside the deserialized response\n        :param polling: True for ARMPolling, False for no polling, or a\n         polling object for personal polling strategy\n        :return: An instance of LROPoller that returns None or\n         ClientRawResponse<None> if raw==True\n        :rtype: ~msrestazure.azure_operation.AzureOperationPoller[None] or\n         ~msrestazure.azure_operation.AzureOperationPoller[~msrest.pipeline.ClientRawResponse[None]]\n        :raises: :class:`CloudError<msrestazure.azure_exceptions.CloudError>`\n        \"\"\"\n        raw_result = self._swap_slot_slot_initial(\n            resource_group_name=resource_group_name,\n            name=name,\n            slot=slot,\n            target_slot=target_slot,\n            preserve_vnet=preserve_vnet,\n            custom_headers=custom_headers,\n            raw=True,\n            **operation_config\n        )\n\n        def get_long_running_output(response):\n            if raw:\n                client_raw_response = ClientRawResponse(None, response)\n                return client_raw_response\n\n        lro_delay = operation_config.get(\n            'long_running_operation_timeout',\n            self.config.long_running_operation_timeout)\n        if polling is True: polling_method = ARMPolling(lro_delay, **operation_config)\n        elif polling is False: polling_method = NoPolling()\n        else: polling_method = polling\n        return LROPoller(self._client, raw_result, get_long_running_output, polling_method)"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nexecutes an OData query for events of a certain type.", "response": "def get_by_type(\n            self, app_id, event_type, timespan=None, filter=None, search=None, orderby=None, select=None, skip=None, top=None, format=None, count=None, apply=None, custom_headers=None, raw=False, **operation_config):\n        \"\"\"Execute OData query.\n\n        Executes an OData query for events.\n\n        :param app_id: ID of the application. This is Application ID from the\n         API Access settings blade in the Azure portal.\n        :type app_id: str\n        :param event_type: The type of events to query; either a standard\n         event type (`traces`, `customEvents`, `pageViews`, `requests`,\n         `dependencies`, `exceptions`, `availabilityResults`) or `$all` to\n         query across all event types. Possible values include: '$all',\n         'traces', 'customEvents', 'pageViews', 'browserTimings', 'requests',\n         'dependencies', 'exceptions', 'availabilityResults',\n         'performanceCounters', 'customMetrics'\n        :type event_type: str or ~azure.applicationinsights.models.EventType\n        :param timespan: Optional. The timespan over which to retrieve events.\n         This is an ISO8601 time period value.  This timespan is applied in\n         addition to any that are specified in the Odata expression.\n        :type timespan: str\n        :param filter: An expression used to filter the returned events\n        :type filter: str\n        :param search: A free-text search expression to match for whether a\n         particular event should be returned\n        :type search: str\n        :param orderby: A comma-separated list of properties with \\\\\"asc\\\\\"\n         (the default) or \\\\\"desc\\\\\" to control the order of returned events\n        :type orderby: str\n        :param select: Limits the properties to just those requested on each\n         returned event\n        :type select: str\n        :param skip: The number of items to skip over before returning events\n        :type skip: int\n        :param top: The number of events to return\n        :type top: int\n        :param format: Format for the returned events\n        :type format: str\n        :param count: Request a count of matching items included with the\n         returned events\n        :type count: bool\n        :param apply: An expression used for aggregation over returned events\n        :type apply: str\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: EventsResults or ClientRawResponse if raw=true\n        :rtype: ~azure.applicationinsights.models.EventsResults or\n         ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`ErrorResponseException<azure.applicationinsights.models.ErrorResponseException>`\n        \"\"\"\n        # Construct URL\n        url = self.get_by_type.metadata['url']\n        path_format_arguments = {\n            'appId': self._serialize.url(\"app_id\", app_id, 'str'),\n            'eventType': self._serialize.url(\"event_type\", event_type, 'str')\n        }\n        url = self._client.format_url(url, **path_format_arguments)\n\n        # Construct parameters\n        query_parameters = {}\n        if timespan is not None:\n            query_parameters['timespan'] = self._serialize.query(\"timespan\", timespan, 'str')\n        if filter is not None:\n            query_parameters['$filter'] = self._serialize.query(\"filter\", filter, 'str')\n        if search is not None:\n            query_parameters['$search'] = self._serialize.query(\"search\", search, 'str')\n        if orderby is not None:\n            query_parameters['$orderby'] = self._serialize.query(\"orderby\", orderby, 'str')\n        if select is not None:\n            query_parameters['$select'] = self._serialize.query(\"select\", select, 'str')\n        if skip is not None:\n            query_parameters['$skip'] = self._serialize.query(\"skip\", skip, 'int')\n        if top is not None:\n            query_parameters['$top'] = self._serialize.query(\"top\", top, 'int')\n        if format is not None:\n            query_parameters['$format'] = self._serialize.query(\"format\", format, 'str')\n        if count is not None:\n            query_parameters['$count'] = self._serialize.query(\"count\", count, 'bool')\n        if apply is not None:\n            query_parameters['$apply'] = self._serialize.query(\"apply\", apply, 'str')\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Accept'] = 'application/json'\n        if custom_headers:\n            header_parameters.update(custom_headers)\n\n        # Construct and send request\n        request = self._client.get(url, query_parameters, header_parameters)\n        response = self._client.send(request, stream=False, **operation_config)\n\n        if response.status_code not in [200]:\n            raise models.ErrorResponseException(self._deserialize, response)\n\n        deserialized = None\n\n        if response.status_code == 200:\n            deserialized = self._deserialize('EventsResults', response)\n\n        if raw:\n            client_raw_response = ClientRawResponse(deserialized, response)\n            return client_raw_response\n\n        return deserialized"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nadding a face to a large face list.", "response": "def add_face_from_stream(\n            self, large_face_list_id, image, user_data=None, target_face=None, custom_headers=None, raw=False, callback=None, **operation_config):\n        \"\"\"Add a face to a large face list. The input face is specified as an\n        image with a targetFace rectangle. It returns a persistedFaceId\n        representing the added face, and persistedFaceId will not expire.\n\n        :param large_face_list_id: Id referencing a particular large face\n         list.\n        :type large_face_list_id: str\n        :param image: An image stream.\n        :type image: Generator\n        :param user_data: User-specified data about the face for any purpose.\n         The maximum length is 1KB.\n        :type user_data: str\n        :param target_face: A face rectangle to specify the target face to be\n         added to a person in the format of \"targetFace=left,top,width,height\".\n         E.g. \"targetFace=10,10,100,100\". If there is more than one face in the\n         image, targetFace is required to specify which face to add. No\n         targetFace means there is only one face detected in the entire image.\n        :type target_face: list[int]\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param callback: When specified, will be called with each chunk of\n         data that is streamed. The callback should take two arguments, the\n         bytes of the current chunk of data and the response object. If the\n         data is uploading, response will be None.\n        :type callback: Callable[Bytes, response=None]\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: PersistedFace or ClientRawResponse if raw=true\n        :rtype: ~azure.cognitiveservices.vision.face.models.PersistedFace or\n         ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`APIErrorException<azure.cognitiveservices.vision.face.models.APIErrorException>`\n        \"\"\"\n        # Construct URL\n        url = self.add_face_from_stream.metadata['url']\n        path_format_arguments = {\n            'Endpoint': self._serialize.url(\"self.config.endpoint\", self.config.endpoint, 'str', skip_quote=True),\n            'largeFaceListId': self._serialize.url(\"large_face_list_id\", large_face_list_id, 'str', max_length=64, pattern=r'^[a-z0-9-_]+$')\n        }\n        url = self._client.format_url(url, **path_format_arguments)\n\n        # Construct parameters\n        query_parameters = {}\n        if user_data is not None:\n            query_parameters['userData'] = self._serialize.query(\"user_data\", user_data, 'str', max_length=1024)\n        if target_face is not None:\n            query_parameters['targetFace'] = self._serialize.query(\"target_face\", target_face, '[int]', div=',')\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Accept'] = 'application/json'\n        header_parameters['Content-Type'] = 'application/octet-stream'\n        if custom_headers:\n            header_parameters.update(custom_headers)\n\n        # Construct body\n        body_content = self._client.stream_upload(image, callback)\n\n        # Construct and send request\n        request = self._client.post(url, query_parameters, header_parameters, body_content)\n        response = self._client.send(request, stream=False, **operation_config)\n\n        if response.status_code not in [200]:\n            raise models.APIErrorException(self._deserialize, response)\n\n        deserialized = None\n\n        if response.status_code == 200:\n            deserialized = self._deserialize('PersistedFace', response)\n\n        if raw:\n            client_raw_response = ClientRawResponse(deserialized, response)\n            return client_raw_response\n\n        return deserialized"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef _handle_redirect(self, r, **kwargs):\n        if r.is_redirect:\n            self._thread_local.auth_attempted = False", "response": "Reset auth_attempted on redirects."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef _handle_401(self, response, **kwargs):\n        # If response is not 401 do not auth and return response\n        if not response.status_code == 401:\n            self._thread_local.auth_attempted = False\n            return response\n\n        # If we've already attempted to auth for this request once, do not auth and return response\n        if self._thread_local.auth_attempted:\n            self._thread_local.auth_attempted = False\n            return response\n\n        auth_header = response.headers.get('www-authenticate', '')\n\n        # Otherwise authenticate and retry the request\n        self._thread_local.auth_attempted = True\n\n        # parse the challenge\n        challenge = HttpChallenge(response.request.url, auth_header, response.headers)\n\n        # bearer and PoP are the only authentication schemes supported at this time\n        # if the response auth header is not a bearer challenge or pop challange do not auth and return response\n        if not (challenge.is_bearer_challenge() or challenge.is_pop_challenge()):\n            self._thread_local.auth_attempted = False\n            return response\n\n        # add the challenge to the cache\n        ChallengeCache.set_challenge_for_url(response.request.url, challenge)\n\n        # Consume content and release the original connection\n        # to allow our new request to reuse the same one.\n        response.content\n        response.close()\n\n        # copy the request to resend\n        prep = response.request.copy()\n\n        if self._thread_local.orig_body is not None:\n            # replace the body with the saved body\n            prep.prepare_body(data=self._thread_local.orig_body, files=None)\n\n        extract_cookies_to_jar(prep._cookies, response.request, response.raw)\n        prep.prepare_cookies(prep._cookies)\n\n        security = self._get_message_security(prep, challenge)\n\n        # auth and protect the prepped request message\n        security.protect_request(prep)\n\n        # resend the request with proper authentication and message protection\n        _response = response.connection.send(prep, **kwargs)\n        _response.history.append(response)\n        _response.request = prep\n\n        # unprotected the response\n        security.unprotect_response(_response)\n\n        return _response", "response": "Handle a 401 response."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef create_and_start_migration(\n            self, resource_group_name, namespace_name, target_namespace, post_migration_name, custom_headers=None, raw=False, polling=True, **operation_config):\n        \"\"\"Creates Migration configuration and starts migration of entities from\n        Standard to Premium namespace.\n\n        :param resource_group_name: Name of the Resource group within the\n         Azure subscription.\n        :type resource_group_name: str\n        :param namespace_name: The namespace name\n        :type namespace_name: str\n        :param target_namespace: Existing premium Namespace ARM Id name which\n         has no entities, will be used for migration\n        :type target_namespace: str\n        :param post_migration_name: Name to access Standard Namespace after\n         migration\n        :type post_migration_name: str\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: The poller return type is ClientRawResponse, the\n         direct response alongside the deserialized response\n        :param polling: True for ARMPolling, False for no polling, or a\n         polling object for personal polling strategy\n        :return: An instance of LROPoller that returns\n         MigrationConfigProperties or\n         ClientRawResponse<MigrationConfigProperties> if raw==True\n        :rtype:\n         ~msrestazure.azure_operation.AzureOperationPoller[~azure.mgmt.servicebus.models.MigrationConfigProperties]\n         or\n         ~msrestazure.azure_operation.AzureOperationPoller[~msrest.pipeline.ClientRawResponse[~azure.mgmt.servicebus.models.MigrationConfigProperties]]\n        :raises:\n         :class:`ErrorResponseException<azure.mgmt.servicebus.models.ErrorResponseException>`\n        \"\"\"\n        raw_result = self._create_and_start_migration_initial(\n            resource_group_name=resource_group_name,\n            namespace_name=namespace_name,\n            target_namespace=target_namespace,\n            post_migration_name=post_migration_name,\n            custom_headers=custom_headers,\n            raw=True,\n            **operation_config\n        )\n\n        def get_long_running_output(response):\n            deserialized = self._deserialize('MigrationConfigProperties', response)\n\n            if raw:\n                client_raw_response = ClientRawResponse(deserialized, response)\n                return client_raw_response\n\n            return deserialized\n\n        lro_delay = operation_config.get(\n            'long_running_operation_timeout',\n            self.config.long_running_operation_timeout)\n        if polling is True: polling_method = ARMPolling(lro_delay, **operation_config)\n        elif polling is False: polling_method = NoPolling()\n        else: polling_method = polling\n        return LROPoller(self._client, raw_result, get_long_running_output, polling_method)", "response": "Creates and starts migration of entities from the given resource group to the given target namespace."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef publish_events(\n            self, topic_hostname, events, custom_headers=None, raw=False, **operation_config):\n        \"\"\"Publishes a batch of events to an Azure Event Grid topic.\n\n        :param topic_hostname: The host name of the topic, e.g.\n         topic1.westus2-1.eventgrid.azure.net\n        :type topic_hostname: str\n        :param events: An array of events to be published to Event Grid.\n        :type events: list[~azure.eventgrid.models.EventGridEvent]\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: None or ClientRawResponse if raw=true\n        :rtype: None or ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`HttpOperationError<msrest.exceptions.HttpOperationError>`\n        \"\"\"\n        # Construct URL\n        url = self.publish_events.metadata['url']\n        path_format_arguments = {\n            'topicHostname': self._serialize.url(\"topic_hostname\", topic_hostname, 'str', skip_quote=True)\n        }\n        url = self._client.format_url(url, **path_format_arguments)\n\n        # Construct parameters\n        query_parameters = {}\n        query_parameters['api-version'] = self._serialize.query(\"self.api_version\", self.api_version, 'str')\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Content-Type'] = 'application/json; charset=utf-8'\n        if custom_headers:\n            header_parameters.update(custom_headers)\n\n        # Construct body\n        body_content = self._serialize.body(events, '[EventGridEvent]')\n\n        # Construct and send request\n        request = self._client.post(url, query_parameters)\n        response = self._client.send(\n            request, header_parameters, body_content, stream=False, **operation_config)\n\n        if response.status_code not in [200]:\n            raise HttpOperationError(self._deserialize, response)\n\n        if raw:\n            client_raw_response = ClientRawResponse(None, response)\n            return client_raw_response", "response": "Publishes a batch of events to an Azure Event Grid topic."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef move_resources(\n            self, source_resource_group_name, resources=None, target_resource_group=None, custom_headers=None, raw=False, polling=True, **operation_config):\n        \"\"\"Moves resources from one resource group to another resource group.\n\n        The resources to move must be in the same source resource group. The\n        target resource group may be in a different subscription. When moving\n        resources, both the source group and the target group are locked for\n        the duration of the operation. Write and delete operations are blocked\n        on the groups until the move completes. .\n\n        :param source_resource_group_name: The name of the resource group\n         containing the resources to move.\n        :type source_resource_group_name: str\n        :param resources: The IDs of the resources.\n        :type resources: list[str]\n        :param target_resource_group: The target resource group.\n        :type target_resource_group: str\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: The poller return type is ClientRawResponse, the\n         direct response alongside the deserialized response\n        :param polling: True for ARMPolling, False for no polling, or a\n         polling object for personal polling strategy\n        :return: An instance of LROPoller that returns None or\n         ClientRawResponse<None> if raw==True\n        :rtype: ~msrestazure.azure_operation.AzureOperationPoller[None] or\n         ~msrestazure.azure_operation.AzureOperationPoller[~msrest.pipeline.ClientRawResponse[None]]\n        :raises: :class:`CloudError<msrestazure.azure_exceptions.CloudError>`\n        \"\"\"\n        raw_result = self._move_resources_initial(\n            source_resource_group_name=source_resource_group_name,\n            resources=resources,\n            target_resource_group=target_resource_group,\n            custom_headers=custom_headers,\n            raw=True,\n            **operation_config\n        )\n\n        def get_long_running_output(response):\n            if raw:\n                client_raw_response = ClientRawResponse(None, response)\n                return client_raw_response\n\n        lro_delay = operation_config.get(\n            'long_running_operation_timeout',\n            self.config.long_running_operation_timeout)\n        if polling is True: polling_method = ARMPolling(lro_delay, **operation_config)\n        elif polling is False: polling_method = NoPolling()\n        else: polling_method = polling\n        return LROPoller(self._client, raw_result, get_long_running_output, polling_method)", "response": "Moves resources from one resource group to another resource group."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nreturn a new instance of the base class for the given API version.", "response": "def models(cls, api_version=DEFAULT_API_VERSION):\n        \"\"\"Module depends on the API version:\n\n           * 2017-07-01: :mod:`v2017_07_01.models<azure.mgmt.containerservice.v2017_07_01.models>`\n           * 2018-03-31: :mod:`v2018_03_31.models<azure.mgmt.containerservice.v2018_03_31.models>`\n           * 2018-08-01-preview: :mod:`v2018_08_01_preview.models<azure.mgmt.containerservice.v2018_08_01_preview.models>`\n           * 2018-09-30-preview: :mod:`v2018_09_30_preview.models<azure.mgmt.containerservice.v2018_09_30_preview.models>`\n           * 2019-02-01: :mod:`v2019_02_01.models<azure.mgmt.containerservice.v2019_02_01.models>`\n        \"\"\"\n        if api_version == '2017-07-01':\n            from .v2017_07_01 import models\n            return models\n        elif api_version == '2018-03-31':\n            from .v2018_03_31 import models\n            return models\n        elif api_version == '2018-08-01-preview':\n            from .v2018_08_01_preview import models\n            return models\n        elif api_version == '2018-09-30-preview':\n            from .v2018_09_30_preview import models\n            return models\n        elif api_version == '2019-02-01':\n            from .v2019_02_01 import models\n            return models\n        raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef agent_pools(self):\n        api_version = self._get_api_version('agent_pools')\n        if api_version == '2019-02-01':\n            from .v2019_02_01.operations import AgentPoolsOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))", "response": "Return an instance of AgentPoolsOperations class."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef container_services(self):\n        api_version = self._get_api_version('container_services')\n        if api_version == '2017-07-01':\n            from .v2017_07_01.operations import ContainerServicesOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))", "response": "Return an instance of the ContainerServicesOperations API version."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nreturn an instance of the OpenShiftManagedClusters API version 2. 0. 0.", "response": "def open_shift_managed_clusters(self):\n        \"\"\"Instance depends on the API version:\n\n           * 2018-09-30-preview: :class:`OpenShiftManagedClustersOperations<azure.mgmt.containerservice.v2018_09_30_preview.operations.OpenShiftManagedClustersOperations>`\n        \"\"\"\n        api_version = self._get_api_version('open_shift_managed_clusters')\n        if api_version == '2018-09-30-preview':\n            from .v2018_09_30_preview.operations import OpenShiftManagedClustersOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nreturns an instance of the Operations API Version available.", "response": "def operations(self):\n        \"\"\"Instance depends on the API version:\n\n           * 2018-03-31: :class:`Operations<azure.mgmt.containerservice.v2018_03_31.operations.Operations>`\n           * 2018-08-01-preview: :class:`Operations<azure.mgmt.containerservice.v2018_08_01_preview.operations.Operations>`\n           * 2019-02-01: :class:`Operations<azure.mgmt.containerservice.v2019_02_01.operations.Operations>`\n        \"\"\"\n        api_version = self._get_api_version('operations')\n        if api_version == '2018-03-31':\n            from .v2018_03_31.operations import Operations as OperationClass\n        elif api_version == '2018-08-01-preview':\n            from .v2018_08_01_preview.operations import Operations as OperationClass\n        elif api_version == '2019-02-01':\n            from .v2019_02_01.operations import Operations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\ndefining a new default profile.", "response": "def use(self, profile):\n        \"\"\"Define a new default profile.\"\"\"\n        if not isinstance(profile, (KnownProfiles, ProfileDefinition)):\n            raise ValueError(\"Can only set as default a ProfileDefinition or a KnownProfiles\")\n        type(self).profile = profile"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nquery policy tracked resources under the management group.", "response": "def list_query_results_for_management_group(\n            self, management_group_name, query_options=None, custom_headers=None, raw=False, **operation_config):\n        \"\"\"Queries policy tracked resources under the management group.\n\n        :param management_group_name: Management group name.\n        :type management_group_name: str\n        :param query_options: Additional parameters for the operation\n        :type query_options: ~azure.mgmt.policyinsights.models.QueryOptions\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: An iterator like instance of PolicyTrackedResource\n        :rtype:\n         ~azure.mgmt.policyinsights.models.PolicyTrackedResourcePaged[~azure.mgmt.policyinsights.models.PolicyTrackedResource]\n        :raises:\n         :class:`QueryFailureException<azure.mgmt.policyinsights.models.QueryFailureException>`\n        \"\"\"\n        top = None\n        if query_options is not None:\n            top = query_options.top\n        filter = None\n        if query_options is not None:\n            filter = query_options.filter\n\n        def internal_paging(next_link=None, raw=False):\n\n            if not next_link:\n                # Construct URL\n                url = self.list_query_results_for_management_group.metadata['url']\n                path_format_arguments = {\n                    'managementGroupsNamespace': self._serialize.url(\"self.management_groups_namespace\", self.management_groups_namespace, 'str'),\n                    'managementGroupName': self._serialize.url(\"management_group_name\", management_group_name, 'str'),\n                    'policyTrackedResourcesResource': self._serialize.url(\"self.policy_tracked_resources_resource\", self.policy_tracked_resources_resource, 'str')\n                }\n                url = self._client.format_url(url, **path_format_arguments)\n\n                # Construct parameters\n                query_parameters = {}\n                query_parameters['api-version'] = self._serialize.query(\"self.api_version\", self.api_version, 'str')\n                if top is not None:\n                    query_parameters['$top'] = self._serialize.query(\"top\", top, 'int', minimum=0)\n                if filter is not None:\n                    query_parameters['$filter'] = self._serialize.query(\"filter\", filter, 'str')\n\n            else:\n                url = next_link\n                query_parameters = {}\n\n            # Construct headers\n            header_parameters = {}\n            header_parameters['Accept'] = 'application/json'\n            if self.config.generate_client_request_id:\n                header_parameters['x-ms-client-request-id'] = str(uuid.uuid1())\n            if custom_headers:\n                header_parameters.update(custom_headers)\n            if self.config.accept_language is not None:\n                header_parameters['accept-language'] = self._serialize.header(\"self.config.accept_language\", self.config.accept_language, 'str')\n\n            # Construct and send request\n            request = self._client.post(url, query_parameters, header_parameters)\n            response = self._client.send(request, stream=False, **operation_config)\n\n            if response.status_code not in [200]:\n                raise models.QueryFailureException(self._deserialize, response)\n\n            return response\n\n        # Deserialize response\n        deserialized = models.PolicyTrackedResourcePaged(internal_paging, self._deserialize.dependencies)\n\n        if raw:\n            header_dict = {}\n            client_raw_response = models.PolicyTrackedResourcePaged(internal_paging, self._deserialize.dependencies, header_dict)\n            return client_raw_response\n\n        return deserialized"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nsending a message and block until acknowledgement is received.", "response": "def send(self, message):\n        \"\"\"Send a message and blocks until acknowledgement is received or the operation fails.\n\n        :param message: The message to be sent.\n        :type message: ~azure.servicebus.common.message.Message\n        :raises: ~azure.servicebus.common.errors.MessageSendFailed if the message fails to send.\n\n        Example:\n            .. literalinclude:: ../examples/test_examples.py\n                :start-after: [START send_message]\n                :end-before: [END send_message]\n                :language: python\n                :dedent: 4\n                :caption: Send a message and block\n\n        \"\"\"\n        if not isinstance(message, Message):\n            raise TypeError(\"Value of message must be of type 'Message'.\")\n        if not self.running:\n            self.open()\n        if self.session_id and not message.properties.group_id:\n            message.properties.group_id = self.session_id\n        try:\n            self._handler.send_message(message.message)\n        except Exception as e:\n            raise MessageSendFailed(e)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef queue_message(self, message):\n        if not self.session_id and not message.properties.group_id:\n            raise ValueError(\"Message must have Session ID.\")\n        super(SessionSender, self).queue_message(message)", "response": "Queue a message to be sent later."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef schedule(self, schedule_time, *messages):\n        for message in messages:\n            if not self.session_id and not message.properties.group_id:\n                raise ValueError(\"Message must have Session ID.\")\n        return super(SessionSender, self).schedule(schedule_time, *messages)", "response": "Schedule one or more messages at a specific time."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef create_queue(\n            self, queue_name,\n            lock_duration=30, max_size_in_megabytes=None,\n            requires_duplicate_detection=False,\n            requires_session=False,\n            default_message_time_to_live=None,\n            dead_lettering_on_message_expiration=False,\n            duplicate_detection_history_time_window=None,\n            max_delivery_count=None, enable_batched_operations=None):\n        \"\"\"Create a queue entity.\n\n        :param queue_name: The name of the new queue.\n        :type queue_name: str\n        :param lock_duration: The lock durection in seconds for each message in the queue.\n        :type lock_duration: int\n        :param max_size_in_megabytes: The max size to allow the queue to grow to.\n        :type max_size_in_megabytes: int\n        :param requires_duplicate_detection: Whether the queue will require every message with\n         a specified time frame to have a unique ID. Non-unique messages will be discarded.\n         Default value is False.\n        :type requires_duplicate_detection: bool\n        :param requires_session: Whether the queue will be sessionful, and therefore require all\n         message to have a Session ID and be received by a sessionful receiver.\n         Default value is False.\n        :type requires_session: bool\n        :param default_message_time_to_live: The length of time a message will remain in the queue\n         before it is either discarded or moved to the dead letter queue.\n        :type default_message_time_to_live: ~datetime.timedelta\n        :param dead_lettering_on_message_expiration: Whether to move expired messages to the\n         dead letter queue. Default value is False.\n        :type dead_lettering_on_message_expiration: bool\n        :param duplicate_detection_history_time_window: The period within which all incoming messages\n         must have a unique message ID.\n        :type duplicate_detection_history_time_window: ~datetime.timedelta\n        :param max_delivery_count: The maximum number of times a message will attempt to be delivered\n         before it is moved to the dead letter queue.\n        :type max_delivery_count: int\n        :param enable_batched_operations:\n        :type: enable_batched_operations: bool\n        :raises: ~azure.servicebus.common.errors.ServiceBusConnectionError if the namespace is not found.\n        :raises: ~azure.common.AzureConflictHttpError if a queue of the same name already exists.\n        \"\"\"\n        queue_properties = Queue(\n            lock_duration=\"PT{}S\".format(int(lock_duration)),\n            max_size_in_megabytes=max_size_in_megabytes,\n            requires_duplicate_detection=requires_duplicate_detection,\n            requires_session=requires_session,\n            default_message_time_to_live=default_message_time_to_live,\n            dead_lettering_on_message_expiration=dead_lettering_on_message_expiration,\n            duplicate_detection_history_time_window=duplicate_detection_history_time_window,\n            max_delivery_count=max_delivery_count,\n            enable_batched_operations=enable_batched_operations)\n        try:\n            return self.mgmt_client.create_queue(queue_name, queue=queue_properties, fail_on_exist=True)\n        except requests.exceptions.ConnectionError as e:\n            raise ServiceBusConnectionError(\"Namespace: {} not found\".format(self.service_namespace), e)", "response": "Creates a new queue entity."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\ndeleting a queue entity.", "response": "def delete_queue(self, queue_name, fail_not_exist=False):\n        \"\"\"Delete a queue entity.\n\n        :param queue_name: The name of the queue to delete.\n        :type queue_name: str\n        :param fail_not_exist: Whether to raise an exception if the named queue is not\n         found. If set to True, a ServiceBusResourceNotFound will be raised.\n         Default value is False.\n        :type fail_not_exist: bool\n        :raises: ~azure.servicebus.common.errors.ServiceBusConnectionError if the namesapce is not found.\n        :raises: ~azure.servicebus.common.errors.ServiceBusResourceNotFound if the queue is not found\n         and `fail_not_exist` is set to True.\n        \"\"\"\n        try:\n            return self.mgmt_client.delete_queue(queue_name, fail_not_exist=fail_not_exist)\n        except requests.exceptions.ConnectionError as e:\n            raise ServiceBusConnectionError(\"Namespace: {} not found\".format(self.service_namespace), e)\n        except azure.common.AzureMissingResourceHttpError as e:\n            raise ServiceBusResourceNotFound(\"Specificed queue '{}' does not exist.\".format(queue_name), e)"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\ncreating a new topic entity.", "response": "def create_topic(\n            self, topic_name,\n            default_message_time_to_live=None,\n            max_size_in_megabytes=None, requires_duplicate_detection=None,\n            duplicate_detection_history_time_window=None,\n            enable_batched_operations=None):\n        \"\"\"Create a topic entity.\n\n        :param topic_name: The name of the new topic.\n        :type topic_name: str\n        :param max_size_in_megabytes: The max size to allow the topic to grow to.\n        :type max_size_in_megabytes: int\n        :param requires_duplicate_detection: Whether the topic will require every message with\n         a specified time frame to have a unique ID. Non-unique messages will be discarded.\n         Default value is False.\n        :type requires_duplicate_detection: bool\n        :param default_message_time_to_live: The length of time a message will remain in the topic\n         before it is either discarded or moved to the dead letter queue.\n        :type default_message_time_to_live: ~datetime.timedelta\n        :param duplicate_detection_history_time_window: The period within which all incoming messages\n         must have a unique message ID.\n        :type duplicate_detection_history_time_window: ~datetime.timedelta\n        :param enable_batched_operations:\n        :type: enable_batched_operations: bool\n        :raises: ~azure.servicebus.common.errors.ServiceBusConnectionError if the namespace is not found.\n        :raises: ~azure.common.AzureConflictHttpError if a topic of the same name already exists.\n        \"\"\"\n        topic_properties = Topic(\n            max_size_in_megabytes=max_size_in_megabytes,\n            requires_duplicate_detection=requires_duplicate_detection,\n            default_message_time_to_live=default_message_time_to_live,\n            duplicate_detection_history_time_window=duplicate_detection_history_time_window,\n            enable_batched_operations=enable_batched_operations)\n        try:\n            return self.mgmt_client.create_topic(topic_name, topic=topic_properties, fail_on_exist=True)\n        except requests.exceptions.ConnectionError as e:\n            raise ServiceBusConnectionError(\"Namespace: {} not found\".format(self.service_namespace), e)"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef delete_topic(self, topic_name, fail_not_exist=False):\n        try:\n            return self.mgmt_client.delete_topic(topic_name, fail_not_exist=fail_not_exist)\n        except requests.exceptions.ConnectionError as e:\n            raise ServiceBusConnectionError(\"Namespace: {} not found\".format(self.service_namespace), e)\n        except azure.common.AzureMissingResourceHttpError as e:\n            raise ServiceBusResourceNotFound(\"Specificed queue does not exist.\", e)", "response": "Delete a topic entity."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef create_subscription(\n            self, topic_name, subscription_name,\n            lock_duration=30, requires_session=None,\n            default_message_time_to_live=None,\n            dead_lettering_on_message_expiration=None,\n            dead_lettering_on_filter_evaluation_exceptions=None,\n            enable_batched_operations=None, max_delivery_count=None):\n        \"\"\"Create a subscription entity.\n\n        :param topic_name: The name of the topic under which to create the subscription.\n        :param subscription_name: The name of the new subscription.\n        :type subscription_name: str\n        :param lock_duration: The lock durection in seconds for each message in the subscription.\n        :type lock_duration: int\n        :param requires_session: Whether the subscription will be sessionful, and therefore require all\n         message to have a Session ID and be received by a sessionful receiver.\n         Default value is False.\n        :type requires_session: bool\n        :param default_message_time_to_live: The length of time a message will remain in the subscription\n         before it is either discarded or moved to the dead letter queue.\n        :type default_message_time_to_live: ~datetime.timedelta\n        :param dead_lettering_on_message_expiration: Whether to move expired messages to the\n         dead letter queue. Default value is False.\n        :type dead_lettering_on_message_expiration: bool\n        :param dead_lettering_on_filter_evaluation_exceptions: Whether to move messages that error on\n         filtering into the dead letter queue. Default is False, and the messages will be discarded.\n        :type dead_lettering_on_filter_evaluation_exceptions: bool\n        :param max_delivery_count: The maximum number of times a message will attempt to be delivered\n         before it is moved to the dead letter queue.\n        :type max_delivery_count: int\n        :param enable_batched_operations:\n        :type: enable_batched_operations: bool\n        :raises: ~azure.servicebus.common.errors.ServiceBusConnectionError if the namespace is not found.\n        :raises: ~azure.common.AzureConflictHttpError if a queue of the same name already exists.\n        \"\"\"\n        sub_properties = Subscription(\n            lock_duration=\"PT{}S\".format(int(lock_duration)),\n            requires_session=requires_session,\n            default_message_time_to_live=default_message_time_to_live,\n            dead_lettering_on_message_expiration=dead_lettering_on_message_expiration,\n            dead_lettering_on_filter_evaluation_exceptions=dead_lettering_on_filter_evaluation_exceptions,\n            max_delivery_count=max_delivery_count,\n            enable_batched_operations=enable_batched_operations)\n        try:\n            return self.mgmt_client.create_subscription(\n                topic_name, subscription_name,\n                subscription=sub_properties, fail_on_exist=True)\n        except requests.exceptions.ConnectionError as e:\n            raise ServiceBusConnectionError(\"Namespace: {} not found\".format(self.service_namespace), e)", "response": "Creates a new subscription entity."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef from_connection_string(cls, conn_str, name=None, **kwargs):\n        address, policy, key, entity = parse_conn_str(conn_str)\n        entity = name or entity\n        address = build_uri(address, entity)\n        name = address.split('/')[-1]\n        return cls(address, name, shared_access_key_name=policy, shared_access_key_value=key, **kwargs)", "response": "Create a new object from a connection string."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef get_properties(self):\n        try:\n            self.entity = self._get_entity()\n            self.properties = dict(self.entity)\n            if hasattr(self.entity, 'requires_session'):\n                self.requires_session = self.entity.requires_session\n            return self.properties\n        except AzureServiceBusResourceNotFound:\n            raise ServiceBusResourceNotFound(\"Specificed queue does not exist.\")\n        except azure.common.AzureHttpError:\n            self.entity = None\n            self.properties = {}\n            self.requires_session = False\n        except requests.exceptions.ConnectionError as e:\n            raise ServiceBusConnectionError(\"Namespace not found\", e)", "response": "Perform an operation to update the properties of the entity."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef expired(self):\n        if self.locked_until and self.locked_until <= datetime.datetime.now():\n            return True\n        return False", "response": "Whether the receivers lock on a particular session has expired."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nqueue a message to be sent later.", "response": "def queue_message(self, message):\n        \"\"\"Queue a message to be sent later.\n\n        This operation should be followed up with send_pending_messages.\n\n        :param message: The message to be sent.\n        :type message: ~azure.servicebus.common.message.Message\n\n        Example:\n            .. literalinclude:: ../examples/test_examples.py\n                :start-after: [START queue_and_send_messages]\n                :end-before: [END queue_and_send_messages]\n                :language: python\n                :dedent: 4\n                :caption: Send the queued messages\n                :name: sender_queue\n\n        \"\"\"\n        if not self.running:\n            self.open()\n        if self.session_id and not message.properties.group_id:\n            message.properties.group_id = self.session_id\n        self._handler.queue_message(message.message)"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef register(self, renewable, timeout=300):\n        starttime = renewable_start_time(renewable)\n        self.executor.submit(self._auto_lock_renew, renewable, starttime, timeout)", "response": "Register a renewable entity for automatic lock renewal."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef create(\n            self, resource_group_name, node_name, session, user_name=None, password=None, retention_period=None, credential_data_format=None, encryption_certificate_thumbprint=None, custom_headers=None, raw=False, polling=True, **operation_config):\n        \"\"\"Creates a session for a node.\n\n        :param resource_group_name: The resource group name uniquely\n         identifies the resource group within the user subscriptionId.\n        :type resource_group_name: str\n        :param node_name: The node name (256 characters maximum).\n        :type node_name: str\n        :param session: The sessionId from the user.\n        :type session: str\n        :param user_name: Encrypted User name to be used to connect to node.\n        :type user_name: str\n        :param password: Encrypted Password associated with user name.\n        :type password: str\n        :param retention_period: Session retention period. Possible values\n         include: 'Session', 'Persistent'\n        :type retention_period: str or\n         ~azure.mgmt.servermanager.models.RetentionPeriod\n        :param credential_data_format: Credential data format. Possible values\n         include: 'RsaEncrypted'\n        :type credential_data_format: str or\n         ~azure.mgmt.servermanager.models.CredentialDataFormat\n        :param encryption_certificate_thumbprint: Encryption certificate\n         thumbprint.\n        :type encryption_certificate_thumbprint: str\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: The poller return type is ClientRawResponse, the\n         direct response alongside the deserialized response\n        :param polling: True for ARMPolling, False for no polling, or a\n         polling object for personal polling strategy\n        :return: An instance of LROPoller that returns SessionResource or\n         ClientRawResponse<SessionResource> if raw==True\n        :rtype:\n         ~msrestazure.azure_operation.AzureOperationPoller[~azure.mgmt.servermanager.models.SessionResource]\n         or\n         ~msrestazure.azure_operation.AzureOperationPoller[~msrest.pipeline.ClientRawResponse[~azure.mgmt.servermanager.models.SessionResource]]\n        :raises:\n         :class:`ErrorException<azure.mgmt.servermanager.models.ErrorException>`\n        \"\"\"\n        raw_result = self._create_initial(\n            resource_group_name=resource_group_name,\n            node_name=node_name,\n            session=session,\n            user_name=user_name,\n            password=password,\n            retention_period=retention_period,\n            credential_data_format=credential_data_format,\n            encryption_certificate_thumbprint=encryption_certificate_thumbprint,\n            custom_headers=custom_headers,\n            raw=True,\n            **operation_config\n        )\n\n        def get_long_running_output(response):\n            deserialized = self._deserialize('SessionResource', response)\n\n            if raw:\n                client_raw_response = ClientRawResponse(deserialized, response)\n                return client_raw_response\n\n            return deserialized\n\n        lro_delay = operation_config.get(\n            'long_running_operation_timeout',\n            self.config.long_running_operation_timeout)\n        if polling is True: polling_method = ARMPolling(lro_delay, **operation_config)\n        elif polling is False: polling_method = NoPolling()\n        else: polling_method = polling\n        return LROPoller(self._client, raw_result, get_long_running_output, polling_method)", "response": "Creates a new session for a node."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\ncreate an Azure subscription.", "response": "def create_subscription(\n            self, billing_account_name, invoice_section_name, body, custom_headers=None, raw=False, polling=True, **operation_config):\n        \"\"\"Creates an Azure subscription.\n\n        :param billing_account_name: The name of the commerce root billing\n         account.\n        :type billing_account_name: str\n        :param invoice_section_name: The name of the invoice section.\n        :type invoice_section_name: str\n        :param body: The subscription creation parameters.\n        :type body:\n         ~azure.mgmt.subscription.models.SubscriptionCreationParameters\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: The poller return type is ClientRawResponse, the\n         direct response alongside the deserialized response\n        :param polling: True for ARMPolling, False for no polling, or a\n         polling object for personal polling strategy\n        :return: An instance of LROPoller that returns\n         SubscriptionCreationResult or\n         ClientRawResponse<SubscriptionCreationResult> if raw==True\n        :rtype:\n         ~msrestazure.azure_operation.AzureOperationPoller[~azure.mgmt.subscription.models.SubscriptionCreationResult]\n         or\n         ~msrestazure.azure_operation.AzureOperationPoller[~msrest.pipeline.ClientRawResponse[~azure.mgmt.subscription.models.SubscriptionCreationResult]]\n        :raises:\n         :class:`ErrorResponseException<azure.mgmt.subscription.models.ErrorResponseException>`\n        \"\"\"\n        raw_result = self._create_subscription_initial(\n            billing_account_name=billing_account_name,\n            invoice_section_name=invoice_section_name,\n            body=body,\n            custom_headers=custom_headers,\n            raw=True,\n            **operation_config\n        )\n\n        def get_long_running_output(response):\n            header_dict = {\n                'Location': 'str',\n                'Retry-After': 'int',\n            }\n            deserialized = self._deserialize('SubscriptionCreationResult', response)\n\n            if raw:\n                client_raw_response = ClientRawResponse(deserialized, response)\n                client_raw_response.add_headers(header_dict)\n                return client_raw_response\n\n            return deserialized\n\n        lro_delay = operation_config.get(\n            'long_running_operation_timeout',\n            self.config.long_running_operation_timeout)\n        if polling is True: polling_method = ARMPolling(lro_delay, **operation_config)\n        elif polling is False: polling_method = NoPolling()\n        else: polling_method = polling\n        return LROPoller(self._client, raw_result, get_long_running_output, polling_method)"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef get_relevant_versions(self, package_name: str):\n        versions = self.get_ordered_versions(package_name)\n        pre_releases = [version for version in versions if not version.is_prerelease]\n        return (\n            versions[-1],\n            pre_releases[-1]\n        )", "response": "Return a tuple of the latest release and latest stable version of the package."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef export_request_rate_by_interval(\n            self, parameters, location, custom_headers=None, raw=False, polling=True, **operation_config):\n        \"\"\"Export logs that show Api requests made by this subscription in the\n        given time window to show throttling activities.\n\n        :param parameters: Parameters supplied to the LogAnalytics\n         getRequestRateByInterval Api.\n        :type parameters:\n         ~azure.mgmt.compute.v2018_04_01.models.RequestRateByIntervalInput\n        :param location: The location upon which virtual-machine-sizes is\n         queried.\n        :type location: str\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: The poller return type is ClientRawResponse, the\n         direct response alongside the deserialized response\n        :param polling: True for ARMPolling, False for no polling, or a\n         polling object for personal polling strategy\n        :return: An instance of LROPoller that returns\n         LogAnalyticsOperationResult or\n         ClientRawResponse<LogAnalyticsOperationResult> if raw==True\n        :rtype:\n         ~msrestazure.azure_operation.AzureOperationPoller[~azure.mgmt.compute.v2018_04_01.models.LogAnalyticsOperationResult]\n         or\n         ~msrestazure.azure_operation.AzureOperationPoller[~msrest.pipeline.ClientRawResponse[~azure.mgmt.compute.v2018_04_01.models.LogAnalyticsOperationResult]]\n        :raises: :class:`CloudError<msrestazure.azure_exceptions.CloudError>`\n        \"\"\"\n        raw_result = self._export_request_rate_by_interval_initial(\n            parameters=parameters,\n            location=location,\n            custom_headers=custom_headers,\n            raw=True,\n            **operation_config\n        )\n\n        def get_long_running_output(response):\n            deserialized = self._deserialize('LogAnalyticsOperationResult', response)\n\n            if raw:\n                client_raw_response = ClientRawResponse(deserialized, response)\n                return client_raw_response\n\n            return deserialized\n\n        lro_delay = operation_config.get(\n            'long_running_operation_timeout',\n            self.config.long_running_operation_timeout)\n        if polling is True: polling_method = ARMPolling(lro_delay, lro_options={'final-state-via': 'azure-async-operation'}, **operation_config)\n        elif polling is False: polling_method = NoPolling()\n        else: polling_method = polling\n        return LROPoller(self._client, raw_result, get_long_running_output, polling_method)", "response": "This method returns the request rate by interval API."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nscans output for exceptions", "response": "def _handle_output(results_queue):\n    \"\"\"Scan output for exceptions\n\n    If there is an output from an add task collection call add it to the results.\n\n    :param results_queue: Queue containing results of attempted add_collection's\n    :type results_queue: collections.deque\n    :return: list of TaskAddResults\n    :rtype: list[~TaskAddResult]\n    \"\"\"\n    results = []\n    while results_queue:\n        queue_item = results_queue.pop()\n        results.append(queue_item)\n    return results"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nadd a chunk of tasks to the results queue.", "response": "def _bulk_add_tasks(self, results_queue, chunk_tasks_to_add):\n        \"\"\"Adds a chunk of tasks to the job\n\n        Retry chunk if body exceeds the maximum request size and retry tasks\n        if failed due to server errors.\n\n        :param results_queue: Queue to place the return value of the request\n        :type results_queue: collections.deque\n        :param chunk_tasks_to_add: Chunk of at most 100 tasks with retry details\n        :type chunk_tasks_to_add: list[~TrackedCloudTask]\n        \"\"\"\n\n        try:\n            add_collection_response = self._original_add_collection(\n                self._client,\n                self._job_id,\n                chunk_tasks_to_add,\n                self._task_add_collection_options,\n                self._custom_headers,\n                self._raw)\n        except BatchErrorException as e:\n            # In case of a chunk exceeding the MaxMessageSize split chunk in half\n            # and resubmit smaller chunk requests\n            # TODO: Replace string with constant variable once available in SDK\n            if e.error.code == \"RequestBodyTooLarge\":  # pylint: disable=no-member\n                # In this case the task is misbehaved and will not be able to be added due to:\n                #   1) The task exceeding the max message size\n                #   2) A single cell of the task exceeds the per-cell limit, or\n                #   3) Sum of all cells exceeds max row limit\n                if len(chunk_tasks_to_add) == 1:\n                    failed_task = chunk_tasks_to_add.pop()\n                    self.errors.appendleft(e)\n                    _LOGGER.error(\"Failed to add task with ID %s due to the body\"\n                                  \" exceeding the maximum request size\", failed_task.id)\n                else:\n                    # Assumption: Tasks are relatively close in size therefore if one batch exceeds size limit\n                    # we should decrease the initial task collection size to avoid repeating the error\n                    # Midpoint is lower bounded by 1 due to above base case\n                    midpoint = int(len(chunk_tasks_to_add) / 2)\n                    # Restrict one thread at a time to do this compare and set,\n                    # therefore forcing max_tasks_per_request to be strictly decreasing\n                    with self._max_tasks_lock:\n                        if midpoint < self._max_tasks_per_request:\n                            self._max_tasks_per_request = midpoint\n                            _LOGGER.info(\"Amount of tasks per request reduced from %s to %s due to the\"\n                                         \" request body being too large\", str(self._max_tasks_per_request),\n                                         str(midpoint))\n\n                    # Not the most efficient solution for all cases, but the goal of this is to handle this\n                    # exception and have it work in all cases where tasks are well behaved\n                    # Behavior retries as a smaller chunk and\n                    # appends extra tasks to queue to be picked up by another thread .\n                    self.tasks_to_add.extendleft(chunk_tasks_to_add[midpoint:])\n                    self._bulk_add_tasks(results_queue, chunk_tasks_to_add[:midpoint])\n            # Retry server side errors\n            elif 500 <= e.response.status_code <= 599:\n                self.tasks_to_add.extendleft(chunk_tasks_to_add)\n            else:\n                # Re-add to pending queue as unknown status / don't have result\n                self.tasks_to_add.extendleft(chunk_tasks_to_add)\n                # Unknown State - don't know if tasks failed to add or were successful\n                self.errors.appendleft(e)\n        except Exception as e:  # pylint: disable=broad-except\n            # Re-add to pending queue as unknown status / don't have result\n            self.tasks_to_add.extendleft(chunk_tasks_to_add)\n            # Unknown State - don't know if tasks failed to add or were successful\n            self.errors.appendleft(e)\n        else:\n            try:\n                add_collection_response = add_collection_response.output\n            except AttributeError:\n                pass\n\n            for task_result in add_collection_response.value:  # pylint: disable=no-member\n                if task_result.status == TaskAddStatus.server_error:\n                    # Server error will be retried\n                    with self._pending_queue_lock:\n                        for task in chunk_tasks_to_add:\n                            if task.id == task_result.task_id:\n                                self.tasks_to_add.appendleft(task)\n                elif (task_result.status == TaskAddStatus.client_error\n                        and not task_result.error.code == \"TaskExists\"):\n                    # Client error will be recorded unless Task already exists\n                    self.failure_tasks.appendleft(task_result)\n                else:\n                    results_queue.appendleft(task_result)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef task_collection_thread_handler(self, results_queue):\n        # Add tasks until either we run out or we run into an unexpected error\n        while self.tasks_to_add and not self.errors:\n            max_tasks = self._max_tasks_per_request  # local copy\n            chunk_tasks_to_add = []\n            with self._pending_queue_lock:\n                while len(chunk_tasks_to_add) < max_tasks and self.tasks_to_add:\n                    chunk_tasks_to_add.append(self.tasks_to_add.pop())\n\n            if chunk_tasks_to_add:\n                self._bulk_add_tasks(results_queue, chunk_tasks_to_add)", "response": "This method is called by the worker thread to add tasks off the pending queue and submits them to the results_queue."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef build_config(config : Dict[str, Any]) -> Dict[str, str]:\n    result = config.copy()\n    # Manage the classifier stable/beta\n    is_stable = result.pop(\"is_stable\", False)\n    if is_stable:\n        result[\"classifier\"] = \"Development Status :: 5 - Production/Stable\"\n    else:\n        result[\"classifier\"] = \"Development Status :: 4 - Beta\"\n    # Manage the nspkg\n    package_name = result[\"package_name\"]\n    result[\"package_nspkg\"] = result.pop(\n        \"package_nspkg\",\n        package_name[:package_name.rindex('-')]+\"-nspkg\"\n    )\n    # ARM?\n    result['is_arm'] = result.pop(\"is_arm\", True)\n\n    # Do I need msrestazure for this package?\n    result['need_msrestazure'] = result.pop(\"need_msrestazure\", True)\n\n    # Pre-compute some Jinja variable that are complicated to do inside the templates\n    package_parts = result[\"package_nspkg\"][:-len('-nspkg')].split('-')\n    result['nspkg_names'] = [\n        \".\".join(package_parts[:i+1])\n        for i in range(len(package_parts))\n    ]\n    result['init_names'] = [\n        \"/\".join(package_parts[:i+1])+\"/__init__.py\"\n        for i in range(len(package_parts))\n    ]\n\n    # Return result\n    return result", "response": "Builds the config for Jinja2 based on SDK config."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nreturn the base models module.", "response": "def models(cls, api_version=DEFAULT_API_VERSION):\n        \"\"\"Module depends on the API version:\n\n           * 2016-09-01: :mod:`v2016_09_01.models<azure.mgmt.resource.links.v2016_09_01.models>`\n        \"\"\"\n        if api_version == '2016-09-01':\n            from .v2016_09_01 import models\n            return models\n        raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nreset the user password on an environment This operation can take a while to complete. :param user_name: The name of the user. :type user_name: str :param reset_password_payload: Represents the payload for resetting passwords. :type reset_password_payload: ~azure.mgmt.labservices.models.ResetPasswordPayload :param dict custom_headers: headers that will be added to the request :param bool raw: The poller return type is ClientRawResponse, the direct response alongside the deserialized response :param polling: True for ARMPolling, False for no polling, or a polling object for personal polling strategy :return: An instance of LROPoller that returns None or ClientRawResponse<None> if raw==True :rtype: ~msrestazure.azure_operation.AzureOperationPoller[None] or ~msrestazure.azure_operation.AzureOperationPoller[~msrest.pipeline.ClientRawResponse[None]] :raises: :class:`CloudError<msrestazure.azure_exceptions.CloudError>`", "response": "def reset_password(\n            self, user_name, reset_password_payload, custom_headers=None, raw=False, polling=True, **operation_config):\n        \"\"\"Resets the user password on an environment This operation can take a\n        while to complete.\n\n        :param user_name: The name of the user.\n        :type user_name: str\n        :param reset_password_payload: Represents the payload for resetting\n         passwords.\n        :type reset_password_payload:\n         ~azure.mgmt.labservices.models.ResetPasswordPayload\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: The poller return type is ClientRawResponse, the\n         direct response alongside the deserialized response\n        :param polling: True for ARMPolling, False for no polling, or a\n         polling object for personal polling strategy\n        :return: An instance of LROPoller that returns None or\n         ClientRawResponse<None> if raw==True\n        :rtype: ~msrestazure.azure_operation.AzureOperationPoller[None] or\n         ~msrestazure.azure_operation.AzureOperationPoller[~msrest.pipeline.ClientRawResponse[None]]\n        :raises: :class:`CloudError<msrestazure.azure_exceptions.CloudError>`\n        \"\"\"\n        raw_result = self._reset_password_initial(\n            user_name=user_name,\n            reset_password_payload=reset_password_payload,\n            custom_headers=custom_headers,\n            raw=True,\n            **operation_config\n        )\n\n        def get_long_running_output(response):\n            if raw:\n                client_raw_response = ClientRawResponse(None, response)\n                return client_raw_response\n\n        lro_delay = operation_config.get(\n            'long_running_operation_timeout',\n            self.config.long_running_operation_timeout)\n        if polling is True: polling_method = ARMPolling(lro_delay, **operation_config)\n        elif polling is False: polling_method = NoPolling()\n        else: polling_method = polling\n        return LROPoller(self._client, raw_result, get_long_running_output, polling_method)"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef start_environment(\n            self, user_name, environment_id, custom_headers=None, raw=False, polling=True, **operation_config):\n        \"\"\"Starts an environment by starting all resources inside the environment.\n        This operation can take a while to complete.\n\n        :param user_name: The name of the user.\n        :type user_name: str\n        :param environment_id: The resourceId of the environment\n        :type environment_id: str\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: The poller return type is ClientRawResponse, the\n         direct response alongside the deserialized response\n        :param polling: True for ARMPolling, False for no polling, or a\n         polling object for personal polling strategy\n        :return: An instance of LROPoller that returns None or\n         ClientRawResponse<None> if raw==True\n        :rtype: ~msrestazure.azure_operation.AzureOperationPoller[None] or\n         ~msrestazure.azure_operation.AzureOperationPoller[~msrest.pipeline.ClientRawResponse[None]]\n        :raises: :class:`CloudError<msrestazure.azure_exceptions.CloudError>`\n        \"\"\"\n        raw_result = self._start_environment_initial(\n            user_name=user_name,\n            environment_id=environment_id,\n            custom_headers=custom_headers,\n            raw=True,\n            **operation_config\n        )\n\n        def get_long_running_output(response):\n            if raw:\n                client_raw_response = ClientRawResponse(None, response)\n                return client_raw_response\n\n        lro_delay = operation_config.get(\n            'long_running_operation_timeout',\n            self.config.long_running_operation_timeout)\n        if polling is True: polling_method = ARMPolling(lro_delay, **operation_config)\n        elif polling is False: polling_method = NoPolling()\n        else: polling_method = polling\n        return LROPoller(self._client, raw_result, get_long_running_output, polling_method)", "response": "Starts an environment by starting all resources inside the environment."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nreturning a new instance of the base class for the specified API version.", "response": "def models(cls, api_version=DEFAULT_API_VERSION):\n        \"\"\"Module depends on the API version:\n\n           * 2017-03-01: :mod:`v2017_03_01.models<azure.mgmt.containerregistry.v2017_03_01.models>`\n           * 2017-10-01: :mod:`v2017_10_01.models<azure.mgmt.containerregistry.v2017_10_01.models>`\n           * 2018-02-01-preview: :mod:`v2018_02_01_preview.models<azure.mgmt.containerregistry.v2018_02_01_preview.models>`\n           * 2018-09-01: :mod:`v2018_09_01.models<azure.mgmt.containerregistry.v2018_09_01.models>`\n        \"\"\"\n        if api_version == '2017-03-01':\n            from .v2017_03_01 import models\n            return models\n        elif api_version == '2017-10-01':\n            from .v2017_10_01 import models\n            return models\n        elif api_version == '2018-02-01-preview':\n            from .v2018_02_01_preview import models\n            return models\n        elif api_version == '2018-09-01':\n            from .v2018_09_01 import models\n            return models\n        raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef build_steps(self):\n        api_version = self._get_api_version('build_steps')\n        if api_version == '2018-02-01-preview':\n            from .v2018_02_01_preview.operations import BuildStepsOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))", "response": "Return an instance of the BuildStepsOperations class."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef build_tasks(self):\n        api_version = self._get_api_version('build_tasks')\n        if api_version == '2018-02-01-preview':\n            from .v2018_02_01_preview.operations import BuildTasksOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))", "response": "Return an instance of the BuildTasksOperations class."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nreturn an instance of the BuildsOperations class.", "response": "def builds(self):\n        \"\"\"Instance depends on the API version:\n\n           * 2018-02-01-preview: :class:`BuildsOperations<azure.mgmt.containerregistry.v2018_02_01_preview.operations.BuildsOperations>`\n        \"\"\"\n        api_version = self._get_api_version('builds')\n        if api_version == '2018-02-01-preview':\n            from .v2018_02_01_preview.operations import BuildsOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nreturn an instance of the Operations API client.", "response": "def operations(self):\n        \"\"\"Instance depends on the API version:\n\n           * 2017-03-01: :class:`Operations<azure.mgmt.containerregistry.v2017_03_01.operations.Operations>`\n           * 2017-10-01: :class:`Operations<azure.mgmt.containerregistry.v2017_10_01.operations.Operations>`\n           * 2018-02-01-preview: :class:`Operations<azure.mgmt.containerregistry.v2018_02_01_preview.operations.Operations>`\n           * 2018-09-01: :class:`Operations<azure.mgmt.containerregistry.v2018_09_01.operations.Operations>`\n        \"\"\"\n        api_version = self._get_api_version('operations')\n        if api_version == '2017-03-01':\n            from .v2017_03_01.operations import Operations as OperationClass\n        elif api_version == '2017-10-01':\n            from .v2017_10_01.operations import Operations as OperationClass\n        elif api_version == '2018-02-01-preview':\n            from .v2018_02_01_preview.operations import Operations as OperationClass\n        elif api_version == '2018-09-01':\n            from .v2018_09_01.operations import Operations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef replications(self):\n        api_version = self._get_api_version('replications')\n        if api_version == '2017-10-01':\n            from .v2017_10_01.operations import ReplicationsOperations as OperationClass\n        elif api_version == '2018-02-01-preview':\n            from .v2018_02_01_preview.operations import ReplicationsOperations as OperationClass\n        elif api_version == '2018-09-01':\n            from .v2018_09_01.operations import ReplicationsOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))", "response": "Return an instance of the Replications API version."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nreturning an instance of the RunsOperations class.", "response": "def runs(self):\n        \"\"\"Instance depends on the API version:\n\n           * 2018-09-01: :class:`RunsOperations<azure.mgmt.containerregistry.v2018_09_01.operations.RunsOperations>`\n        \"\"\"\n        api_version = self._get_api_version('runs')\n        if api_version == '2018-09-01':\n            from .v2018_09_01.operations import RunsOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef tasks(self):\n        api_version = self._get_api_version('tasks')\n        if api_version == '2018-09-01':\n            from .v2018_09_01.operations import TasksOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))", "response": "Return an instance of the TasksOperations API version available for this registry."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef webhooks(self):\n        api_version = self._get_api_version('webhooks')\n        if api_version == '2017-10-01':\n            from .v2017_10_01.operations import WebhooksOperations as OperationClass\n        elif api_version == '2018-02-01-preview':\n            from .v2018_02_01_preview.operations import WebhooksOperations as OperationClass\n        elif api_version == '2018-09-01':\n            from .v2018_09_01.operations import WebhooksOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))", "response": "Return an instance of the webhooks API version."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\ncreates a message from a response.", "response": "def _create_message(response, service_instance):\n    ''' Create message from response.\n\n    response:\n        response from Service Bus cloud server.\n    service_instance:\n        the Service Bus client.\n    '''\n    respbody = response.body\n    custom_properties = {}\n    broker_properties = None\n    message_type = None\n    message_location = None\n\n    # gets all information from respheaders.\n    for name, value in response.headers:\n        if name.lower() == 'brokerproperties':\n            broker_properties = json.loads(value)\n        elif name.lower() == 'content-type':\n            message_type = value\n        elif name.lower() == 'location':\n            message_location = value\n        # Exclude common HTTP headers to avoid noise. List\n        # is not exhaustive. At worst, custom properties will contains\n        # an unexpected content generated by the webserver and not the customer.\n        elif name.lower() not in ['transfer-encoding',\n                                  'server',\n                                  'date',\n                                  'strict-transport-security']:\n            # Follow the spec:\n            # https://docs.microsoft.com/rest/api/servicebus/message-headers-and-properties\n            if '\"' in value:\n                value = value[1:-1].replace('\\\\\"', '\"')\n                try:\n                    custom_properties[name] = datetime.strptime(\n                        value, '%a, %d %b %Y %H:%M:%S GMT')\n                except ValueError:\n                    custom_properties[name] = value\n            elif value.lower() == 'true':\n                custom_properties[name] = True\n            elif value.lower() == 'false':\n                custom_properties[name] = False\n            else:  # in theory, only int or float\n                try:\n                    # int('3.1') doesn't work so need to get float('3.14') first\n                    float_value = float(value)\n                    if str(int(float_value)) == value:\n                        custom_properties[name] = int(value)\n                    else:\n                        custom_properties[name] = float_value\n                except ValueError:\n                    # If we are here, this header does not respect the spec.\n                    # Could be an unexpected HTTP header or an invalid\n                    # header value. In both case we ignore without failing.\n                    pass\n\n    if message_type is None:\n        message = Message(\n            respbody, service_instance, message_location, custom_properties,\n            'application/atom+xml;type=entry;charset=utf-8', broker_properties)\n    else:\n        message = Message(respbody, service_instance, message_location,\n                          custom_properties, message_type, broker_properties)\n    return message"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nconvert an XML element to a rule object.", "response": "def _convert_etree_element_to_rule(entry_element):\n    ''' Converts entry element to rule object.\n\n    The format of xml for rule:\n<entry xmlns='http://www.w3.org/2005/Atom'>\n<content type='application/xml'>\n<RuleDescription\n    xmlns:i=\"http://www.w3.org/2001/XMLSchema-instance\"\n    xmlns=\"http://schemas.microsoft.com/netservices/2010/10/servicebus/connect\">\n    <Filter i:type=\"SqlFilterExpression\">\n        <SqlExpression>MyProperty='XYZ'</SqlExpression>\n    </Filter>\n    <Action i:type=\"SqlFilterAction\">\n        <SqlExpression>set MyProperty2 = 'ABC'</SqlExpression>\n    </Action>\n</RuleDescription>\n</content>\n</entry>\n    '''\n    rule = Rule()\n\n    rule_element = entry_element.find('./atom:content/sb:RuleDescription', _etree_sb_feed_namespaces)\n    if rule_element is not None:\n        filter_element = rule_element.find('./sb:Filter', _etree_sb_feed_namespaces)\n        if filter_element is not None:\n            rule.filter_type = filter_element.attrib.get(\n                _make_etree_ns_attr_name(_etree_sb_feed_namespaces['i'], 'type'), None)\n            sql_exp_element = filter_element.find('./sb:SqlExpression', _etree_sb_feed_namespaces)\n            if sql_exp_element is not None:\n                rule.filter_expression = sql_exp_element.text\n\n        action_element = rule_element.find('./sb:Action', _etree_sb_feed_namespaces)\n        if action_element is not None:\n            rule.action_type = action_element.attrib.get(\n                _make_etree_ns_attr_name(_etree_sb_feed_namespaces['i'], 'type'), None)\n            sql_exp_element = action_element.find('./sb:SqlExpression', _etree_sb_feed_namespaces)\n            if sql_exp_element is not None:\n                rule.action_expression = sql_exp_element.text\n\n\n    # extract id, updated and name value from feed entry and set them of rule.\n    for name, value in _ETreeXmlToObject.get_entry_properties_from_element(\n            entry_element, True, '/rules').items():\n        setattr(rule, name, value)\n\n    return rule"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef _convert_etree_element_to_queue(entry_element):\n    ''' Converts entry element to queue object.\n\n    The format of xml response for queue:\n<QueueDescription\n    xmlns=\\\"http://schemas.microsoft.com/netservices/2010/10/servicebus/connect\\\">\n    <MaxSizeInBytes>10000</MaxSizeInBytes>\n    <DefaultMessageTimeToLive>PT5M</DefaultMessageTimeToLive>\n    <LockDuration>PT2M</LockDuration>\n    <RequiresGroupedReceives>False</RequiresGroupedReceives>\n    <SupportsDuplicateDetection>False</SupportsDuplicateDetection>\n    ...\n</QueueDescription>\n\n    '''\n    queue = Queue()\n\n    # get node for each attribute in Queue class, if nothing found then the\n    # response is not valid xml for Queue.\n    invalid_queue = True\n\n    queue_element = entry_element.find('./atom:content/sb:QueueDescription', _etree_sb_feed_namespaces)\n    if queue_element is not None:\n        mappings = [\n            ('LockDuration', 'lock_duration', None),\n            ('MaxSizeInMegabytes', 'max_size_in_megabytes', int),\n            ('RequiresDuplicateDetection', 'requires_duplicate_detection', _parse_bool),\n            ('RequiresSession', 'requires_session', _parse_bool),\n            ('DefaultMessageTimeToLive', 'default_message_time_to_live', None),\n            ('DeadLetteringOnMessageExpiration', 'dead_lettering_on_message_expiration', _parse_bool),\n            ('DuplicateDetectionHistoryTimeWindow', 'duplicate_detection_history_time_window', None),\n            ('EnableBatchedOperations', 'enable_batched_operations', _parse_bool),\n            ('MaxDeliveryCount', 'max_delivery_count', int),\n            ('MessageCount', 'message_count', int),\n            ('SizeInBytes', 'size_in_bytes', int),\n        ]\n\n        for mapping in mappings:\n            if _read_etree_element(queue_element, mapping[0], queue, mapping[1], mapping[2]):\n                invalid_queue = False\n\n    if invalid_queue:\n        raise AzureServiceBusResourceNotFound(_ERROR_QUEUE_NOT_FOUND)\n\n    # extract id, updated and name value from feed entry and set them of queue.\n    for name, value in _ETreeXmlToObject.get_entry_properties_from_element(\n            entry_element, True).items():\n        setattr(queue, name, value)\n\n    return queue", "response": "Converts an xml entry element to a queue object."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef _convert_etree_element_to_topic(entry_element):\n    '''Converts entry element to topic\n\n    The xml format for topic:\n<entry xmlns='http://www.w3.org/2005/Atom'>\n    <content type='application/xml'>\n    <TopicDescription\n        xmlns:i=\"http://www.w3.org/2001/XMLSchema-instance\"\n        xmlns=\"http://schemas.microsoft.com/netservices/2010/10/servicebus/connect\">\n        <DefaultMessageTimeToLive>P10675199DT2H48M5.4775807S</DefaultMessageTimeToLive>\n        <MaxSizeInMegabytes>1024</MaxSizeInMegabytes>\n        <RequiresDuplicateDetection>false</RequiresDuplicateDetection>\n        <DuplicateDetectionHistoryTimeWindow>P7D</DuplicateDetectionHistoryTimeWindow>\n        <DeadLetteringOnFilterEvaluationExceptions>true</DeadLetteringOnFilterEvaluationExceptions>\n    </TopicDescription>\n    </content>\n</entry>\n    '''\n    topic = Topic()\n\n    invalid_topic = True\n\n    topic_element = entry_element.find('./atom:content/sb:TopicDescription', _etree_sb_feed_namespaces)\n    if topic_element is not None:\n        mappings = [\n            ('DefaultMessageTimeToLive', 'default_message_time_to_live', None),\n            ('MaxSizeInMegabytes', 'max_size_in_megabytes', int),\n            ('RequiresDuplicateDetection', 'requires_duplicate_detection', _parse_bool),\n            ('DuplicateDetectionHistoryTimeWindow', 'duplicate_detection_history_time_window', None),\n            ('EnableBatchedOperations', 'enable_batched_operations', _parse_bool),\n            ('SizeInBytes', 'size_in_bytes', int),\n        ]\n\n        for mapping in mappings:\n            if _read_etree_element(topic_element, mapping[0], topic, mapping[1], mapping[2]):\n                invalid_topic = False\n\n    if invalid_topic:\n        raise AzureServiceBusResourceNotFound(_ERROR_TOPIC_NOT_FOUND)\n\n    # extract id, updated and name value from feed entry and set them of topic.\n    for name, value in _ETreeXmlToObject.get_entry_properties_from_element(\n            entry_element, True).items():\n        setattr(topic, name, value)\n\n    return topic", "response": "Converts an element from the xml format to the topic xml format."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef _convert_etree_element_to_subscription(entry_element):\n    '''Converts entry element to subscription\n\n    The xml format for subscription:\n<entry xmlns='http://www.w3.org/2005/Atom'>\n    <content type='application/xml'>\n    <SubscriptionDescription\n        xmlns:i=\"http://www.w3.org/2001/XMLSchema-instance\"\n        xmlns=\"http://schemas.microsoft.com/netservices/2010/10/servicebus/connect\">\n        <LockDuration>PT5M</LockDuration>\n        <RequiresSession>false</RequiresSession>\n        <DefaultMessageTimeToLive>P10675199DT2H48M5.4775807S</DefaultMessageTimeToLive>\n        <DeadLetteringOnMessageExpiration>false</DeadLetteringOnMessageExpiration>\n        <DeadLetteringOnFilterEvaluationExceptions>true</DeadLetteringOnFilterEvaluationExceptions>\n    </SubscriptionDescription>\n    </content>\n</entry>\n    '''\n    subscription = Subscription()\n\n    subscription_element = entry_element.find('./atom:content/sb:SubscriptionDescription', _etree_sb_feed_namespaces)\n    if subscription_element is not None:\n        mappings = [\n            ('LockDuration', 'lock_duration', None),\n            ('RequiresSession', 'requires_session', _parse_bool),\n            ('DefaultMessageTimeToLive', 'default_message_time_to_live', None),\n            ('DeadLetteringOnFilterEvaluationExceptions', 'dead_lettering_on_filter_evaluation_exceptions', _parse_bool),  # pylint: disable=line-too-long\n            ('DeadLetteringOnMessageExpiration', 'dead_lettering_on_message_expiration', _parse_bool),\n            ('EnableBatchedOperations', 'enable_batched_operations', _parse_bool),\n            ('MaxDeliveryCount', 'max_delivery_count', int),\n            ('MessageCount', 'message_count', int),\n        ]\n\n        for mapping in mappings:\n            _read_etree_element(subscription_element, mapping[0], subscription, mapping[1], mapping[2])\n\n    for name, value in _ETreeXmlToObject.get_entry_properties_from_element(\n            entry_element, True, '/subscriptions').items():\n        setattr(subscription, name, value)\n\n    return subscription", "response": "Converts an xml. dom. Element to a subscription."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\ncreate a new certificate inside the specified account.", "response": "def create(\n            self, resource_group_name, account_name, certificate_name, parameters, if_match=None, if_none_match=None, custom_headers=None, raw=False, **operation_config):\n        \"\"\"Creates a new certificate inside the specified account.\n\n        :param resource_group_name: The name of the resource group that\n         contains the Batch account.\n        :type resource_group_name: str\n        :param account_name: The name of the Batch account.\n        :type account_name: str\n        :param certificate_name: The identifier for the certificate. This must\n         be made up of algorithm and thumbprint separated by a dash, and must\n         match the certificate data in the request. For example SHA1-a3d1c5.\n        :type certificate_name: str\n        :param parameters: Additional parameters for certificate creation.\n        :type parameters:\n         ~azure.mgmt.batch.models.CertificateCreateOrUpdateParameters\n        :param if_match: The entity state (ETag) version of the certificate to\n         update. A value of \"*\" can be used to apply the operation only if the\n         certificate already exists. If omitted, this operation will always be\n         applied.\n        :type if_match: str\n        :param if_none_match: Set to '*' to allow a new certificate to be\n         created, but to prevent updating an existing certificate. Other values\n         will be ignored.\n        :type if_none_match: str\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :return: An instance of AzureOperationPoller that returns Certificate\n         or ClientRawResponse if raw=true\n        :rtype:\n         ~msrestazure.azure_operation.AzureOperationPoller[~azure.mgmt.batch.models.Certificate]\n         or ~msrest.pipeline.ClientRawResponse\n        :raises: :class:`CloudError<msrestazure.azure_exceptions.CloudError>`\n        \"\"\"\n        raw_result = self._create_initial(\n            resource_group_name=resource_group_name,\n            account_name=account_name,\n            certificate_name=certificate_name,\n            parameters=parameters,\n            if_match=if_match,\n            if_none_match=if_none_match,\n            custom_headers=custom_headers,\n            raw=True,\n            **operation_config\n        )\n        if raw:\n            return raw_result\n\n        # Construct and send request\n        def long_running_send():\n            return raw_result.response\n\n        def get_long_running_status(status_link, headers=None):\n\n            request = self._client.get(status_link)\n            if headers:\n                request.headers.update(headers)\n            header_parameters = {}\n            header_parameters['x-ms-client-request-id'] = raw_result.response.request.headers['x-ms-client-request-id']\n            return self._client.send(\n                request, header_parameters, stream=False, **operation_config)\n\n        def get_long_running_output(response):\n\n            if response.status_code not in [200]:\n                exp = CloudError(response)\n                exp.request_id = response.headers.get('x-ms-request-id')\n                raise exp\n\n            header_dict = {\n                'ETag': 'str',\n            }\n            deserialized = self._deserialize('Certificate', response)\n\n            if raw:\n                client_raw_response = ClientRawResponse(deserialized, response)\n                client_raw_response.add_headers(header_dict)\n                return client_raw_response\n\n            return deserialized\n\n        long_running_operation_timeout = operation_config.get(\n            'long_running_operation_timeout',\n            self.config.long_running_operation_timeout)\n        return AzureOperationPoller(\n            long_running_send, get_long_running_output,\n            get_long_running_status, long_running_operation_timeout)"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef delete(\n            self, resource_group_name, account_name, certificate_name, custom_headers=None, raw=False, **operation_config):\n        \"\"\"Deletes the specified certificate.\n\n        :param resource_group_name: The name of the resource group that\n         contains the Batch account.\n        :type resource_group_name: str\n        :param account_name: The name of the Batch account.\n        :type account_name: str\n        :param certificate_name: The identifier for the certificate. This must\n         be made up of algorithm and thumbprint separated by a dash, and must\n         match the certificate data in the request. For example SHA1-a3d1c5.\n        :type certificate_name: str\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :return: An instance of AzureOperationPoller that returns None or\n         ClientRawResponse if raw=true\n        :rtype: ~msrestazure.azure_operation.AzureOperationPoller[None] or\n         ~msrest.pipeline.ClientRawResponse\n        :raises: :class:`CloudError<msrestazure.azure_exceptions.CloudError>`\n        \"\"\"\n        raw_result = self._delete_initial(\n            resource_group_name=resource_group_name,\n            account_name=account_name,\n            certificate_name=certificate_name,\n            custom_headers=custom_headers,\n            raw=True,\n            **operation_config\n        )\n        if raw:\n            return raw_result\n\n        # Construct and send request\n        def long_running_send():\n            return raw_result.response\n\n        def get_long_running_status(status_link, headers=None):\n\n            request = self._client.get(status_link)\n            if headers:\n                request.headers.update(headers)\n            header_parameters = {}\n            header_parameters['x-ms-client-request-id'] = raw_result.response.request.headers['x-ms-client-request-id']\n            return self._client.send(\n                request, header_parameters, stream=False, **operation_config)\n\n        def get_long_running_output(response):\n\n            if response.status_code not in [200, 202, 204]:\n                exp = CloudError(response)\n                exp.request_id = response.headers.get('x-ms-request-id')\n                raise exp\n\n            if raw:\n                client_raw_response = ClientRawResponse(None, response)\n                client_raw_response.add_headers({\n                    'Location': 'str',\n                    'Retry-After': 'int',\n                })\n                return client_raw_response\n\n        long_running_operation_timeout = operation_config.get(\n            'long_running_operation_timeout',\n            self.config.long_running_operation_timeout)\n        return AzureOperationPoller(\n            long_running_send, get_long_running_output,\n            get_long_running_status, long_running_operation_timeout)", "response": "Deletes the specified certificate."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef update_pr_main():\n\n    parser = argparse.ArgumentParser(\n        description='Build package.',\n        formatter_class=argparse.RawTextHelpFormatter)\n    parser.add_argument('--pr-number', '-p',\n                        dest='pr_number', type=int, required=True,\n                        help='PR number')\n    parser.add_argument('--repo', '-r',\n                        dest='repo_id', default=\"Azure/azure-sdk-for-python\",\n                        help='Repo id. [default: %(default)s]')\n    parser.add_argument(\"-v\", \"--verbose\",\n                        dest=\"verbose\", action=\"store_true\",\n                        help=\"Verbosity in INFO mode\")\n    parser.add_argument(\"--debug\",\n                        dest=\"debug\", action=\"store_true\",\n                        help=\"Verbosity in DEBUG mode\")\n\n    args = parser.parse_args()\n    main_logger = logging.getLogger()\n    if args.verbose or args.debug:\n        logging.basicConfig()\n        main_logger.setLevel(logging.DEBUG if args.debug else logging.INFO)\n\n    update_pr(\n        os.environ.get(\"GH_TOKEN\", None),\n        args.repo_id,\n        int(args.pr_number),\n    )", "response": "Main method for the update_pr method."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef _instantiate_client(client_class, **kwargs):\n    args = get_arg_spec(client_class.__init__).args\n    for key in ['subscription_id', 'tenant_id']:\n        if key not in kwargs:\n            continue\n        if key not in args:\n            del kwargs[key]\n        elif sys.version_info < (3, 0) and isinstance(kwargs[key], unicode):\n            kwargs[key] = kwargs[key].encode('utf-8')\n    return client_class(**kwargs)", "response": "Instantiate a client from kwargs removing the subscription_id and tenant_id argument if unsupported."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nreturn a tuple of the resource used to get the right access token and base URL for the client.", "response": "def _client_resource(client_class, cloud):\n    \"\"\"Return a tuple of the resource (used to get the right access token), and the base URL for the client.\n    Either or both can be None to signify that the default value should be used.\n    \"\"\"\n    if client_class.__name__ == 'GraphRbacManagementClient':\n        return cloud.endpoints.active_directory_graph_resource_id, cloud.endpoints.active_directory_graph_resource_id\n    if client_class.__name__ == 'KeyVaultClient':\n        vault_host = cloud.suffixes.keyvault_dns[1:]\n        vault_url = 'https://{}'.format(vault_host)\n        return vault_url, None\n    return None, None"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nreturn an instantiated SDK client with current CLI credentials subscription_id and base_url.", "response": "def get_client_from_cli_profile(client_class, **kwargs):\n    \"\"\"Return a SDK client initialized with current CLI credentials, CLI default subscription and CLI default cloud.\n\n    This method will fill automatically the following client parameters:\n    - credentials\n    - subscription_id\n    - base_url\n\n    Parameters provided in kwargs will override CLI parameters and be passed directly to the client.\n\n    :Example:\n\n    .. code:: python\n\n        from azure.common.client_factory import get_client_from_cli_profile\n        from azure.mgmt.compute import ComputeManagementClient\n        client = get_client_from_cli_profile(ComputeManagementClient)\n\n    .. versionadded:: 1.1.6\n\n    :param client_class: A SDK client class\n    :return: An instantiated client\n    :raises: ImportError if azure-cli-core package is not available\n    \"\"\"\n    cloud = get_cli_active_cloud()\n    parameters = {}\n    if 'credentials' not in kwargs or 'subscription_id' not in kwargs:\n        resource, _ = _client_resource(client_class, cloud)\n        credentials, subscription_id, tenant_id = get_azure_cli_credentials(resource=resource,\n                                                                            with_tenant=True)\n        parameters.update({\n            'credentials': kwargs.get('credentials', credentials),\n            'subscription_id': kwargs.get('subscription_id', subscription_id)\n        })\n\n    args = get_arg_spec(client_class.__init__).args\n    if 'adla_job_dns_suffix' in args and 'adla_job_dns_suffix' not in kwargs:  # Datalake\n        # Let it raise here with AttributeError at worst, this would mean this cloud does not define\n        # ADL endpoint and no manual suffix was given\n        parameters['adla_job_dns_suffix'] = cloud.suffixes.azure_datalake_analytics_catalog_and_job_endpoint\n    elif 'base_url' in args and 'base_url' not in kwargs:\n        _, base_url = _client_resource(client_class, cloud)\n        if base_url:\n            parameters['base_url'] = base_url\n        else:\n            parameters['base_url'] = cloud.endpoints.resource_manager\n    if 'tenant_id' in args and 'tenant_id' not in kwargs:\n        parameters['tenant_id'] = tenant_id\n    parameters.update(kwargs)\n    return _instantiate_client(client_class, **parameters)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef get_client_from_json_dict(client_class, config_dict, **kwargs):\n    is_graphrbac = client_class.__name__ == 'GraphRbacManagementClient'\n    parameters = {\n        'subscription_id': config_dict.get('subscriptionId'),\n        'base_url': config_dict.get('resourceManagerEndpointUrl'),\n        'tenant_id': config_dict.get('tenantId')  # GraphRbac\n    }\n    if is_graphrbac:\n        parameters['base_url'] = config_dict['activeDirectoryGraphResourceId']\n\n    if 'credentials' not in kwargs:\n        # Get the right resource for Credentials\n        if is_graphrbac:\n            resource = config_dict['activeDirectoryGraphResourceId']\n        else:\n            if \"activeDirectoryResourceId\" not in config_dict and 'resourceManagerEndpointUrl' not in config_dict:\n                raise ValueError(\"Need activeDirectoryResourceId or resourceManagerEndpointUrl key\")\n            resource = config_dict.get('activeDirectoryResourceId', config_dict['resourceManagerEndpointUrl'])\n\n        authority_url = config_dict['activeDirectoryEndpointUrl']\n        is_adfs = bool(re.match('.+(/adfs|/adfs/)$', authority_url, re.I))\n        if is_adfs:\n            authority_url = authority_url.rstrip('/')  # workaround: ADAL is known to reject auth urls with trailing /\n        else:\n            authority_url = authority_url + '/' + config_dict['tenantId']\n\n        context = adal.AuthenticationContext(\n            authority_url,\n            api_version=None,\n            validate_authority=not is_adfs\n        )\n        parameters['credentials'] = AdalAuthentication(\n            context.acquire_token_with_client_credentials,\n            resource,\n            config_dict['clientId'],\n            config_dict['clientSecret']\n        )\n\n    parameters.update(kwargs)\n    return _instantiate_client(client_class, **parameters)", "response": "Returns a new SDK client initialized with a JSON auth dict."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef get_client_from_auth_file(client_class, auth_path=None, **kwargs):\n    auth_path = auth_path or os.environ['AZURE_AUTH_LOCATION']\n\n    with io.open(auth_path, 'r', encoding='utf-8-sig') as auth_fd:\n        config_dict = json.load(auth_fd)\n    return get_client_from_json_dict(client_class, config_dict, **kwargs)", "response": "Returns a new SDK client initialized with the given auth file."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef parse_response(response, return_type):\n        '''\n        Parse the HTTPResponse's body and fill all the data into a class of\n        return_type.\n        '''\n        root = ETree.fromstring(response.body)\n        xml_name = getattr(return_type, '_xml_name', return_type.__name__)\n        if root.tag == xml_name:\n            return _ETreeXmlToObject._parse_response_body_from_xml_node(root, return_type)\n\n        return None", "response": "Parse the HTTPResponse s body and fill all the data into a class of\n        return_type."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nparses the response body of a list of containers and returns a ContainerEnumResults object with the elements populated with the results.", "response": "def parse_enum_results_list(response, return_type, resp_type, item_type):\n        \"\"\"resp_body is the XML we received\n        resp_type is a string, such as Containers,\n        return_type is the type we're constructing, such as ContainerEnumResults\n        item_type is the type object of the item to be created, such as Container\n\n        This function then returns a ContainerEnumResults object with the\n        containers member populated with the results.\n        \"\"\"\n\n        # parsing something like:\n        # <EnumerationResults ... >\n        #   <Queues>\n        #       <Queue>\n        #           <Something />\n        #           <SomethingElse />\n        #       </Queue>\n        #   </Queues>\n        # </EnumerationResults>\n        return_obj = return_type()\n        root = ETree.fromstring(response.body)\n\n        items = []\n\n        for container_element in root.findall(resp_type):\n            for item_element in container_element.findall(resp_type[:-1]):\n                items.append(_ETreeXmlToObject.fill_instance_element(item_element, item_type))\n\n        for name, value in vars(return_obj).items():\n            # queues, Queues, this is the list its self which we populated\n            # above\n            if name == resp_type.lower():\n                # the list its self.\n                continue\n            value = _ETreeXmlToObject.fill_data_member(root, name, value)\n            if value is not None:\n                setattr(return_obj, name, value)\n\n        setattr(return_obj, resp_type.lower(), items)\n        return return_obj"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef get_entry_properties_from_element(element, include_id, id_prefix_to_skip=None, use_title_as_id=False):\n        ''' get properties from element tree element '''\n        properties = {}\n\n        etag = element.attrib.get(_make_etree_ns_attr_name(_etree_entity_feed_namespaces['m'], 'etag'), None)\n        if etag is not None:\n            properties['etag'] = etag\n\n        updated = element.findtext('./atom:updated', '', _etree_entity_feed_namespaces)\n        if updated:\n            properties['updated'] = updated\n\n        author_name = element.findtext('./atom:author/atom:name', '', _etree_entity_feed_namespaces)\n        if author_name:\n            properties['author'] = author_name\n\n        if include_id:\n            if use_title_as_id:\n                title = element.findtext('./atom:title', '', _etree_entity_feed_namespaces)\n                if title:\n                    properties['name'] = title\n            else:\n                element_id = element.findtext('./atom:id', '', _etree_entity_feed_namespaces)\n                if element_id:\n                    properties['name'] = _get_readable_id(element_id, id_prefix_to_skip)\n\n        return properties", "response": "get properties from element tree element"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nparse the xml and fill all the data into a class of return_type", "response": "def _parse_response_body_from_xml_node(node, return_type):\n        '''\n        parse the xml and fill all the data into a class of return_type\n        '''\n        return_obj = return_type()\n        _ETreeXmlToObject._fill_data_to_return_object(node, return_obj)\n\n        return return_obj"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _fill_instance_child(xmldoc, element_name, return_type):\n        '''Converts a child of the current dom element to the specified type.\n        '''\n        element = xmldoc.find(_get_serialization_name(element_name))\n        if element is None:\n            return None\n\n        return_obj = return_type()\n        _ETreeXmlToObject._fill_data_to_return_object(element, return_obj)\n\n        return return_obj", "response": "Converts a child of the current dom element to the specified type."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef check_dns_name_availability(\n            self, location, domain_name_label, custom_headers=None, raw=False, **operation_config):\n        \"\"\"Checks whether a domain name in the cloudapp.azure.com zone is\n        available for use.\n\n        :param location: The location of the domain name.\n        :type location: str\n        :param domain_name_label: The domain name to be verified. It must\n         conform to the following regular expression:\n         ^[a-z][a-z0-9-]{1,61}[a-z0-9]$.\n        :type domain_name_label: str\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: :class:`DnsNameAvailabilityResult\n         <azure.mgmt.network.v2018_08_01.models.DnsNameAvailabilityResult>` or\n         :class:`ClientRawResponse<msrest.pipeline.ClientRawResponse>` if\n         raw=true\n        :rtype: :class:`DnsNameAvailabilityResult\n         <azure.mgmt.network.v2018_08_01.models.DnsNameAvailabilityResult>` or\n         :class:`ClientRawResponse<msrest.pipeline.ClientRawResponse>`\n        :raises: :class:`CloudError<msrestazure.azure_exceptions.CloudError>`\n        \"\"\"\n        api_version = self._get_api_version('check_dns_name_availability')\n        if api_version == '2018-12-01':\n            from .v2018_12_01 import NetworkManagementClient as ClientClass\n        elif api_version == '2018-11-01':\n            from .v2018_11_01 import NetworkManagementClient as ClientClass\n        elif api_version == '2018-10-01':\n            from .v2018_10_01 import NetworkManagementClient as ClientClass\n        elif api_version == '2018-08-01':\n            from .v2018_08_01 import NetworkManagementClient as ClientClass\n        elif api_version == '2018-07-01':\n            from .v2018_07_01 import NetworkManagementClient as ClientClass\n        elif api_version == '2018-06-01':\n            from .v2018_06_01 import NetworkManagementClient as ClientClass\n        elif api_version == '2018-04-01':\n            from .v2018_04_01 import NetworkManagementClient as ClientClass\n        elif api_version == '2018-02-01':\n            from .v2018_02_01 import NetworkManagementClient as ClientClass\n        elif api_version == '2018-01-01':\n            from .v2018_01_01 import NetworkManagementClient as ClientClass\n        elif api_version == '2017-11-01':\n            from .v2017_11_01 import NetworkManagementClient as ClientClass\n        elif api_version == '2017-10-01':\n            from .v2017_10_01 import NetworkManagementClient as ClientClass\n        elif api_version == '2017-09-01':\n            from .v2017_09_01 import NetworkManagementClient as ClientClass\n        elif api_version == '2017-08-01':\n            from .v2017_08_01 import NetworkManagementClient as ClientClass\n        elif api_version == '2017-06-01':\n            from .v2017_06_01 import NetworkManagementClient as ClientClass\n        elif api_version == '2017-03-01':\n            from .v2017_03_01 import NetworkManagementClient as ClientClass\n        elif api_version == '2016-12-01':\n            from .v2016_12_01 import NetworkManagementClient as ClientClass\n        elif api_version == '2016-09-01':\n            from .v2016_09_01 import NetworkManagementClient as ClientClass\n        elif api_version == '2015-06-15':\n            from .v2015_06_15 import NetworkManagementClient as ClientClass\n        localclient = ClientClass(self.config.credentials,\n                                  self.config.subscription_id,\n                                  self.config.base_url)\n        return localclient.check_dns_name_availability(location,\n                                                       domain_name_label,\n                                                       custom_headers,\n                                                       raw,\n                                                       **operation_config)", "response": "Checks whether a domain name in the cloudapp. azure. com zone is available for use."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nreturn an instance of the DdosCustomPolicies API Version 2. 0 Operation class.", "response": "def ddos_custom_policies(self):\n        \"\"\"Instance depends on the API version:\n\n           * 2018-11-01: :class:`DdosCustomPoliciesOperations<azure.mgmt.network.v2018_11_01.operations.DdosCustomPoliciesOperations>`\n           * 2018-12-01: :class:`DdosCustomPoliciesOperations<azure.mgmt.network.v2018_12_01.operations.DdosCustomPoliciesOperations>`\n           * 2019-02-01: :class:`DdosCustomPoliciesOperations<azure.mgmt.network.v2019_02_01.operations.DdosCustomPoliciesOperations>`\n        \"\"\"\n        api_version = self._get_api_version('ddos_custom_policies')\n        if api_version == '2018-11-01':\n            from .v2018_11_01.operations import DdosCustomPoliciesOperations as OperationClass\n        elif api_version == '2018-12-01':\n            from .v2018_12_01.operations import DdosCustomPoliciesOperations as OperationClass\n        elif api_version == '2019-02-01':\n            from .v2019_02_01.operations import DdosCustomPoliciesOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nreturn an instance of the ExpressRouteConnections API version 2. 0.", "response": "def express_route_connections(self):\n        \"\"\"Instance depends on the API version:\n\n           * 2018-08-01: :class:`ExpressRouteConnectionsOperations<azure.mgmt.network.v2018_08_01.operations.ExpressRouteConnectionsOperations>`\n           * 2018-10-01: :class:`ExpressRouteConnectionsOperations<azure.mgmt.network.v2018_10_01.operations.ExpressRouteConnectionsOperations>`\n           * 2018-11-01: :class:`ExpressRouteConnectionsOperations<azure.mgmt.network.v2018_11_01.operations.ExpressRouteConnectionsOperations>`\n           * 2018-12-01: :class:`ExpressRouteConnectionsOperations<azure.mgmt.network.v2018_12_01.operations.ExpressRouteConnectionsOperations>`\n           * 2019-02-01: :class:`ExpressRouteConnectionsOperations<azure.mgmt.network.v2019_02_01.operations.ExpressRouteConnectionsOperations>`\n        \"\"\"\n        api_version = self._get_api_version('express_route_connections')\n        if api_version == '2018-08-01':\n            from .v2018_08_01.operations import ExpressRouteConnectionsOperations as OperationClass\n        elif api_version == '2018-10-01':\n            from .v2018_10_01.operations import ExpressRouteConnectionsOperations as OperationClass\n        elif api_version == '2018-11-01':\n            from .v2018_11_01.operations import ExpressRouteConnectionsOperations as OperationClass\n        elif api_version == '2018-12-01':\n            from .v2018_12_01.operations import ExpressRouteConnectionsOperations as OperationClass\n        elif api_version == '2019-02-01':\n            from .v2019_02_01.operations import ExpressRouteConnectionsOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nreturning a list of all the ExpressRouteCrossConnectionPeerings objects in the current state.", "response": "def express_route_cross_connection_peerings(self):\n        \"\"\"Instance depends on the API version:\n\n           * 2018-02-01: :class:`ExpressRouteCrossConnectionPeeringsOperations<azure.mgmt.network.v2018_02_01.operations.ExpressRouteCrossConnectionPeeringsOperations>`\n           * 2018-04-01: :class:`ExpressRouteCrossConnectionPeeringsOperations<azure.mgmt.network.v2018_04_01.operations.ExpressRouteCrossConnectionPeeringsOperations>`\n           * 2018-06-01: :class:`ExpressRouteCrossConnectionPeeringsOperations<azure.mgmt.network.v2018_06_01.operations.ExpressRouteCrossConnectionPeeringsOperations>`\n           * 2018-07-01: :class:`ExpressRouteCrossConnectionPeeringsOperations<azure.mgmt.network.v2018_07_01.operations.ExpressRouteCrossConnectionPeeringsOperations>`\n           * 2018-08-01: :class:`ExpressRouteCrossConnectionPeeringsOperations<azure.mgmt.network.v2018_08_01.operations.ExpressRouteCrossConnectionPeeringsOperations>`\n           * 2018-10-01: :class:`ExpressRouteCrossConnectionPeeringsOperations<azure.mgmt.network.v2018_10_01.operations.ExpressRouteCrossConnectionPeeringsOperations>`\n           * 2018-11-01: :class:`ExpressRouteCrossConnectionPeeringsOperations<azure.mgmt.network.v2018_11_01.operations.ExpressRouteCrossConnectionPeeringsOperations>`\n           * 2018-12-01: :class:`ExpressRouteCrossConnectionPeeringsOperations<azure.mgmt.network.v2018_12_01.operations.ExpressRouteCrossConnectionPeeringsOperations>`\n           * 2019-02-01: :class:`ExpressRouteCrossConnectionPeeringsOperations<azure.mgmt.network.v2019_02_01.operations.ExpressRouteCrossConnectionPeeringsOperations>`\n        \"\"\"\n        api_version = self._get_api_version('express_route_cross_connection_peerings')\n        if api_version == '2018-02-01':\n            from .v2018_02_01.operations import ExpressRouteCrossConnectionPeeringsOperations as OperationClass\n        elif api_version == '2018-04-01':\n            from .v2018_04_01.operations import ExpressRouteCrossConnectionPeeringsOperations as OperationClass\n        elif api_version == '2018-06-01':\n            from .v2018_06_01.operations import ExpressRouteCrossConnectionPeeringsOperations as OperationClass\n        elif api_version == '2018-07-01':\n            from .v2018_07_01.operations import ExpressRouteCrossConnectionPeeringsOperations as OperationClass\n        elif api_version == '2018-08-01':\n            from .v2018_08_01.operations import ExpressRouteCrossConnectionPeeringsOperations as OperationClass\n        elif api_version == '2018-10-01':\n            from .v2018_10_01.operations import ExpressRouteCrossConnectionPeeringsOperations as OperationClass\n        elif api_version == '2018-11-01':\n            from .v2018_11_01.operations import ExpressRouteCrossConnectionPeeringsOperations as OperationClass\n        elif api_version == '2018-12-01':\n            from .v2018_12_01.operations import ExpressRouteCrossConnectionPeeringsOperations as OperationClass\n        elif api_version == '2019-02-01':\n            from .v2019_02_01.operations import ExpressRouteCrossConnectionPeeringsOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef hub_virtual_network_connections(self):\n        api_version = self._get_api_version('hub_virtual_network_connections')\n        if api_version == '2018-04-01':\n            from .v2018_04_01.operations import HubVirtualNetworkConnectionsOperations as OperationClass\n        elif api_version == '2018-06-01':\n            from .v2018_06_01.operations import HubVirtualNetworkConnectionsOperations as OperationClass\n        elif api_version == '2018-07-01':\n            from .v2018_07_01.operations import HubVirtualNetworkConnectionsOperations as OperationClass\n        elif api_version == '2018-08-01':\n            from .v2018_08_01.operations import HubVirtualNetworkConnectionsOperations as OperationClass\n        elif api_version == '2018-10-01':\n            from .v2018_10_01.operations import HubVirtualNetworkConnectionsOperations as OperationClass\n        elif api_version == '2018-11-01':\n            from .v2018_11_01.operations import HubVirtualNetworkConnectionsOperations as OperationClass\n        elif api_version == '2018-12-01':\n            from .v2018_12_01.operations import HubVirtualNetworkConnectionsOperations as OperationClass\n        elif api_version == '2019-02-01':\n            from .v2019_02_01.operations import HubVirtualNetworkConnectionsOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))", "response": "Return a list of all Hub Virtual Network Connections in the current state."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef nat_gateways(self):\n        api_version = self._get_api_version('nat_gateways')\n        if api_version == '2019-02-01':\n            from .v2019_02_01.operations import NatGatewaysOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))", "response": "Returns an instance of NatGatewaysOperations class based on API version."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef network_watchers(self):\n        api_version = self._get_api_version('network_watchers')\n        if api_version == '2016-09-01':\n            from .v2016_09_01.operations import NetworkWatchersOperations as OperationClass\n        elif api_version == '2016-12-01':\n            from .v2016_12_01.operations import NetworkWatchersOperations as OperationClass\n        elif api_version == '2017-03-01':\n            from .v2017_03_01.operations import NetworkWatchersOperations as OperationClass\n        elif api_version == '2017-06-01':\n            from .v2017_06_01.operations import NetworkWatchersOperations as OperationClass\n        elif api_version == '2017-08-01':\n            from .v2017_08_01.operations import NetworkWatchersOperations as OperationClass\n        elif api_version == '2017-09-01':\n            from .v2017_09_01.operations import NetworkWatchersOperations as OperationClass\n        elif api_version == '2017-10-01':\n            from .v2017_10_01.operations import NetworkWatchersOperations as OperationClass\n        elif api_version == '2017-11-01':\n            from .v2017_11_01.operations import NetworkWatchersOperations as OperationClass\n        elif api_version == '2018-01-01':\n            from .v2018_01_01.operations import NetworkWatchersOperations as OperationClass\n        elif api_version == '2018-02-01':\n            from .v2018_02_01.operations import NetworkWatchersOperations as OperationClass\n        elif api_version == '2018-04-01':\n            from .v2018_04_01.operations import NetworkWatchersOperations as OperationClass\n        elif api_version == '2018-06-01':\n            from .v2018_06_01.operations import NetworkWatchersOperations as OperationClass\n        elif api_version == '2018-07-01':\n            from .v2018_07_01.operations import NetworkWatchersOperations as OperationClass\n        elif api_version == '2018-08-01':\n            from .v2018_08_01.operations import NetworkWatchersOperations as OperationClass\n        elif api_version == '2018-10-01':\n            from .v2018_10_01.operations import NetworkWatchersOperations as OperationClass\n        elif api_version == '2018-11-01':\n            from .v2018_11_01.operations import NetworkWatchersOperations as OperationClass\n        elif api_version == '2018-12-01':\n            from .v2018_12_01.operations import NetworkWatchersOperations as OperationClass\n        elif api_version == '2019-02-01':\n            from .v2019_02_01.operations import NetworkWatchersOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))", "response": "Get the current network watch states for the current user."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef peer_express_route_circuit_connections(self):\n        api_version = self._get_api_version('peer_express_route_circuit_connections')\n        if api_version == '2018-12-01':\n            from .v2018_12_01.operations import PeerExpressRouteCircuitConnectionsOperations as OperationClass\n        elif api_version == '2019-02-01':\n            from .v2019_02_01.operations import PeerExpressRouteCircuitConnectionsOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))", "response": "Return an instance of the PeerExpressRouteCircuitConnectionsOperations API client."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef service_endpoint_policies(self):\n        api_version = self._get_api_version('service_endpoint_policies')\n        if api_version == '2018-07-01':\n            from .v2018_07_01.operations import ServiceEndpointPoliciesOperations as OperationClass\n        elif api_version == '2018-08-01':\n            from .v2018_08_01.operations import ServiceEndpointPoliciesOperations as OperationClass\n        elif api_version == '2018-10-01':\n            from .v2018_10_01.operations import ServiceEndpointPoliciesOperations as OperationClass\n        elif api_version == '2018-11-01':\n            from .v2018_11_01.operations import ServiceEndpointPoliciesOperations as OperationClass\n        elif api_version == '2018-12-01':\n            from .v2018_12_01.operations import ServiceEndpointPoliciesOperations as OperationClass\n        elif api_version == '2019-02-01':\n            from .v2019_02_01.operations import ServiceEndpointPoliciesOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))", "response": "Return an instance of the Service Endpoint Policies API Version 2. 0."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef web_application_firewall_policies(self):\n        api_version = self._get_api_version('web_application_firewall_policies')\n        if api_version == '2018-12-01':\n            from .v2018_12_01.operations import WebApplicationFirewallPoliciesOperations as OperationClass\n        elif api_version == '2019-02-01':\n            from .v2019_02_01.operations import WebApplicationFirewallPoliciesOperations as OperationClass\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return OperationClass(self._client, self.config, Serializer(self._models_dict(api_version)), Deserializer(self._models_dict(api_version)))", "response": "Return an instance of the WebApplicationFirewallPolicies API version 2. 0."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\ndeleting the specified certificate assosciated with the Provisioning Service Certificate.", "response": "def delete(\n            self, resource_group_name, if_match, provisioning_service_name, certificate_name, certificatename=None, certificateraw_bytes=None, certificateis_verified=None, certificatepurpose=None, certificatecreated=None, certificatelast_updated=None, certificatehas_private_key=None, certificatenonce=None, custom_headers=None, raw=False, **operation_config):\n        \"\"\"Delete the Provisioning Service Certificate.\n\n        Deletes the specified certificate assosciated with the Provisioning\n        Service.\n\n        :param resource_group_name: Resource group identifier.\n        :type resource_group_name: str\n        :param if_match: ETag of the certificate\n        :type if_match: str\n        :param provisioning_service_name: The name of the provisioning\n         service.\n        :type provisioning_service_name: str\n        :param certificate_name: This is a mandatory field, and is the logical\n         name of the certificate that the provisioning service will access by.\n        :type certificate_name: str\n        :param certificatename: This is optional, and it is the Common Name of\n         the certificate.\n        :type certificatename: str\n        :param certificateraw_bytes: Raw data within the certificate.\n        :type certificateraw_bytes: bytearray\n        :param certificateis_verified: Indicates if certificate has been\n         verified by owner of the private key.\n        :type certificateis_verified: bool\n        :param certificatepurpose: A description that mentions the purpose of\n         the certificate. Possible values include: 'clientAuthentication',\n         'serverAuthentication'\n        :type certificatepurpose: str or\n         ~azure.mgmt.iothubprovisioningservices.models.CertificatePurpose\n        :param certificatecreated: Time the certificate is created.\n        :type certificatecreated: datetime\n        :param certificatelast_updated: Time the certificate is last updated.\n        :type certificatelast_updated: datetime\n        :param certificatehas_private_key: Indicates if the certificate\n         contains a private key.\n        :type certificatehas_private_key: bool\n        :param certificatenonce: Random number generated to indicate Proof of\n         Possession.\n        :type certificatenonce: str\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: None or ClientRawResponse if raw=true\n        :rtype: None or ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`ErrorDetailsException<azure.mgmt.iothubprovisioningservices.models.ErrorDetailsException>`\n        \"\"\"\n        # Construct URL\n        url = self.delete.metadata['url']\n        path_format_arguments = {\n            'subscriptionId': self._serialize.url(\"self.config.subscription_id\", self.config.subscription_id, 'str'),\n            'resourceGroupName': self._serialize.url(\"resource_group_name\", resource_group_name, 'str'),\n            'provisioningServiceName': self._serialize.url(\"provisioning_service_name\", provisioning_service_name, 'str'),\n            'certificateName': self._serialize.url(\"certificate_name\", certificate_name, 'str')\n        }\n        url = self._client.format_url(url, **path_format_arguments)\n\n        # Construct parameters\n        query_parameters = {}\n        if certificatename is not None:\n            query_parameters['certificate.name'] = self._serialize.query(\"certificatename\", certificatename, 'str')\n        if certificateraw_bytes is not None:\n            query_parameters['certificate.rawBytes'] = self._serialize.query(\"certificateraw_bytes\", certificateraw_bytes, 'bytearray')\n        if certificateis_verified is not None:\n            query_parameters['certificate.isVerified'] = self._serialize.query(\"certificateis_verified\", certificateis_verified, 'bool')\n        if certificatepurpose is not None:\n            query_parameters['certificate.purpose'] = self._serialize.query(\"certificatepurpose\", certificatepurpose, 'str')\n        if certificatecreated is not None:\n            query_parameters['certificate.created'] = self._serialize.query(\"certificatecreated\", certificatecreated, 'iso-8601')\n        if certificatelast_updated is not None:\n            query_parameters['certificate.lastUpdated'] = self._serialize.query(\"certificatelast_updated\", certificatelast_updated, 'iso-8601')\n        if certificatehas_private_key is not None:\n            query_parameters['certificate.hasPrivateKey'] = self._serialize.query(\"certificatehas_private_key\", certificatehas_private_key, 'bool')\n        if certificatenonce is not None:\n            query_parameters['certificate.nonce'] = self._serialize.query(\"certificatenonce\", certificatenonce, 'str')\n        query_parameters['api-version'] = self._serialize.query(\"self.api_version\", self.api_version, 'str')\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Content-Type'] = 'application/json; charset=utf-8'\n        if self.config.generate_client_request_id:\n            header_parameters['x-ms-client-request-id'] = str(uuid.uuid1())\n        if custom_headers:\n            header_parameters.update(custom_headers)\n        header_parameters['If-Match'] = self._serialize.header(\"if_match\", if_match, 'str')\n        if self.config.accept_language is not None:\n            header_parameters['accept-language'] = self._serialize.header(\"self.config.accept_language\", self.config.accept_language, 'str')\n\n        # Construct and send request\n        request = self._client.delete(url, query_parameters)\n        response = self._client.send(request, header_parameters, stream=False, **operation_config)\n\n        if response.status_code not in [200, 204]:\n            raise models.ErrorDetailsException(self._deserialize, response)\n\n        if raw:\n            client_raw_response = ClientRawResponse(None, response)\n            return client_raw_response"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef list(\n            self, skiptoken=None, skip=None, top=None, select=None, search=None, filter=None, view=None, group_name=None, cache_control=\"no-cache\", custom_headers=None, raw=False, **operation_config):\n        \"\"\"List all entities (Management Groups, Subscriptions, etc.) for the\n        authenticated user.\n\n        :param skiptoken: Page continuation token is only used if a previous\n         operation returned a partial result. If a previous response contains a\n         nextLink element, the value of the nextLink element will include a\n         token parameter that specifies a starting point to use for subsequent\n         calls.\n        :type skiptoken: str\n        :param skip: Number of entities to skip over when retrieving results.\n         Passing this in will override $skipToken.\n        :type skip: int\n        :param top: Number of elements to return when retrieving results.\n         Passing this in will override $skipToken.\n        :type top: int\n        :param select: This parameter specifies the fields to include in the\n         response. Can include any combination of\n         Name,DisplayName,Type,ParentDisplayNameChain,ParentChain, e.g.\n         '$select=Name,DisplayName,Type,ParentDisplayNameChain,ParentNameChain'.\n         When specified the $select parameter can override select in\n         $skipToken.\n        :type select: str\n        :param search: The $search parameter is used in conjunction with the\n         $filter parameter to return three different outputs depending on the\n         parameter passed in. With $search=AllowedParents the API will return\n         the entity info of all groups that the requested entity will be able\n         to reparent to as determined by the user's permissions. With\n         $search=AllowedChildren the API will return the entity info of all\n         entities that can be added as children of the requested entity. With\n         $search=ParentAndFirstLevelChildren the API will return the parent and\n         first level of children that the user has either direct access to or\n         indirect access via one of their descendants. Possible values include:\n         'AllowedParents', 'AllowedChildren', 'ParentAndFirstLevelChildren',\n         'ParentOnly', 'ChildrenOnly'\n        :type search: str\n        :param filter: The filter parameter allows you to filter on the name\n         or display name fields. You can check for equality on the name field\n         (e.g. name eq '{entityName}')  and you can check for substrings on\n         either the name or display name fields(e.g. contains(name,\n         '{substringToSearch}'), contains(displayName, '{substringToSearch')).\n         Note that the '{entityName}' and '{substringToSearch}' fields are\n         checked case insensitively.\n        :type filter: str\n        :param view: The view parameter allows clients to filter the type of\n         data that is returned by the getEntities call. Possible values\n         include: 'FullHierarchy', 'GroupsOnly', 'SubscriptionsOnly', 'Audit'\n        :type view: str\n        :param group_name: A filter which allows the get entities call to\n         focus on a particular group (i.e. \"$filter=name eq 'groupName'\")\n        :type group_name: str\n        :param cache_control: Indicates that the request shouldn't utilize any\n         caches.\n        :type cache_control: str\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: An iterator like instance of EntityInfo\n        :rtype:\n         ~azure.mgmt.managementgroups.models.EntityInfoPaged[~azure.mgmt.managementgroups.models.EntityInfo]\n        :raises:\n         :class:`ErrorResponseException<azure.mgmt.managementgroups.models.ErrorResponseException>`\n        \"\"\"\n        def internal_paging(next_link=None, raw=False):\n\n            if not next_link:\n                # Construct URL\n                url = self.list.metadata['url']\n\n                # Construct parameters\n                query_parameters = {}\n                query_parameters['api-version'] = self._serialize.query(\"self.api_version\", self.api_version, 'str')\n                if skiptoken is not None:\n                    query_parameters['$skiptoken'] = self._serialize.query(\"skiptoken\", skiptoken, 'str')\n                if skip is not None:\n                    query_parameters['$skip'] = self._serialize.query(\"skip\", skip, 'int')\n                if top is not None:\n                    query_parameters['$top'] = self._serialize.query(\"top\", top, 'int')\n                if select is not None:\n                    query_parameters['$select'] = self._serialize.query(\"select\", select, 'str')\n                if search is not None:\n                    query_parameters['$search'] = self._serialize.query(\"search\", search, 'str')\n                if filter is not None:\n                    query_parameters['$filter'] = self._serialize.query(\"filter\", filter, 'str')\n                if view is not None:\n                    query_parameters['$view'] = self._serialize.query(\"view\", view, 'str')\n                if group_name is not None:\n                    query_parameters['groupName'] = self._serialize.query(\"group_name\", group_name, 'str')\n\n            else:\n                url = next_link\n                query_parameters = {}\n\n            # Construct headers\n            header_parameters = {}\n            header_parameters['Accept'] = 'application/json'\n            if self.config.generate_client_request_id:\n                header_parameters['x-ms-client-request-id'] = str(uuid.uuid1())\n            if custom_headers:\n                header_parameters.update(custom_headers)\n            if cache_control is not None:\n                header_parameters['Cache-Control'] = self._serialize.header(\"cache_control\", cache_control, 'str')\n            if self.config.accept_language is not None:\n                header_parameters['accept-language'] = self._serialize.header(\"self.config.accept_language\", self.config.accept_language, 'str')\n\n            # Construct and send request\n            request = self._client.post(url, query_parameters, header_parameters)\n            response = self._client.send(request, stream=False, **operation_config)\n\n            if response.status_code not in [200]:\n                raise models.ErrorResponseException(self._deserialize, response)\n\n            return response\n\n        # Deserialize response\n        deserialized = models.EntityInfoPaged(internal_paging, self._deserialize.dependencies)\n\n        if raw:\n            header_dict = {}\n            client_raw_response = models.EntityInfoPaged(internal_paging, self._deserialize.dependencies, header_dict)\n            return client_raw_response\n\n        return deserialized", "response": "This method returns a list of all entities in the specified base namespace."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\ncreating a new job for the content of the current content.", "response": "def create_job(\n            self, team_name, content_type, content_id, workflow_name, job_content_type, content_value, call_back_endpoint=None, custom_headers=None, raw=False, **operation_config):\n        \"\"\"A job Id will be returned for the content posted on this endpoint.\n        Once the content is evaluated against the Workflow provided the review\n        will be created or ignored based on the workflow expression.\n        <h3>CallBack Schemas </h3>\n        <p>\n        <h4>Job Completion CallBack Sample</h4><br/>\n        {<br/>\n        \"JobId\": \"<Job Id>,<br/>\n        \"ReviewId\": \"<Review Id, if the Job resulted in a Review to be\n        created>\",<br/>\n        \"WorkFlowId\": \"default\",<br/>\n        \"Status\": \"<This will be one of Complete, InProgress, Error>\",<br/>\n        \"ContentType\": \"Image\",<br/>\n        \"ContentId\": \"<This is the ContentId that was specified on\n        input>\",<br/>\n        \"CallBackType\": \"Job\",<br/>\n        \"Metadata\": {<br/>\n        \"adultscore\": \"0.xxx\",<br/>\n        \"a\": \"False\",<br/>\n        \"racyscore\": \"0.xxx\",<br/>\n        \"r\": \"True\"<br/>\n        }<br/>\n        }<br/>\n        </p>\n        <p>\n        <h4>Review Completion CallBack Sample</h4><br/>\n        {\n        \"ReviewId\": \"<Review Id>\",<br/>\n        \"ModifiedOn\": \"2016-10-11T22:36:32.9934851Z\",<br/>\n        \"ModifiedBy\": \"<Name of the Reviewer>\",<br/>\n        \"CallBackType\": \"Review\",<br/>\n        \"ContentId\": \"<The ContentId that was specified input>\",<br/>\n        \"Metadata\": {<br/>\n        \"adultscore\": \"0.xxx\",\n        \"a\": \"False\",<br/>\n        \"racyscore\": \"0.xxx\",<br/>\n        \"r\": \"True\"<br/>\n        },<br/>\n        \"ReviewerResultTags\": {<br/>\n        \"a\": \"False\",<br/>\n        \"r\": \"True\"<br/>\n        }<br/>\n        }<br/>\n        </p>.\n\n        :param team_name: Your team name.\n        :type team_name: str\n        :param content_type: Image, Text or Video. Possible values include:\n         'Image', 'Text', 'Video'\n        :type content_type: str\n        :param content_id: Id/Name to identify the content submitted.\n        :type content_id: str\n        :param workflow_name: Workflow Name that you want to invoke.\n        :type workflow_name: str\n        :param job_content_type: The content type. Possible values include:\n         'application/json', 'image/jpeg'\n        :type job_content_type: str\n        :param content_value: Content to evaluate for a job.\n        :type content_value: str\n        :param call_back_endpoint: Callback endpoint for posting the create\n         job result.\n        :type call_back_endpoint: str\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: JobId or ClientRawResponse if raw=true\n        :rtype: ~azure.cognitiveservices.vision.contentmoderator.models.JobId\n         or ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`APIErrorException<azure.cognitiveservices.vision.contentmoderator.models.APIErrorException>`\n        \"\"\"\n        content = models.Content(content_value=content_value)\n\n        # Construct URL\n        url = self.create_job.metadata['url']\n        path_format_arguments = {\n            'Endpoint': self._serialize.url(\"self.config.endpoint\", self.config.endpoint, 'str', skip_quote=True),\n            'teamName': self._serialize.url(\"team_name\", team_name, 'str')\n        }\n        url = self._client.format_url(url, **path_format_arguments)\n\n        # Construct parameters\n        query_parameters = {}\n        query_parameters['ContentType'] = self._serialize.query(\"content_type\", content_type, 'str')\n        query_parameters['ContentId'] = self._serialize.query(\"content_id\", content_id, 'str')\n        query_parameters['WorkflowName'] = self._serialize.query(\"workflow_name\", workflow_name, 'str')\n        if call_back_endpoint is not None:\n            query_parameters['CallBackEndpoint'] = self._serialize.query(\"call_back_endpoint\", call_back_endpoint, 'str')\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Accept'] = 'application/json'\n        header_parameters['Content-Type'] = 'application/json; charset=utf-8'\n        if custom_headers:\n            header_parameters.update(custom_headers)\n        header_parameters['Content-Type'] = self._serialize.header(\"job_content_type\", job_content_type, 'str')\n\n        # Construct body\n        body_content = self._serialize.body(content, 'Content')\n\n        # Construct and send request\n        request = self._client.post(url, query_parameters, header_parameters, body_content)\n        response = self._client.send(request, stream=False, **operation_config)\n\n        if response.status_code not in [200]:\n            raise models.APIErrorException(self._deserialize, response)\n\n        deserialized = None\n\n        if response.status_code == 200:\n            deserialized = self._deserialize('JobId', response)\n\n        if raw:\n            client_raw_response = ClientRawResponse(deserialized, response)\n            return client_raw_response\n\n        return deserialized"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\ngetting a specific queue client.", "response": "def get_queue(self, queue_name):\n        \"\"\"Get a client for a queue entity.\n\n        :param queue_name: The name of the queue.\n        :type queue_name: str\n        :rtype: ~azure.servicebus.servicebus_client.QueueClient\n        :raises: ~azure.servicebus.common.errors.ServiceBusConnectionError if the namespace is not found.\n        :raises: ~azure.servicebus.common.errors.ServiceBusResourceNotFound if the queue is not found.\n\n        Example:\n            .. literalinclude:: ../examples/test_examples.py\n                :start-after: [START get_queue_client]\n                :end-before: [END get_queue_client]\n                :language: python\n                :dedent: 8\n                :caption: Get the specific queue client from Service Bus client\n\n        \"\"\"\n        try:\n            queue = self.mgmt_client.get_queue(queue_name)\n        except requests.exceptions.ConnectionError as e:\n            raise ServiceBusConnectionError(\"Namespace: {} not found\".format(self.service_namespace), e)\n        except AzureServiceBusResourceNotFound:\n            raise ServiceBusResourceNotFound(\"Specificed queue does not exist.\")\n        return QueueClient.from_entity(\n            self._get_host(), queue,\n            shared_access_key_name=self.shared_access_key_name,\n            shared_access_key_value=self.shared_access_key_value,\n            mgmt_client=self.mgmt_client,\n            debug=self.debug)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef list_queues(self):\n        try:\n            queues = self.mgmt_client.list_queues()\n        except requests.exceptions.ConnectionError as e:\n            raise ServiceBusConnectionError(\"Namespace: {} not found\".format(self.service_namespace), e)\n        queue_clients = []\n        for queue in queues:\n            queue_clients.append(QueueClient.from_entity(\n                self._get_host(), queue,\n                shared_access_key_name=self.shared_access_key_name,\n                shared_access_key_value=self.shared_access_key_value,\n                mgmt_client=self.mgmt_client,\n                debug=self.debug))\n        return queue_clients", "response": "List all queues in the namespace."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\ngetting a client for a specific topic.", "response": "def get_topic(self, topic_name):\n        \"\"\"Get a client for a topic entity.\n\n        :param topic_name: The name of the topic.\n        :type topic_name: str\n        :rtype: ~azure.servicebus.servicebus_client.TopicClient\n        :raises: ~azure.servicebus.common.errors.ServiceBusConnectionError if the namespace is not found.\n        :raises: ~azure.servicebus.common.errors.ServiceBusResourceNotFound if the topic is not found.\n\n        Example:\n            .. literalinclude:: ../examples/test_examples.py\n                :start-after: [START get_topic_client]\n                :end-before: [END get_topic_client]\n                :language: python\n                :dedent: 8\n                :caption: Get the specific topic client from Service Bus client\n\n        \"\"\"\n        try:\n            topic = self.mgmt_client.get_topic(topic_name)\n        except requests.exceptions.ConnectionError as e:\n            raise ServiceBusConnectionError(\"Namespace: {} not found\".format(self.service_namespace), e)\n        except AzureServiceBusResourceNotFound:\n            raise ServiceBusResourceNotFound(\"Specificed topic does not exist.\")\n        return TopicClient.from_entity(\n            self._get_host(), topic,\n            shared_access_key_name=self.shared_access_key_name,\n            shared_access_key_value=self.shared_access_key_value,\n            debug=self.debug)"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nget a list of all topic clients in the namespace.", "response": "def list_topics(self):\n        \"\"\"Get a client for all topic entities in the namespace.\n\n        :rtype: list[~azure.servicebus.servicebus_client.TopicClient]\n        :raises: ~azure.servicebus.common.errors.ServiceBusConnectionError if the namespace is not found.\n\n        Example:\n            .. literalinclude:: ../examples/test_examples.py\n                :start-after: [START list_topics]\n                :end-before: [END list_topics]\n                :language: python\n                :dedent: 4\n                :caption: List the topics from Service Bus client\n\n        \"\"\"\n        try:\n            topics = self.mgmt_client.list_topics()\n        except requests.exceptions.ConnectionError as e:\n            raise ServiceBusConnectionError(\"Namespace: {} not found\".format(self.service_namespace), e)\n        topic_clients = []\n        for topic in topics:\n            topic_clients.append(TopicClient.from_entity(\n                self._get_host(), topic,\n                shared_access_key_name=self.shared_access_key_name,\n                shared_access_key_value=self.shared_access_key_value,\n                debug=self.debug))\n        return topic_clients"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef peek(self, count=1, start_from=0, session=None, **kwargs):\n        message = {'from-sequence-number': types.AMQPLong(start_from), 'message-count': int(count)}\n        if self.entity and self.requires_session:\n            if not session:\n                raise ValueError(\"Sessions are required, please set session.\")\n            message['session-id'] = session\n\n        with BaseHandler(self.entity_uri, self.auth_config, debug=self.debug, **kwargs) as handler:\n            return handler._mgmt_request_response(  # pylint: disable=protected-access\n                REQUEST_RESPONSE_PEEK_OPERATION,\n                message,\n                mgmt_handlers.peek_op)", "response": "Browse messages currently pending in the queue."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nlisting the pending messages in the queue.", "response": "def list_sessions(self, updated_since=None, max_results=100, skip=0, **kwargs):\n        \"\"\"List session IDs.\n\n        List the Session IDs with pending messages in the queue where the state of the session\n        has been updated since the timestamp provided. If no timestamp is provided, all will be returned.\n        If the state of a session has never been set, it will not be returned regardless of whether\n        there are messages pending.\n\n        :param updated_since: The UTC datetime from which to return updated pending Session IDs.\n        :type updated_since: datetime.datetime\n        :param max_results: The maximum number of Session IDs to return. Default value is 100.\n        :type max_results: int\n        :param skip: The page value to jump to. Default value is 0.\n        :type skip: int\n        :rtype: list[str]\n\n        Example:\n            .. literalinclude:: ../examples/test_examples.py\n                :start-after: [START list_sessions_service_bus]\n                :end-before: [END list_sessions_service_bus]\n                :language: python\n                :dedent: 4\n                :caption: Get the Ids of session which have messages pending in the queue\n\n        \"\"\"\n        if self.entity and not self.requires_session:\n            raise ValueError(\"This is not a sessionful entity.\")\n        message = {\n            'last-updated-time': updated_since or datetime.datetime.utcfromtimestamp(0),\n            'skip': types.AMQPInt(skip),\n            'top': types.AMQPInt(max_results),\n        }\n        with BaseHandler(self.entity_uri, self.auth_config, debug=self.debug, **kwargs) as handler:\n            return handler._mgmt_request_response(  # pylint: disable=protected-access\n                REQUEST_RESPONSE_GET_MESSAGE_SESSIONS_OPERATION,\n                message,\n                mgmt_handlers.list_sessions_op)"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef receive_deferred_messages(self, sequence_numbers, mode=ReceiveSettleMode.PeekLock, **kwargs):\n        if (self.entity and self.requires_session) or kwargs.get('session'):\n            raise ValueError(\"Sessionful deferred messages can only be received within a locked receive session.\")\n        if not sequence_numbers:\n            raise ValueError(\"At least one sequence number must be specified.\")\n        try:\n            receive_mode = mode.value.value\n        except AttributeError:\n            receive_mode = int(mode)\n        message = {\n            'sequence-numbers': types.AMQPArray([types.AMQPLong(s) for s in sequence_numbers]),\n            'receiver-settle-mode': types.AMQPuInt(receive_mode)}\n        mgmt_handler = functools.partial(mgmt_handlers.deferred_message_op, mode=receive_mode)\n        with BaseHandler(self.entity_uri, self.auth_config, debug=self.debug, **kwargs) as handler:\n            return handler._mgmt_request_response(  # pylint: disable=protected-access\n                REQUEST_RESPONSE_RECEIVE_BY_SEQUENCE_NUMBER,\n                message,\n                mgmt_handler)", "response": "Receive deferred messages from the service bus."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef settle_deferred_messages(self, settlement, messages, **kwargs):\n        if (self.entity and self.requires_session) or kwargs.get('session'):\n            raise ValueError(\"Sessionful deferred messages can only be settled within a locked receive session.\")\n        if settlement.lower() not in ['completed', 'suspended', 'abandoned']:\n            raise ValueError(\"Settlement must be one of: 'completed', 'suspended', 'abandoned'\")\n        if not messages:\n            raise ValueError(\"At least one message must be specified.\")\n        message = {\n            'disposition-status': settlement.lower(),\n            'lock-tokens': types.AMQPArray([m.lock_token for m in messages])}\n\n        with BaseHandler(self.entity_uri, self.auth_config, debug=self.debug, **kwargs) as handler:\n            return handler._mgmt_request_response(  # pylint: disable=protected-access\n                REQUEST_RESPONSE_UPDATE_DISPOSTION_OPERATION,\n                message,\n                mgmt_handlers.default)", "response": "Settle messages that have been previously deferred.\n\n        :param settlement: How the messages are to be settled. This must be a string\n         of one of the following values: 'completed', 'suspended', 'abandoned'.\n        :type settlement: str\n        :param messages: A list of deferred messages to be settled.\n        :type messages: list[~azure.servicebus.common.message.DeferredMessage]\n\n        Example:\n            .. literalinclude:: ../examples/test_examples.py\n                :start-after: [START settle_deferred_messages_service_bus]\n                :end-before: [END settle_deferred_messages_service_bus]\n                :language: python\n                :dedent: 8\n                :caption: Settle deferred messages."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef get_site(self, webspace_name, website_name):\n        '''\n        List the web sites defined on this webspace.\n\n        webspace_name:\n            The name of the webspace.\n        website_name:\n            The name of the website.\n        '''\n        return self._perform_get(self._get_sites_details_path(webspace_name,\n                                                              website_name),\n                                 Site)", "response": "Get the site details for the given webspace and website name."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\ncreates a website. webspace_name: The name of the webspace. website_name: The name of the website. geo_region: The geographical region of the webspace that will be created. host_names: An array of fully qualified domain names for website. Only one hostname can be specified in the azurewebsites.net domain. The hostname should match the name of the website. Custom domains can only be specified for Shared or Standard websites. plan: This value must be 'VirtualDedicatedPlan'. compute_mode: This value should be 'Shared' for the Free or Paid Shared offerings, or 'Dedicated' for the Standard offering. The default value is 'Shared'. If you set it to 'Dedicated', you must specify a value for the server_farm parameter. server_farm: The name of the Server Farm associated with this website. This is a required value for Standard mode. site_mode: Can be None, 'Limited' or 'Basic'. This value is 'Limited' for the Free offering, and 'Basic' for the Paid Shared offering. Standard mode does not use the site_mode parameter; it uses the compute_mode parameter.", "response": "def create_site(self, webspace_name, website_name, geo_region, host_names,\n                    plan='VirtualDedicatedPlan', compute_mode='Shared',\n                    server_farm=None, site_mode=None):\n        '''\n        Create a website.\n\n        webspace_name:\n            The name of the webspace.\n        website_name:\n            The name of the website.\n        geo_region:\n            The geographical region of the webspace that will be created.\n        host_names:\n            An array of fully qualified domain names for website. Only one\n            hostname can be specified in the azurewebsites.net domain.\n            The hostname should match the name of the website. Custom domains\n            can only be specified for Shared or Standard websites.\n        plan:\n            This value must be 'VirtualDedicatedPlan'.\n        compute_mode:\n            This value should be 'Shared' for the Free or Paid Shared\n            offerings, or 'Dedicated' for the Standard offering. The default\n            value is 'Shared'. If you set it to 'Dedicated', you must specify\n            a value for the server_farm parameter.\n        server_farm:\n            The name of the Server Farm associated with this website. This is\n            a required value for Standard mode.\n        site_mode:\n            Can be None, 'Limited' or 'Basic'. This value is 'Limited' for the\n            Free offering, and 'Basic' for the Paid Shared offering. Standard\n            mode does not use the site_mode parameter; it uses the compute_mode\n            parameter.\n        '''\n        xml = _XmlSerializer.create_website_to_xml(webspace_name, website_name, geo_region, plan, host_names, compute_mode, server_farm, site_mode)\n        return self._perform_post(\n            self._get_sites_path(webspace_name),\n            xml,\n            Site)"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\ndeletes a website. webspace_name: The name of the webspace. website_name: The name of the website. delete_empty_server_farm: If the site being deleted is the last web site in a server farm, you can delete the server farm by setting this to True. delete_metrics: To also delete the metrics for the site that you are deleting, you can set this to True.", "response": "def delete_site(self, webspace_name, website_name,\n                    delete_empty_server_farm=False, delete_metrics=False):\n        '''\n        Delete a website.\n\n        webspace_name:\n            The name of the webspace.\n        website_name:\n            The name of the website.\n        delete_empty_server_farm:\n            If the site being deleted is the last web site in a server farm,\n            you can delete the server farm by setting this to True.\n        delete_metrics:\n            To also delete the metrics for the site that you are deleting, you\n            can set this to True.\n        '''\n        path = self._get_sites_details_path(webspace_name, website_name)\n        query = ''\n        if delete_empty_server_farm:\n            query += '&deleteEmptyServerFarm=true'\n        if delete_metrics:\n            query += '&deleteMetrics=true'\n        if query:\n            path = path + '?' + query.lstrip('&')\n        return self._perform_delete(path)"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nupdate a web site.", "response": "def update_site(self, webspace_name, website_name, state=None):\n        '''\n        Update a web site.\n\n        webspace_name:\n            The name of the webspace.\n        website_name:\n            The name of the website.\n        state:\n            The wanted state ('Running' or 'Stopped' accepted)\n        '''\n        xml = _XmlSerializer.update_website_to_xml(state)\n        return self._perform_put(\n            self._get_sites_details_path(webspace_name, website_name),\n            xml, as_async=True)"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef restart_site(self, webspace_name, website_name):\n        '''\n        Restart a web site.\n\n        webspace_name:\n            The name of the webspace.\n        website_name:\n            The name of the website.\n        '''\n        return self._perform_post(\n            self._get_restart_path(webspace_name, website_name),\n            None, as_async=True)", "response": "Restarts a web site."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef get_historical_usage_metrics(self, webspace_name, website_name,\n                                     metrics = None, start_time=None, end_time=None, time_grain=None):\n        '''\n        Get historical usage metrics.\n\n        webspace_name:\n            The name of the webspace.\n        website_name:\n            The name of the website.\n        metrics:\n            Optional. List of metrics name. Otherwise, all metrics returned.\n        start_time:\n            Optional. An ISO8601 date. Otherwise, current hour is used.\n        end_time:\n            Optional. An ISO8601 date. Otherwise, current time is used.\n        time_grain:\n            Optional. A rollup name, as P1D. OTherwise, default rollup for the metrics is used.\n        More information and metrics name at:\n        http://msdn.microsoft.com/en-us/library/azure/dn166964.aspx\n        '''        \n        metrics = ('names='+','.join(metrics)) if metrics else ''\n        start_time = ('StartTime='+start_time) if start_time else ''\n        end_time = ('EndTime='+end_time) if end_time else ''\n        time_grain = ('TimeGrain='+time_grain) if time_grain else ''\n        parameters = ('&'.join(v for v in (metrics, start_time, end_time, time_grain) if v))\n        parameters = '?'+parameters if parameters else ''\n        return self._perform_get(self._get_historical_usage_metrics_path(webspace_name, website_name) + parameters,\n                                 MetricResponses)", "response": "Get historical usage metrics."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef get_metric_definitions(self, webspace_name, website_name):\n        '''\n        Get metric definitions of metrics available of this web site.\n\n        webspace_name:\n            The name of the webspace.\n        website_name:\n            The name of the website.\n        '''\n        return self._perform_get(self._get_metric_definitions_path(webspace_name, website_name),\n                                 MetricDefinitions)", "response": "Get the list of metric definitions available for this website."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\ngetting a site s publish profile as a string.", "response": "def get_publish_profile_xml(self, webspace_name, website_name):\n        '''\n        Get a site's publish profile as a string\n\n        webspace_name:\n            The name of the webspace.\n        website_name:\n            The name of the website.\n        '''\n        return self._perform_get(self._get_publishxml_path(webspace_name, website_name),\n                                 None).body.decode(\"utf-8\")"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\ngetting a site s publish profile as an object", "response": "def get_publish_profile(self, webspace_name, website_name):\n        '''\n        Get a site's publish profile as an object\n\n        webspace_name:\n            The name of the webspace.\n        website_name:\n            The name of the website.\n        '''\n        return self._perform_get(self._get_publishxml_path(webspace_name, website_name),\n                                 PublishData)"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nadding authorization header, and encrypts and signs the request if supported on the specific request. :param request: unprotected request to apply security protocol :return: protected request with appropriate security protocal applied", "response": "def protect_request(self, request):\n        \"\"\"\n        Adds authorization header, and encrypts and signs the request if supported on the specific request.\n        :param request: unprotected request to apply security protocol\n        :return: protected request with appropriate security protocal applied\n        \"\"\"\n        # Setup the auth header on the request\n        # Due to limitations in the service we hard code the auth scheme to 'Bearer' as the service will fail with any\n        # other scheme or a different casing such as 'bearer', once this is fixed the following line should be replaced:\n        # request.headers['Authorization'] = '{} {}'.format(auth[0], auth[1])\n        request.headers['Authorization'] = '{} {}'.format('Bearer', self.client_security_token)\n\n        # if the current message security doesn't support message protection, or the body is empty\n        # skip protection and return the original request\n        if not self.supports_protection() or len(request.body) == 0:\n            return request\n\n        plain_text = request.body\n\n        # if the client encryption key is specified add it to the body of the request\n        if self.client_encryption_key:\n            # note that this assumes that the body is already json and not simple string content\n            # this is true for all requests which currently support message encryption, but might\n            # need to be revisited when the types of\n            body_dict = json.loads(plain_text)\n            body_dict['rek'] = {'jwk': self.client_encryption_key.to_jwk().serialize()}\n            plain_text = json.dumps(body_dict).encode(encoding='utf8')\n\n        # build the header for the jws body\n        jws_header = _JwsHeader()\n        jws_header.alg = 'RS256'\n        jws_header.kid = self.client_signature_key.kid\n        jws_header.at = self.client_security_token\n        jws_header.ts = int(time.time())\n        jws_header.typ = 'PoP'\n\n        jws = _JwsObject()\n\n        jws.protected = jws_header.to_compact_header()\n        jws.payload = self._protect_payload(plain_text)\n        data = (jws.protected + '.' + jws.payload).encode('ascii')\n        jws.signature = _bstr_to_b64url(self.client_signature_key.sign(data))\n\n        request.headers['Content-Type'] = 'application/jose+json'\n\n        request.prepare_body(data=jws.to_flattened_jws(), files=None)\n\n        return request"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nremoves protection from the specified response :param request: response from the key vault service :return: unprotected response with any security protocal encryption removed", "response": "def unprotect_response(self, response, **kwargs):\n        \"\"\"\n        Removes protection from the specified response\n        :param request: response from the key vault service\n        :return: unprotected response with any security protocal encryption removed\n        \"\"\"\n        body = response.content\n        # if the current message security doesn't support message protection, the body is empty, or the request failed\n        # skip protection and return the original response\n        if not self.supports_protection() or len(response.content) == 0 or response.status_code != 200:\n            return response\n\n        # ensure the content-type is application/jose+json\n        if 'application/jose+json' not in response.headers.get('content-type', '').lower():\n            raise ValueError('Invalid protected response')\n\n        # deserialize the response into a JwsObject, using response.text so requests handles the encoding\n        jws = _JwsObject().deserialize(body)\n\n        # deserialize the protected header\n        jws_header = _JwsHeader.from_compact_header(jws.protected)\n\n        # ensure the jws signature kid matches the key from original challenge\n        # and the alg matches expected signature alg\n        if jws_header.kid != self.server_signature_key.kid \\\n                or jws_header.alg != 'RS256':\n            raise ValueError('Invalid protected response')\n\n        # validate the signature of the jws\n        data = (jws.protected + '.' + jws.payload).encode('ascii')\n        # verify will raise an InvalidSignature exception if the signature doesn't match\n        self.server_signature_key.verify(signature=_b64_to_bstr(jws.signature), data=data)\n\n        # get the unprotected response body\n        decrypted = self._unprotect_payload(jws.payload)\n\n        response._content = decrypted\n        response.headers['Content-Type'] = 'application/json'\n\n        return response"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef supports_protection(self):\n        return self.client_signature_key \\\n               and self.client_encryption_key \\\n               and self.server_signature_key \\\n               and self.server_encryption_key", "response": "Determines if the current object supports the message protection protocol."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nupdating the policies for the specified container registry.", "response": "def update_policies(\n            self, resource_group_name, registry_name, quarantine_policy=None, trust_policy=None, custom_headers=None, raw=False, polling=True, **operation_config):\n        \"\"\"Updates the policies for the specified container registry.\n\n        :param resource_group_name: The name of the resource group to which\n         the container registry belongs.\n        :type resource_group_name: str\n        :param registry_name: The name of the container registry.\n        :type registry_name: str\n        :param quarantine_policy: An object that represents quarantine policy\n         for a container registry.\n        :type quarantine_policy:\n         ~azure.mgmt.containerregistry.v2018_02_01_preview.models.QuarantinePolicy\n        :param trust_policy: An object that represents content trust policy\n         for a container registry.\n        :type trust_policy:\n         ~azure.mgmt.containerregistry.v2018_02_01_preview.models.TrustPolicy\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: The poller return type is ClientRawResponse, the\n         direct response alongside the deserialized response\n        :param polling: True for ARMPolling, False for no polling, or a\n         polling object for personal polling strategy\n        :return: An instance of LROPoller that returns RegistryPolicies or\n         ClientRawResponse<RegistryPolicies> if raw==True\n        :rtype:\n         ~msrestazure.azure_operation.AzureOperationPoller[~azure.mgmt.containerregistry.v2018_02_01_preview.models.RegistryPolicies]\n         or\n         ~msrestazure.azure_operation.AzureOperationPoller[~msrest.pipeline.ClientRawResponse[~azure.mgmt.containerregistry.v2018_02_01_preview.models.RegistryPolicies]]\n        :raises: :class:`CloudError<msrestazure.azure_exceptions.CloudError>`\n        \"\"\"\n        raw_result = self._update_policies_initial(\n            resource_group_name=resource_group_name,\n            registry_name=registry_name,\n            quarantine_policy=quarantine_policy,\n            trust_policy=trust_policy,\n            custom_headers=custom_headers,\n            raw=True,\n            **operation_config\n        )\n\n        def get_long_running_output(response):\n            deserialized = self._deserialize('RegistryPolicies', response)\n\n            if raw:\n                client_raw_response = ClientRawResponse(deserialized, response)\n                return client_raw_response\n\n            return deserialized\n\n        lro_delay = operation_config.get(\n            'long_running_operation_timeout',\n            self.config.long_running_operation_timeout)\n        if polling is True: polling_method = ARMPolling(lro_delay, **operation_config)\n        elif polling is False: polling_method = NoPolling()\n        else: polling_method = polling\n        return LROPoller(self._client, raw_result, get_long_running_output, polling_method)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef create_cloud_service(self, cloud_service_id, label, description, geo_region):\n        '''\n        The Create Cloud Service request creates a new cloud service. When job\n        collections are created, they are hosted within a cloud service.\n        A cloud service groups job collections together in a given region.\n        Once a cloud service has been created, job collections can then be\n        created and contained within it.\n\n        cloud_service_id:\n            The cloud service id\n        label:\n            The name of the cloud service.\n        description:\n            The description of the cloud service.\n        geo_region:\n            The geographical region of the webspace that will be created.\n        '''\n        _validate_not_none('cloud_service_id', cloud_service_id)\n        _validate_not_none('label', label)\n        _validate_not_none('description', description)\n        _validate_not_none('geo_region', geo_region)\n\n        path = self._get_cloud_services_path(cloud_service_id)\n        body = _SchedulerManagementXmlSerializer.create_cloud_service_to_xml(\n            label, description, geo_region)\n\n        return self._perform_put(path, body, as_async=True)", "response": "Creates a new cloud service."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef get_cloud_service(self, cloud_service_id):\n        '''\n        The Get Cloud Service operation gets all the resources (job collections)\n        in the cloud service.\n\n        cloud_service_id:\n            The cloud service id\n        '''\n        _validate_not_none('cloud_service_id', cloud_service_id)\n        path = self._get_cloud_services_path(cloud_service_id)\n        return self._perform_get(path, CloudService)", "response": "Get the details of a specific cloud service."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef delete_cloud_service(self, cloud_service_id):\n        '''\n        The Get Cloud Service operation gets all the resources (job collections)\n        in the cloud service.\n\n        cloud_service_id:\n            The cloud service id\n        '''\n        _validate_not_none('cloud_service_id', cloud_service_id)\n        path = self._get_cloud_services_path(cloud_service_id)\n        return self._perform_delete(path, as_async=True)", "response": "Delete the resources in the specified cloud service."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef check_job_collection_name(self, cloud_service_id, job_collection_id):\n        '''\n        The Check Name Availability operation checks if a new job collection with\n        the given name may be created, or if it is unavailable. The result of the\n        operation is a Boolean true or false.\n\n        cloud_service_id:\n            The cloud service id\n        job_collection_id:\n            The name of the job_collection_id.\n        '''\n        _validate_not_none('cloud_service_id', cloud_service_id)\n        _validate_not_none('job_collection_id', job_collection_id)\n\n        path = self._get_cloud_services_path(\n            cloud_service_id, \"scheduler\", \"jobCollections\")\n        path += \"?op=checknameavailability&resourceName=\" + job_collection_id\n        return self._perform_post(path, None, AvailabilityResponse)", "response": "The Check Name Availability operation checks if a new job collection with the given name is available."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\ncreates a job collection for the specified hosted service.", "response": "def create_job_collection(self, cloud_service_id, job_collection_id, plan=\"Standard\"):\n        '''\n        The Create Job Collection request is specified as follows. Replace <subscription-id>\n        with your subscription ID, <cloud-service-id> with your cloud service ID, and\n        <job-collection-id> with the ID of the job collection you\\'d like to create.\n        There are no \"default\" pre-existing job collections every job collection\n        must be manually created.\n\n        cloud_service_id:\n            The cloud service id\n        job_collection_id:\n            Name of the hosted service.\n        '''\n        _validate_not_none('cloud_service_id', cloud_service_id)\n        _validate_not_none('job_collection_id', job_collection_id)\n\n        path = self._get_cloud_services_path(\n            cloud_service_id, \"scheduler\", \"jobCollections\")\n\n        path += '/' + _str(job_collection_id)\n        body = _SchedulerManagementXmlSerializer.create_job_collection_to_xml(\n            plan)\n\n        return self._perform_put(path, body, as_async=True)"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef delete_job_collection(self, cloud_service_id, job_collection_id):\n        '''\n        The Delete Job Collection request is specified as follows. Replace <subscription-id>\n        with your subscription ID, <cloud-service-id> with your cloud service ID, and\n        <job-collection-id> with the ID of the job collection you\\'d like to delete.\n\n        cloud_service_id:\n            The cloud service id\n        job_collection_id:\n            Name of the hosted service.\n        '''\n        _validate_not_none('cloud_service_id', cloud_service_id)\n        _validate_not_none('job_collection_id', job_collection_id)\n\n        path = self._get_cloud_services_path(\n            cloud_service_id, \"scheduler\", \"jobCollections\")\n\n        path += '/' + _str(job_collection_id)\n\n        return self._perform_delete(path, as_async=True)", "response": "This method deletes the specified job collection from the specified hosted service."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef get_job_collection(self, cloud_service_id, job_collection_id):\n        '''\n        The Get Job Collection operation gets the details of a job collection\n\n        cloud_service_id:\n            The cloud service id\n        job_collection_id:\n            Name of the hosted service.\n        '''\n        _validate_not_none('cloud_service_id', cloud_service_id)\n        _validate_not_none('job_collection_id', job_collection_id)\n\n        path = self._get_job_collection_path(\n            cloud_service_id, job_collection_id)\n\n        return self._perform_get(path, Resource)", "response": "The Get Job Collection operation gets the details of a job collection."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef create_job(self, cloud_service_id, job_collection_id, job_id, job):\n        '''\n        The Create Job request creates a new job.\n        cloud_service_id:\n            The cloud service id\n        job_collection_id:\n            Name of the hosted service.\n        job_id:\n            The job id you wish to create.\n        job:\n            A dictionary of the payload\n        '''\n        _validate_not_none('cloud_service_id', cloud_service_id)\n        _validate_not_none('job_collection_id', job_collection_id)\n        _validate_not_none('job_id', job_id)\n        _validate_not_none('job', job)\n\n        path = self._get_job_collection_path(\n            cloud_service_id, job_collection_id, job_id)\n\n        self.content_type = \"application/json\"\n        return self._perform_put(path, JSONEncoder().encode(job), as_async=True)", "response": "This method creates a new job in the specified collection."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef delete_job(self, cloud_service_id, job_collection_id, job_id):\n        '''\n        The Delete Job request creates a new job.\n        cloud_service_id:\n            The cloud service id\n        job_collection_id:\n            Name of the hosted service.\n        job_id:\n            The job id you wish to create.\n        '''\n        _validate_not_none('cloud_service_id', cloud_service_id)\n        _validate_not_none('job_collection_id', job_collection_id)\n        _validate_not_none('job_id', job_id)\n\n        path = self._get_job_collection_path(\n            cloud_service_id, job_collection_id, job_id)\n        return self._perform_delete(path, as_async=True)", "response": "This method deletes a job from the hosted service."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef get_job(self, cloud_service_id, job_collection_id, job_id):\n        '''\n        The Get Job operation gets the details (including the current job status)\n        of the specified job from the specified job collection.\n\n        The return type is\n\n        cloud_service_id:\n            The cloud service id\n        job_collection_id:\n            Name of the hosted service.\n        job_id:\n            The job id you wish to create.\n        '''\n        _validate_not_none('cloud_service_id', cloud_service_id)\n        _validate_not_none('job_collection_id', job_collection_id)\n        _validate_not_none('job_id', job_id)\n\n        path = self._get_job_collection_path(\n            cloud_service_id, job_collection_id, job_id)\n\n        self.content_type = \"application/json\"\n        payload = self._perform_get(path).body.decode()\n        return json.loads(payload)", "response": "Get the details of a job from the specified job collection."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\ncomplete the restore operation on a managed database.", "response": "def complete_restore(\n            self, location_name, operation_id, last_backup_name, custom_headers=None, raw=False, polling=True, **operation_config):\n        \"\"\"Completes the restore operation on a managed database.\n\n        :param location_name: The name of the region where the resource is\n         located.\n        :type location_name: str\n        :param operation_id: Management operation id that this request tries\n         to complete.\n        :type operation_id: str\n        :param last_backup_name: The last backup name to apply\n        :type last_backup_name: str\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: The poller return type is ClientRawResponse, the\n         direct response alongside the deserialized response\n        :param polling: True for ARMPolling, False for no polling, or a\n         polling object for personal polling strategy\n        :return: An instance of LROPoller that returns None or\n         ClientRawResponse<None> if raw==True\n        :rtype: ~msrestazure.azure_operation.AzureOperationPoller[None] or\n         ~msrestazure.azure_operation.AzureOperationPoller[~msrest.pipeline.ClientRawResponse[None]]\n        :raises: :class:`CloudError<msrestazure.azure_exceptions.CloudError>`\n        \"\"\"\n        raw_result = self._complete_restore_initial(\n            location_name=location_name,\n            operation_id=operation_id,\n            last_backup_name=last_backup_name,\n            custom_headers=custom_headers,\n            raw=True,\n            **operation_config\n        )\n\n        def get_long_running_output(response):\n            if raw:\n                client_raw_response = ClientRawResponse(None, response)\n                return client_raw_response\n\n        lro_delay = operation_config.get(\n            'long_running_operation_timeout',\n            self.config.long_running_operation_timeout)\n        if polling is True: polling_method = ARMPolling(lro_delay, **operation_config)\n        elif polling is False: polling_method = NoPolling()\n        else: polling_method = polling\n        return LROPoller(self._client, raw_result, get_long_running_output, polling_method)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nreturn a list of models that can be used to create the keyvault object.", "response": "def models(self):\n        \"\"\"Module depends on the API version:\n            * 2016-10-01: :mod:`v2016_10_01.models<azure.keyvault.v2016_10_01.models>`\n            * 7.0: :mod:`v7_0.models<azure.keyvault.v7_0.models>`\n         \"\"\"\n        api_version = self._get_api_version(None)\n\n        if api_version == v7_0_VERSION:\n            from azure.keyvault.v7_0 import models as implModels\n        elif api_version == v2016_10_01_VERSION:\n            from azure.keyvault.v2016_10_01 import models as implModels\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n        return implModels"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\ngets the versioned client implementation corresponding to the current profile.", "response": "def _get_client_impl(self):\n        \"\"\"\n        Get the versioned client implementation corresponding to the current profile.\n        :return:  The versioned client implementation.\n        \"\"\"\n        api_version = self._get_api_version(None)\n        if api_version not in self._client_impls:\n            self._create_client_impl(api_version)\n        return self._client_impls[api_version]"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef _create_client_impl(self, api_version):\n        if api_version == v7_0_VERSION:\n            from azure.keyvault.v7_0 import KeyVaultClient as ImplClient\n        elif api_version == v2016_10_01_VERSION:\n            from azure.keyvault.v2016_10_01 import KeyVaultClient as ImplClient\n        else:\n            raise NotImplementedError(\"APIVersion {} is not available\".format(api_version))\n\n        impl = ImplClient(credentials=self._credentials)\n        impl.config = self.config\n\n        # if __enter__ has previously been called and the impl client has __enter__ defined we need to call it\n        if self._entered and hasattr(impl, '__enter__'):\n            impl.__enter__()\n\n        self._client_impls[api_version] = impl\n        return impl", "response": "Creates the client implementation corresponding to the specifeid api_version."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nsend one or more messages to be enqueued at a specific time.", "response": "async def schedule(self, schedule_time, *messages):\n        \"\"\"Send one or more messages to be enqueued at a specific time.\n\n        Returns a list of the sequence numbers of the enqueued messages.\n\n        :param schedule_time: The date and time to enqueue the messages.\n        :type schedule_time: ~datetime.datetime\n        :param messages: The messages to schedule.\n        :type messages: ~azure.servicebus.aio.async_message.Message\n        :rtype: list[int]\n\n        Example:\n            .. literalinclude:: ../examples/async_examples/test_examples_async.py\n                :start-after: [START schedule_messages]\n                :end-before: [END schedule_messages]\n                :language: python\n                :dedent: 4\n                :caption: Schedule messages.\n\n        \"\"\"\n        if not self.running:\n            await self.open()\n        request_body = self._build_schedule_request(schedule_time, *messages)\n        return await self._mgmt_request_response(\n            REQUEST_RESPONSE_SCHEDULE_MESSAGE_OPERATION,\n            request_body,\n            mgmt_handlers.schedule_op)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\nasync def cancel_scheduled_messages(self, *sequence_numbers):\n        if not self.running:\n            await self.open()\n        numbers = [types.AMQPLong(s) for s in sequence_numbers]\n        request_body = {'sequence-numbers': types.AMQPArray(numbers)}\n        return await self._mgmt_request_response(\n            REQUEST_RESPONSE_CANCEL_SCHEDULED_MESSAGE_OPERATION,\n            request_body,\n            mgmt_handlers.default)", "response": "Cancel one or more scheduled messages."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nsends all pending messages.", "response": "async def send_pending_messages(self):\n        \"\"\"Wait until all pending messages have been sent.\n\n        :returns: A list of the send results of all the pending messages. Each\n         send result is a tuple with two values. The first is a boolean, indicating `True`\n         if the message sent, or `False` if it failed. The second is an error if the message\n         failed, otherwise it will be `None`.\n        :rtype: list[tuple[bool, ~azure.servicebus.common.errors.MessageSendFailed]]\n\n        Example:\n            .. literalinclude:: ../examples/async_examples/test_examples_async.py\n                :start-after: [START queue_sender_messages]\n                :end-before: [END queue_sender_messages]\n                :language: python\n                :dedent: 4\n                :caption: Schedule messages.\n\n        \"\"\"\n        if not self.running:\n            await self.open()\n        try:\n            pending = self._handler._pending_messages[:]  # pylint: disable=protected-access\n            await self._handler.wait_async()\n            results = []\n            for m in pending:\n                if m.state == constants.MessageState.SendFailed:\n                    results.append((False, MessageSendFailed(m._response)))  # pylint: disable=protected-access\n                else:\n                    results.append((True, None))\n            return results\n        except Exception as e:  # pylint: disable=broad-except\n            raise MessageSendFailed(e)"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\nasync def reconnect(self):\n        unsent_events = self._handler.pending_messages\n        await super(Sender, self).reconnect()\n        try:\n            self._handler.queue_message(*unsent_events)\n            await self._handler.wait_async()\n        except Exception as e:  # pylint: disable=broad-except\n            await self._handle_exception(e)", "response": "Reconnect the handler.\n\n        If the handler was disconnected from the service with\n        a retryable error - attempt to reconnect.\n        This method will be called automatically for most retryable errors.\n        Also attempts to re-queue any messages that were pending before the reconnect."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nsend a message and blocks until acknowledgement is received.", "response": "async def send(self, message):\n        \"\"\"Send a message and blocks until acknowledgement is received or the operation fails.\n\n        If neither the Sender nor the message has a session ID, a `ValueError` will be raised.\n\n        :param message: The message to be sent.\n        :type message: ~azure.servicebus.aio.async_message.Message\n        :raises: ~azure.servicebus.common.errors.MessageSendFailed if the message fails to\n         send.\n\n        Example:\n            .. literalinclude:: ../examples/async_examples/test_examples_async.py\n                :start-after: [START open_close_session_sender_context]\n                :end-before: [END open_close_session_sender_context]\n                :language: python\n                :dedent: 4\n                :caption: Open a sessionful Sender and send messages.\n\n        \"\"\"\n        if not isinstance(message, Message):\n            raise TypeError(\"Value of message must be of type 'Message'.\")\n        if not self.session_id and not message.properties.group_id:\n            raise ValueError(\"Message must have Session ID.\")\n        return await super(SessionSender, self).send(message)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nread a certificate file from a publish settings file.", "response": "def get_certificate_from_publish_settings(publish_settings_path, path_to_write_certificate, subscription_id=None):\n    '''\n    Writes a certificate file to the specified location.  This can then be used \n    to instantiate ServiceManagementService.  Returns the subscription ID.\n\n    publish_settings_path: \n        Path to subscription file downloaded from \n        http://go.microsoft.com/fwlink/?LinkID=301775\n    path_to_write_certificate:\n        Path to write the certificate file.\n    subscription_id:\n        (optional)  Provide a subscription id here if you wish to use a \n        specific subscription under the publish settings file.\n    '''\n    import base64\n\n    try:\n        from xml.etree import cElementTree as ET\n    except ImportError:\n        from xml.etree import ElementTree as ET\n\n    try:\n        import OpenSSL.crypto as crypto\n    except:\n        raise Exception(\"pyopenssl is required to use get_certificate_from_publish_settings\")\n\n    _validate_not_none('publish_settings_path', publish_settings_path)\n    _validate_not_none('path_to_write_certificate', path_to_write_certificate)\n\n    # parse the publishsettings file and find the ManagementCertificate Entry\n    tree = ET.parse(publish_settings_path)\n    subscriptions = tree.getroot().findall(\"./PublishProfile/Subscription\")\n    \n    # Default to the first subscription in the file if they don't specify\n    # or get the matching subscription or return none.\n    if subscription_id:\n        subscription = next((s for s in subscriptions if s.get('Id').lower() == subscription_id.lower()), None)\n    else:\n        subscription = subscriptions[0]\n\n    # validate that subscription was found\n    if subscription is None:\n        raise ValueError(\"The provided subscription_id '{}' was not found in the publish settings file provided at '{}'\".format(subscription_id, publish_settings_path))\n\n    cert_string = _decode_base64_to_bytes(subscription.get('ManagementCertificate'))\n\n    # Load the string in pkcs12 format.  Don't provide a password as it isn't encrypted.\n    cert = crypto.load_pkcs12(cert_string, b'') \n\n    # Write the data out as a PEM format to a random location in temp for use under this run.\n    with open(path_to_write_certificate, 'wb') as f:\n        f.write(crypto.dump_certificate(crypto.FILETYPE_PEM, cert.get_certificate()))\n        f.write(crypto.dump_privatekey(crypto.FILETYPE_PEM, cert.get_privatekey()))\n\n    return subscription.get('Id')"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef image_search(\n            self, custom_config, query, accept_language=None, user_agent=None, client_id=None, client_ip=None, location=None, aspect=None, color=None, country_code=None, count=None, freshness=None, height=None, id=None, image_content=None, image_type=None, license=None, market=None, max_file_size=None, max_height=None, max_width=None, min_file_size=None, min_height=None, min_width=None, offset=None, safe_search=None, size=None, set_lang=None, width=None, custom_headers=None, raw=False, **operation_config):\n        \"\"\"The Custom Image Search API lets you send an image search query to Bing\n        and get image results found in your custom view of the web.\n\n        :param custom_config: The identifier for the custom search\n         configuration\n        :type custom_config: long\n        :param query: The user's search query term. The term cannot be empty.\n         The term may contain [Bing Advanced\n         Operators](http://msdn.microsoft.com/library/ff795620.aspx). For\n         example, to limit images to a specific domain, use the\n         [site:](http://msdn.microsoft.com/library/ff795613.aspx) operator. To\n         help improve relevance of an insights query (see\n         [insightsToken](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#insightstoken)),\n         you should always include the user's query term. Use this parameter\n         only with the Image Search API.Do not specify this parameter when\n         calling the Trending Images API.\n        :type query: str\n        :param accept_language: A comma-delimited list of one or more\n         languages to use for user interface strings. The list is in decreasing\n         order of preference. For additional information, including expected\n         format, see\n         [RFC2616](http://www.w3.org/Protocols/rfc2616/rfc2616-sec14.html).\n         This header and the\n         [setLang](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#setlang)\n         query parameter are mutually exclusive; do not specify both. If you\n         set this header, you must also specify the\n         [cc](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#cc)\n         query parameter. To determine the market to return results for, Bing\n         uses the first supported language it finds from the list and combines\n         it with the cc parameter value. If the list does not include a\n         supported language, Bing finds the closest language and market that\n         supports the request or it uses an aggregated or default market for\n         the results. To determine the market that Bing used, see the\n         BingAPIs-Market header. Use this header and the cc query parameter\n         only if you specify multiple languages. Otherwise, use the\n         [mkt](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#mkt)\n         and\n         [setLang](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#setlang)\n         query parameters. A user interface string is a string that's used as a\n         label in a user interface. There are few user interface strings in the\n         JSON response objects. Any links to Bing.com properties in the\n         response objects apply the specified language.\n        :type accept_language: str\n        :param user_agent: The user agent originating the request. Bing uses\n         the user agent to provide mobile users with an optimized experience.\n         Although optional, you are encouraged to always specify this header.\n         The user-agent should be the same string that any commonly used\n         browser sends. For information about user agents, see [RFC\n         2616](http://www.w3.org/Protocols/rfc2616/rfc2616-sec14.html). The\n         following are examples of user-agent strings. Windows Phone:\n         Mozilla/5.0 (compatible; MSIE 10.0; Windows Phone 8.0; Trident/6.0;\n         IEMobile/10.0; ARM; Touch; NOKIA; Lumia 822). Android: Mozilla / 5.0\n         (Linux; U; Android 2.3.5; en - us; SCH - I500 Build / GINGERBREAD)\n         AppleWebKit / 533.1 (KHTML; like Gecko) Version / 4.0 Mobile Safari /\n         533.1. iPhone: Mozilla / 5.0 (iPhone; CPU iPhone OS 6_1 like Mac OS X)\n         AppleWebKit / 536.26 (KHTML; like Gecko) Mobile / 10B142 iPhone4; 1\n         BingWeb / 3.03.1428.20120423. PC: Mozilla / 5.0 (Windows NT 6.3;\n         WOW64; Trident / 7.0; Touch; rv:11.0) like Gecko. iPad: Mozilla / 5.0\n         (iPad; CPU OS 7_0 like Mac OS X) AppleWebKit / 537.51.1 (KHTML, like\n         Gecko) Version / 7.0 Mobile / 11A465 Safari / 9537.53\n        :type user_agent: str\n        :param client_id: Bing uses this header to provide users with\n         consistent behavior across Bing API calls. Bing often flights new\n         features and improvements, and it uses the client ID as a key for\n         assigning traffic on different flights. If you do not use the same\n         client ID for a user across multiple requests, then Bing may assign\n         the user to multiple conflicting flights. Being assigned to multiple\n         conflicting flights can lead to an inconsistent user experience. For\n         example, if the second request has a different flight assignment than\n         the first, the experience may be unexpected. Also, Bing can use the\n         client ID to tailor web results to that client ID\u2019s search history,\n         providing a richer experience for the user. Bing also uses this header\n         to help improve result rankings by analyzing the activity generated by\n         a client ID. The relevance improvements help with better quality of\n         results delivered by Bing APIs and in turn enables higher\n         click-through rates for the API consumer. IMPORTANT: Although\n         optional, you should consider this header required. Persisting the\n         client ID across multiple requests for the same end user and device\n         combination enables 1) the API consumer to receive a consistent user\n         experience, and 2) higher click-through rates via better quality of\n         results from the Bing APIs. Each user that uses your application on\n         the device must have a unique, Bing generated client ID. If you do not\n         include this header in the request, Bing generates an ID and returns\n         it in the X-MSEdge-ClientID response header. The only time that you\n         should NOT include this header in a request is the first time the user\n         uses your app on that device. Use the client ID for each Bing API\n         request that your app makes for this user on the device. Persist the\n         client ID. To persist the ID in a browser app, use a persistent HTTP\n         cookie to ensure the ID is used across all sessions. Do not use a\n         session cookie. For other apps such as mobile apps, use the device's\n         persistent storage to persist the ID. The next time the user uses your\n         app on that device, get the client ID that you persisted. Bing\n         responses may or may not include this header. If the response includes\n         this header, capture the client ID and use it for all subsequent Bing\n         requests for the user on that device. If you include the\n         X-MSEdge-ClientID, you must not include cookies in the request.\n        :type client_id: str\n        :param client_ip: The IPv4 or IPv6 address of the client device. The\n         IP address is used to discover the user's location. Bing uses the\n         location information to determine safe search behavior. Although\n         optional, you are encouraged to always specify this header and the\n         X-Search-Location header. Do not obfuscate the address (for example,\n         by changing the last octet to 0). Obfuscating the address results in\n         the location not being anywhere near the device's actual location,\n         which may result in Bing serving erroneous results.\n        :type client_ip: str\n        :param location: A semicolon-delimited list of key/value pairs that\n         describe the client's geographical location. Bing uses the location\n         information to determine safe search behavior and to return relevant\n         local content. Specify the key/value pair as <key>:<value>. The\n         following are the keys that you use to specify the user's location.\n         lat (required): The latitude of the client's location, in degrees. The\n         latitude must be greater than or equal to -90.0 and less than or equal\n         to +90.0. Negative values indicate southern latitudes and positive\n         values indicate northern latitudes. long (required): The longitude of\n         the client's location, in degrees. The longitude must be greater than\n         or equal to -180.0 and less than or equal to +180.0. Negative values\n         indicate western longitudes and positive values indicate eastern\n         longitudes. re (required): The radius, in meters, which specifies the\n         horizontal accuracy of the coordinates. Pass the value returned by the\n         device's location service. Typical values might be 22m for GPS/Wi-Fi,\n         380m for cell tower triangulation, and 18,000m for reverse IP lookup.\n         ts (optional): The UTC UNIX timestamp of when the client was at the\n         location. (The UNIX timestamp is the number of seconds since January\n         1, 1970.) head (optional): The client's relative heading or direction\n         of travel. Specify the direction of travel as degrees from 0 through\n         360, counting clockwise relative to true north. Specify this key only\n         if the sp key is nonzero. sp (optional): The horizontal velocity\n         (speed), in meters per second, that the client device is traveling.\n         alt (optional): The altitude of the client device, in meters. are\n         (optional): The radius, in meters, that specifies the vertical\n         accuracy of the coordinates. Specify this key only if you specify the\n         alt key. Although many of the keys are optional, the more information\n         that you provide, the more accurate the location results are. Although\n         optional, you are encouraged to always specify the user's geographical\n         location. Providing the location is especially important if the\n         client's IP address does not accurately reflect the user's physical\n         location (for example, if the client uses VPN). For optimal results,\n         you should include this header and the X-MSEdge-ClientIP header, but\n         at a minimum, you should include this header.\n        :type location: str\n        :param aspect: Filter images by the following aspect ratios. All: Do\n         not filter by aspect.Specifying this value is the same as not\n         specifying the aspect parameter. Square: Return images with standard\n         aspect ratio. Wide: Return images with wide screen aspect ratio. Tall:\n         Return images with tall aspect ratio. Possible values include: 'All',\n         'Square', 'Wide', 'Tall'\n        :type aspect: str or\n         ~azure.cognitiveservices.search.customimagesearch.models.ImageAspect\n        :param color: Filter images by the following color options. ColorOnly:\n         Return color images. Monochrome: Return black and white images. Return\n         images with one of the following dominant colors: Black, Blue, Brown,\n         Gray, Green, Orange, Pink, Purple, Red, Teal, White, Yellow. Possible\n         values include: 'ColorOnly', 'Monochrome', 'Black', 'Blue', 'Brown',\n         'Gray', 'Green', 'Orange', 'Pink', 'Purple', 'Red', 'Teal', 'White',\n         'Yellow'\n        :type color: str or\n         ~azure.cognitiveservices.search.customimagesearch.models.ImageColor\n        :param country_code: A 2-character country code of the country where\n         the results come from. For a list of possible values, see [Market\n         Codes](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#market-codes).\n         If you set this parameter, you must also specify the\n         [Accept-Language](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#acceptlanguage)\n         header. Bing uses the first supported language it finds from the\n         languages list, and combine that language with the country code that\n         you specify to determine the market to return results for. If the\n         languages list does not include a supported language, Bing finds the\n         closest language and market that supports the request, or it may use\n         an aggregated or default market for the results instead of a specified\n         one. You should use this query parameter and the Accept-Language query\n         parameter only if you specify multiple languages; otherwise, you\n         should use the mkt and setLang query parameters. This parameter and\n         the\n         [mkt](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#mkt)\n         query parameter are mutually exclusive\u2014do not specify both.\n        :type country_code: str\n        :param count: The number of images to return in the response. The\n         actual number delivered may be less than requested. The default is 35.\n         The maximum value is 150. You use this parameter along with the offset\n         parameter to page results.For example, if your user interface displays\n         20 images per page, set count to 20 and offset to 0 to get the first\n         page of results.For each subsequent page, increment offset by 20 (for\n         example, 0, 20, 40). Use this parameter only with the Image Search\n         API.Do not specify this parameter when calling the Insights, Trending\n         Images, or Web Search APIs.\n        :type count: int\n        :param freshness: Filter images by the following discovery options.\n         Day: Return images discovered by Bing within the last 24 hours. Week:\n         Return images discovered by Bing within the last 7 days. Month: Return\n         images discovered by Bing within the last 30 days. Possible values\n         include: 'Day', 'Week', 'Month'\n        :type freshness: str or\n         ~azure.cognitiveservices.search.customimagesearch.models.Freshness\n        :param height: Filter images that have the specified height, in\n         pixels. You may use this filter with the size filter to return small\n         images that have a height of 150 pixels.\n        :type height: int\n        :param id: An ID that uniquely identifies an image. Use this parameter\n         to ensure that the specified image is the first image in the list of\n         images that Bing returns. The\n         [Image](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#image)\n         object's imageId field contains the ID that you set this parameter to.\n        :type id: str\n        :param image_content: Filter images by the following content types.\n         Face: Return images that show only a person's face. Portrait: Return\n         images that show only a person's head and shoulders. Possible values\n         include: 'Face', 'Portrait'\n        :type image_content: str or\n         ~azure.cognitiveservices.search.customimagesearch.models.ImageContent\n        :param image_type: Filter images by the following image types.\n         AnimatedGif: Return only animated GIFs. Clipart: Return only clip art\n         images. Line: Return only line drawings. Photo: Return only\n         photographs(excluding line drawings, animated Gifs, and clip art).\n         Shopping: Return only images that contain items where Bing knows of a\n         merchant that is selling the items. This option is valid in the en -\n         US market only.Transparent: Return only images with a transparent\n         background. Possible values include: 'AnimatedGif', 'Clipart', 'Line',\n         'Photo', 'Shopping', 'Transparent'\n        :type image_type: str or\n         ~azure.cognitiveservices.search.customimagesearch.models.ImageType\n        :param license: Filter images by the following license types. All: Do\n         not filter by license type.Specifying this value is the same as not\n         specifying the license parameter. Any: Return images that are under\n         any license type. The response doesn't include images that do not\n         specify a license or the license is unknown. Public: Return images\n         where the creator has waived their exclusive rights, to the fullest\n         extent allowed by law. Share: Return images that may be shared with\n         others. Changing or editing the image might not be allowed. Also,\n         modifying, sharing, and using the image for commercial purposes might\n         not be allowed. Typically, this option returns the most images.\n         ShareCommercially: Return images that may be shared with others for\n         personal or commercial purposes. Changing or editing the image might\n         not be allowed. Modify: Return images that may be modified, shared,\n         and used. Changing or editing the image might not be allowed.\n         Modifying, sharing, and using the image for commercial purposes might\n         not be allowed. ModifyCommercially: Return images that may be\n         modified, shared, and used for personal or commercial purposes.\n         Typically, this option returns the fewest images. For more information\n         about these license types, see [Filter Images By License\n         Type](http://go.microsoft.com/fwlink/?LinkId=309768). Possible values\n         include: 'All', 'Any', 'Public', 'Share', 'ShareCommercially',\n         'Modify', 'ModifyCommercially'\n        :type license: str or\n         ~azure.cognitiveservices.search.customimagesearch.models.ImageLicense\n        :param market: The market where the results come from. Typically, mkt\n         is the country where the user is making the request from. However, it\n         could be a different country if the user is not located in a country\n         where Bing delivers results. The market must be in the form <language\n         code>-<country code>. For example, en-US. The string is case\n         insensitive. For a list of possible market values, see [Market\n         Codes](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#market-codes).\n         NOTE: If known, you are encouraged to always specify the market.\n         Specifying the market helps Bing route the request and return an\n         appropriate and optimal response. If you specify a market that is not\n         listed in [Market\n         Codes](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#market-codes),\n         Bing uses a best fit market code based on an internal mapping that is\n         subject to change. This parameter and the\n         [cc](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#cc)\n         query parameter are mutually exclusive\u2014do not specify both.\n        :type market: str\n        :param max_file_size: Filter images that are less than or equal to the\n         specified file size. The maximum file size that you may specify is\n         520,192 bytes. If you specify a larger value, the API uses 520,192. It\n         is possible that the response may include images that are slightly\n         larger than the specified maximum. You may specify this filter and\n         minFileSize to filter images within a range of file sizes.\n        :type max_file_size: long\n        :param max_height: Filter images that have a height that is less than\n         or equal to the specified height. Specify the height in pixels. You\n         may specify this filter and minHeight to filter images within a range\n         of heights. This filter and the height filter are mutually exclusive.\n        :type max_height: long\n        :param max_width: Filter images that have a width that is less than or\n         equal to the specified width. Specify the width in pixels. You may\n         specify this filter and maxWidth to filter images within a range of\n         widths. This filter and the width filter are mutually exclusive.\n        :type max_width: long\n        :param min_file_size: Filter images that are greater than or equal to\n         the specified file size. The maximum file size that you may specify is\n         520,192 bytes. If you specify a larger value, the API uses 520,192. It\n         is possible that the response may include images that are slightly\n         smaller than the specified minimum. You may specify this filter and\n         maxFileSize to filter images within a range of file sizes.\n        :type min_file_size: long\n        :param min_height: Filter images that have a height that is greater\n         than or equal to the specified height. Specify the height in pixels.\n         You may specify this filter and maxHeight to filter images within a\n         range of heights. This filter and the height filter are mutually\n         exclusive.\n        :type min_height: long\n        :param min_width: Filter images that have a width that is greater than\n         or equal to the specified width. Specify the width in pixels. You may\n         specify this filter and maxWidth to filter images within a range of\n         widths. This filter and the width filter are mutually exclusive.\n        :type min_width: long\n        :param offset: The zero-based offset that indicates the number of\n         images to skip before returning images. The default is 0. The offset\n         should be less than\n         ([totalEstimatedMatches](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#totalestimatedmatches)\n         - count). Use this parameter along with the count parameter to page\n         results. For example, if your user interface displays 20 images per\n         page, set count to 20 and offset to 0 to get the first page of\n         results. For each subsequent page, increment offset by 20 (for\n         example, 0, 20, 40). It is possible for multiple pages to include some\n         overlap in results. To prevent duplicates, see\n         [nextOffset](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#nextoffset).\n         Use this parameter only with the Image API. Do not specify this\n         parameter when calling the Trending Images API or the Web Search API.\n        :type offset: long\n        :param safe_search: Filter images for adult content. The following are\n         the possible filter values. Off: May return images with adult content.\n         If the request is through the Image Search API, the response includes\n         thumbnail images that are clear (non-fuzzy). However, if the request\n         is through the Web Search API, the response includes thumbnail images\n         that are pixelated (fuzzy). Moderate: If the request is through the\n         Image Search API, the response doesn't include images with adult\n         content. If the request is through the Web Search API, the response\n         may include images with adult content (the thumbnail images are\n         pixelated (fuzzy)). Strict: Do not return images with adult content.\n         The default is Moderate. If the request comes from a market that\n         Bing's adult policy requires that safeSearch is set to Strict, Bing\n         ignores the safeSearch value and uses Strict. If you use the site:\n         query operator, there is the chance that the response may contain\n         adult content regardless of what the safeSearch query parameter is set\n         to. Use site: only if you are aware of the content on the site and\n         your scenario supports the possibility of adult content. Possible\n         values include: 'Off', 'Moderate', 'Strict'\n        :type safe_search: str or\n         ~azure.cognitiveservices.search.customimagesearch.models.SafeSearch\n        :param size: Filter images by the following sizes. All: Do not filter\n         by size. Specifying this value is the same as not specifying the size\n         parameter. Small: Return images that are less than 200x200 pixels.\n         Medium: Return images that are greater than or equal to 200x200 pixels\n         but less than 500x500 pixels. Large: Return images that are 500x500\n         pixels or larger. Wallpaper: Return wallpaper images. You may use this\n         parameter along with the height or width parameters. For example, you\n         may use height and size to request small images that are 150 pixels\n         tall. Possible values include: 'All', 'Small', 'Medium', 'Large',\n         'Wallpaper'\n        :type size: str or\n         ~azure.cognitiveservices.search.customimagesearch.models.ImageSize\n        :param set_lang: The language to use for user interface strings.\n         Specify the language using the ISO 639-1 2-letter language code. For\n         example, the language code for English is EN. The default is EN\n         (English). Although optional, you should always specify the language.\n         Typically, you set setLang to the same language specified by mkt\n         unless the user wants the user interface strings displayed in a\n         different language. This parameter and the\n         [Accept-Language](https://docs.microsoft.com/en-us/rest/api/cognitiveservices/bing-images-api-v7-reference#acceptlanguage)\n         header are mutually exclusive; do not specify both. A user interface\n         string is a string that's used as a label in a user interface. There\n         are few user interface strings in the JSON response objects. Also, any\n         links to Bing.com properties in the response objects apply the\n         specified language.\n        :type set_lang: str\n        :param width: Filter images that have the specified width, in pixels.\n         You may use this filter with the size filter to return small images\n         that have a width of 150 pixels.\n        :type width: int\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: Images or ClientRawResponse if raw=true\n        :rtype:\n         ~azure.cognitiveservices.search.customimagesearch.models.Images or\n         ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`ErrorResponseException<azure.cognitiveservices.search.customimagesearch.models.ErrorResponseException>`\n        \"\"\"\n        # Construct URL\n        url = self.image_search.metadata['url']\n\n        # Construct parameters\n        query_parameters = {}\n        query_parameters['customConfig'] = self._serialize.query(\"custom_config\", custom_config, 'long', minimum=0)\n        if aspect is not None:\n            query_parameters['aspect'] = self._serialize.query(\"aspect\", aspect, 'str')\n        if color is not None:\n            query_parameters['color'] = self._serialize.query(\"color\", color, 'str')\n        if country_code is not None:\n            query_parameters['cc'] = self._serialize.query(\"country_code\", country_code, 'str')\n        if count is not None:\n            query_parameters['count'] = self._serialize.query(\"count\", count, 'int')\n        if freshness is not None:\n            query_parameters['freshness'] = self._serialize.query(\"freshness\", freshness, 'str')\n        if height is not None:\n            query_parameters['height'] = self._serialize.query(\"height\", height, 'int')\n        if id is not None:\n            query_parameters['id'] = self._serialize.query(\"id\", id, 'str')\n        if image_content is not None:\n            query_parameters['imageContent'] = self._serialize.query(\"image_content\", image_content, 'str')\n        if image_type is not None:\n            query_parameters['imageType'] = self._serialize.query(\"image_type\", image_type, 'str')\n        if license is not None:\n            query_parameters['license'] = self._serialize.query(\"license\", license, 'str')\n        if market is not None:\n            query_parameters['mkt'] = self._serialize.query(\"market\", market, 'str')\n        if max_file_size is not None:\n            query_parameters['maxFileSize'] = self._serialize.query(\"max_file_size\", max_file_size, 'long')\n        if max_height is not None:\n            query_parameters['maxHeight'] = self._serialize.query(\"max_height\", max_height, 'long')\n        if max_width is not None:\n            query_parameters['maxWidth'] = self._serialize.query(\"max_width\", max_width, 'long')\n        if min_file_size is not None:\n            query_parameters['minFileSize'] = self._serialize.query(\"min_file_size\", min_file_size, 'long')\n        if min_height is not None:\n            query_parameters['minHeight'] = self._serialize.query(\"min_height\", min_height, 'long')\n        if min_width is not None:\n            query_parameters['minWidth'] = self._serialize.query(\"min_width\", min_width, 'long')\n        if offset is not None:\n            query_parameters['offset'] = self._serialize.query(\"offset\", offset, 'long')\n        query_parameters['q'] = self._serialize.query(\"query\", query, 'str')\n        if safe_search is not None:\n            query_parameters['safeSearch'] = self._serialize.query(\"safe_search\", safe_search, 'str')\n        if size is not None:\n            query_parameters['size'] = self._serialize.query(\"size\", size, 'str')\n        if set_lang is not None:\n            query_parameters['setLang'] = self._serialize.query(\"set_lang\", set_lang, 'str')\n        if width is not None:\n            query_parameters['width'] = self._serialize.query(\"width\", width, 'int')\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Content-Type'] = 'application/json; charset=utf-8'\n        if custom_headers:\n            header_parameters.update(custom_headers)\n        header_parameters['X-BingApis-SDK'] = self._serialize.header(\"self.x_bing_apis_sdk\", self.x_bing_apis_sdk, 'str')\n        if accept_language is not None:\n            header_parameters['Accept-Language'] = self._serialize.header(\"accept_language\", accept_language, 'str')\n        if user_agent is not None:\n            header_parameters['User-Agent'] = self._serialize.header(\"user_agent\", user_agent, 'str')\n        if client_id is not None:\n            header_parameters['X-MSEdge-ClientID'] = self._serialize.header(\"client_id\", client_id, 'str')\n        if client_ip is not None:\n            header_parameters['X-MSEdge-ClientIP'] = self._serialize.header(\"client_ip\", client_ip, 'str')\n        if location is not None:\n            header_parameters['X-Search-Location'] = self._serialize.header(\"location\", location, 'str')\n\n        # Construct and send request\n        request = self._client.get(url, query_parameters)\n        response = self._client.send(request, header_parameters, stream=False, **operation_config)\n\n        if response.status_code not in [200]:\n            raise models.ErrorResponseException(self._deserialize, response)\n\n        deserialized = None\n\n        if response.status_code == 200:\n            deserialized = self._deserialize('Images', response)\n\n        if raw:\n            client_raw_response = ClientRawResponse(deserialized, response)\n            return client_raw_response\n\n        return deserialized", "response": "This method is used to send an image search query to the Bing\n        and get the image results."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nreturning a dictionary containing the available streams for the current locale.", "response": "def streams(self, stream_types=None, sorting_excludes=None):\n        \"\"\"Attempts to extract available streams.\n\n        Returns a :class:`dict` containing the streams, where the key is\n        the name of the stream, most commonly the quality and the value\n        is a :class:`Stream` object.\n\n        The result can contain the synonyms **best** and **worst** which\n        points to the streams which are likely to be of highest and\n        lowest quality respectively.\n\n        If multiple streams with the same name are found, the order of\n        streams specified in *stream_types* will determine which stream\n        gets to keep the name while the rest will be renamed to\n        \"<name>_<stream type>\".\n\n        The synonyms can be fine tuned with the *sorting_excludes*\n        parameter. This can be either of these types:\n\n            - A list of filter expressions in the format\n              *[operator]<value>*. For example the filter \">480p\" will\n              exclude streams ranked higher than \"480p\" from the list\n              used in the synonyms ranking. Valid operators are >, >=, <\n              and <=. If no operator is specified then equality will be\n              tested.\n\n            - A function that is passed to filter() with a list of\n              stream names as input.\n\n\n        :param stream_types: A list of stream types to return.\n        :param sorting_excludes: Specify which streams to exclude from\n                                 the best/worst synonyms.\n\n        \"\"\"\n\n        try:\n            ostreams = self._get_streams()\n            if isinstance(ostreams, dict):\n                ostreams = ostreams.items()\n\n            # Flatten the iterator to a list so we can reuse it.\n            if ostreams:\n                ostreams = list(ostreams)\n        except NoStreamsError:\n            return {}\n        except (IOError, OSError, ValueError) as err:\n            raise PluginError(err)\n\n        if not ostreams:\n            return {}\n\n        if stream_types is None:\n            stream_types = self.default_stream_types(ostreams)\n\n        # Add streams depending on stream type and priorities\n        sorted_streams = sorted(iterate_streams(ostreams),\n                                key=partial(stream_type_priority,\n                                            stream_types))\n\n        streams = {}\n        for name, stream in sorted_streams:\n            stream_type = type(stream).shortname()\n\n            # Use * as wildcard to match other stream types\n            if \"*\" not in stream_types and stream_type not in stream_types:\n                continue\n\n            # drop _alt from any stream names\n            if name.endswith(\"_alt\"):\n                name = name[:-len(\"_alt\")]\n\n            existing = streams.get(name)\n            if existing:\n                existing_stream_type = type(existing).shortname()\n                if existing_stream_type != stream_type:\n                    name = \"{0}_{1}\".format(name, stream_type)\n\n                if name in streams:\n                    name = \"{0}_alt\".format(name)\n                    num_alts = len(list(filter(lambda n: n.startswith(name), streams.keys())))\n\n                    # We shouldn't need more than 2 alt streams\n                    if num_alts >= 2:\n                        continue\n                    elif num_alts > 0:\n                        name = \"{0}{1}\".format(name, num_alts + 1)\n\n            # Validate stream name and discard the stream if it's bad.\n            match = re.match(\"([A-z0-9_+]+)\", name)\n            if match:\n                name = match.group(1)\n            else:\n                self.logger.debug(\"The stream '{0}' has been ignored \"\n                                  \"since it is badly named.\", name)\n                continue\n\n            # Force lowercase name and replace space with underscore.\n            streams[name.lower()] = stream\n\n        # Create the best/worst synonmys\n        def stream_weight_only(s):\n            return (self.stream_weight(s)[0] or\n                    (len(streams) == 1 and 1))\n\n        stream_names = filter(stream_weight_only, streams.keys())\n        sorted_streams = sorted(stream_names, key=stream_weight_only)\n        unfiltered_sorted_streams = sorted_streams\n\n        if isinstance(sorting_excludes, list):\n            for expr in sorting_excludes:\n                filter_func = stream_sorting_filter(expr, self.stream_weight)\n                sorted_streams = list(filter(filter_func, sorted_streams))\n        elif callable(sorting_excludes):\n            sorted_streams = list(filter(sorting_excludes, sorted_streams))\n\n        final_sorted_streams = OrderedDict()\n\n        for stream_name in sorted(streams, key=stream_weight_only):\n            final_sorted_streams[stream_name] = streams[stream_name]\n\n        if len(sorted_streams) > 0:\n            best = sorted_streams[-1]\n            worst = sorted_streams[0]\n            final_sorted_streams[\"worst\"] = streams[worst]\n            final_sorted_streams[\"best\"] = streams[best]\n        elif len(unfiltered_sorted_streams) > 0:\n            best = unfiltered_sorted_streams[-1]\n            worst = unfiltered_sorted_streams[0]\n            final_sorted_streams[\"worst-unfiltered\"] = streams[worst]\n            final_sorted_streams[\"best-unfiltered\"] = streams[best]\n\n        return final_sorted_streams"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nsaving the cookies from the HTTP session to the plugin cache.", "response": "def save_cookies(self, cookie_filter=None, default_expires=60 * 60 * 24 * 7):\n        \"\"\"\n        Store the cookies from ``http`` in the plugin cache until they expire. The cookies can be filtered\n        by supplying a filter method. eg. ``lambda c: \"auth\" in c.name``. If no expiry date is given in the\n        cookie then the ``default_expires`` value will be used.\n\n        :param cookie_filter: a function to filter the cookies\n        :type cookie_filter: function\n        :param default_expires: time (in seconds) until cookies with no expiry will expire\n        :type default_expires: int\n        :return: list of the saved cookie names\n        \"\"\"\n        if not self.session or not self.cache:\n            raise RuntimeError(\"Cannot cache cookies in unbound plugin\")\n\n        cookie_filter = cookie_filter or (lambda c: True)\n        saved = []\n\n        for cookie in filter(cookie_filter, self.session.http.cookies):\n            cookie_dict = {}\n            for attr in (\"version\", \"name\", \"value\", \"port\", \"domain\", \"path\", \"secure\", \"expires\", \"discard\",\n                         \"comment\", \"comment_url\", \"rfc2109\"):\n                cookie_dict[attr] = getattr(cookie, attr, None)\n            cookie_dict[\"rest\"] = getattr(cookie, \"rest\", getattr(cookie, \"_rest\", None))\n\n            expires = default_expires\n            if cookie_dict['expires']:\n                expires = int(cookie_dict['expires'] - time.time())\n            key = \"__cookie:{0}:{1}:{2}:{3}\".format(cookie.name,\n                                                    cookie.domain,\n                                                    cookie.port_specified and cookie.port or \"80\",\n                                                    cookie.path_specified and cookie.path or \"*\")\n            self.cache.set(key, cookie_dict, expires)\n            saved.append(cookie.name)\n\n        if saved:\n            self.logger.debug(\"Saved cookies: {0}\".format(\", \".join(saved)))\n        return saved"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nloading any stored cookies for the plugin that have not expired.", "response": "def load_cookies(self):\n        \"\"\"\n        Load any stored cookies for the plugin that have not expired.\n\n        :return: list of the restored cookie names\n        \"\"\"\n        if not self.session or not self.cache:\n            raise RuntimeError(\"Cannot loaded cached cookies in unbound plugin\")\n\n        restored = []\n\n        for key, value in self.cache.get_all().items():\n            if key.startswith(\"__cookie\"):\n                cookie = requests.cookies.create_cookie(**value)\n                self.session.http.cookies.set_cookie(cookie)\n                restored.append(cookie.name)\n\n        if restored:\n            self.logger.debug(\"Restored cookies: {0}\".format(\", \".join(restored)))\n        return restored"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef clear_cookies(self, cookie_filter=None):\n        if not self.session or not self.cache:\n            raise RuntimeError(\"Cannot loaded cached cookies in unbound plugin\")\n\n        cookie_filter = cookie_filter or (lambda c: True)\n        removed = []\n\n        for key, value in sorted(self.cache.get_all().items(), key=operator.itemgetter(0), reverse=True):\n            if key.startswith(\"__cookie\"):\n                cookie = requests.cookies.create_cookie(**value)\n                if cookie_filter(cookie):\n                    del self.session.http.cookies[cookie.name]\n                    self.cache.set(key, None, 0)\n                    removed.append(key)\n\n        return removed", "response": "Removes all of the saved cookies for this plugin."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef shlex_quote(s):\n\n    if is_py3:  # use the latest version instead of backporting if it's available\n        return quote(s)\n\n    if not s:\n        return \"''\"\n    if _find_unsafe(s) is None:\n        return s\n\n    # use single quotes, and put single quotes into double quotes\n    # the string $'b is then quoted as '$'\"'\"'b'\n    return \"'\" + s.replace(\"'\", \"'\\\"'\\\"'\") + \"'\"", "response": "Return a shell - escaped version of the string s."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nreturns the width of the terminal string.", "response": "def terminal_width(value):\n    \"\"\"Returns the width of the string it would be when displayed.\"\"\"\n    if isinstance(value, bytes):\n        value = value.decode(\"utf8\", \"ignore\")\n    return sum(map(get_width, map(ord, value)))"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef get_cut_prefix(value, max_len):\n    should_convert = isinstance(value, bytes)\n    if should_convert:\n        value = value.decode(\"utf8\", \"ignore\")\n    for i in range(len(value)):\n        if terminal_width(value[i:]) <= max_len:\n            break\n    return value[i:].encode(\"utf8\", \"ignore\") if should_convert else value[i:]", "response": "Drops Characters by unicode not by bytes."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nclear out the previous line and prints a new line.", "response": "def print_inplace(msg):\n    \"\"\"Clears out the previous line and prints a new one.\"\"\"\n    term_width = get_terminal_size().columns\n    spacing = term_width - terminal_width(msg)\n\n    # On windows we need one less space or we overflow the line for some reason.\n    if is_win32:\n        spacing -= 1\n\n    sys.stderr.write(\"\\r{0}\".format(msg))\n    sys.stderr.write(\" \" * max(0, spacing))\n    sys.stderr.flush()"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef format_filesize(size):\n    for suffix in (\"bytes\", \"KB\", \"MB\", \"GB\", \"TB\"):\n        if size < 1024.0:\n            if suffix in (\"GB\", \"TB\"):\n                return \"{0:3.2f} {1}\".format(size, suffix)\n            else:\n                return \"{0:3.1f} {1}\".format(size, suffix)\n\n        size /= 1024.0", "response": "Formats the file size into a human readable format."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nformatting elapsed seconds into a human readable format.", "response": "def format_time(elapsed):\n    \"\"\"Formats elapsed seconds into a human readable format.\"\"\"\n    hours = int(elapsed / (60 * 60))\n    minutes = int((elapsed % (60 * 60)) / 60)\n    seconds = int(elapsed % 60)\n\n    rval = \"\"\n    if hours:\n        rval += \"{0}h\".format(hours)\n\n    if elapsed > 60:\n        rval += \"{0}m\".format(minutes)\n\n    rval += \"{0}s\".format(seconds)\n    return rval"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef create_status_line(**params):\n    max_size = get_terminal_size().columns - 1\n\n    for fmt in PROGRESS_FORMATS:\n        status = fmt.format(**params)\n\n        if len(status) <= max_size:\n            break\n\n    return status", "response": "Creates a status line with appropriate size."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef progress(iterator, prefix):\n    if terminal_width(prefix) > 25:\n        prefix = (\"..\" + get_cut_prefix(prefix, 23))\n    speed_updated = start = time()\n    speed_written = written = 0\n    speed_history = deque(maxlen=5)\n\n    for data in iterator:\n        yield data\n\n        now = time()\n        elapsed = now - start\n        written += len(data)\n\n        speed_elapsed = now - speed_updated\n        if speed_elapsed >= 0.5:\n            speed_history.appendleft((\n                written - speed_written,\n                speed_updated,\n            ))\n            speed_updated = now\n            speed_written = written\n\n            speed_history_written = sum(h[0] for h in speed_history)\n            speed_history_elapsed = now - speed_history[-1][1]\n            speed = speed_history_written / speed_history_elapsed\n\n            status = create_status_line(\n                prefix=prefix,\n                written=format_filesize(written),\n                elapsed=format_time(elapsed),\n                speed=format_filesize(speed)\n            )\n            print_inplace(status)\n    sys.stderr.write(\"\\n\")\n    sys.stderr.flush()", "response": "Progress an iterator and updates a pretty status line to the terminal."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef segment_numbers(self):\n        log.debug(\"Generating segment numbers for {0} playlist (id={1})\".format(self.root.type, self.parent.id))\n        if self.root.type == u\"static\":\n            available_iter = repeat(epoch_start)\n            duration = self.period.duration.seconds or self.root.mediaPresentationDuration.seconds\n            if duration:\n                number_iter = range(self.startNumber, int(duration / self.duration_seconds) + 1)\n            else:\n                number_iter = count(self.startNumber)\n        else:\n            now = datetime.datetime.now(utc)\n            if self.presentationTimeOffset:\n                since_start = (now - self.presentationTimeOffset) - self.root.availabilityStartTime\n                available_start_date = self.root.availabilityStartTime + self.presentationTimeOffset + since_start\n                available_start = available_start_date\n            else:\n                since_start = now - self.root.availabilityStartTime\n                available_start = now\n\n            # if there is no delay, use a delay of 3 seconds\n            suggested_delay = datetime.timedelta(seconds=(self.root.suggestedPresentationDelay.total_seconds()\n                                                          if self.root.suggestedPresentationDelay\n                                                          else 3))\n\n            # the number of the segment that is available at NOW - SUGGESTED_DELAY - BUFFER_TIME\n            number_iter = count(self.startNumber +\n                                int((since_start - suggested_delay - self.root.minBufferTime).total_seconds() /\n                                    self.duration_seconds))\n\n            # the time the segment number is available at NOW\n            available_iter = count_dt(available_start,\n                                      step=datetime.timedelta(seconds=self.duration_seconds))\n\n        for number, available_at in izip(number_iter, available_iter):\n            yield number, available_at", "response": "Yields the number of segments that are available at the current time."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nyields all the segments that are available at a certain time.", "response": "def segments(self, **kwargs):\n        \"\"\"\n        Segments are yielded when they are available\n\n        Segments appear on a time line, for dynamic content they are only available at a certain time\n        and sometimes for a limited time. For static content they are all available at the same time.\n\n        :param kwargs: extra args to pass to the segment template\n        :return: yields Segments\n        \"\"\"\n\n        segmentBase = self.segmentBase or self.walk_back_get_attr(\"segmentBase\")\n        segmentLists = self.segmentList or self.walk_back_get_attr(\"segmentList\")\n        segmentTemplate = self.segmentTemplate or self.walk_back_get_attr(\"segmentTemplate\")\n\n        if segmentTemplate:\n            for segment in segmentTemplate.segments(RepresentationID=self.id,\n                                                    Bandwidth=int(self.bandwidth * 1000),\n                                                    **kwargs):\n                if segment.init:\n                    yield segment\n                else:\n                    yield segment\n        elif segmentLists:\n            for segmentList in segmentLists:\n                for segment in segmentList.segments:\n                    yield segment\n        else:\n            yield Segment(self.base_url, 0, True, True)"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nremoves all items from od.", "response": "def clear(self):\n        'od.clear() -> None.  Remove all items from od.'\n        try:\n            for node in self.__map.itervalues():\n                del node[:]\n            root = self.__root\n            root[:] = [root, root, None]\n            self.__map.clear()\n        except AttributeError:\n            pass\n        dict.clear(self)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nlocating unique identifier (\"room_id\") for the room.", "response": "def _get_room_id(self):\n        \"\"\"\n        Locates unique identifier (\"room_id\") for the room.\n\n        Returns the room_id as a string, or None if no room_id was found\n        \"\"\"\n        match_dict = _url_re.match(self.url).groupdict()\n\n        if match_dict['room_id'] is not None:\n            return match_dict['room_id']\n        else:\n            res = self.session.http.get(self.url, headers=self._headers)\n            match = _room_id_re.search(res.text)\n            if not match:\n                title = self.url.rsplit('/', 1)[-1]\n                self.logger.debug(_room_id_lookup_failure_log.format(title, 'primary'))\n                match = _room_id_alt_re.search(res.text)\n                if not match:\n                    self.logger.debug(_room_id_lookup_failure_log.format(title, 'secondary'))\n                    return  # Raise exception?\n            return match.group('room_id')"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nshutting down the thread.", "response": "def close(self):\n        \"\"\"Shuts down the thread.\"\"\"\n        if not self.closed:\n            log.debug(\"Closing worker thread\")\n\n        self.closed = True\n        if self._wait:\n            self._wait.set()"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef close(self):\n        if not self.closed:\n            log.debug(\"Closing writer thread\")\n\n        self.closed = True\n        self.reader.buffer.close()\n        self.executor.shutdown(wait=False)\n        if concurrent.futures.thread._threads_queues:\n            concurrent.futures.thread._threads_queues.clear()", "response": "Shuts down the thread."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef put(self, segment):\n        if self.closed:\n            return\n\n        if segment is not None:\n            future = self.executor.submit(self.fetch, segment,\n                                          retries=self.retries)\n        else:\n            future = None\n\n        self.queue(self.futures, (segment, future))", "response": "Adds a segment to the download pool and write queue."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef queue(self, queue_, value):\n        while not self.closed:\n            try:\n                queue_.put(value, block=True, timeout=1)\n                return\n            except queue.Full:\n                continue", "response": "Puts a value into a queue but aborts if this thread is closed."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef parse_manifest(cls, session, url, timeout=60, pvswf=None, is_akamai=False,\n                       **request_params):\n        \"\"\"Parses a HDS manifest and returns its substreams.\n\n        :param url: The URL to the manifest.\n        :param timeout: How long to wait for data to be returned from\n                        from the stream before raising an error.\n        :param is_akamai: force adding of the akamai parameters\n        :param pvswf: URL of player SWF for Akamai HD player verification.\n        \"\"\"\n        # private argument, should only be used in recursive calls\n        raise_for_drm = request_params.pop(\"raise_for_drm\", False)\n\n        if not request_params:\n            request_params = {}\n\n        request_params[\"headers\"] = request_params.get(\"headers\", {})\n        request_params[\"params\"] = request_params.get(\"params\", {})\n\n        # These params are reserved for internal use\n        request_params.pop(\"exception\", None)\n        request_params.pop(\"stream\", None)\n        request_params.pop(\"timeout\", None)\n        request_params.pop(\"url\", None)\n\n        if \"akamaihd\" in url or is_akamai:\n            request_params[\"params\"][\"hdcore\"] = HDCORE_VERSION\n            request_params[\"params\"][\"g\"] = cls.cache_buster_string(12)\n\n        res = session.http.get(url, exception=IOError, **request_params)\n        manifest = session.http.xml(res, \"manifest XML\", ignore_ns=True,\n                                    exception=IOError)\n\n        if manifest.findtext(\"drmAdditionalHeader\"):\n            log.debug(\"Omitting HDS stream protected by DRM: {}\", url)\n            if raise_for_drm:\n                raise PluginError(\"{} is protected by DRM\".format(url))\n            log.warning(\"Some or all streams are unavailable as they are protected by DRM\")\n            return {}\n\n        parsed = urlparse(url)\n        baseurl = manifest.findtext(\"baseURL\")\n        baseheight = manifest.findtext(\"height\")\n        bootstraps = {}\n        streams = {}\n\n        if not baseurl:\n            baseurl = urljoin(url, os.path.dirname(parsed.path))\n\n        if not baseurl.endswith(\"/\"):\n            baseurl += \"/\"\n\n        for bootstrap in manifest.findall(\"bootstrapInfo\"):\n            name = bootstrap.attrib.get(\"id\") or \"_global\"\n            url = bootstrap.attrib.get(\"url\")\n\n            if url:\n                box = absolute_url(baseurl, url)\n            else:\n                data = base64.b64decode(bytes(bootstrap.text, \"utf8\"))\n                box = Box.deserialize(BytesIO(data))\n\n            bootstraps[name] = box\n\n        pvtoken = manifest.findtext(\"pv-2.0\")\n        if pvtoken:\n            if not pvswf:\n                raise IOError(\"This manifest requires the 'pvswf' parameter \"\n                              \"to verify the SWF\")\n\n            params = cls._pv_params(session, pvswf, pvtoken, **request_params)\n            request_params[\"params\"].update(params)\n\n        child_drm = False\n\n        for media in manifest.findall(\"media\"):\n            url = media.attrib.get(\"url\")\n            bootstrapid = media.attrib.get(\"bootstrapInfoId\", \"_global\")\n            href = media.attrib.get(\"href\")\n\n            if url and bootstrapid:\n                bootstrap = bootstraps.get(bootstrapid)\n\n                if not bootstrap:\n                    continue\n\n                bitrate = media.attrib.get(\"bitrate\")\n                streamid = media.attrib.get(\"streamId\")\n                height = media.attrib.get(\"height\")\n\n                if height:\n                    quality = height + \"p\"\n                elif bitrate:\n                    quality = bitrate + \"k\"\n                elif streamid:\n                    quality = streamid\n                elif baseheight:\n                    quality = baseheight + \"p\"\n                else:\n                    quality = \"live\"\n\n                metadata = media.findtext(\"metadata\")\n\n                if metadata:\n                    metadata = base64.b64decode(bytes(metadata, \"utf8\"))\n                    metadata = ScriptData.deserialize(BytesIO(metadata))\n                else:\n                    metadata = None\n\n                stream = HDSStream(session, baseurl, url, bootstrap,\n                                   metadata=metadata, timeout=timeout,\n                                   **request_params)\n                streams[quality] = stream\n\n            elif href:\n                url = absolute_url(baseurl, href)\n                try:\n                    child_streams = cls.parse_manifest(session, url,\n                                                       timeout=timeout,\n                                                       is_akamai=is_akamai,\n                                                       raise_for_drm=True,\n                                                       **request_params)\n                except PluginError:\n                    child_drm = True\n                    child_streams = {}\n\n                for name, stream in child_streams.items():\n                    # Override stream name if bitrate is available in parent\n                    # manifest but not the child one.\n                    bitrate = media.attrib.get(\"bitrate\")\n\n                    if bitrate and not re.match(r\"^(\\d+)k$\", name):\n                        name = bitrate + \"k\"\n\n                    streams[name] = stream\n        if child_drm:\n            log.warning(\"Some or all streams are unavailable as they are protected by DRM\")\n\n        return streams", "response": "Parses a HDS manifest and returns its substreams."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nreturns any parameters needed for Akamai HD player verification.", "response": "def _pv_params(cls, session, pvswf, pv, **request_params):\n        \"\"\"Returns any parameters needed for Akamai HD player verification.\n\n        Algorithm originally documented by KSV, source:\n        http://stream-recorder.com/forum/showpost.php?p=43761&postcount=13\n        \"\"\"\n\n        try:\n            data, hdntl = pv.split(\";\")\n        except ValueError:\n            data = pv\n            hdntl = \"\"\n\n        cache = Cache(filename=\"stream.json\")\n        key = \"akamaihd-player:\" + pvswf\n        cached = cache.get(key)\n\n        request_params = deepcopy(request_params)\n        headers = request_params.pop(\"headers\", {})\n        if cached:\n            headers[\"If-Modified-Since\"] = cached[\"modified\"]\n        swf = session.http.get(pvswf, headers=headers, **request_params)\n\n        if cached and swf.status_code == 304:  # Server says not modified\n            hash = cached[\"hash\"]\n        else:\n            # Calculate SHA-256 hash of the uncompressed SWF file, base-64\n            # encoded\n            hash = sha256()\n            hash.update(swfdecompress(swf.content))\n            hash = base64.b64encode(hash.digest()).decode(\"ascii\")\n            modified = swf.headers.get(\"Last-Modified\", \"\")\n\n            # Only save in cache if a valid date is given\n            if len(modified) < 40:\n                cache.set(key, dict(hash=hash, modified=modified))\n\n        msg = \"st=0~exp=9999999999~acl=*~data={0}!{1}\".format(data, hash)\n        auth = hmac.new(AKAMAIHD_PV_KEY, msg.encode(\"ascii\"), sha256)\n        pvtoken = \"{0}~hmac={1}\".format(msg, auth.hexdigest())\n\n        # The \"hdntl\" parameter can be accepted as a cookie or passed in the\n        # query string, but the \"pvtoken\" parameter can only be in the query\n        # string\n        params = [(\"pvtoken\", pvtoken)]\n        params.extend(parse_qsl(hdntl, keep_blank_values=True))\n\n        return params"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nextracting the nonce from the response from the bbc session endpoint and return it.", "response": "def _extract_nonce(cls, http_result):\n        \"\"\"\n        Given an HTTP response from the sessino endpoint, extract the nonce, so we can \"sign\" requests with it.\n        We don't really sign the requests in the traditional sense of a nonce, we just incude them in the auth requests.\n\n        :param http_result: HTTP response from the bbc session endpoint.\n        :type http_result: requests.Response\n        :return: nonce to \"sign\" url requests with\n        :rtype: string\n        \"\"\"\n\n        # Extract the redirect URL from the last call\n        last_redirect_url = urlparse(http_result.history[-1].request.url)\n        last_redirect_query = dict(parse_qsl(last_redirect_url.query))\n        # Extract the nonce from the query string in the redirect URL\n        final_url = urlparse(last_redirect_query['goto'])\n        goto_url = dict(parse_qsl(final_url.query))\n        goto_url_query = parse_json(goto_url['state'])\n\n        # Return the nonce we can use for future queries\n        return goto_url_query['nonce']"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nfinds the Video Packet ID in the HTML for the provided URL.", "response": "def find_vpid(self, url, res=None):\n        \"\"\"\n        Find the Video Packet ID in the HTML for the provided URL\n\n        :param url: URL to download, if res is not provided.\n        :param res: Provide a cached version of the HTTP response to search\n        :type url: string\n        :type res: requests.Response\n        :return: Video Packet ID for a Programme in iPlayer\n        :rtype: string\n        \"\"\"\n        log.debug(\"Looking for vpid on {0}\", url)\n        # Use pre-fetched page if available\n        res = res or self.session.http.get(url)\n        m = self.mediator_re.search(res.text)\n        vpid = m and parse_json(m.group(1), schema=self.mediator_schema)\n        return vpid"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef login(self, ptrt_url):\n        def auth_check(res):\n            return ptrt_url in ([h.url for h in res.history] + [res.url])\n\n        # make the session request to get the correct cookies\n        session_res = self.session.http.get(\n            self.session_url,\n            params=dict(ptrt=ptrt_url)\n        )\n\n        if auth_check(session_res):\n            log.debug(\"Already authenticated, skipping authentication\")\n            return True\n\n        http_nonce = self._extract_nonce(session_res)\n        res = self.session.http.post(\n            self.auth_url,\n            params=dict(\n                ptrt=ptrt_url,\n                nonce=http_nonce\n            ),\n            data=dict(\n                jsEnabled=True,\n                username=self.get_option(\"username\"),\n                password=self.get_option('password'),\n                attempts=0\n            ),\n            headers={\"Referer\": self.url})\n\n        return auth_check(res)", "response": "Create session using BBC ID and login using the snapback URL."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef pkcs7_decode(paddedData, keySize=16):\n    '''\n    Remove the PKCS#7 padding\n    '''\n    # Use ord + [-1:] to support both python 2 and 3\n    val = ord(paddedData[-1:])\n    if val > keySize:\n        raise StreamError(\"Input is not padded or padding is corrupt, got padding size of {0}\".format(val))\n\n    return paddedData[:-val]", "response": "Decode a PKCS#7 encoded string."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef parse_variant_playlist(cls, session_, url, name_key=\"name\",\n                               name_prefix=\"\", check_streams=False,\n                               force_restart=False, name_fmt=None,\n                               start_offset=0, duration=None,\n                               **request_params):\n        \"\"\"Attempts to parse a variant playlist and return its streams.\n\n        :param url: The URL of the variant playlist.\n        :param name_key: Prefer to use this key as stream name, valid keys are:\n                         name, pixels, bitrate.\n        :param name_prefix: Add this prefix to the stream names.\n        :param check_streams: Only allow streams that are accessible.\n        :param force_restart: Start at the first segment even for a live stream\n        :param name_fmt: A format string for the name, allowed format keys are\n                         name, pixels, bitrate.\n        \"\"\"\n        locale = session_.localization\n        # Backwards compatibility with \"namekey\" and \"nameprefix\" params.\n        name_key = request_params.pop(\"namekey\", name_key)\n        name_prefix = request_params.pop(\"nameprefix\", name_prefix)\n        audio_select = session_.options.get(\"hls-audio-select\") or []\n\n        res = session_.http.get(url, exception=IOError, **request_params)\n\n        try:\n            parser = hls_playlist.load(res.text, base_uri=res.url)\n        except ValueError as err:\n            raise IOError(\"Failed to parse playlist: {0}\".format(err))\n\n        streams = {}\n        for playlist in filter(lambda p: not p.is_iframe, parser.playlists):\n            names = dict(name=None, pixels=None, bitrate=None)\n            audio_streams = []\n            fallback_audio = []\n            default_audio = []\n            preferred_audio = []\n            for media in playlist.media:\n                if media.type == \"VIDEO\" and media.name:\n                    names[\"name\"] = media.name\n                elif media.type == \"AUDIO\":\n                    audio_streams.append(media)\n            for media in audio_streams:\n                # Media without a uri is not relevant as external audio\n                if not media.uri:\n                    continue\n\n                if not fallback_audio and media.default:\n                    fallback_audio = [media]\n\n                # if the media is \"audoselect\" and it better matches the users preferences, use that\n                # instead of default\n                if not default_audio and (media.autoselect and locale.equivalent(language=media.language)):\n                    default_audio = [media]\n\n                # select the first audio stream that matches the users explict language selection\n                if (('*' in audio_select or media.language in audio_select or media.name in audio_select) or\n                        ((not preferred_audio or media.default) and locale.explicit and locale.equivalent(\n                            language=media.language))):\n                    preferred_audio.append(media)\n\n            # final fallback on the first audio stream listed\n            fallback_audio = fallback_audio or (len(audio_streams) and\n                                                audio_streams[0].uri and [audio_streams[0]])\n\n            if playlist.stream_info.resolution:\n                width, height = playlist.stream_info.resolution\n                names[\"pixels\"] = \"{0}p\".format(height)\n\n            if playlist.stream_info.bandwidth:\n                bw = playlist.stream_info.bandwidth\n\n                if bw >= 1000:\n                    names[\"bitrate\"] = \"{0}k\".format(int(bw / 1000.0))\n                else:\n                    names[\"bitrate\"] = \"{0}k\".format(bw / 1000.0)\n\n            if name_fmt:\n                stream_name = name_fmt.format(**names)\n            else:\n                stream_name = (names.get(name_key) or names.get(\"name\") or\n                               names.get(\"pixels\") or names.get(\"bitrate\"))\n\n            if not stream_name:\n                continue\n            if stream_name in streams:  # rename duplicate streams\n                stream_name = \"{0}_alt\".format(stream_name)\n                num_alts = len(list(filter(lambda n: n.startswith(stream_name), streams.keys())))\n\n                # We shouldn't need more than 2 alt streams\n                if num_alts >= 2:\n                    continue\n                elif num_alts > 0:\n                    stream_name = \"{0}{1}\".format(stream_name, num_alts + 1)\n\n            if check_streams:\n                try:\n                    session_.http.get(playlist.uri, **request_params)\n                except KeyboardInterrupt:\n                    raise\n                except Exception:\n                    continue\n\n            external_audio = preferred_audio or default_audio or fallback_audio\n\n            if external_audio and FFMPEGMuxer.is_usable(session_):\n                external_audio_msg = \", \".join([\n                    \"(language={0}, name={1})\".format(x.language, (x.name or \"N/A\"))\n                    for x in external_audio\n                ])\n                log.debug(\"Using external audio tracks for stream {0} {1}\", name_prefix + stream_name,\n                          external_audio_msg)\n\n                stream = MuxedHLSStream(session_,\n                                        video=playlist.uri,\n                                        audio=[x.uri for x in external_audio if x.uri],\n                                        force_restart=force_restart,\n                                        start_offset=start_offset,\n                                        duration=duration,\n                                        **request_params)\n            else:\n                stream = cls(session_,\n                             playlist.uri,\n                             force_restart=force_restart,\n                             start_offset=start_offset,\n                             duration=duration,\n                             **request_params)\n            streams[name_prefix + stream_name] = stream\n\n        return streams", "response": "Attempts to parse a HLS - encoded variant playlist and return its streams."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nchanging google. com to www. google. com", "response": "def prepend_www(url):\n    \"\"\"Changes google.com to www.google.com\"\"\"\n    parsed = urlparse(url)\n    if parsed.netloc.split(\".\")[0] != \"www\":\n        return parsed.scheme + \"://www.\" + parsed.netloc + parsed.path\n    else:\n        return url"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nwrapping around json. loads.", "response": "def parse_json(data, name=\"JSON\", exception=PluginError, schema=None):\n    \"\"\"Wrapper around json.loads.\n\n    Wraps errors in custom exception with a snippet of the data in the message.\n    \"\"\"\n    try:\n        json_data = json.loads(data)\n    except ValueError as err:\n        snippet = repr(data)\n        if len(snippet) > 35:\n            snippet = snippet[:35] + \" ...\"\n        else:\n            snippet = data\n\n        raise exception(\"Unable to parse {0}: {1} ({2})\".format(name, err, snippet))\n\n    if schema:\n        json_data = schema.validate(json_data, name=name, exception=exception)\n\n    return json_data"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nwrapping around ElementTree. fromstring with some extra features", "response": "def parse_xml(data, name=\"XML\", ignore_ns=False, exception=PluginError, schema=None, invalid_char_entities=False):\n    \"\"\"Wrapper around ElementTree.fromstring with some extras.\n\n    Provides these extra features:\n     - Handles incorrectly encoded XML\n     - Allows stripping namespace information\n     - Wraps errors in custom exception with a snippet of the data in the message\n    \"\"\"\n    if is_py2 and isinstance(data, unicode):\n        data = data.encode(\"utf8\")\n    elif is_py3 and isinstance(data, str):\n        data = bytearray(data, \"utf8\")\n\n    if ignore_ns:\n        data = re.sub(br\"[\\t ]xmlns=\\\"(.+?)\\\"\", b\"\", data)\n\n    if invalid_char_entities:\n        data = re.sub(br'&(?!(?:#(?:[0-9]+|[Xx][0-9A-Fa-f]+)|[A-Za-z0-9]+);)', b'&amp;', data)\n\n    try:\n        tree = ET.fromstring(data)\n    except Exception as err:\n        snippet = repr(data)\n        if len(snippet) > 35:\n            snippet = snippet[:35] + \" ...\"\n\n        raise exception(\"Unable to parse {0}: {1} ({2})\".format(name, err, snippet))\n\n    if schema:\n        tree = schema.validate(tree, name=name, exception=exception)\n\n    return tree"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef parse_qsd(data, name=\"query string\", exception=PluginError, schema=None, **params):\n\n    value = dict(parse_qsl(data, **params))\n    if schema:\n        value = schema.validate(value, name=name, exception=exception)\n\n    return value", "response": "Parses a query string into a dict."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nsearching for a key in a nested dict or list of nested dicts and return the values.", "response": "def search_dict(data, key):\n    \"\"\"\n    Search for a key in a nested dict, or list of nested dicts, and return the values.\n\n    :param data: dict/list to search\n    :param key: key to find\n    :return: matches for key\n    \"\"\"\n    if isinstance(data, dict):\n        for dkey, value in data.items():\n            if dkey == key:\n                yield value\n            for result in search_dict(value, key):\n                yield result\n    elif isinstance(data, list):\n        for value in data:\n            for result in search_dict(value, key):\n                yield result"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\ngetting the live streams in a particular language", "response": "def _get_live_streams(self, lang, path):\n        \"\"\"\n        Get the live stream in a particular language\n        :param lang:\n        :param path:\n        :return:\n        \"\"\"\n        res = self.session.http.get(self._live_api_url.format(lang, path))\n        live_res = self.session.http.json(res)['default']['uid']\n        post_data = '{\"channel_url\":\"/api/channels/%s/\"}' % live_res\n        try:\n            stream_data = self.session.http.json(self.session.http.post(self._stream_get_url, data=post_data))['stream_url']\n        except BaseException:\n            stream_data = self.session.http.json(self.session.http.post(self._stream_get_url, data=post_data))['channel_url']\n        return HLSStream.parse_variant_playlist(self.session, stream_data)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef _get_streams(self):\n        match = self._url_re.match(self.url)\n        type_of_stream = match.group('type')\n        lang = re.search(r\"/../\", self.url).group(0)\n\n        if type_of_stream == 'tv':\n            path = re.search(r\"tv/.*-\\d/$\", self.url).group(0)\n\n            return self._get_live_streams(lang, path)\n        elif type_of_stream == 'playback':\n            path = re.search(r\"/playback/.*/$\", self.url).group(0)\n            return self._get_vod_streams()", "response": "Find the streams for OlympicChannel."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef spawn(self, parameters=None, arguments=None, stderr=None, timeout=None, short_option_prefix=\"-\", long_option_prefix=\"--\"):\n        stderr = stderr or self.stderr\n        cmd = self.bake(self._check_cmd(), parameters, arguments, short_option_prefix, long_option_prefix)\n        log.debug(\"Spawning command: {0}\", subprocess.list2cmdline(cmd))\n\n        try:\n            process = subprocess.Popen(cmd, stderr=stderr, stdout=subprocess.PIPE)\n        except (OSError, IOError) as err:\n            raise StreamError(\"Failed to start process: {0} ({1})\".format(self._check_cmd(), str(err)))\n\n        if timeout:\n            elapsed = 0\n            while elapsed < timeout and not process.poll():\n                time.sleep(0.25)\n                elapsed += 0.25\n\n            # kill after the timeout has expired and the process still hasn't ended\n            if not process.poll():\n                try:\n                    log.debug(\"Process timeout expired ({0}s), killing process\".format(timeout))\n                    process.kill()\n                except Exception:\n                    pass\n\n            process.wait()\n\n        return process", "response": "Spawn a process defined in cmd and return it."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nparsing a DASH manifest file and return its streams", "response": "def parse_manifest(cls, session, url_or_manifest, **args):\n        \"\"\"\n        Attempt to parse a DASH manifest file and return its streams\n\n        :param session: Streamlink session instance\n        :param url_or_manifest: URL of the manifest file or an XML manifest string\n        :return: a dict of name -> DASHStream instances\n        \"\"\"\n        ret = {}\n\n        if url_or_manifest.startswith('<?xml'):\n            mpd = MPD(parse_xml(url_or_manifest, ignore_ns=True))\n        else:\n            res = session.http.get(url_or_manifest, **args)\n            url = res.url\n\n            urlp = list(urlparse(url))\n            urlp[2], _ = urlp[2].rsplit(\"/\", 1)\n\n            mpd = MPD(session.http.xml(res, ignore_ns=True), base_url=urlunparse(urlp), url=url)\n\n        video, audio = [], []\n\n        # Search for suitable video and audio representations\n        for aset in mpd.periods[0].adaptationSets:\n            if aset.contentProtection:\n                raise PluginError(\"{} is protected by DRM\".format(url))\n            for rep in aset.representations:\n                if rep.mimeType.startswith(\"video\"):\n                    video.append(rep)\n                elif rep.mimeType.startswith(\"audio\"):\n                    audio.append(rep)\n\n        if not video:\n            video = [None]\n\n        if not audio:\n            audio = [None]\n\n        locale = session.localization\n        locale_lang = locale.language\n        lang = None\n        available_languages = set()\n\n        # if the locale is explicitly set, prefer that language over others\n        for aud in audio:\n            if aud and aud.lang:\n                available_languages.add(aud.lang)\n                try:\n                    if locale.explicit and aud.lang and Language.get(aud.lang) == locale_lang:\n                        lang = aud.lang\n                except LookupError:\n                    continue\n\n        if not lang:\n            # filter by the first language that appears\n            lang = audio[0] and audio[0].lang\n\n        log.debug(\"Available languages for DASH audio streams: {0} (using: {1})\".format(\", \".join(available_languages) or \"NONE\", lang or \"n/a\"))\n\n        # if the language is given by the stream, filter out other languages that do not match\n        if len(available_languages) > 1:\n            audio = list(filter(lambda a: a.lang is None or a.lang == lang, audio))\n\n        for vid, aud in itertools.product(video, audio):\n            stream = DASHStream(session, mpd, vid, aud, **args)\n            stream_name = []\n\n            if vid:\n                stream_name.append(\"{:0.0f}{}\".format(vid.height or vid.bandwidth_rounded, \"p\" if vid.height else \"k\"))\n            if audio and len(audio) > 1:\n                stream_name.append(\"a{:0.0f}k\".format(aud.bandwidth))\n            ret['+'.join(stream_name)] = stream\n        return ret"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef determine_json_encoding(cls, sample):\n        nulls_at = [i for i, j in enumerate(bytearray(sample[:4])) if j == 0]\n        if nulls_at == [0, 1, 2]:\n            return \"UTF-32BE\"\n        elif nulls_at == [0, 2]:\n            return \"UTF-16BE\"\n        elif nulls_at == [1, 2, 3]:\n            return \"UTF-32LE\"\n        elif nulls_at == [1, 3]:\n            return \"UTF-16LE\"\n        else:\n            return \"UTF-8\"", "response": "Determines the encoding of the JSON text."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef json(cls, res, *args, **kwargs):\n        # if an encoding is already set then use the provided encoding\n        if res.encoding is None:\n            res.encoding = cls.determine_json_encoding(res.content[:4])\n        return parse_json(res.text, *args, **kwargs)", "response": "Parses a JSON response from a HTTP response."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nparses XML from a response.", "response": "def xml(cls, res, *args, **kwargs):\n        \"\"\"Parses XML from a response.\"\"\"\n        return parse_xml(res.text, *args, **kwargs)"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nparses a semi - colon delimited list of cookies.", "response": "def parse_cookies(self, cookies, **kwargs):\n        \"\"\"Parses a semi-colon delimited list of cookies.\n\n        Example: foo=bar;baz=qux\n        \"\"\"\n        for name, value in _parse_keyvalue_list(cookies):\n            self.cookies.set(name, value, **kwargs)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nparse a semi - colon delimited list of headers.", "response": "def parse_headers(self, headers):\n        \"\"\"Parses a semi-colon delimited list of headers.\n\n        Example: foo=bar;baz=qux\n        \"\"\"\n        for name, value in _parse_keyvalue_list(headers):\n            self.headers[name] = value"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nparses a semi - colon delimited list of query parameters.", "response": "def parse_query_params(self, cookies, **kwargs):\n        \"\"\"Parses a semi-colon delimited list of query parameters.\n\n        Example: foo=bar;baz=qux\n        \"\"\"\n        for name, value in _parse_keyvalue_list(cookies):\n            self.params[name] = value"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef _get_streams(self):\n        token = self.login(self.get_option(\"username\"), self.get_option(\"password\"))\n        m = self._url_re.match(self.url)\n        scode = m and m.group(\"scode\") or self.get_option(\"station_code\")\n\n        res = self.session.http.get(self._guide_url, params=dict(token=token))\n\n        channels = OrderedDict()\n        for t in itertags(res.text, \"a\"):\n            if t.attributes.get('cs'):\n                channels[t.attributes.get('cs').lower()] = t.attributes.get('title').replace(\"Watch \", \"\").strip()\n\n        if not scode:\n            log.error(\"Station code not provided, use --ustvnow-station-code.\")\n            log.info(\"Available stations are: \\n{0} \".format('\\n'.join('    {0} ({1})'.format(c, n) for c, n in channels.items())))\n            return\n\n        if scode in channels:\n            log.debug(\"Finding streams for: {0}\", channels.get(scode))\n\n            r = self.session.http.get(self._stream_url, params={\"scode\": scode,\n                                                   \"token\": token,\n                                                   \"br_n\": \"Firefox\",\n                                                   \"br_v\": \"52\",\n                                                   \"br_d\": \"desktop\"},\n                         headers={\"User-Agent\": useragents.FIREFOX})\n\n            data = self.session.http.json(r)\n            return HLSStream.parse_variant_playlist(self.session, data[\"stream\"])\n        else:\n            log.error(\"Invalid station-code: {0}\", scode)", "response": "Find the streams from tvcatchup. com."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef device_id(self):\n        if self._device_id is None:\n            self._device_id = \"\".join(\n                random.choice(\"0123456789abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ\") for _ in range(50))\n        return self._device_id", "response": "Randomly generated deviceId.\n        :return:"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nreturn the message for this LogRecord.", "response": "def getMessage(self):\n        \"\"\"\n        Return the message for this LogRecord.\n\n        Return the message for this LogRecord after merging any user-supplied\n        arguments with the message.\n        \"\"\"\n        msg = self.msg\n        if self.args:\n            msg = msg.format(*self.args)\n        return maybe_encode(msg)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\ncreating a new record.", "response": "def makeRecord(self, name, level, fn, lno, msg, args, exc_info,\n                   func=None, extra=None, sinfo=None):\n        \"\"\"\n        A factory method which can be overridden in subclasses to create\n        specialized LogRecords.\n        \"\"\"\n        if name.startswith(\"streamlink\"):\n            rv = _LogRecord(name, level, fn, lno, msg, args, exc_info, func, sinfo)\n        else:\n            rv = _CompatLogRecord(name, level, fn, lno, msg, args, exc_info, func, sinfo)\n        if extra is not None:\n            for key in extra:\n                if (key in [\"message\", \"asctime\"]) or (key in rv.__dict__):\n                    raise KeyError(\"Attempt to overwrite %r in LogRecord\" % key)\n                rv.__dict__[key] = extra[key]\n        return rv"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef send(self, request, **kwargs):\n\n        # Check that the method makes sense. Only support GET\n        if request.method not in (\"GET\", \"HEAD\"):\n            raise ValueError(\"Invalid request method %s\" % request.method)\n\n        # Parse the URL\n        url_parts = urlparse(request.url)\n\n        # Make the Windows URLs slightly nicer\n        if is_win32 and url_parts.netloc.endswith(\":\"):\n            url_parts = url_parts._replace(path=\"/\" + url_parts.netloc + url_parts.path, netloc='')\n\n        # Reject URLs with a hostname component\n        if url_parts.netloc and url_parts.netloc not in (\"localhost\", \".\", \"..\", \"-\"):\n            raise ValueError(\"file: URLs with hostname components are not permitted\")\n\n        # If the path is relative update it to be absolute\n        if url_parts.netloc in (\".\", \"..\"):\n            pwd = os.path.abspath(url_parts.netloc).replace(os.sep, \"/\") + \"/\"\n            if is_win32:\n                # prefix the path with a / in Windows\n                pwd = \"/\" + pwd\n            url_parts = url_parts._replace(path=urljoin(pwd, url_parts.path.lstrip(\"/\")))\n\n        resp = Response()\n        resp.url = request.url\n\n        # Open the file, translate certain errors into HTTP responses\n        # Use urllib's unquote to translate percent escapes into whatever\n        # they actually need to be\n        try:\n            # If the netloc is - then read from stdin\n            if url_parts.netloc == \"-\":\n                if is_py3:\n                    resp.raw = sys.stdin.buffer\n                else:\n                    resp.raw = sys.stdin\n                # make a fake response URL, the current directory\n                resp.url = \"file://\" + os.path.abspath(\".\").replace(os.sep, \"/\") + \"/\"\n            else:\n                # Split the path on / (the URL directory separator) and decode any\n                # % escapes in the parts\n                path_parts = [unquote(p) for p in url_parts.path.split('/')]\n\n                # Strip out the leading empty parts created from the leading /'s\n                while path_parts and not path_parts[0]:\n                    path_parts.pop(0)\n\n                # If os.sep is in any of the parts, someone fed us some shenanigans.\n                # Treat is like a missing file.\n                if any(os.sep in p for p in path_parts):\n                    raise IOError(errno.ENOENT, os.strerror(errno.ENOENT))\n\n                # Look for a drive component. If one is present, store it separately\n                # so that a directory separator can correctly be added to the real\n                # path, and remove any empty path parts between the drive and the path.\n                # Assume that a part ending with : or | (legacy) is a drive.\n                if path_parts and (path_parts[0].endswith('|') or\n                                   path_parts[0].endswith(':')):\n                    path_drive = path_parts.pop(0)\n                    if path_drive.endswith('|'):\n                        path_drive = path_drive[:-1] + ':'\n\n                    while path_parts and not path_parts[0]:\n                        path_parts.pop(0)\n                else:\n                    path_drive = ''\n\n                # Try to put the path back together\n                # Join the drive back in, and stick os.sep in front of the path to\n                # make it absolute.\n                path = path_drive + os.sep + os.path.join(*path_parts)\n\n                # Check if the drive assumptions above were correct. If path_drive\n                # is set, and os.path.splitdrive does not return a drive, it wasn't\n                # reall a drive. Put the path together again treating path_drive\n                # as a normal path component.\n                if path_drive and not os.path.splitdrive(path):\n                    path = os.sep + os.path.join(path_drive, *path_parts)\n\n                # Use io.open since we need to add a release_conn method, and\n                # methods can't be added to file objects in python 2.\n                resp.raw = io.open(path, \"rb\")\n                resp.raw.release_conn = resp.raw.close\n        except IOError as e:\n            if e.errno == errno.EACCES:\n                resp.status_code = codes.forbidden\n            elif e.errno == errno.ENOENT:\n                resp.status_code = codes.not_found\n            else:\n                resp.status_code = codes.bad_request\n\n            # Wrap the error message in a file-like object\n            # The error message will be localized, try to convert the string\n            # representation of the exception into a byte stream\n            resp_str = str(e).encode(locale.getpreferredencoding(False))\n            resp.raw = BytesIO(resp_str)\n            resp.headers['Content-Length'] = len(resp_str)\n\n            # Add release_conn to the BytesIO object\n            resp.raw.release_conn = resp.raw.close\n        else:\n            resp.status_code = codes.ok\n\n            # If it's a regular file, set the Content-Length\n            resp_stat = os.fstat(resp.raw.fileno())\n            if stat.S_ISREG(resp_stat.st_mode):\n                resp.headers['Content-Length'] = resp_stat.st_size\n\n        return resp", "response": "Wraps a file described in request and returns a Response object containing the file holding the file s content."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef _get_media_info(self, content_id):\n        params = {\"identityPointId\": self._session_attributes.get(\"ipid\"),\n                  \"fingerprint\": self._session_attributes.get(\"fprt\"),\n                  \"contentId\": content_id,\n                  \"playbackScenario\": self.playback_scenario,\n                  \"platform\": \"WEB_MEDIAPLAYER_5\",\n                  \"subject\": \"LIVE_EVENT_COVERAGE\",\n                  \"frameworkURL\": \"https://ws.media.net.wwe.com\",\n                  \"_\": int(time.time())}\n        if self.session_key:\n            params[\"sessionKey\"] = self.session_key\n        url = self.api_url.format(id=content_id)\n        res = self.session.http.get(url, params=params)\n        return self.session.http.xml(res, ignore_ns=True, schema=self._info_schema)", "response": "Get the info about the content based on the ID\n       ."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef _get_streams(self):\n        self.session.http.headers.update({'User-Agent': useragents.IPHONE_6})\n\n        # If this is a 'videos' catalog URL\n        # with an video ID in the GET request, get that instead\n        url = self.follow_vk_redirect(self.url)\n\n        m = self._url_re.match(url)\n        if not m:\n            log.error('URL is not compatible: {0}'.format(url))\n            return\n\n        video_id = m.group('video_id')\n        log.debug('video ID: {0}'.format(video_id))\n\n        params = {\n            'act': 'show_inline',\n            'al': '1',\n            'video': video_id,\n        }\n        res = self.session.http.post(self.API_URL, params=params)\n\n        for _i in itertags(res.text, 'iframe'):\n            if _i.attributes.get('src'):\n                iframe_url = update_scheme(self.url, _i.attributes['src'])\n                log.debug('Found iframe: {0}'.format(iframe_url))\n                for s in self.session.streams(iframe_url).items():\n                    yield s\n\n        for _i in itertags(res.text, 'source'):\n            if _i.attributes.get('type') == 'application/vnd.apple.mpegurl':\n                video_url = _i.attributes['src']\n                # Remove invalid URL\n                if video_url.startswith('https://vk.com/'):\n                    continue\n                streams = HLSStream.parse_variant_playlist(self.session,\n                                                           video_url)\n                if not streams:\n                    yield 'live', HLSStream(self.session, video_url)\n                else:\n                    for s in streams.items():\n                        yield s\n            elif _i.attributes.get('type') == 'video/mp4':\n                q = 'vod'\n                video_url = _i.attributes['src']\n                m = self._vod_quality_re.search(video_url)\n                if m:\n                    q = '{0}p'.format(m.group(1))\n                yield q, HTTPStream(self.session, video_url)", "response": "Get the streams for the current URL."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nfinding the streams for web. tv", "response": "def _get_streams(self):\n        \"\"\"\n        Find the streams for web.tv\n        :return:\n        \"\"\"\n        headers = {}\n        res = self.session.http.get(self.url, headers=headers)\n        headers[\"Referer\"] = self.url\n\n        sources = self._sources_re.findall(res.text)\n        if len(sources):\n            sdata = parse_json(sources[0], schema=self._sources_schema)\n            for source in sdata:\n                self.logger.debug(\"Found stream of type: {}\", source[u'type'])\n                if source[u'type'] == u\"application/vnd.apple.mpegurl\":\n                    url = update_scheme(self.url, source[u\"src\"])\n\n                    try:\n                        # try to parse the stream as a variant playlist\n                        variant = HLSStream.parse_variant_playlist(self.session, url, headers=headers)\n                        if variant:\n                            for q, s in variant.items():\n                                yield q, s\n                        else:\n                            # and if that fails, try it as a plain HLS stream\n                            yield 'live', HLSStream(self.session, url, headers=headers)\n                    except IOError:\n                        self.logger.warning(\"Could not open the stream, perhaps the channel is offline\")"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nattempts a login to LiveEdu. tv", "response": "def login(self):\n        \"\"\"\n        Attempt a login to LiveEdu.tv\n        \"\"\"\n        email = self.get_option(\"email\")\n        password = self.get_option(\"password\")\n\n        if email and password:\n            res = self.session.http.get(self.login_url)\n            csrf_match = self.csrf_re.search(res.text)\n            token = csrf_match and csrf_match.group(1)\n            self.logger.debug(\"Attempting login as {0} (token={1})\", email, token)\n\n            res = self.session.http.post(self.login_url,\n                            data=dict(login=email, password=password, csrfmiddlewaretoken=token),\n                            allow_redirects=False,\n                            raise_for_status=False,\n                            headers={\"Referer\": self.login_url})\n\n            if res.status_code != 302:\n                self.logger.error(\"Failed to login to LiveEdu account: {0}\", email)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef _get_streams(self):\n        # attempt a login\n        self.login()\n\n        res = self.session.http.get(self.url)\n        # decode the config for the page\n        matches = self.config_re.finditer(res.text)\n        try:\n            config = self.config_schema.validate(dict(\n                [m.group(\"key\", \"value\") for m in matches]\n            ))\n        except PluginError:\n            return\n\n        if config[\"selectedVideoHID\"]:\n            self.logger.debug(\"Found video hash ID: {0}\", config[\"selectedVideoHID\"])\n            api_url = urljoin(self.url, urljoin(config[\"videosURL\"], config[\"selectedVideoHID\"]))\n        elif config[\"livestreamURL\"]:\n            self.logger.debug(\"Found live stream URL: {0}\", config[\"livestreamURL\"])\n            api_url = urljoin(self.url, config[\"livestreamURL\"])\n        else:\n            return\n\n        ares = self.session.http.get(api_url)\n        data = self.session.http.json(ares, schema=self.api_schema)\n        viewing_urls = data[\"viewing_urls\"]\n\n        if \"error\" in viewing_urls:\n            self.logger.error(\"Failed to load streams: {0}\", viewing_urls[\"error\"])\n        else:\n            for url in viewing_urls[\"urls\"]:\n                try:\n                    label = \"{0}p\".format(url.get(\"res\", url[\"label\"]))\n                except KeyError:\n                    label = \"live\"\n\n                if url[\"type\"] == \"rtmp/mp4\" and RTMPStream.is_usable(self.session):\n                    params = {\n                        \"rtmp\": url[\"src\"],\n                        \"pageUrl\": self.url,\n                        \"live\": True,\n                    }\n                    yield label, RTMPStream(self.session, params)\n\n                elif url[\"type\"] == \"application/x-mpegURL\":\n                    for s in HLSStream.parse_variant_playlist(self.session, url[\"src\"]).items():\n                        yield s", "response": "Get the config object from the page source and call the API to get the list of streams"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef load_support_plugin(name):\n\n    # Get the path of the caller module\n    stack = list(filter(lambda f: f[3] == \"<module>\", inspect.stack()))\n    prev_frame = stack[0]\n    path = os.path.dirname(prev_frame[1])\n\n    # Major hack. If we are frozen by bbfreeze the stack trace will\n    # contain relative paths. We therefore use the __file__ variable\n    # in this module to correct it.\n    if not os.path.isabs(path):\n        prefix = os.path.normpath(__file__ + \"../../../../../\")\n        path = os.path.join(prefix, path)\n\n    return load_module(name, path)", "response": "Loads a plugin from the same directory as the calling plugin."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef update_scheme(current, target):\n    target_p = urlparse(target)\n    if not target_p.scheme and target_p.netloc:\n        return \"{0}:{1}\".format(urlparse(current).scheme,\n                                urlunparse(target_p))\n    elif not target_p.scheme and not target_p.netloc:\n        return \"{0}://{1}\".format(urlparse(current).scheme,\n                                  urlunparse(target_p))\n    else:\n        return target", "response": "Update the scheme from the current URL and the target URL."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef url_equal(first, second, ignore_scheme=False, ignore_netloc=False, ignore_path=False, ignore_params=False,\n              ignore_query=False, ignore_fragment=False):\n    \"\"\"\n    Compare two URLs and return True if they are equal, some parts of the URLs can be ignored\n    :param first: URL\n    :param second: URL\n    :param ignore_scheme: ignore the scheme\n    :param ignore_netloc: ignore the netloc\n    :param ignore_path: ignore the path\n    :param ignore_params: ignore the params\n    :param ignore_query: ignore the query string\n    :param ignore_fragment: ignore the fragment\n    :return: result of comparison\n    \"\"\"\n    # <scheme>://<netloc>/<path>;<params>?<query>#<fragment>\n\n    firstp = urlparse(first)\n    secondp = urlparse(second)\n\n    return ((firstp.scheme == secondp.scheme or ignore_scheme) and\n            (firstp.netloc == secondp.netloc or ignore_netloc) and\n            (firstp.path == secondp.path or ignore_path) and\n            (firstp.params == secondp.params or ignore_params) and\n            (firstp.query == secondp.query or ignore_query) and\n            (firstp.fragment == secondp.fragment or ignore_fragment))", "response": "Compare two URLs and return True if they are equal."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef url_concat(base, *parts, **kwargs):\n    allow_fragments = kwargs.get(\"allow_fragments\", True)\n    for part in parts:\n        base = urljoin(base.rstrip(\"/\") + \"/\", part.strip(\"/\"), allow_fragments)\n    return base", "response": "Join extra paths to a URL"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nupdate or remove keys from a query string in a URL", "response": "def update_qsd(url, qsd=None, remove=None):\n    \"\"\"\n    Update or remove keys from a query string in a URL\n\n    :param url: URL to update\n    :param qsd: dict of keys to update, a None value leaves it unchanged\n    :param remove: list of keys to remove, or \"*\" to remove all\n                   note: updated keys are never removed, even if unchanged\n    :return: updated URL\n    \"\"\"\n    qsd = qsd or {}\n    remove = remove or []\n\n    # parse current query string\n    parsed = urlparse(url)\n    current_qsd = OrderedDict(parse_qsl(parsed.query))\n\n    # * removes all possible keys\n    if remove == \"*\":\n        remove = list(current_qsd.keys())\n\n    # remove keys before updating, but leave updated keys untouched\n    for key in remove:\n        if key not in qsd:\n            del current_qsd[key]\n\n    # and update the query string\n    for key, value in qsd.items():\n        if value:\n            current_qsd[key] = value\n\n    return parsed._replace(query=urlencode(current_qsd)).geturl()"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nget all the streams for the ITV url", "response": "def _get_streams(self):\n        \"\"\"\n            Find all the streams for the ITV url\n            :return: Mapping of quality to stream\n        \"\"\"\n        self.session.http.headers.update({\"User-Agent\": useragents.FIREFOX})\n        video_info = self.video_info()\n        video_info_url = video_info.get(\"data-html5-playlist\") or video_info.get(\"data-video-id\")\n\n        res = self.session.http.post(video_info_url,\n                        data=json.dumps(self.device_info),\n                        headers={\"hmac\": video_info.get(\"data-video-hmac\")})\n        data = self.session.http.json(res, schema=self._video_info_schema)\n\n        log.debug(\"Video ID info response: {0}\".format(data))\n\n        stype = data['Playlist']['VideoType']\n\n        for media in data['Playlist']['Video']['MediaFiles']:\n            url = urljoin(data['Playlist']['Video']['Base'], media['Href'])\n            name_fmt = \"{pixels}_{bitrate}\" if stype == \"CATCHUP\" else None\n            for s in HLSStream.parse_variant_playlist(self.session, url, name_fmt=name_fmt).items():\n                yield s"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nread FLV tags from fd or buf and returns them with adjusted timestamps.", "response": "def iter_chunks(self, fd=None, buf=None, skip_header=None):\n        \"\"\"Reads FLV tags from fd or buf and returns them with adjusted\n           timestamps.\"\"\"\n        timestamps = dict(self.timestamps_add)\n        tag_iterator = self.iter_tags(fd=fd, buf=buf, skip_header=skip_header)\n\n        if not self.flv_header_written:\n            analyzed_tags = self.analyze_tags(tag_iterator)\n        else:\n            analyzed_tags = []\n\n        for tag in chain(analyzed_tags, tag_iterator):\n            if not self.flv_header_written:\n                flv_header = Header(has_video=self.has_video,\n                                    has_audio=self.has_audio)\n                yield flv_header.serialize()\n                self.flv_header_written = True\n\n            if self.verify_tag(tag):\n                self.adjust_tag_gap(tag)\n                self.adjust_tag_timestamp(tag)\n\n                if self.duration:\n                    norm_timestamp = tag.timestamp / 1000\n                    if norm_timestamp > self.duration:\n                        break\n                yield tag.serialize()\n                timestamps[tag.type] = tag.timestamp\n\n        if not self.flatten_timestamps:\n            self.timestamps_add = timestamps\n\n        self.tags = []"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nfinding all the arguments required by name", "response": "def requires(self, name):\n        \"\"\"\n        Find all the arguments required by name\n\n        :param name: name of the argument the find the dependencies\n\n        :return: list of dependant arguments\n        \"\"\"\n        results = set([name])\n        argument = self.get(name)\n        for reqname in argument.requires:\n            required = self.get(reqname)\n            if not required:\n                raise KeyError(\"{0} is not a valid argument for this plugin\".format(reqname))\n\n            if required.name in results:\n                raise RuntimeError(\"cycle detected in plugin argument config\")\n            results.add(required.name)\n            yield required\n\n            for r in self.requires(required.name):\n                if r.name in results:\n                    raise RuntimeError(\"cycle detected in plugin argument config\")\n                results.add(r.name)\n                yield r"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nchecks if file already exists and asks user if it should be overwritten if it does.", "response": "def check_file_output(filename, force):\n    \"\"\"Checks if file already exists and ask the user if it should\n    be overwritten if it does.\"\"\"\n\n    log.debug(\"Checking file output\")\n\n    if os.path.isfile(filename) and not force:\n        if sys.stdin.isatty():\n            answer = console.ask(\"File {0} already exists! Overwrite it? [y/N] \",\n                                 filename)\n\n            if answer.lower() != \"y\":\n                sys.exit()\n        else:\n            log.error(\"File {0} already exists, use --force to overwrite it.\".format(filename))\n            sys.exit()\n\n    return FileOutput(filename)"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\ncreates the output for the current VLC.", "response": "def create_output(plugin):\n    \"\"\"Decides where to write the stream.\n\n    Depending on arguments it can be one of these:\n     - The stdout pipe\n     - A subprocess' stdin pipe\n     - A named pipe that the subprocess reads from\n     - A regular file\n\n    \"\"\"\n\n    if (args.output or args.stdout) and (args.record or args.record_and_pipe):\n        console.exit(\"Cannot use record options with other file output options.\")\n\n    if args.output:\n        if args.output == \"-\":\n            out = FileOutput(fd=stdout)\n        else:\n            out = check_file_output(args.output, args.force)\n    elif args.stdout:\n        out = FileOutput(fd=stdout)\n    elif args.record_and_pipe:\n        record = check_file_output(args.record_and_pipe, args.force)\n        out = FileOutput(fd=stdout, record=record)\n    else:\n        http = namedpipe = record = None\n\n        if not args.player:\n            console.exit(\"The default player (VLC) does not seem to be \"\n                         \"installed. You must specify the path to a player \"\n                         \"executable with --player.\")\n\n        if args.player_fifo:\n            pipename = \"streamlinkpipe-{0}\".format(os.getpid())\n            log.info(\"Creating pipe {0}\", pipename)\n\n            try:\n                namedpipe = NamedPipe(pipename)\n            except IOError as err:\n                console.exit(\"Failed to create pipe: {0}\", err)\n        elif args.player_http:\n            http = create_http_server()\n\n        title = create_title(plugin)\n\n        if args.record:\n            record = check_file_output(args.record, args.force)\n\n        log.info(\"Starting player: {0}\", args.player)\n\n        out = PlayerOutput(args.player, args=args.player_args,\n                           quiet=not args.verbose_player,\n                           kill=not args.player_no_close,\n                           namedpipe=namedpipe, http=http,\n                           record=record, title=title)\n\n    return out"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef create_http_server(host=None, port=0):\n\n    try:\n        http = HTTPServer()\n        http.bind(host=host, port=port)\n    except OSError as err:\n        console.exit(\"Failed to create HTTP server: {0}\", err)\n\n    return http", "response": "Creates a HTTP server listening on a given host and port."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef iter_http_requests(server, player):\n\n    while not player or player.running:\n        try:\n            yield server.open(timeout=2.5)\n        except OSError:\n            continue", "response": "Iterate over HTTP requests on a server."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef output_stream_passthrough(plugin, stream):\n    global output\n\n    title = create_title(plugin)\n    filename = '\"{0}\"'.format(stream_to_url(stream))\n    output = PlayerOutput(args.player, args=args.player_args,\n                          filename=filename, call=True,\n                          quiet=not args.verbose_player,\n                          title=title)\n\n    try:\n        log.info(\"Starting player: {0}\", args.player)\n        output.open()\n    except OSError as err:\n        console.exit(\"Failed to start player: {0} ({1})\", args.player, err)\n        return False\n\n    return True", "response": "Prepares a filename to be passed to the player."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nopening a stream and reads 8192 bytes from it.", "response": "def open_stream(stream):\n    \"\"\"Opens a stream and reads 8192 bytes from it.\n\n    This is useful to check if a stream actually has data\n    before opening the output.\n\n    \"\"\"\n    global stream_fd\n\n    # Attempts to open the stream\n    try:\n        stream_fd = stream.open()\n    except StreamError as err:\n        raise StreamError(\"Could not open stream: {0}\".format(err))\n\n    # Read 8192 bytes before proceeding to check for errors.\n    # This is to avoid opening the output unnecessarily.\n    try:\n        log.debug(\"Pre-buffering 8192 bytes\")\n        prebuffer = stream_fd.read(8192)\n    except IOError as err:\n        stream_fd.close()\n        raise StreamError(\"Failed to read data from stream: {0}\".format(err))\n\n    if not prebuffer:\n        stream_fd.close()\n        raise StreamError(\"No data returned from stream\")\n\n    return stream_fd, prebuffer"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nopening stream create output and finally write the stream to output.", "response": "def output_stream(plugin, stream):\n    \"\"\"Open stream, create output and finally write the stream to output.\"\"\"\n    global output\n\n    success_open = False\n    for i in range(args.retry_open):\n        try:\n            stream_fd, prebuffer = open_stream(stream)\n            success_open = True\n            break\n        except StreamError as err:\n            log.error(\"Try {0}/{1}: Could not open stream {2} ({3})\", i + 1, args.retry_open, stream, err)\n\n    if not success_open:\n        console.exit(\"Could not open stream {0}, tried {1} times, exiting\", stream, args.retry_open)\n\n    output = create_output(plugin)\n\n    try:\n        output.open()\n    except (IOError, OSError) as err:\n        if isinstance(output, PlayerOutput):\n            console.exit(\"Failed to start player: {0} ({1})\",\n                         args.player, err)\n        else:\n            console.exit(\"Failed to open output: {0} ({1})\",\n                         args.output, err)\n\n    with closing(output):\n        log.debug(\"Writing stream to output\")\n        read_stream(stream_fd, output, prebuffer)\n\n    return True"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef read_stream(stream, output, prebuffer, chunk_size=8192):\n    is_player = isinstance(output, PlayerOutput)\n    is_http = isinstance(output, HTTPServer)\n    is_fifo = is_player and output.namedpipe\n    show_progress = isinstance(output, FileOutput) and output.fd is not stdout and sys.stdout.isatty()\n    show_record_progress = hasattr(output, \"record\") and isinstance(output.record, FileOutput) and output.record.fd is not stdout and sys.stdout.isatty()\n\n    stream_iterator = chain(\n        [prebuffer],\n        iter(partial(stream.read, chunk_size), b\"\")\n    )\n    if show_progress:\n        stream_iterator = progress(stream_iterator,\n                                   prefix=os.path.basename(args.output))\n    elif show_record_progress:\n        stream_iterator = progress(stream_iterator,\n                                   prefix=os.path.basename(args.record))\n\n    try:\n        for data in stream_iterator:\n            # We need to check if the player process still exists when\n            # using named pipes on Windows since the named pipe is not\n            # automatically closed by the player.\n            if is_win32 and is_fifo:\n                output.player.poll()\n\n                if output.player.returncode is not None:\n                    log.info(\"Player closed\")\n                    break\n\n            try:\n                output.write(data)\n            except IOError as err:\n                if is_player and err.errno in ACCEPTABLE_ERRNO:\n                    log.info(\"Player closed\")\n                elif is_http and err.errno in ACCEPTABLE_ERRNO:\n                    log.info(\"HTTP connection closed\")\n                else:\n                    console.exit(\"Error when writing to output: {0}, exiting\", err)\n\n                break\n    except IOError as err:\n        console.exit(\"Error when reading from stream: {0}, exiting\", err)\n    finally:\n        stream.close()\n        log.info(\"Stream ended\")", "response": "Reads data from stream and writes it to the output."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nhandling the processing of a single stream.", "response": "def handle_stream(plugin, streams, stream_name):\n    \"\"\"Decides what to do with the selected stream.\n\n    Depending on arguments it can be one of these:\n     - Output internal command-line\n     - Output JSON represenation\n     - Continuously output the stream over HTTP\n     - Output stream data to selected output\n\n    \"\"\"\n\n    stream_name = resolve_stream_name(streams, stream_name)\n    stream = streams[stream_name]\n\n    # Print internal command-line if this stream\n    # uses a subprocess.\n    if args.subprocess_cmdline:\n        if isinstance(stream, StreamProcess):\n            try:\n                cmdline = stream.cmdline()\n            except StreamError as err:\n                console.exit(\"{0}\", err)\n\n            console.msg(\"{0}\", cmdline)\n        else:\n            console.exit(\"The stream specified cannot be translated to a command\")\n\n    # Print JSON representation of the stream\n    elif console.json:\n        console.msg_json(stream)\n\n    elif args.stream_url:\n        try:\n            console.msg(\"{0}\", stream.to_url())\n        except TypeError:\n            console.exit(\"The stream specified cannot be translated to a URL\")\n\n    # Output the stream\n    else:\n        # Find any streams with a '_alt' suffix and attempt\n        # to use these in case the main stream is not usable.\n        alt_streams = list(filter(lambda k: stream_name + \"_alt\" in k,\n                                  sorted(streams.keys())))\n        file_output = args.output or args.stdout\n\n        for stream_name in [stream_name] + alt_streams:\n            stream = streams[stream_name]\n            stream_type = type(stream).shortname()\n\n            if stream_type in args.player_passthrough and not file_output:\n                log.info(\"Opening stream: {0} ({1})\", stream_name,\n                         stream_type)\n                success = output_stream_passthrough(plugin, stream)\n            elif args.player_external_http:\n                return output_stream_http(plugin, streams, external=True,\n                                          port=args.player_external_http_port)\n            elif args.player_continuous_http and not file_output:\n                return output_stream_http(plugin, streams)\n            else:\n                log.info(\"Opening stream: {0} ({1})\", stream_name,\n                         stream_type)\n\n                success = output_stream(plugin, stream)\n\n            if success:\n                break"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nfetching streams using correct parameters.", "response": "def fetch_streams(plugin):\n    \"\"\"Fetches streams using correct parameters.\"\"\"\n\n    return plugin.streams(stream_types=args.stream_types,\n                          sorting_excludes=args.stream_sorting_excludes)"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef fetch_streams_with_retry(plugin, interval, count):\n\n    try:\n        streams = fetch_streams(plugin)\n    except PluginError as err:\n        log.error(u\"{0}\", err)\n        streams = None\n\n    if not streams:\n        log.info(\"Waiting for streams, retrying every {0} \"\n                 \"second(s)\", interval)\n    attempts = 0\n\n    while not streams:\n        sleep(interval)\n\n        try:\n            streams = fetch_streams(plugin)\n        except FatalPluginError as err:\n            raise\n        except PluginError as err:\n            log.error(u\"{0}\", err)\n\n        if count > 0:\n            attempts += 1\n            if attempts >= count:\n                break\n\n    return streams", "response": "Attempts to fetch streams repeatedly with retry."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nreturn the real stream name of a synonym.", "response": "def resolve_stream_name(streams, stream_name):\n    \"\"\"Returns the real stream name of a synonym.\"\"\"\n\n    if stream_name in STREAM_SYNONYMS and stream_name in streams:\n        for name, stream in streams.items():\n            if stream is streams[stream_name] and name not in STREAM_SYNONYMS:\n                return name\n\n    return stream_name"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef format_valid_streams(plugin, streams):\n\n    delimiter = \", \"\n    validstreams = []\n\n    for name, stream in sorted(streams.items(),\n                               key=lambda stream: plugin.stream_weight(stream[0])):\n        if name in STREAM_SYNONYMS:\n            continue\n\n        def synonymfilter(n):\n            return stream is streams[n] and n is not name\n\n        synonyms = list(filter(synonymfilter, streams.keys()))\n\n        if len(synonyms) > 0:\n            joined = delimiter.join(synonyms)\n            name = \"{0} ({1})\".format(name, joined)\n\n        validstreams.append(name)\n\n    return delimiter.join(validstreams)", "response": "Formats a dict of streams."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef handle_url():\n\n    try:\n        plugin = streamlink.resolve_url(args.url)\n        setup_plugin_options(streamlink, plugin)\n        log.info(\"Found matching plugin {0} for URL {1}\",\n                 plugin.module, args.url)\n\n        plugin_args = []\n        for parg in plugin.arguments:\n            value = plugin.get_option(parg.dest)\n            if value:\n                plugin_args.append((parg, value))\n\n        if plugin_args:\n            log.debug(\"Plugin specific arguments:\")\n            for parg, value in plugin_args:\n                log.debug(\" {0}={1} ({2})\".format(parg.argument_name(plugin.module),\n                                                  value if not parg.sensitive else (\"*\" * 8),\n                                                  parg.dest))\n\n        if args.retry_max or args.retry_streams:\n            retry_streams = 1\n            retry_max = 0\n            if args.retry_streams:\n                retry_streams = args.retry_streams\n            if args.retry_max:\n                retry_max = args.retry_max\n            streams = fetch_streams_with_retry(plugin, retry_streams,\n                                               retry_max)\n        else:\n            streams = fetch_streams(plugin)\n    except NoPluginError:\n        console.exit(\"No plugin can handle URL: {0}\", args.url)\n    except PluginError as err:\n        console.exit(u\"{0}\", err)\n\n    if not streams:\n        console.exit(\"No playable streams found on this URL: {0}\", args.url)\n\n    if args.default_stream and not args.stream and not args.json:\n        args.stream = args.default_stream\n\n    if args.stream:\n        validstreams = format_valid_streams(plugin, streams)\n        for stream_name in args.stream:\n            if stream_name in streams:\n                log.info(\"Available streams: {0}\", validstreams)\n                handle_stream(plugin, streams, stream_name)\n                return\n\n        err = (\"The specified stream(s) '{0}' could not be \"\n               \"found\".format(\", \".join(args.stream)))\n\n        if console.json:\n            console.msg_json(dict(streams=streams, plugin=plugin.module,\n                                  error=err))\n        else:\n            console.exit(\"{0}.\\n       Available streams: {1}\",\n                         err, validstreams)\n    else:\n        if console.json:\n            console.msg_json(dict(streams=streams, plugin=plugin.module))\n        else:\n            validstreams = format_valid_streams(plugin, streams)\n            console.msg(\"Available streams: {0}\", validstreams)", "response": "The main entry point for the url handler."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\noutput a list of all plugins Streamlink has loaded.", "response": "def print_plugins():\n    \"\"\"Outputs a list of all plugins Streamlink has loaded.\"\"\"\n\n    pluginlist = list(streamlink.get_plugins().keys())\n    pluginlist_formatted = \", \".join(sorted(pluginlist))\n\n    if console.json:\n        console.msg_json(pluginlist)\n    else:\n        console.msg(\"Loaded plugins: {0}\", pluginlist_formatted)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef authenticate_twitch_oauth():\n\n    client_id = TWITCH_CLIENT_ID\n    redirect_uri = \"https://streamlink.github.io/twitch_oauth.html\"\n    url = (\"https://api.twitch.tv/kraken/oauth2/authorize\"\n           \"?response_type=token\"\n           \"&client_id={0}\"\n           \"&redirect_uri={1}\"\n           \"&scope=user_read+user_subscriptions\"\n           \"&force_verify=true\").format(client_id, redirect_uri)\n\n    console.msg(\"Attempting to open a browser to let you authenticate \"\n                \"Streamlink with Twitch\")\n\n    try:\n        if not webbrowser.open_new_tab(url):\n            raise webbrowser.Error\n    except webbrowser.Error:\n        console.exit(\"Unable to open a web browser, try accessing this URL \"\n                     \"manually instead:\\n{0}\".format(url))", "response": "Opens a web browser to allow the user to grant Streamlink with Twitch OAuth access to their Twitch account."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nattempts to load plugins from a list of directories.", "response": "def load_plugins(dirs):\n    \"\"\"Attempts to load plugins from a list of directories.\"\"\"\n\n    dirs = [os.path.expanduser(d) for d in dirs]\n\n    for directory in dirs:\n        if os.path.isdir(directory):\n            streamlink.load_plugins(directory)\n        else:\n            log.warning(\"Plugin path {0} does not exist or is not \"\n                        \"a directory!\", directory)"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef setup_http_session():\n    if args.http_proxy:\n        streamlink.set_option(\"http-proxy\", args.http_proxy)\n\n    if args.https_proxy:\n        streamlink.set_option(\"https-proxy\", args.https_proxy)\n\n    if args.http_cookie:\n        streamlink.set_option(\"http-cookies\", dict(args.http_cookie))\n\n    if args.http_header:\n        streamlink.set_option(\"http-headers\", dict(args.http_header))\n\n    if args.http_query_param:\n        streamlink.set_option(\"http-query-params\", dict(args.http_query_param))\n\n    if args.http_ignore_env:\n        streamlink.set_option(\"http-trust-env\", False)\n\n    if args.http_no_ssl_verify:\n        streamlink.set_option(\"http-ssl-verify\", False)\n\n    if args.http_disable_dh:\n        streamlink.set_option(\"http-disable-dh\", True)\n\n    if args.http_ssl_cert:\n        streamlink.set_option(\"http-ssl-cert\", args.http_ssl_cert)\n\n    if args.http_ssl_cert_crt_key:\n        streamlink.set_option(\"http-ssl-cert\", tuple(args.http_ssl_cert_crt_key))\n\n    if args.http_timeout:\n        streamlink.set_option(\"http-timeout\", args.http_timeout)\n\n    if args.http_cookies:\n        streamlink.set_option(\"http-cookies\", args.http_cookies)\n\n    if args.http_headers:\n        streamlink.set_option(\"http-headers\", args.http_headers)\n\n    if args.http_query_params:\n        streamlink.set_option(\"http-query-params\", args.http_query_params)", "response": "Sets the global HTTP settings such as proxy and headers."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nloads any additional plugins.", "response": "def setup_plugins(extra_plugin_dir=None):\n    \"\"\"Loads any additional plugins.\"\"\"\n    if os.path.isdir(PLUGINS_DIR):\n        load_plugins([PLUGINS_DIR])\n\n    if extra_plugin_dir:\n        load_plugins(extra_plugin_dir)"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nsets the options for the current Streamlink module.", "response": "def setup_options():\n    \"\"\"Sets Streamlink options.\"\"\"\n    if args.hls_live_edge:\n        streamlink.set_option(\"hls-live-edge\", args.hls_live_edge)\n\n    if args.hls_segment_attempts:\n        streamlink.set_option(\"hls-segment-attempts\", args.hls_segment_attempts)\n\n    if args.hls_playlist_reload_attempts:\n        streamlink.set_option(\"hls-playlist-reload-attempts\", args.hls_playlist_reload_attempts)\n\n    if args.hls_segment_threads:\n        streamlink.set_option(\"hls-segment-threads\", args.hls_segment_threads)\n\n    if args.hls_segment_timeout:\n        streamlink.set_option(\"hls-segment-timeout\", args.hls_segment_timeout)\n\n    if args.hls_segment_ignore_names:\n        streamlink.set_option(\"hls-segment-ignore-names\", args.hls_segment_ignore_names)\n\n    if args.hls_segment_key_uri:\n        streamlink.set_option(\"hls-segment-key-uri\", args.hls_segment_key_uri)\n\n    if args.hls_timeout:\n        streamlink.set_option(\"hls-timeout\", args.hls_timeout)\n\n    if args.hls_audio_select:\n        streamlink.set_option(\"hls-audio-select\", args.hls_audio_select)\n\n    if args.hls_start_offset:\n        streamlink.set_option(\"hls-start-offset\", args.hls_start_offset)\n\n    if args.hls_duration:\n        streamlink.set_option(\"hls-duration\", args.hls_duration)\n\n    if args.hls_live_restart:\n        streamlink.set_option(\"hls-live-restart\", args.hls_live_restart)\n\n    if args.hds_live_edge:\n        streamlink.set_option(\"hds-live-edge\", args.hds_live_edge)\n\n    if args.hds_segment_attempts:\n        streamlink.set_option(\"hds-segment-attempts\", args.hds_segment_attempts)\n\n    if args.hds_segment_threads:\n        streamlink.set_option(\"hds-segment-threads\", args.hds_segment_threads)\n\n    if args.hds_segment_timeout:\n        streamlink.set_option(\"hds-segment-timeout\", args.hds_segment_timeout)\n\n    if args.hds_timeout:\n        streamlink.set_option(\"hds-timeout\", args.hds_timeout)\n\n    if args.http_stream_timeout:\n        streamlink.set_option(\"http-stream-timeout\", args.http_stream_timeout)\n\n    if args.ringbuffer_size:\n        streamlink.set_option(\"ringbuffer-size\", args.ringbuffer_size)\n\n    if args.rtmp_proxy:\n        streamlink.set_option(\"rtmp-proxy\", args.rtmp_proxy)\n\n    if args.rtmp_rtmpdump:\n        streamlink.set_option(\"rtmp-rtmpdump\", args.rtmp_rtmpdump)\n\n    if args.rtmp_timeout:\n        streamlink.set_option(\"rtmp-timeout\", args.rtmp_timeout)\n\n    if args.stream_segment_attempts:\n        streamlink.set_option(\"stream-segment-attempts\", args.stream_segment_attempts)\n\n    if args.stream_segment_threads:\n        streamlink.set_option(\"stream-segment-threads\", args.stream_segment_threads)\n\n    if args.stream_segment_timeout:\n        streamlink.set_option(\"stream-segment-timeout\", args.stream_segment_timeout)\n\n    if args.stream_timeout:\n        streamlink.set_option(\"stream-timeout\", args.stream_timeout)\n\n    if args.ffmpeg_ffmpeg:\n        streamlink.set_option(\"ffmpeg-ffmpeg\", args.ffmpeg_ffmpeg)\n    if args.ffmpeg_verbose:\n        streamlink.set_option(\"ffmpeg-verbose\", args.ffmpeg_verbose)\n    if args.ffmpeg_verbose_path:\n        streamlink.set_option(\"ffmpeg-verbose-path\", args.ffmpeg_verbose_path)\n    if args.ffmpeg_video_transcode:\n        streamlink.set_option(\"ffmpeg-video-transcode\", args.ffmpeg_video_transcode)\n    if args.ffmpeg_audio_transcode:\n        streamlink.set_option(\"ffmpeg-audio-transcode\", args.ffmpeg_audio_transcode)\n\n    streamlink.set_option(\"subprocess-errorlog\", args.subprocess_errorlog)\n    streamlink.set_option(\"subprocess-errorlog-path\", args.subprocess_errorlog_path)\n    streamlink.set_option(\"locale\", args.locale)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef setup_plugin_args(session, parser):\n\n    plugin_args = parser.add_argument_group(\"Plugin options\")\n    for pname, plugin in session.plugins.items():\n        defaults = {}\n        for parg in plugin.arguments:\n            plugin_args.add_argument(parg.argument_name(pname), **parg.options)\n            defaults[parg.dest] = parg.default\n\n        plugin.options = PluginOptions(defaults)", "response": "Sets Streamlink plugin options."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nsetting the plugin options.", "response": "def setup_plugin_options(session, plugin):\n    \"\"\"Sets Streamlink plugin options.\"\"\"\n    pname = plugin.module\n    required = OrderedDict({})\n    for parg in plugin.arguments:\n        if parg.options.get(\"help\") != argparse.SUPPRESS:\n            if parg.required:\n                required[parg.name] = parg\n            value = getattr(args, parg.namespace_dest(pname))\n            session.set_plugin_option(pname, parg.dest, value)\n            # if the value is set, check to see if any of the required arguments are not set\n            if parg.required or value:\n                try:\n                    for rparg in plugin.arguments.requires(parg.name):\n                        required[rparg.name] = rparg\n                except RuntimeError:\n                    console.logger.error(\"{0} plugin has a configuration error and the arguments \"\n                                         \"cannot be parsed\".format(pname))\n                    break\n    if required:\n        for req in required.values():\n            if not session.get_plugin_option(pname, req.dest):\n                prompt = req.prompt or \"Enter {0} {1}\".format(pname, req.name)\n                session.set_plugin_option(pname, req.dest,\n                                          console.askpass(prompt + \": \")\n                                          if req.sensitive else\n                                          console.ask(prompt + \": \"))"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef log_current_versions():\n    if logger.root.isEnabledFor(logging.DEBUG):\n        # MAC OS X\n        if sys.platform == \"darwin\":\n            os_version = \"macOS {0}\".format(platform.mac_ver()[0])\n        # Windows\n        elif sys.platform.startswith(\"win\"):\n            os_version = \"{0} {1}\".format(platform.system(), platform.release())\n        # linux / other\n        else:\n            os_version = platform.platform()\n\n        log.debug(\"OS:         {0}\".format(os_version))\n        log.debug(\"Python:     {0}\".format(platform.python_version()))\n        log.debug(\"Streamlink: {0}\".format(streamlink_version))\n        log.debug(\"Requests({0}), Socks({1}), Websocket({2})\".format(\n            requests.__version__, socks_version, websocket_version))", "response": "Show current installed versions"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef _get_stream_id(self, text):\n        m = self._image_re.search(text)\n        if m:\n            return m.group(\"stream_id\")", "response": "Try to find a stream_id from the text"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef _get_iframe(self, text):\n        m = self._iframe_re.search(text)\n        if m:\n            return self.session.streams(m.group(\"url\"))", "response": "Get the iframe from the text."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nsets the value of the option key to value.", "response": "def set_option(self, key, value):\n        \"\"\"Sets general options used by plugins and streams originating\n        from this session object.\n\n        :param key: key of the option\n        :param value: value to set the option to\n\n\n        **Available options**:\n\n        ======================== =========================================\n        hds-live-edge            ( float) Specify the time live HDS\n                                 streams will start from the edge of\n                                 stream, default: ``10.0``\n\n        hds-segment-attempts     (int) How many attempts should be done\n                                 to download each HDS segment, default: ``3``\n\n        hds-segment-threads      (int) The size of the thread pool used\n                                 to download segments, default: ``1``\n\n        hds-segment-timeout      (float) HDS segment connect and read\n                                 timeout, default: ``10.0``\n\n        hds-timeout              (float) Timeout for reading data from\n                                 HDS streams, default: ``60.0``\n\n        hls-live-edge            (int) How many segments from the end\n                                 to start live streams on, default: ``3``\n\n        hls-segment-attempts     (int) How many attempts should be done\n                                 to download each HLS segment, default: ``3``\n\n        hls-segment-threads      (int) The size of the thread pool used\n                                 to download segments, default: ``1``\n\n        hls-segment-timeout      (float) HLS segment connect and read\n                                 timeout, default: ``10.0``\n\n        hls-timeout              (float) Timeout for reading data from\n                                 HLS streams, default: ``60.0``\n\n        http-proxy               (str) Specify a HTTP proxy to use for\n                                 all HTTP requests\n\n        https-proxy              (str) Specify a HTTPS proxy to use for\n                                 all HTTPS requests\n\n        http-cookies             (dict or str) A dict or a semi-colon (;)\n                                 delimited str of cookies to add to each\n                                 HTTP request, e.g. ``foo=bar;baz=qux``\n\n        http-headers             (dict or str) A dict or semi-colon (;)\n                                 delimited str of headers to add to each\n                                 HTTP request, e.g. ``foo=bar;baz=qux``\n\n        http-query-params        (dict or str) A dict or a ampersand (&)\n                                 delimited string of query parameters to\n                                 add to each HTTP request,\n                                 e.g. ``foo=bar&baz=qux``\n\n        http-trust-env           (bool) Trust HTTP settings set in the\n                                 environment, such as environment\n                                 variables (HTTP_PROXY, etc) and\n                                 ~/.netrc authentication\n\n        http-ssl-verify          (bool) Verify SSL certificates,\n                                 default: ``True``\n\n        http-ssl-cert            (str or tuple) SSL certificate to use,\n                                 can be either a .pem file (str) or a\n                                 .crt/.key pair (tuple)\n\n        http-timeout             (float) General timeout used by all HTTP\n                                 requests except the ones covered by\n                                 other options, default: ``20.0``\n\n        http-stream-timeout      (float) Timeout for reading data from\n                                 HTTP streams, default: ``60.0``\n\n        subprocess-errorlog      (bool) Log errors from subprocesses to\n                                 a file located in the temp directory\n\n        subprocess-errorlog-path (str) Log errors from subprocesses to\n                                 a specific file\n\n        ringbuffer-size          (int) The size of the internal ring\n                                 buffer used by most stream types,\n                                 default: ``16777216`` (16MB)\n\n        rtmp-proxy               (str) Specify a proxy (SOCKS) that RTMP\n                                 streams will use\n\n        rtmp-rtmpdump            (str) Specify the location of the\n                                 rtmpdump executable used by RTMP streams,\n                                 e.g. ``/usr/local/bin/rtmpdump``\n\n        rtmp-timeout             (float) Timeout for reading data from\n                                 RTMP streams, default: ``60.0``\n\n        ffmpeg-ffmpeg            (str) Specify the location of the\n                                 ffmpeg executable use by Muxing streams\n                                 e.g. ``/usr/local/bin/ffmpeg``\n\n        ffmpeg-verbose           (bool) Log stderr from ffmpeg to the\n                                 console\n\n        ffmpeg-verbose-path      (str) Specify the location of the\n                                 ffmpeg stderr log file\n\n        ffmpeg-video-transcode   (str) The codec to use if transcoding\n                                 video when muxing with ffmpeg\n                                 e.g. ``h264``\n\n        ffmpeg-audio-transcode   (str) The codec to use if transcoding\n                                 audio when muxing with ffmpeg\n                                 e.g. ``aac``\n\n        stream-segment-attempts  (int) How many attempts should be done\n                                 to download each segment, default: ``3``.\n                                 General option used by streams not\n                                 covered by other options.\n\n        stream-segment-threads   (int) The size of the thread pool used\n                                 to download segments, default: ``1``.\n                                 General option used by streams not\n                                 covered by other options.\n\n        stream-segment-timeout   (float) Segment connect and read\n                                 timeout, default: ``10.0``.\n                                 General option used by streams not\n                                 covered by other options.\n\n        stream-timeout           (float) Timeout for reading data from\n                                 stream, default: ``60.0``.\n                                 General option used by streams not\n                                 covered by other options.\n\n        locale                   (str) Locale setting, in the RFC 1766 format\n                                 eg. en_US or es_ES\n                                 default: ``system locale``.\n\n        user-input-requester     (UserInputRequester) instance of UserInputRequester\n                                 to collect input from the user at runtime. Must be\n                                 set before the plugins are loaded.\n                                 default: ``UserInputRequester``.\n        ======================== =========================================\n\n        \"\"\"\n\n        # Backwards compatibility\n        if key == \"rtmpdump\":\n            key = \"rtmp-rtmpdump\"\n        elif key == \"rtmpdump-proxy\":\n            key = \"rtmp-proxy\"\n        elif key == \"errorlog\":\n            key = \"subprocess-errorlog\"\n        elif key == \"errorlog-path\":\n            key = \"subprocess-errorlog-path\"\n\n        if key == \"http-proxy\":\n            self.http.proxies[\"http\"] = update_scheme(\"http://\", value)\n        elif key == \"https-proxy\":\n            self.http.proxies[\"https\"] = update_scheme(\"https://\", value)\n        elif key == \"http-cookies\":\n            if isinstance(value, dict):\n                self.http.cookies.update(value)\n            else:\n                self.http.parse_cookies(value)\n        elif key == \"http-headers\":\n            if isinstance(value, dict):\n                self.http.headers.update(value)\n            else:\n                self.http.parse_headers(value)\n        elif key == \"http-query-params\":\n            if isinstance(value, dict):\n                self.http.params.update(value)\n            else:\n                self.http.parse_query_params(value)\n        elif key == \"http-trust-env\":\n            self.http.trust_env = value\n        elif key == \"http-ssl-verify\":\n            self.http.verify = value\n        elif key == \"http-disable-dh\":\n            if value:\n                requests.packages.urllib3.util.ssl_.DEFAULT_CIPHERS += ':!DH'\n                try:\n                    requests.packages.urllib3.contrib.pyopenssl.DEFAULT_SSL_CIPHER_LIST = \\\n                        requests.packages.urllib3.util.ssl_.DEFAULT_CIPHERS.encode(\"ascii\")\n                except AttributeError:\n                    # no ssl to disable the cipher on\n                    pass\n        elif key == \"http-ssl-cert\":\n            self.http.cert = value\n        elif key == \"http-timeout\":\n            self.http.timeout = value\n        else:\n            self.options.set(key, value)"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef get_option(self, key):\n        # Backwards compatibility\n        if key == \"rtmpdump\":\n            key = \"rtmp-rtmpdump\"\n        elif key == \"rtmpdump-proxy\":\n            key = \"rtmp-proxy\"\n        elif key == \"errorlog\":\n            key = \"subprocess-errorlog\"\n\n        if key == \"http-proxy\":\n            return self.http.proxies.get(\"http\")\n        elif key == \"https-proxy\":\n            return self.http.proxies.get(\"https\")\n        elif key == \"http-cookies\":\n            return self.http.cookies\n        elif key == \"http-headers\":\n            return self.http.headers\n        elif key == \"http-query-params\":\n            return self.http.params\n        elif key == \"http-trust-env\":\n            return self.http.trust_env\n        elif key == \"http-ssl-verify\":\n            return self.http.verify\n        elif key == \"http-ssl-cert\":\n            return self.http.cert\n        elif key == \"http-timeout\":\n            return self.http.timeout\n        else:\n            return self.options.get(key)", "response": "Returns the current value of the specified option."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nsets the option to the value of the specified plugin.", "response": "def set_plugin_option(self, plugin, key, value):\n        \"\"\"Sets plugin specific options used by plugins originating\n        from this session object.\n\n        :param plugin: name of the plugin\n        :param key: key of the option\n        :param value: value to set the option to\n\n        \"\"\"\n\n        if plugin in self.plugins:\n            plugin = self.plugins[plugin]\n            plugin.set_option(key, value)"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef get_plugin_option(self, plugin, key):\n\n        if plugin in self.plugins:\n            plugin = self.plugins[plugin]\n            return plugin.get_option(key)", "response": "Returns current value of plugin specific option."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nattempting to find a plugin that can use this URL.", "response": "def resolve_url(self, url, follow_redirect=True):\n        \"\"\"Attempts to find a plugin that can use this URL.\n\n        The default protocol (http) will be prefixed to the URL if\n        not specified.\n\n        Raises :exc:`NoPluginError` on failure.\n\n        :param url: a URL to match against loaded plugins\n        :param follow_redirect: follow redirects\n\n        \"\"\"\n        url = update_scheme(\"http://\", url)\n\n        available_plugins = []\n        for name, plugin in self.plugins.items():\n            if plugin.can_handle_url(url):\n                available_plugins.append(plugin)\n\n        available_plugins.sort(key=lambda x: x.priority(url), reverse=True)\n        if available_plugins:\n            return available_plugins[0](url)\n\n        if follow_redirect:\n            # Attempt to handle a redirect URL\n            try:\n                res = self.http.head(url, allow_redirects=True, acceptable_status=[501])\n\n                # Fall back to GET request if server doesn't handle HEAD.\n                if res.status_code == 501:\n                    res = self.http.get(url, stream=True)\n\n                if res.url != url:\n                    return self.resolve_url(res.url, follow_redirect=follow_redirect)\n            except PluginError:\n                pass\n\n        raise NoPluginError"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef streams(self, url, **params):\n\n        plugin = self.resolve_url(url)\n        return plugin.streams(**params)", "response": "Attempts to find a plugin and extract streams from the url."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef load_plugins(self, path):\n        for loader, name, ispkg in pkgutil.iter_modules([path]):\n            file, pathname, desc = imp.find_module(name, [path])\n            # set the full plugin module name\n            module_name = \"streamlink.plugin.{0}\".format(name)\n\n            try:\n                self.load_plugin(module_name, file, pathname, desc)\n            except Exception:\n                sys.stderr.write(\"Failed to load plugin {0}:\\n\".format(name))\n                print_small_exception(\"load_plugin\")\n\n                continue", "response": "Attempt to load plugins from the path specified."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\ngetting the VOD data path and the default VOD ID.", "response": "def vod_data(self, vid=None):\n        \"\"\"\n        Get the VOD data path and the default VOD ID\n        :return:\n        \"\"\"\n        page = self.session.http.get(self.url)\n        m = self._vod_re.search(page.text)\n        vod_data_url = m and urljoin(self.url, m.group(0))\n        if vod_data_url:\n            self.logger.debug(\"Found VOD data url: {0}\", vod_data_url)\n            res = self.session.http.get(vod_data_url)\n            return self.session.http.json(res)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nconverting a timestamp to seconds", "response": "def hours_minutes_seconds(value):\n    \"\"\"converts a timestamp to seconds\n\n      - hours:minutes:seconds to seconds\n      - minutes:seconds to seconds\n      - 11h22m33s to seconds\n      - 11h to seconds\n      - 20h15m to seconds\n      - seconds to seconds\n\n    :param value: hh:mm:ss ; 00h00m00s ; seconds\n    :return: seconds\n    \"\"\"\n    try:\n        return int(value)\n    except ValueError:\n        pass\n\n    match = (_hours_minutes_seconds_re.match(value)\n             or _hours_minutes_seconds_2_re.match(value))\n    if not match:\n        raise ValueError\n\n    s = 0\n    s += int(match.group(\"hours\") or \"0\") * 60 * 60\n    s += int(match.group(\"minutes\") or \"0\") * 60\n    s += int(match.group(\"seconds\") or \"0\")\n\n    return s"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\ncheck value for minimum length using len.", "response": "def length(length):\n    \"\"\"Checks value for minimum length using len().\"\"\"\n    def min_len(value):\n        if not len(value) >= length:\n            raise ValueError(\n                \"Minimum length is {0} but value is {1}\".format(length, len(value))\n            )\n        return True\n\n    return min_len"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nchecks if the string value starts with another string.", "response": "def startswith(string):\n    \"\"\"Checks if the string value starts with another string.\"\"\"\n    def starts_with(value):\n        validate(text, value)\n        if not value.startswith(string):\n            raise ValueError(\"'{0}' does not start with '{1}'\".format(value, string))\n        return True\n\n    return starts_with"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef endswith(string):\n    def ends_with(value):\n        validate(text, value)\n        if not value.endswith(string):\n            raise ValueError(\"'{0}' does not end with '{1}'\".format(value, string))\n        return True\n\n    return ends_with", "response": "Returns a validation function that checks if the string value ends with another string."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nchecks if the string value contains another string.", "response": "def contains(string):\n    \"\"\"Checks if the string value contains another string.\"\"\"\n    def contains_str(value):\n        validate(text, value)\n        if string not in value:\n            raise ValueError(\"'{0}' does not contain '{1}'\".format(value, string))\n        return True\n\n    return contains_str"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef get(item, default=None):\n\n    def getter(value):\n        if ET.iselement(value):\n            value = value.attrib\n\n        try:\n            # Use .group() if this is a regex match object\n            if _is_re_match(value):\n                return value.group(item)\n            else:\n                return value[item]\n        except (KeyError, IndexError):\n            return default\n        except (TypeError, AttributeError) as err:\n            raise ValueError(err)\n\n    return transform(getter)", "response": "Get item from value."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef getattr(attr, default=None):\n    def getter(value):\n        return _getattr(value, attr, default)\n\n    return transform(getter)", "response": "Get a named attribute from an object."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nfiltering out unwanted items using the specified function.", "response": "def filter(func):\n    \"\"\"Filters out unwanted items using the specified function.\n\n    Supports both dicts and sequences, key/value pairs are\n    expanded when applied to a dict.\n    \"\"\"\n    def expand_kv(kv):\n        return func(*kv)\n\n    def filter_values(value):\n        cls = type(value)\n        if isinstance(value, dict):\n            return cls(_filter(expand_kv, value.items()))\n        else:\n            return cls(_filter(func, value))\n\n    return transform(filter_values)"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef map(func):\n    # text is an alias for basestring on Python 2, which cannot be\n    # instantiated and therefore can't be used to transform the value,\n    # so we force to unicode instead.\n    if is_py2 and text == func:\n        func = unicode\n\n    def expand_kv(kv):\n        return func(*kv)\n\n    def map_values(value):\n        cls = type(value)\n        if isinstance(value, dict):\n            return cls(_map(expand_kv, value.items()))\n        else:\n            return cls(_map(func, value))\n\n    return transform(map_values)", "response": "Apply function to each value inside the sequence or dict."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef url(**attributes):\n    def check_url(value):\n        validate(text, value)\n        parsed = urlparse(value)\n        if not parsed.netloc:\n            raise ValueError(\"'{0}' is not a valid URL\".format(value))\n\n        for name, schema in attributes.items():\n            if not _hasattr(parsed, name):\n                raise ValueError(\"Invalid URL attribute '{0}'\".format(name))\n\n            try:\n                validate(schema, _getattr(parsed, name))\n            except ValueError as err:\n                raise ValueError(\n                    \"Unable to validate URL attribute '{0}': {1}\".format(\n                        name, err\n                    )\n                )\n\n        return True\n\n    # Convert \"http\" to be either any(\"http\", \"https\") for convenience\n    if attributes.get(\"scheme\") == \"http\":\n        attributes[\"scheme\"] = any(\"http\", \"https\")\n\n    return check_url", "response": "Parses an URL and validates its attributes."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nfind a XML element via xpath.", "response": "def xml_find(xpath):\n    \"\"\"Find a XML element via xpath.\"\"\"\n    def xpath_find(value):\n        validate(ET.iselement, value)\n        value = value.find(xpath)\n        if value is None:\n            raise ValueError(\"XPath '{0}' did not return an element\".format(xpath))\n\n        return validate(ET.iselement, value)\n\n    return transform(xpath_find)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef xml_findall(xpath):\n    def xpath_findall(value):\n        validate(ET.iselement, value)\n        return value.findall(xpath)\n\n    return transform(xpath_findall)", "response": "Find a list of XML elements via xpath."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nfind playlist info in HTTP response.", "response": "def _find_playlist_info(response):\n    \"\"\"\n    Finds playlist info (type, id) in HTTP response.\n\n    :param response: Response object.\n    :returns: Dictionary with type and id.\n    \"\"\"\n    values = {}\n    matches = _playlist_info_re.search(response.text)\n    if matches:\n        values['type'] = matches.group(1)\n        values['id'] = matches.group(2)\n\n    return values"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nfind embedded player url in HTTP response.", "response": "def _find_player_url(response):\n    \"\"\"\n    Finds embedded player url in HTTP response.\n\n    :param response: Response object.\n    :returns: Player url (str).\n    \"\"\"\n    url = ''\n    matches = _player_re.search(response.text)\n    if matches:\n        tmp_url = matches.group(0).replace('&amp;', '&')\n        if 'hash' not in tmp_url:\n            # there's no hash in the URL, try to find it\n            matches = _hash_re.search(response.text)\n            if matches:\n                url = tmp_url + '&hash=' + matches.group(1)\n        else:\n            url = tmp_url\n\n    return 'http://ceskatelevize.cz/' + url"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nfind the VOD video url Find the VOD video url", "response": "def _get_vod_stream(self):\n        \"\"\"\n        Find the VOD video url\n        :return: video url\n        \"\"\"\n        res = self.session.http.get(self.url)\n        video_urls = self._re_vod.findall(res.text)\n        if len(video_urls):\n            return dict(vod=HTTPStream(self.session, video_urls[0]))"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef _get_live_streams(self, match):\n        live_url = self._live_api_url.format(match.get(\"subdomain\"))\n        live_res = self.session.http.json(self.session.http.get(live_url), schema=self._live_schema)\n\n        api_url = update_scheme(\"{0}:///\".format(match.get(\"scheme\")), live_res[\"url\"])\n        api_res = self.session.http.json(self.session.http.get(api_url), schema=self._stream_api_schema)\n\n        return HLSStream.parse_variant_playlist(self.session, api_res[\"primary\"])", "response": "Get the live streams in a particular language"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nfinding the streams for euronews", "response": "def _get_streams(self):\n        \"\"\"\n        Find the streams for euronews\n        :return:\n        \"\"\"\n        match = self._url_re.match(self.url).groupdict()\n\n        if match.get(\"path\") == \"live\":\n            return self._get_live_streams(match)\n        else:\n            return self._get_vod_stream()"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nattempts to parse a M3U8 playlist from a string of data.", "response": "def load(data, base_uri=None, parser=M3U8Parser, **kwargs):\n    \"\"\"Attempts to parse a M3U8 playlist from a string of data.\n\n    If specified, *base_uri* is the base URI that relative URIs will\n    be joined together with, otherwise relative URIs will be as is.\n\n    If specified, *parser* can be a M3U8Parser subclass to be used\n    to parse the data.\n\n    \"\"\"\n    return parser(base_uri, **kwargs).parse(data)"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\ncheck if the current player supports adding a title", "response": "def supported_player(cls, cmd):\n        \"\"\"\n        Check if the current player supports adding a title\n\n        :param cmd: command to test\n        :return: name of the player|None\n        \"\"\"\n        if not is_win32:\n            # under a POSIX system use shlex to find the actual command\n            # under windows this is not an issue because executables end in .exe\n            cmd = shlex.split(cmd)[0]\n\n        cmd = os.path.basename(cmd.lower())\n        for player, possiblecmds in SUPPORTED_PLAYERS.items():\n            for possiblecmd in possiblecmds:\n                if cmd.startswith(possiblecmd):\n                    return player"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nlogging-in to the schoolism account and return the users account", "response": "def login(self, email, password):\n        \"\"\"\n        Login to the schoolism account and return the users account\n        :param email: (str) email for account\n        :param password: (str) password for account\n        :return: (str) users email\n        \"\"\"\n        if self.options.get(\"email\") and self.options.get(\"password\"):\n            res = self.session.http.post(self.login_url, data={\"email\": email,\n                                                               \"password\": password,\n                                                               \"redirect\": None,\n                                                               \"submit\": \"Login\"})\n\n            if res.cookies.get(\"password\") and res.cookies.get(\"email\"):\n                return res.cookies.get(\"email\")\n            else:\n                log.error(\"Failed to login to Schoolism, incorrect email/password combination\")\n        else:\n            log.error(\"An email and password are required to access Schoolism streams\")"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nget the livestream videoid from a username.", "response": "def get_live_id(self, username):\n        \"\"\"Get the livestream videoid from a username.\n           https://developer.dailymotion.com/tools/apiexplorer#/user/videos/list\n        \"\"\"\n        params = {\n            \"flags\": \"live_onair\"\n        }\n        api_user_videos = USER_INFO_URL.format(username) + \"/videos\"\n        try:\n            res = self.session.http.get(api_user_videos.format(username),\n                           params=params)\n        except Exception as e:\n            self.logger.error(\"invalid username\")\n            raise NoStreamsError(self.url)\n\n        data = self.session.http.json(res, schema=_live_id_schema)\n        if data[\"total\"] > 0:\n            media_id = data[\"list\"][0][\"id\"]\n            return media_id\n        return False"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nsearches for the episode with the requested experience Id", "response": "def episode_info(self):\n        \"\"\"\n        Search for the episode with the requested experience Id\n        :return:\n        \"\"\"\n        if self.show_info:\n            for season in self.show_info[\"seasons\"]:\n                for episode in season[\"episodes\"]:\n                    for lang in episode[\"languages\"].values():\n                        for alpha in lang[\"alpha\"].values():\n                            if alpha[\"experienceId\"] == self.experience_id:\n                                return episode"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nget the sources for a given experience_id which is tied to a specific language", "response": "def sources(self):\n        \"\"\"\n        Get the sources for a given experience_id, which is tied to a specific language\n        :param experience_id: int; video content id\n        :return: sources dict\n        \"\"\"\n        api_url = self.sources_api_url.format(experience_id=self.experience_id)\n        res = self.get(api_url, params={\"pinst_id\": self.pinst_id})\n        return self.session.http.json(res)"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef encrypt_password(self, email, password):\n        res = self.session.http.get(self._get_rsa_key_url, params=dict(username=email, donotcache=self.donotcache))\n        rsadata = self.session.http.json(res, schema=self._rsa_key_schema)\n\n        rsa = RSA.construct((rsadata[\"publickey_mod\"], rsadata[\"publickey_exp\"]))\n        cipher = PKCS1_v1_5.new(rsa)\n        return base64.b64encode(cipher.encrypt(password.encode(\"utf8\"))), rsadata[\"timestamp\"]", "response": "Encrypt the password for the user and return the encrypted password"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nlogs in to Steam and returns the ID of the Steam instance.", "response": "def dologin(self, email, password, emailauth=\"\", emailsteamid=\"\", captchagid=\"-1\", captcha_text=\"\", twofactorcode=\"\"):\n        \"\"\"\n        Logs in to Steam\n\n        \"\"\"\n        epassword, rsatimestamp = self.encrypt_password(email, password)\n\n        login_data = {\n            'username': email,\n            \"password\": epassword,\n            \"emailauth\": emailauth,\n            \"loginfriendlyname\": \"Streamlink\",\n            \"captchagid\": captchagid,\n            \"captcha_text\": captcha_text,\n            \"emailsteamid\": emailsteamid,\n            \"rsatimestamp\": rsatimestamp,\n            \"remember_login\": True,\n            \"donotcache\": self.donotcache,\n            \"twofactorcode\": twofactorcode\n        }\n\n        res = self.session.http.post(self._dologin_url, data=login_data)\n\n        resp = self.session.http.json(res, schema=self._dologin_schema)\n\n        if not resp[u\"success\"]:\n            if resp.get(u\"captcha_needed\"):\n                # special case for captcha\n                captchagid = resp[u\"captcha_gid\"]\n                log.error(\"Captcha result required, open this URL to see the captcha: {}\".format(\n                    self._captcha_url.format(captchagid)))\n                try:\n                    captcha_text = self.input_ask(\"Captcha text\")\n                except FatalPluginError:\n                    captcha_text = None\n                if not captcha_text:\n                    return False\n            else:\n                # If the user must enter the code that was emailed to them\n                if resp.get(u\"emailauth_needed\"):\n                    if not emailauth:\n                        try:\n                            emailauth = self.input_ask(\"Email auth code required\")\n                        except FatalPluginError:\n                            emailauth = None\n                        if not emailauth:\n                            return False\n                    else:\n                        raise SteamLoginFailed(\"Email auth key error\")\n\n                # If the user must enter a two factor auth code\n                if resp.get(u\"requires_twofactor\"):\n                    try:\n                        twofactorcode = self.input_ask(\"Two factor auth code required\")\n                    except FatalPluginError:\n                        twofactorcode = None\n                    if not twofactorcode:\n                        return False\n\n                if resp.get(u\"message\"):\n                    raise SteamLoginFailed(resp[u\"message\"])\n\n            return self.dologin(email, password,\n                                emailauth=emailauth,\n                                emailsteamid=resp.get(u\"emailsteamid\", u\"\"),\n                                captcha_text=captcha_text,\n                                captchagid=captchagid,\n                                twofactorcode=twofactorcode)\n        elif resp.get(\"login_complete\"):\n            return True\n        else:\n            log.error(\"Something when wrong when logging in to Steam\")\n            return False"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nreturning the stream_id contained in the HTML.", "response": "def get_stream_id(self, html):\n        \"\"\"Returns the stream_id contained in the HTML.\"\"\"\n        stream_id = stream_id_pattern.search(html)\n\n        if not stream_id:\n            self.logger.error(\"Failed to extract stream_id.\")\n\n        return stream_id.group(\"stream_id\")"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nextract the stream options from the given HTML.", "response": "def get_stream_info(self, html):\n        \"\"\"\n        Returns a nested list of different stream options.\n\n        Each entry in the list will contain a stream_url and stream_quality_name\n        for each stream occurrence that was found in the JS.\n        \"\"\"\n        stream_info = stream_info_pattern.findall(html)\n\n        if not stream_info:\n            self.logger.error(\"Failed to extract stream_info.\")\n\n        # Rename the \"\" quality to \"source\" by transforming the tuples to a\n        # list and reassigning.\n        stream_info_list = []\n        for info in stream_info:\n            if not info[1]:\n                stream_info_list.append([info[0], \"source\"])\n            else:\n                stream_info_list.append(list(info))\n\n        return stream_info_list"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nlogging-in and update cached cookies", "response": "def _login(self, username, password):\n        '''login and update cached cookies'''\n        self.logger.debug('login ...')\n\n        res = self.session.http.get(self.login_url)\n        input_list = self._input_re.findall(res.text)\n        if not input_list:\n            raise PluginError('Missing input data on login website.')\n\n        data = {}\n        for _input_data in input_list:\n            try:\n                _input_name = self._name_re.search(_input_data).group(1)\n            except AttributeError:\n                continue\n\n            try:\n                _input_value = self._value_re.search(_input_data).group(1)\n            except AttributeError:\n                _input_value = ''\n\n            data[_input_name] = _input_value\n\n        login_data = {\n            'ctl00$Login1$UserName': username,\n            'ctl00$Login1$Password': password,\n            'ctl00$Login1$LoginButton.x': '0',\n            'ctl00$Login1$LoginButton.y': '0'\n        }\n        data.update(login_data)\n\n        res = self.session.http.post(self.login_url, data=data)\n\n        for cookie in self.session.http.cookies:\n            self._session_attributes.set(cookie.name, cookie.value, expires=3600 * 24)\n\n        if self._session_attributes.get('ASP.NET_SessionId') and self._session_attributes.get('.abportail1'):\n            self.logger.debug('New session data')\n            self.set_expires_time_cache()\n            return True\n        else:\n            self.logger.error('Failed to login, check your username/password')\n            return False"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\ncreate a key - function mapping.", "response": "def map(self, key, func, *args, **kwargs):\n        \"\"\"Creates a key-function mapping.\n\n        The return value from the function should be either\n          - A tuple containing a name and stream\n          - A iterator of tuples containing a name and stream\n\n        Any extra arguments will be passed to the function.\n        \"\"\"\n        self._map.append((key, partial(func, *args, **kwargs)))"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef parse_timestamp(ts):\n    return (\n        datetime.datetime.strptime(ts[:-7], \"%Y-%m-%dT%H:%M:%S\") +\n        datetime.timedelta(hours=int(ts[-5:-3]), minutes=int(ts[-2:])) *\n        int(ts[-6:-5] + \"1\")\n    )", "response": "Takes an ISO 8601 format string and converts into a utc datetime"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nmakes a call against the API.", "response": "def _api_call(self, entrypoint, params=None, schema=None):\n        \"\"\"Makes a call against the api.\n\n        :param entrypoint: API method to call.\n        :param params: parameters to include in the request data.\n        :param schema: schema to use to validate the data\n        \"\"\"\n        url = self._api_url.format(entrypoint)\n\n        # Default params\n        params = params or {}\n        if self.session_id:\n            params.update({\n                \"session_id\": self.session_id\n            })\n        else:\n            params.update({\n                \"device_id\": self.device_id,\n                \"device_type\": self._access_type,\n                \"access_token\": self._access_token,\n                \"version\": self._version_code\n            })\n        params.update({\n            \"locale\": self.locale.replace('_', ''),\n        })\n\n        if self.session_id:\n            params[\"session_id\"] = self.session_id\n\n        # The certificate used by Crunchyroll cannot be verified in some environments.\n        res = self.session.http.post(url, data=params, headers=self.headers, verify=False)\n        json_res = self.session.http.json(res, schema=_api_schema)\n\n        if json_res[\"error\"]:\n            err_msg = json_res.get(\"message\", \"Unknown error\")\n            err_code = json_res.get(\"code\", \"unknown_error\")\n            raise CrunchyrollAPIError(err_msg, err_code)\n\n        data = json_res.get(\"data\")\n        if schema:\n            data = schema.validate(data, name=\"API response\")\n\n        return data"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef start_session(self):\n        params = {}\n        if self.auth:\n            params[\"auth\"] = self.auth\n        self.session_id = self._api_call(\"start_session\", params, schema=_session_schema)\n        log.debug(\"Session created with ID: {0}\".format(self.session_id))\n        return self.session_id", "response": "Starts a session against Crunchyroll s server."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nlogs-in to the server.", "response": "def login(self, username, password):\n        \"\"\"\n            Authenticates the session to be able to access restricted data from\n            the server (e.g. premium restricted videos).\n        \"\"\"\n        params = {\n            \"account\": username,\n            \"password\": password\n        }\n\n        login = self._api_call(\"login\", params, schema=_login_schema)\n        self.auth = login[\"auth\"]\n        self.cache.set(\"auth\", login[\"auth\"], expires_at=login[\"expires\"])\n        return login"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef get_info(self, media_id, fields=None, schema=None):\n        params = {\"media_id\": media_id}\n\n        if fields:\n            params[\"fields\"] = \",\".join(fields)\n\n        return self._api_call(\"info\", params, schema=schema)", "response": "Get information about a certain media item."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\ncreates a new CrunchyrollAPI object and returns it.", "response": "def _create_api(self):\n        \"\"\"Creates a new CrunchyrollAPI object, initiates it's session and\n        tries to authenticate it either by using saved credentials or the\n        user's username and password.\n        \"\"\"\n        if self.options.get(\"purge_credentials\"):\n            self.cache.set(\"session_id\", None, 0)\n            self.cache.set(\"auth\", None, 0)\n            self.cache.set(\"session_id\", None, 0)\n\n        # use the crunchyroll locale as an override, for backwards compatibility\n        locale = self.get_option(\"locale\") or self.session.localization.language_code\n        api = CrunchyrollAPI(self.cache,\n                             self.session,\n                             session_id=self.get_option(\"session_id\"),\n                             locale=locale)\n\n        if not self.get_option(\"session_id\"):\n            self.logger.debug(\"Creating session with locale: {0}\", locale)\n            api.start_session()\n\n            if api.auth:\n                self.logger.debug(\"Using saved credentials\")\n                login = api.authenticate()\n                self.logger.info(\"Successfully logged in as '{0}'\",\n                                 login[\"user\"][\"username\"] or login[\"user\"][\"email\"])\n            elif self.options.get(\"username\"):\n                try:\n                    self.logger.debug(\"Attempting to login using username and password\")\n                    api.login(self.options.get(\"username\"),\n                              self.options.get(\"password\"))\n                    login = api.authenticate()\n                    self.logger.info(\"Logged in as '{0}'\",\n                                     login[\"user\"][\"username\"] or login[\"user\"][\"email\"])\n\n                except CrunchyrollAPIError as err:\n                    raise PluginError(u\"Authentication error: {0}\".format(err.msg))\n            else:\n                self.logger.warning(\n                    \"No authentication provided, you won't be able to access \"\n                    \"premium restricted content\"\n                )\n\n        return api"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef compress(string, mode=MODE_GENERIC, quality=11, lgwin=22, lgblock=0):\n    compressor = Compressor(mode=mode, quality=quality, lgwin=lgwin,\n                            lgblock=lgblock)\n    return compressor.process(string) + compressor.finish()", "response": "Compress a byte string."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef get_binary_stdio(stream):\n    assert stream in ['stdin', 'stdout', 'stderr'], 'invalid stream name'\n    stdio = getattr(sys, stream)\n    if sys.version_info[0] < 3:\n        if sys.platform == 'win32':\n            # set I/O stream binary flag on python2.x (Windows)\n            runtime = platform.python_implementation()\n            if runtime == 'PyPy':\n                # the msvcrt trick doesn't work in pypy, so I use fdopen\n                mode = 'rb' if stream == 'stdin' else 'wb'\n                stdio = os.fdopen(stdio.fileno(), mode, 0)\n            else:\n                # this works with CPython -- untested on other implementations\n                import msvcrt\n                msvcrt.setmode(stdio.fileno(), os.O_BINARY)\n        return stdio\n    else:\n        # get 'buffer' attribute to read/write binary data on python3.x\n        if hasattr(stdio, 'buffer'):\n            return stdio.buffer\n        else:\n            orig_stdio = getattr(sys, '__%s__' % stream)\n            return orig_stdio.buffer", "response": "Return the specified standard input output or errors stream as a a\n raw buffer object suitable for reading binary data from or writing binary data to it."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef outputCharFormatter(c):\n    #TODO 2: allow hex only output\n    if 32<c<127: return chr(c)\n    elif c==10: return '\\\\n'\n    elif c==13: return '\\\\r'\n    elif c==32: return '\" \"'\n    else: return '\\\\x{:02x}'.format(c)", "response": "Show character in readable format"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nshows string or char.", "response": "def outputFormatter(s):\n    \"\"\"Show string or char.\n    \"\"\"\n    result = ''\n    def formatSubString(s):\n        for c in s:\n            if c==32: yield ' '\n            else: yield outputCharFormatter(c)\n    if len(result)<200: return ''.join(formatSubString(s))\n    else:\n        return ''.join(formatSubString(s[:100]))+'...'+ \\\n               ''.join(formatSubString(s[-100:]))"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef read(self, n):\n        value = self.peek(n)\n        self.pos += n\n        if self.pos>len(self.data)*8:\n            raise ValueError('Read past end of stream')\n        return value", "response": "Read n bits from the stream and return as an integer."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef peek(self, n):\n        #read bytes that contain the data: self.data[self.pos>>3:self.pos+n+7>>3]\n        #convert to int: int.from_bytes(..., 'little')\n        #shift out the bits from the first byte: >>(self.pos&7)\n        #mask unwanted bits: & (1<<n)-1\n        return int.from_bytes(\n            self.data[self.pos>>3:self.pos+n+7>>3],\n            'little')>>(self.pos&7) & (1<<n)-1", "response": "Peek n bit integer from the stream without updating the pointer."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nreads n bytes from the stream.", "response": "def readBytes(self, n):\n        \"\"\"Read n bytes from the stream on a byte boundary.\n        \"\"\"\n        if self.pos&7: raise ValueError('readBytes: need byte boundary')\n        result = self.data[self.pos>>3:(self.pos>>3)+n]\n        self.pos += 8*n\n        return result"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nlongs explanation of the value from the numeric value", "response": "def explanation(self, extra=None):\n        \"\"\"Long explanation of the value from the numeric value\n        with optional extra bits\n        Used by Layout.verboseRead when printing the value\n        \"\"\"\n        if isinstance(self.code, WithExtra):\n            return self.code.callback(self, extra)\n        return self.code.callback(self)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\ndecode a peeked symbol into a tuple of the maximum and minimum length of the symbol.", "response": "def decodePeek(self, data):\n        \"\"\"Find which symbol index matches the given data (from peek, as a number)\n        and return the number of bits decoded.\n        Can also be used to figure out length of a symbol.\n        \"\"\"\n        return self.maxLength, Symbol(self, data&(1<<self.maxLength)-1)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef decodePeek(self, data):\n        #do binary search for word length\n        #invariant: lo<=length<=hi\n        lo, hi = self.minLength, self.maxLength\n        while lo<=hi:\n            mid = lo+hi>>1\n            #note lo<=mid<hi at this point\n            mask = (1<<mid)-1\n            #lets see what happens if we guess length is mid\n            try: index = self.decodeTable[data&mask]\n            except KeyError:\n                #too many bits specified, reduce estimated length\n                hi = mid-1\n                continue\n            #we found a symbol, but there could be a longer match\n            symbolLength = self.lengthTable[index]\n            if symbolLength<=mid:\n                #all bits match, symbol must be right\n                return symbolLength, Symbol(self, index)\n            #there must be more bits to match\n            lo = mid+1\n        return lo, Symbol(self, index)", "response": "Find which symbol index matches the given data and return the number of bits decoded."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nstoring decodeTable and compute lengthTable minLength maxLength from encodings.", "response": "def setDecode(self, decodeTable):\n        \"\"\"Store decodeTable,\n        and compute lengthTable, minLength, maxLength from encodings.\n        \"\"\"\n        self.decodeTable = decodeTable\n        #set of symbols with unknown length\n        todo = set(decodeTable)\n        #bit size under investigation\n        maskLength = 0\n        lengthTable = {}\n        while todo:\n            mask = (1<<maskLength)-1\n            #split the encodings that we didn't find yet using b bits\n            splitSymbols = defaultdict(list)\n            for s in todo: splitSymbols[s&mask].append(s)\n            #unique encodings have a length of maskLength bits\n            #set length, and remove from todo list\n            for s,subset in splitSymbols.items():\n                if len(subset)==1:\n                    lengthTable[self.decodeTable[s]] = maskLength\n                    todo.remove(s)\n            #now investigate with longer mask\n            maskLength +=1\n        #save result\n        self.lengthTable = lengthTable\n        self.minLength = min(lengthTable.values())\n        self.maxLength = max(lengthTable.values())\n        self.switchToPrefix()"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\ngiving the bit pattern lengths for symbols given in lengthTable set self. lengthTable minLength maxLength", "response": "def setLength(self, lengthTable):\n        \"\"\"Given the bit pattern lengths for symbols given in lengthTable,\n        set decodeTable, minLength, maxLength\n        \"\"\"\n        self.lengthTable = lengthTable\n        self.minLength = min(lengthTable.values())\n        self.maxLength = max(lengthTable.values())\n        #compute the backwards codes first; then reverse them\n        #compute (backwards) first code for every separate lengths\n        nextCodes = []\n        #build codes for each length, from right to left\n        code = 0\n        for bits in range(self.maxLength+1):\n            code <<= 1\n            nextCodes.append(code)\n            code += sum(x==bits for x in lengthTable.values())\n        self.decodeTable = {}\n        #count codes for each length, and store reversed in the table\n        for symbol in sorted(lengthTable):\n            bits = lengthTable[symbol]\n            bitpattern = '{:0{}b}'.format(nextCodes[bits], bits)\n            self.decodeTable[int(bitpattern[::-1], 2)] = symbol\n            nextCodes[bits] += 1\n        self.switchToPrefix()"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nlong explanation of the value from the numeric value", "response": "def explanation(self, index):\n        \"\"\"Long explanation of the value from the numeric value\n        This is a default routine.\n        You can customize in three ways:\n        - set description to add some text\n        - override to get more control\n        - set callback to make it dependent on you local variables\n        \"\"\"\n        value = self.value(index)\n        return '{0}{1}: {2}'.format(\n            self.description and self.description+': ',\n            self.bitPattern(index),\n            value,\n            )"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nshowing all words of the code in a nice format.", "response": "def showCode(self, width=80):\n        \"\"\"Show all words of the code in a nice format.\n        \"\"\"\n        #make table of all symbols with binary strings\n        symbolStrings = [\n            (self.bitPattern(s.index), self.mnemonic(s.index))\n            for s in self\n            ]\n        #determine column widths the way Lisp programmers do it\n        leftColWidth, rightColWidth = map(max, map(\n            map,\n            repeat(len),\n            zip(*symbolStrings)\n            ))\n        colwidth = leftColWidth+rightColWidth\n        columns = 81//(colwidth+2)\n        rows = -(-len(symbolStrings)//columns)\n        def justify(bs):\n            b,s = bs\n            return b.rjust(leftColWidth)+':'+s.ljust(rightColWidth)\n        for i in range(rows):\n            print(' '.join(map(justify, symbolStrings[i::rows])).rstrip())"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef readTuple(self, stream):\n        length, symbol = self.decodePeek(stream.peek(self.maxLength))\n        stream.pos += length\n        return length, symbol", "response": "Read symbol from stream. Returns symbol length."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nreading symbol and extrabits from stream. Returns symbol length symbol extraBits extraBits", "response": "def readTupleAndExtra(self, stream):\n        \"\"\"Read symbol and extrabits from stream.\n        Returns symbol length, symbol, extraBits, extra\n        >>> olleke.pos = 6\n        >>> MetablockLengthAlphabet().readTupleAndExtra(olleke)\n        (2, Symbol(MLEN, 4), 16, 46)\n        \"\"\"\n        length, symbol = self.decodePeek(stream.peek(self.maxLength))\n        stream.pos += length\n        extraBits = self.extraBits(symbol.index)\n        return length, symbol, extraBits, stream.read(extraBits)"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nexpand version of Code. explanation supporting extra bits.", "response": "def explanation(self, index, extra=None):\n        \"\"\"Expanded version of Code.explanation supporting extra bits.\n        If you don't supply extra, it is not mentioned.\n        \"\"\"\n        extraBits = 0 if extra is None else self.extraBits(index)\n        if not hasattr(self, 'extraTable'):\n            formatString = '{0}{3}'\n            lo = hi = value = self.value(index, extra)\n        elif extraBits==0:\n            formatString = '{0}{2}: {3}'\n            lo, hi = self.span(index)\n            value = lo\n        else:\n            formatString = '{0}{1} {2}: {3}-{4}; {3}+{5}={6}'\n            lo, hi = self.span(index)\n            value = lo+extra\n        return formatString.format(\n            self.description and self.description+': ',\n            'x'*extraBits,\n            self.bitPattern(index),\n            lo, hi,\n            extra,\n            value,\n            )"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef value(self, index, extra):\n        lower, upper = self.span(index)\n        value = lower+(extra or 0)\n        if value>upper:\n            raise ValueError('value: extra out of range')\n        return value", "response": "Override if you don t define value0 and extraTable\n       "}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\ngives the range of possible values in a tuple Useful for mnemonic and explanation", "response": "def span(self, index):\n        \"\"\"Give the range of possible values in a tuple\n        Useful for mnemonic and explanation\n        \"\"\"\n        lower = self.value0+sum(1<<x for x in self.extraTable[:index])\n        upper = lower+(1<<self.extraTable[index])\n        return lower, upper-1"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef value(self, index, extra):\n        if index==1:\n            if extra>3:\n                raise ValueError('value: extra out of range')\n            return 'Simple', extra+1\n        if extra:\n            raise ValueError('value: extra out of range')\n        return 'Complex', index", "response": "Returns the value of the key."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef value(self, index, extra):\n        index = index\n        if index==0: return 1, 0\n        if index<=self.RLEMAX: return (1<<index)+extra, 0\n        return 1, index-self.RLEMAX", "response": "Give count and value."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef splitSymbol(self, index):\n        #determine insert and copy upper bits from table\n        row = [0,0,1,1,2,2,1,3,2,3,3][index>>6]\n        col = [0,1,0,1,0,1,2,0,2,1,2][index>>6]\n        #determine inserts and copy sub codes\n        insertLengthCode = row<<3 | index>>3&7\n        if row: insertLengthCode -= 8\n        copyLengthCode = col<<3 | index&7\n        return (\n            Symbol(self.insertLengthAlphabet, insertLengthCode),\n            Symbol(self.copyLengthAlphabet, copyLengthCode),\n            row==0\n            )", "response": "Give relevant values for computations"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nmakes a nice mnemonic for the given index.", "response": "def mnemonic(self, index):\n        \"\"\"Make a nice mnemonic\n        \"\"\"\n        i,c,d0 = self.splitSymbol(index)\n        iLower, _ = i.code.span(i.index)\n        iExtra = i.extraBits()\n        cLower, _ = c.code.span(c.index)\n        cExtra = c.extraBits()\n        return 'I{}{}{}C{}{}{}{}'.format(\n            iLower,\n            '+' if iExtra else '',\n            'x'*iExtra if iExtra<6 else '[{}*x]'.format(iExtra),\n            cLower,\n            '+' if cExtra else '',\n            'x'*cExtra if cExtra<6 else '[{}*x]'.format(cExtra),\n            '&D=0' if d0 else '')"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef extraBits(self, index):\n        if index<16+self.NDIRECT: return 0\n        return 1 + ((index - self.NDIRECT - 16) >> (self.NPOSTFIX + 1))", "response": "Indicate how many extra bits are needed to interpret a symbol in the cache."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\ndecodes value of symbol together with the extra bits.", "response": "def value(self, dcode, dextra):\n        \"\"\"Decode value of symbol together with the extra bits.\n        >>> d = DistanceAlphabet('D', NPOSTFIX=2, NDIRECT=10)\n        >>> d[34].value(2)\n        (0, 35)\n        \"\"\"\n        if dcode<16:\n            return [(1,0),(2,0),(3,0),(4,0),\n                    (1,-1),(1,+1),(1,-2),(1,+2),(1,-3),(1,+3),\n                    (2,-1),(2,+1),(2,-2),(2,+2),(2,-3),(2,+3)\n                ][dcode]\n        if dcode<16+self.NDIRECT:\n            return (0,dcode-16)\n        #we use the original formulas, instead of my clear explanation\n        POSTFIX_MASK = (1 << self.NPOSTFIX) - 1\n        ndistbits = 1 + ((dcode - self.NDIRECT - 16) >> (self.NPOSTFIX + 1))\n        hcode = (dcode - self.NDIRECT - 16) >> self.NPOSTFIX\n        lcode = (dcode - self.NDIRECT - 16) & POSTFIX_MASK\n        offset = ((2 + (hcode & 1)) << ndistbits) - 4\n        distance = ((offset + dextra) << self.NPOSTFIX) + lcode + self.NDIRECT + 1\n        return (0,distance)"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef mnemonic(self, index, verbose=False):\n        if index<16:\n            return ['last', '2last', '3last', '4last',\n                'last-1', 'last+1', 'last-2', 'last+2', 'last-3', 'last+3',\n                '2last-1', '2last+1', '2last-2', '2last+2', '2last-3', '2last+3'\n                ][index]\n        if index<16+self.NDIRECT:\n            return str(index-16)\n        #construct strings like \"1xx01-15\"\n        index -= self.NDIRECT+16\n        hcode = index >> self.NPOSTFIX\n        lcode = index & (1<<self.NPOSTFIX)-1\n        if self.NPOSTFIX: formatString = '1{0}{1}{2:0{3}b}{4:+d}'\n        else: formatString = '1{0}{1}{4:+d}'\n        return formatString.format(\n            hcode&1,\n            'x'*(2+hcode>>1) if hcode<13 or verbose else '[{}*x]'.format(2+hcode>>1),\n            lcode, self.NPOSTFIX,\n            self.NDIRECT+1-(4<<self.NPOSTFIX))", "response": "Give mnemonic representation of meaning."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nreturns an explanation of the current object.", "response": "def explanation(self, index, extra):\n        \"\"\"\n        >>> d = DistanceAlphabet('D', NPOSTFIX=2, NDIRECT=10)\n        >>> d[55].explanation(13)\n        '11[1101]01-5: [0]+240'\n        \"\"\"\n        extraBits = self.extraBits(index)\n        extraString = '[{:0{}b}]'.format(extra, extraBits)\n        return '{0}: [{1[0]}]{1[1]:+d}'.format(\n            self.mnemonic(index, True).replace('x'*(extraBits or 1), extraString),\n            self.value(index, extra))"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\ngetting a word from the file.", "response": "def word(self, size, dist):\n        \"\"\"Get word\n        \"\"\"\n        #split dist in index and action\n        ndbits = self.NDBITS[size]\n        index = dist&(1<<ndbits)-1\n        action = dist>>ndbits\n        #compute position in file\n        position = sum(n<<self.NDBITS[n] for n in range(4,size))+size*index\n        self.file.seek(position)\n        return self.doAction(self.file.read(size), action)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef compileActions(self):\n        import re\n        self.actionList = actions = [None]*121\n        #Action 73, which is too long, looks like this when expanded:\n        actions[73] = \"b' the '+w+b' of the '\"\n        #find out what the columns are\n        actionLines = self.actionTable.splitlines()\n        colonPositions = [m.start()\n            for m in re.finditer(':',actionLines[1])\n            ]+[100]\n        columns = [(colonPositions[i]-3,colonPositions[i+1]-3)\n            for i in range(len(colonPositions)-1)]\n        for line in self.actionTable.splitlines(keepends=False):\n            for start,end in columns:\n                action = line[start:end]\n                #skip empty actions\n                if not action or action.isspace(): continue\n                #chop it up, and check if the colon is properly placed\n                index, colon, action = action[:3], action[3], action[4:]\n                assert colon==':'\n                #remove filler spaces at right\n                action = action.rstrip()\n                #replace space symbols\n                action = action.replace('_', ' ')\n                wPos = action.index('w')\n                #add quotes around left string when present\n                #translation: any pattern from beginning, up to\n                #(but not including) a + following by a w later on\n                action = re.sub(r\"^(.*)(?=\\+[U(]*w)\", r\"b'\\1'\", action)\n                #add quotes around right string when present\n                #translation: anything with a w in it, followed by a +\n                #and a pattern up to the end\n                #(there is no variable lookbehind assertion,\n                #so we have to copy the pattern)\n                action = re.sub(r\"(w[[:\\-1\\]).U]*)\\+(.*)$\", r\"\\1+b'\\2'\", action)\n                #expand shortcut for uppercaseAll\n                action = action.replace(\".U\", \".upper()\")\n                #store action\n                actions[int(index)] = action", "response": "Compile the action table from the text above the actionTable."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nperforming the proper action", "response": "def doAction(self, w, action):\n        \"\"\"Perform the proper action\n        \"\"\"\n        #set environment for the UpperCaseFirst\n        U = self.upperCase1\n        return eval(self.actionList[action], locals())"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef makeHexData(self, pos):\n        firstAddress = pos+7>>3\n        lastAddress = self.stream.pos+7>>3\n        return ''.join(map('{:02x} '.format,\n            self.stream.data[firstAddress:lastAddress]))", "response": "Produce hex dump of all data containing the bits\n            from pos to stream. pos\n           "}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nshows formatted bit data for a specific bit at the given position.", "response": "def formatBitData(self, pos, width1, width2=0):\n        \"\"\"Show formatted bit data:\n        Bytes are separated by commas\n        whole bytes are displayed in hex\n        >>> Layout(olleke).formatBitData(6, 2, 16)\n        '|00h|2Eh,|00'\n        >>> Layout(olleke).formatBitData(4, 1, 0)\n        '1'\n        \"\"\"\n        result = []\n        #make empty prefix code explicit\n        if width1==0: result = ['()', ',']\n        for width in width1, width2:\n            #skip empty width2\n            if width==0: continue\n            #build result backwards in a list\n            while width>0:\n                availableBits = 8-(pos&7)\n                if width<availableBits:\n                    #read partial byte, beginning nor ending at boundary\n                    data = self.stream.data[pos>>3] >> (pos&7) & (1<<width)-1\n                    result.append('{:0{}b}'.format(data, width))\n                elif availableBits<8:\n                    #read rest of byte, ending at boundary\n                    data = self.stream.data[pos>>3] >> (pos&7)\n                    result.append('|{:0{}b}'.format(data, availableBits))\n                else:\n                    #read whole byte (in hex), beginning and ending at boundary\n                    data = self.stream.data[pos>>3]\n                    result.append('|{:02X}h'.format(data))\n                width -= availableBits\n                pos += availableBits\n            #if width overshot from the availableBits subtraction, fix it\n            pos += width\n            #add comma to separate fields\n            result.append(',')\n        #concatenate pieces, reversed, skipping the last space\n        return ''.join(result[-2::-1])"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\ngives alphabet the prefix code that is read from the stream", "response": "def readPrefixCode(self, alphabet):\n        \"\"\"give alphabet the prefix code that is read from the stream\n        Called for the following alphabets, in this order:\n        The alphabet in question must have a \"logical\" order,\n        otherwise the assignment of symbols doesn't work.\n        \"\"\"\n        mode, numberOfSymbols = self.verboseRead(PrefixCodeHeader(alphabet.name))\n        if mode=='Complex':\n            #for a complex code, numberOfSymbols means hskip\n            self.readComplexCode(numberOfSymbols, alphabet)\n            return alphabet\n        else:\n            table = []\n            #Set table of lengths for mnemonic function\n            lengths = [[0], [1,1], [1,2,2], '????'][numberOfSymbols-1]\n            #adjust mnemonic function of alphabet class\n            def myMnemonic(index):\n                return '{} bit{}: {}'.format(\n                    lengths[i],\n                    '' if lengths[i]==1 else 's',\n                    alphabet.__class__.mnemonic(alphabet, index)\n                    )\n            alphabet.mnemonic = myMnemonic\n            for i in range(numberOfSymbols):\n                table.append(self.verboseRead(alphabet, skipExtra=True).index)\n            #restore mnemonic\n            del alphabet.mnemonic\n            if numberOfSymbols==4:\n                #read tree shape to redefine lengths\n                lengths = self.verboseRead(TreeShapeAlhabet())\n            #construct the alphabet prefix code\n            alphabet.setLength(dict(zip(table, lengths)))\n        return alphabet"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nread a complex code from the file and return the object.", "response": "def readComplexCode(self, hskip, alphabet):\n        \"\"\"Read complex code\"\"\"\n        stream = self.stream\n        #read the lengths for the length code\n        lengths = [1,2,3,4,0,5,17,6,16,7,8,9,10,11,12,13,14,15][hskip:]\n        codeLengths = {}\n        total = 0\n        lol = LengthOfLengthAlphabet('##'+alphabet.name)\n        #lengthCode will be used for coding the lengths of the new code\n        #we use it for display until now; definition comes below\n        lengthCode = LengthAlphabet('#'+alphabet.name)\n        lengthIter = iter(lengths)\n        lengthsLeft = len(lengths)\n        while total<32 and lengthsLeft>0:\n            lengthsLeft -= 1\n            newSymbol = next(lengthIter)\n            lol.description = str(lengthCode[newSymbol])\n            length = self.verboseRead(lol)\n            if length:\n                codeLengths[newSymbol] = length\n                total += 32>>length\n        if total>32: raise ValueError(\"Stream format\")\n        if len(codeLengths)==1: codeLengths[list(codeLengths.keys())[0]] = 0\n        #Now set the encoding of the lengthCode\n        lengthCode.setLength(codeLengths)\n        print(\"***** Lengths for {} will be coded as:\".format(alphabet.name))\n        lengthCode.showCode()\n        #Now determine the symbol lengths with the lengthCode\n        symbolLengths = {}\n        total = 0\n        lastLength = 8\n        alphabetIter = iter(alphabet)\n        while total<32768:\n            #look ahead to see what is going to happen\n            length = lengthCode.decodePeek(\n                self.stream.peek(lengthCode.maxLength))[1].index\n            #in every branch, set lengthCode.description to explanatory text\n            #lengthCode calls format(symbol, extra) with this string\n            if length==0:\n                symbol = next(alphabetIter)\n                lengthCode.description = 'symbol {} unused'.format(symbol)\n                self.verboseRead(lengthCode)\n                #unused symbol\n                continue\n            if length==16:\n                lengthCode.description = \\\n                    '{1}+3 symbols of length '+str(lastLength)\n                extra = self.verboseRead(lengthCode)\n                #scan series of 16s (repeat counts)\n                #start with repeat count 2\n                repeat = 2\n                startSymbol = next(alphabetIter)\n                endSymbol = next(alphabetIter)\n                symbolLengths[startSymbol.index] = \\\n                    symbolLengths[endSymbol.index] = lastLength\n                #count the two just defined symbols\n                total += 2*32768>>lastLength\n                #note: loop may end because we're there\n                #even if a 16 _appears_ to follow\n                while True:\n                    #determine last symbol\n                    oldRepeat = repeat\n                    repeat = (repeat-2<<2)+extra+3\n                    #read as many symbols as repeat increased\n                    for i in range(oldRepeat, repeat):\n                        endSymbol = next(alphabetIter)\n                        symbolLengths[endSymbol.index] = lastLength\n                    #compute new total; it may be end of loop\n                    total += (repeat-oldRepeat)*32768>>lastLength\n                    if total>=32768: break\n                    #see if there is more to do\n                    length = lengthCode.decodePeek(\n                        self.stream.peek(lengthCode.maxLength))[1].index\n                    if length!=16: break\n                    lengthCode.description = 'total {}+{{1}} symbols'.format(\n                        (repeat-2<<2)+3)\n                    extra = self.verboseRead(lengthCode)\n            elif length==17:\n                #read, and show explanation\n                lengthCode.description = '{1}+3 unused'\n                extra = self.verboseRead(lengthCode)\n                #scan series of 17s (groups of zero counts)\n                #start with repeat count 2\n                repeat = 2\n                startSymbol = next(alphabetIter)\n                endSymbol = next(alphabetIter)\n                #note: loop will not end with total==32768,\n                #since total doesn't change here\n                while True:\n                    #determine last symbol\n                    oldRepeat = repeat\n                    repeat = (repeat-2<<3)+extra+3\n                    #read as many symbols as repeat increases\n                    for i in range(repeat-oldRepeat):\n                        endSymbol = next(alphabetIter)\n                    #see if there is more to do\n                    length = lengthCode.decodePeek(\n                        self.stream.peek(lengthCode.maxLength))[1].index\n                    if length!=17: break\n                    lengthCode.description = 'total {}+{{1}} unused'.format(\n                        (repeat-2<<3)+3)\n                    extra = self.verboseRead(lengthCode)\n            else:\n                symbol = next(alphabetIter)\n                #double braces for format\n                char = str(symbol)\n                if char in '{}': char *= 2\n                lengthCode.description = \\\n                    'Length for {} is {{0.index}} bits'.format(char)\n                #output is not needed (will be 0)\n                self.verboseRead(lengthCode)\n                symbolLengths[symbol.index] = length\n                total += 32768>>length\n                lastLength = length\n        assert total==32768\n        alphabet.setLength(symbolLengths)\n        print('End of table. Prefix code '+alphabet.name+':')\n        alphabet.showCode()"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef processStream(self):\n        print('addr  hex{:{}s}binary context explanation'.format(\n            '', self.width-10))\n        print('Stream header'.center(60, '-'))\n        self.windowSize = self.verboseRead(WindowSizeAlphabet())\n        print('Metablock header'.center(60, '='))\n        self.ISLAST = False\n        self.output = bytearray()\n        while not self.ISLAST:\n            self.ISLAST = self.verboseRead(\n                BoolCode('LAST', description=\"Last block\"))\n            if self.ISLAST:\n                if self.verboseRead(\n                    BoolCode('EMPTY', description=\"Empty block\")): break\n            if self.metablockLength(): continue\n            if not self.ISLAST and self.uncompressed(): continue\n            print('Block type descriptors'.center(60, '-'))\n            self.numberOfBlockTypes = {}\n            self.currentBlockCounts = {}\n            self.blockTypeCodes = {}\n            self.blockCountCodes = {}\n            for blockType in (L,I,D): self.blockType(blockType)\n            print('Distance code parameters'.center(60, '-'))\n            self.NPOSTFIX, self.NDIRECT = self.verboseRead(DistanceParamAlphabet())\n            self.readLiteralContextModes()\n            print('Context maps'.center(60, '-'))\n            self.cmaps = {}\n            #keep the number of each kind of prefix tree for the last loop\n            numberOfTrees = {I: self.numberOfBlockTypes[I]}\n            for blockType in (L,D):\n                numberOfTrees[blockType] = self.contextMap(blockType)\n            print('Prefix code lists'.center(60, '-'))\n            self.prefixCodes = {}\n            for blockType in (L,I,D):\n                self.readPrefixArray(blockType, numberOfTrees[blockType])\n            self.metablock()", "response": "Process a brotli stream."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nreading a single element of the sequence from the stream and explain what happens.", "response": "def verboseRead(self, alphabet, context='', skipExtra=False):\n        \"\"\"Read symbol and extra from stream and explain what happens.\n        Returns the value of the symbol\n        >>> olleke.pos = 0\n        >>> l = Layout(olleke)\n        >>> l.verboseRead(WindowSizeAlphabet())\n        0000  1b                   1011 WSIZE   windowsize=(1<<22)-16=4194288\n        4194288\n        \"\"\"\n        #TODO 2: verbosity level, e.g. show only codes and maps in header\n        stream = self.stream\n        pos = stream.pos\n        if skipExtra:\n            length, symbol = alphabet.readTuple(stream)\n            extraBits, extra = 0, None\n        else:\n            length, symbol, extraBits, extra = alphabet.readTupleAndExtra(\n                stream)\n        #fields: address, hex data, binary data, name of alphabet, explanation\n        hexdata = self.makeHexData(pos)\n        addressField = '{:04x}'.format(pos+7>>3) if hexdata else ''\n        bitdata = self.formatBitData(pos, length, extraBits)\n        #bitPtr moves bitdata so that the bytes are easier to read\n        #jump back to right if a new byte starts\n        if '|' in bitdata[1:]:\n            #start over on the right side\n            self.bitPtr = self.width\n        fillWidth = self.bitPtr-(len(hexdata)+len(bitdata))\n        if fillWidth<0: fillWidth = 0\n        print('{:<5s} {:<{}s} {:7s} {}'.format(\n            addressField,\n            hexdata+' '*fillWidth+bitdata, self.width,\n            context+alphabet.name,\n            symbol if skipExtra else symbol.explanation(extra),\n            ))\n        #jump to the right if we started with a '|'\n        #because we didn't jump before printing\n        if bitdata.startswith('|'): self.bitPtr = self.width\n        else: self.bitPtr -= len(bitdata)\n        return symbol if skipExtra else symbol.value(extra)"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef metablockLength(self):\n        self.MLEN = self.verboseRead(MetablockLengthAlphabet())\n        if self.MLEN:\n            return False\n        #empty block; skip and return False\n        self.verboseRead(ReservedAlphabet())\n        MSKIP = self.verboseRead(SkipLengthAlphabet())\n        self.verboseRead(FillerAlphabet(streamPos=self.stream.pos))\n        self.stream.pos += 8*MSKIP\n        print(\"Skipping to {:x}\".format(self.stream.pos>>3))\n        return True", "response": "Read MNIBBLES and meta block length ; return True if all data is available."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef blockType(self, kind):\n        NBLTYPES = self.verboseRead(TypeCountAlphabet(\n            'BT#'+kind[0].upper(),\n            description='{} block types'.format(kind),\n            ))\n        self.numberOfBlockTypes[kind] = NBLTYPES\n        if NBLTYPES>=2:\n            self.blockTypeCodes[kind] = self.readPrefixCode(\n                BlockTypeAlphabet('BT'+kind[0].upper(), NBLTYPES))\n            self.blockCountCodes[kind] = self.readPrefixCode(\n                BlockCountAlphabet('BC'+kind[0].upper()))\n            blockCount = self.verboseRead(self.blockCountCodes[kind])\n        else:\n            blockCount = 1<<24\n        self.currentBlockCounts[kind] = blockCount", "response": "Read block type switch descriptor for given kind of blockType."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef readLiteralContextModes(self):\n        print('Context modes'.center(60, '-'))\n        self.literalContextModes = []\n        for i in range(self.numberOfBlockTypes[L]):\n            self.literalContextModes.append(\n                self.verboseRead(LiteralContextMode(number=i)))", "response": "Read literal context modes."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef contextMap(self, kind):\n        NTREES = self.verboseRead(TypeCountAlphabet(\n            kind[0].upper()+'T#',\n            description='{} prefix trees'.format(kind)))\n        mapSize = {L:64, D:4}[kind]\n        if NTREES<2:\n            self.cmaps[kind] = [0]*mapSize\n        else:\n            #read CMAPkind\n            RLEMAX = self.verboseRead(RLEmaxAlphabet(\n                'RLE#'+kind[0].upper(),\n                description=kind+' context map'))\n            alphabet = TreeAlphabet('CM'+kind[0].upper(), NTREES=NTREES, RLEMAX=RLEMAX)\n            cmapCode = self.readPrefixCode(alphabet)\n            tableSize = mapSize*self.numberOfBlockTypes[kind]\n            cmap = []\n            while len(cmap)<tableSize:\n                cmapCode.description = 'map {}, entry {}'.format(\n                    *divmod(len(cmap), mapSize))\n                count, value = self.verboseRead(cmapCode)\n                cmap.extend([value]*count)\n            assert len(cmap)==tableSize\n            IMTF = self.verboseRead(BoolCode('IMTF', description='Apply inverse MTF'))\n            if IMTF:\n                self.IMTF(cmap)\n            if kind==L:\n                print('Context maps for literal data:')\n                for i in range(0, len(cmap), 64):\n                    print(*(\n                        ''.join(map(str, cmap[j:j+8]))\n                        for j in range(i, i+64, 8)\n                        ))\n            else:\n                print('Context map for distances:')\n                print(*(\n                    ''.join(map('{:x}'.format, cmap[i:i+4]))\n                    for i in range(0, len(cmap), 4)\n                    ))\n            self.cmaps[kind] = cmap\n        return NTREES", "response": "Read context maps and return the number of differnt values on the context map."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef IMTF(v):\n        #mtf is initialized virtually with range(infinity)\n        mtf = []\n        for i, vi in enumerate(v):\n            #get old value from mtf. If never seen, take virtual value\n            try: value = mtf.pop(vi)\n            except IndexError: value = vi\n            #put value at front\n            mtf.insert(0, value)\n            #replace transformed value\n            v[i] = value", "response": "In place inverse move to front transform."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nreading prefix code array.", "response": "def readPrefixArray(self, kind, numberOfTrees):\n        \"\"\"Read prefix code array\"\"\"\n        prefixes = []\n        for i in range(numberOfTrees):\n            if kind==L: alphabet = LiteralAlphabet(i)\n            elif kind==I: alphabet = InsertAndCopyAlphabet(i)\n            elif kind==D: alphabet = DistanceAlphabet(\n                i, NPOSTFIX=self.NPOSTFIX, NDIRECT=self.NDIRECT)\n            self.readPrefixCode(alphabet)\n            prefixes.append(alphabet)\n        self.prefixCodes[kind] = prefixes"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef metablock(self):\n        print('Meta block contents'.center(60, '='))\n        self.currentBlockTypes = {L:0, I:0, D:0, pL:1, pI:1, pD:1}\n        self.lastDistances = deque([17,16,11,4], maxlen=4)\n        #the current context mode is for block type 0\n        self.contextMode = ContextModeKeeper(self.literalContextModes[0])\n        wordList = WordList()\n\n        #setup distance callback function\n        def distanceCallback(symbol, extra):\n            \"callback function for displaying decoded distance\"\n            index, offset = symbol.value(extra)\n            if index:\n                #recent distance\n                distance = self.lastDistances[-index]+offset\n                return 'Distance: {}last{:+d}={}'.format(index, offset, distance)\n            #absolute value\n            if offset<=maxDistance:\n                return 'Absolute value: {} (pos {})'.format(offset, maxDistance-offset)\n            #word list value\n            action, word = divmod(offset-maxDistance, 1<<wordList.NDBITS[copyLen])\n            return '{}-{} gives word {},{} action {}'.format(\n                offset, maxDistance, copyLen, word, action)\n        for dpc in self.prefixCodes[D]: dpc.callback = distanceCallback\n\n        blockLen = 0\n        #there we go\n        while blockLen<self.MLEN:\n            #get insert&copy command\n            litLen, copyLen, dist0Flag = self.verboseRead(\n                self.prefixCodes[I][\n                    self.figureBlockType(I)])\n            #literal data\n            for i in range(litLen):\n                bt = self.figureBlockType(L)\n                cm = self.contextMode.getIndex()\n                ct = self.cmaps[L][bt<<6|cm]\n                char = self.verboseRead(\n                    self.prefixCodes[L][ct],\n                    context='{},{}='.format(bt,cm))\n                self.contextMode.add(char)\n                self.output.append(char)\n            blockLen += litLen\n            #check if we're done\n            if blockLen>=self.MLEN: return\n            #distance\n            #distances are computed relative to output length, at most window size\n            maxDistance = min(len(self.output), self.windowSize)\n            if dist0Flag:\n                distance = self.lastDistances[-1]\n            else:\n                bt = self.figureBlockType(D)\n                cm = {2:0, 3:1, 4:2}.get(copyLen, 3)\n                ct = self.cmaps[D][bt<<2|cm]\n                index, offset = self.verboseRead(\n                    self.prefixCodes[D][ct],\n                    context='{},{}='.format(bt,cm))\n                distance = self.lastDistances[-index]+offset if index else offset\n                if index==1 and offset==0:\n                    #to make sure distance is not put in last distance list\n                    dist0Flag = True\n            if distance<=maxDistance:\n                #copy from output\n                for i in range(\n                        maxDistance-distance,\n                        maxDistance-distance+copyLen):\n                    self.output.append(self.output[i])\n                if not dist0Flag: self.lastDistances.append(distance)\n                comment = 'Seen before'\n            else:\n                #fetch from wordlist\n                newWord = wordList.word(copyLen, distance-maxDistance-1)\n                self.output.extend(newWord)\n                #adjust copyLen to reflect actual new data\n                copyLen = len(newWord)\n                comment = 'From wordlist'\n            blockLen += copyLen\n            print(' '*40,\n                comment,\n                ': \"',\n                outputFormatter(self.output[-copyLen:]),\n                '\"',\n                sep='')\n            self.contextMode.add(self.output[-2])\n            self.contextMode.add(self.output[-1])", "response": "Process the data. xml file for the current metablock."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nreturning BROTLI_VERSION string as defined in common. version. h file.", "response": "def get_version():\n    \"\"\" Return BROTLI_VERSION string as defined in 'common/version.h' file. \"\"\"\n    version_file_path = os.path.join(CURR_DIR, 'c', 'common', 'version.h')\n    version = 0\n    with open(version_file_path, 'r') as f:\n        for line in f:\n            m = re.match(r'#define\\sBROTLI_VERSION\\s+0x([0-9a-fA-F]+)', line)\n            if m:\n                version = int(m.group(1), 16)\n    if version == 0:\n        return ''\n    # Semantic version is calculated as (MAJOR << 24) | (MINOR << 12) | PATCH.\n    major = version >> 24\n    minor = (version >> 12) & 0xFFF\n    patch = version & 0xFFF\n    return '{0}.{1}.{2}'.format(major, minor, patch)"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef monochrome(I, color, vmin=None, vmax=None):\n    if vmin is None:\n        vmin = np.nanmin(I)\n    if vmax is None:\n        vmax = np.nanmax(I)\n    normalized = (I - vmin) / (vmax - vmin)\n    return np.clip(normalized[..., np.newaxis], 0, 1) * np.array(color)", "response": "Turns a intensity array to a monochrome image by replacing each intensity by a scaled color."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nfunctioning decorater that executes the function in parallel", "response": "def parallelize(cores=None, fork=True, flatten=False, info=False, infoclass=InfoThreadProgressBar, init=None, *args, **kwargs):\n\t\"\"\"Function decorater that executes the function in parallel\n\t\n\tUsage::\n\n\t\t@parallelize(cores=10, info=True)\n\t\tdef f(x):\n\t\t\treturn x**2\n\t\t\n\t\tx = numpy.arange(0, 100, 0.1)\n\t\ty = f(x) # this gets executed parallel\n\n\t:param cores: number of cpus/cores to use (if None, it counts the cores using /proc/cpuinfo)\n\t:param fork: fork a process (should always be true since of the GIT, but can be used with c modules that release the GIT)\n\t:param flatten: if False and each return value is a list, final result will be a list of lists, if True, all lists are combined to one big list\n\t:param info: show progress bar (see infoclass)\n\t:param infoclass: class to instantiate that shows the progress (default shows progressbar)\n\t:param init: function to be called in each forked process before executing, can be used to set the seed, takes a integer as parameter (number that identifies the process)\n\t:param args: extra arguments passed to function\n\t:param kwargs: extra keyword arguments passed to function\n\t\t\t\n\tExample::\n\t\t\t\n\t\t@parallelize(cores=10, info=True, n=2)\n\t\tdef f(x, n):\n\t\t\treturn x**n\n\t\t\n\t\tx = numpy.arange(0, 100, 0.1)\n\t\ty = f(x) # this gets executed parallel\n\t\t\t\n\t\n\t\n\t\"\"\"\n\tif cores == None:\n\t\tcores = multiprocessing.cpu_count()\n\tdef wrapper(f):\n\t\tdef execute(*multiargs):\n\t\t\tresults = []\n\t\t\tlen(list(zip(*multiargs)))\n\t\t\tN = len(multiargs[0])\n\t\t\tif info:\n\t\t\t\tprint(\"running %i jobs on %i cores\" % (N, cores))\n\t\t\ttaskQueue = queue.Queue(len(multiargs[0]))\n\t\t\t#for timenr in range(times):\n\t\t\t#\ttaskQueue.put(timenr)\n\t\t\tfor tasknr, _args in enumerate(zip(*multiargs)):\n\t\t\t\ttaskQueue.put((tasknr, list(_args)))\n\t\t\t#for timenr in range(times):\n\t\t\t#\tresult = f(*args, **kwargs)\n\t\t\t#\tresults.append(result)\n\t\t\texecutions = [Execution(taskQueue, fork, f, init, corenr, args, kwargs) for corenr in range(cores)]\n\t\t\tif info:\n\t\t\t\tinfoobj = infoclass(len(multiargs[0]), executions)\n\t\t\t\tinfoobj.start()\n\t\t\tfor i, execution in enumerate(executions):\n\t\t\t\texecution.setName(\"T-%d\" % i)\n\t\t\t\texecution.start()\n\t\t\t#if 1:\n\t\t\t#\twatchdog = Watchdog(executions)\n\t\t\t#\twatchdog.start()\n\t\t\terror = False\n\t\t\tfor execution in executions:\n\t\t\t\tlog(\"joining:\",execution.getName())\n\t\t\t\ttry:\n\t\t\t\t\texecution.join()\n\t\t\t\texcept BaseException:\n\t\t\t\t\terror = True\n\t\t\t\tresults.extend(execution.results)\n\t\t\t\tif execution.error:\n\t\t\t\t\terror = True \n\t\t\tif info:\n\t\t\t\tinfoobj.join()\n\t\t\tif error:\n\t\t\t\tprint(\"error\", file=sys.stderr)\n\t\t\t\tresults = None\n\t\t\t\traise Exception(\"error in one or more of the executors\")\n\t\t\telse:\n\t\t\t\tresults.sort(cmp=lambda a, b: cmp(a[0], b[0]))\n\t\t\t\tresults = [k[1] for k in results]\n\t\t\t\t#print \"bla\", results\n\t\t\t\tif flatten:\n\t\t\t\t\tflatresults = []\n\t\t\t\t\tfor result in results:\n\t\t\t\t\t\tflatresults.extend(result)\n\t\t\t\t\tresults = flatresults\n\t\t\treturn results\n\t\treturn execute\n\treturn wrapper"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef export_hdf5(dataset, path, column_names=None, byteorder=\"=\", shuffle=False, selection=False, progress=None, virtual=True, sort=None, ascending=True):\n\n    if selection:\n        if selection == True:  # easier to work with the name\n            selection = \"default\"\n    # first open file using h5py api\n    with h5py.File(path, \"w\") as h5file_output:\n\n        h5table_output = h5file_output.require_group(\"/table\")\n        h5table_output.attrs[\"type\"] = \"table\"\n        h5columns_output = h5file_output.require_group(\"/table/columns\")\n        # i1, i2 = dataset.current_slice\n        N = len(dataset) if not selection else dataset.selected_length(selection)\n        if N == 0:\n            raise ValueError(\"Cannot export empty table\")\n        logger.debug(\"virtual=%r\", virtual)\n        logger.debug(\"exporting %d rows to file %s\" % (N, path))\n        # column_names = column_names or (dataset.get_column_names() + (list(dataset.virtual_columns.keys()) if virtual else []))\n        column_names = column_names or dataset.get_column_names(virtual=virtual, strings=True)\n\n        logger.debug(\"exporting columns(hdf5): %r\" % column_names)\n        sparse_groups = collections.defaultdict(list)\n        sparse_matrices = {}  # alternative to a set of matrices, since they are not hashable\n        for column_name in list(column_names):\n            sparse_matrix = dataset._sparse_matrix(column_name)\n            if sparse_matrix is not None:\n                # sparse columns are stored differently\n                sparse_groups[id(sparse_matrix)].append(column_name)\n                sparse_matrices[id(sparse_matrix)] = sparse_matrix\n                continue\n            dtype = dataset.dtype(column_name)\n            if column_name in dataset.get_column_names(virtual=False):\n                column = dataset.columns[column_name]\n                shape = (N,) + column.shape[1:]\n            else:\n                shape = (N,)\n            h5column_output = h5columns_output.require_group(column_name)\n            if dtype == str_type:\n                # TODO: if no selection or filter, we could do this\n                # if isinstance(column, ColumnStringArrow):\n                #     data_shape = column.bytes.shape\n                #     indices_shape = column.indices.shape\n                # else:\n\n                byte_length = dataset[column_name].str.byte_length().sum(selection=selection)\n                if byte_length > max_int32:\n                    dtype_indices = 'i8'\n                else:\n                    dtype_indices = 'i4'\n\n                data_shape = (byte_length, )\n                indices_shape = (N+1, )\n\n                array = h5column_output.require_dataset('data', shape=data_shape, dtype='S1')\n                array[0] = array[0]  # make sure the array really exists\n\n                index_array = h5column_output.require_dataset('indices', shape=indices_shape, dtype=dtype_indices)\n                index_array[0] = index_array[0]  # make sure the array really exists\n\n                null_value_count = N - dataset.count(column_name, selection=selection)\n                if null_value_count > 0:\n                    null_shape = ((N + 7) // 8, )  # TODO: arrow requires padding right?\n                    null_bitmap_array = h5column_output.require_dataset('null_bitmap', shape=null_shape, dtype='u1')\n                    null_bitmap_array[0] = null_bitmap_array[0]  # make sure the array really exists\n\n                array.attrs[\"dtype\"] = 'str'\n                # TODO: masked support ala arrow?\n            else:\n                if dtype.kind in 'mM':\n                    array = h5column_output.require_dataset('data', shape=shape, dtype=np.int64)\n                    array.attrs[\"dtype\"] = dtype.name\n                elif dtype.kind == 'U':\n                    # numpy uses utf32 for unicode\n                    char_length = dtype.itemsize // 4\n                    shape = (N, char_length)\n                    array = h5column_output.require_dataset('data', shape=shape, dtype=np.uint8)\n                    array.attrs[\"dtype\"] = 'utf32'\n                    array.attrs[\"dlength\"] = char_length\n                else:\n                    try:\n                        array = h5column_output.require_dataset('data', shape=shape, dtype=dtype.newbyteorder(byteorder))\n                    except:\n                        logging.exception(\"error creating dataset for %r, with type %r \" % (column_name, dtype))\n                        del h5columns_output[column_name]\n                        column_names.remove(column_name)\n                array[0] = array[0]  # make sure the array really exists\n\n                data = dataset.evaluate(column_name, 0, 1)\n                if np.ma.isMaskedArray(data):\n                    mask = h5column_output.require_dataset('mask', shape=shape, dtype=np.bool)\n                    mask[0] = mask[0]  # make sure the array really exists\n        random_index_name = None\n        column_order = list(column_names)  # copy\n        if shuffle:\n            random_index_name = \"random_index\"\n            while random_index_name in dataset.get_column_names():\n                random_index_name += \"_new\"\n            shuffle_array = h5columns_output.require_dataset(random_index_name + \"/data\", shape=(N,), dtype=byteorder + \"i8\")\n            shuffle_array[0] = shuffle_array[0]\n            column_order.append(random_index_name)  # last item\n        h5columns_output.attrs[\"column_order\"] = \",\".join(column_order)  # keep track or the ordering of columns\n\n        sparse_index = 0\n        for sparse_matrix in sparse_matrices.values():\n            columns = sorted(sparse_groups[id(sparse_matrix)], key=lambda col: dataset.columns[col].column_index)\n            name = \"sparse\" + str(sparse_index)\n            sparse_index += 1\n            # TODO: slice columns\n            # sparse_matrix = sparse_matrix[:,]\n            sparse_group = h5columns_output.require_group(name)\n            sparse_group.attrs['type'] = 'csr_matrix'\n            ar = sparse_group.require_dataset('data', shape=(len(sparse_matrix.data), ), dtype=sparse_matrix.dtype)\n            ar[0] = ar[0]\n            ar = sparse_group.require_dataset('indptr', shape=(len(sparse_matrix.indptr), ), dtype=sparse_matrix.indptr.dtype)\n            ar[0] = ar[0]\n            ar = sparse_group.require_dataset('indices', shape=(len(sparse_matrix.indices), ), dtype=sparse_matrix.indices.dtype)\n            ar[0] = ar[0]\n            for i, column_name in enumerate(columns):\n                h5column = sparse_group.require_group(column_name)\n                h5column.attrs['column_index'] = i\n\n    # after this the file is closed,, and reopen it using out class\n    dataset_output = vaex.hdf5.dataset.Hdf5MemoryMapped(path, write=True)\n\n    column_names = vaex.export._export(dataset_input=dataset, dataset_output=dataset_output, path=path, random_index_column=random_index_name,\n                                       column_names=column_names, selection=selection, shuffle=shuffle, byteorder=byteorder,\n                                       progress=progress, sort=sort, ascending=ascending)\n    import getpass\n    import datetime\n    user = getpass.getuser()\n    date = str(datetime.datetime.now())\n    source = dataset.path\n    description = \"file exported by vaex, by user %s, on date %s, from source %s\" % (user, date, source)\n    if dataset.description:\n        description += \"previous description:\\n\" + dataset.description\n    dataset_output.copy_metadata(dataset)\n    dataset_output.description = description\n    logger.debug(\"writing meta information\")\n    dataset_output.write_meta()\n    dataset_output.close_files()\n    return", "response": "Export the current state of the current table to HDF5."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef arrow_table_from_vaex_df(ds, column_names=None, selection=None, strings=True, virtual=False):\n    names = []\n    arrays = []\n    for name, array in ds.to_items(column_names=column_names, selection=selection, strings=strings, virtual=virtual):\n        names.append(name)\n        arrays.append(arrow_array_from_numpy_array(array))\n    return pyarrow.Table.from_arrays(arrays, names)", "response": "Implementation of Dataset. to_arrow_table"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nadding a method f to the Dataset class", "response": "def patch(f):\n    '''Adds method f to the Dataset class'''\n    name = f.__name__\n    Dataset.__hidden__[name] = f\n    return f"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nadd virtual columns from equatorial coordinates.", "response": "def add_virtual_columns_eq2ecl(self, long_in=\"ra\", lat_in=\"dec\", long_out=\"lambda_\", lat_out=\"beta\", name_prefix=\"__celestial_eq2ecl\", radians=False):\n    \"\"\"Add ecliptic coordates (long_out, lat_out) from equatorial coordinates.\n\n    :param long_in: Name/expression for right ascension\n    :param lat_in: Name/expression for declination\n    :param long_out:  Output name for lambda coordinate\n    :param lat_out: Output name for beta coordinate\n    :param name_prefix:\n    :param radians: input and output in radians (True), or degrees (False)\n    :return:\n    \"\"\"\n\n    self.add_virtual_columns_celestial(long_in, lat_in, long_out, lat_out, name_prefix=name_prefix, radians=radians, _matrix='eq2ecl')"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nadds virtual columns from a parallax expression to the current language.", "response": "def add_virtual_columns_distance_from_parallax(self, parallax=\"parallax\", distance_name=\"distance\", parallax_uncertainty=None, uncertainty_postfix=\"_uncertainty\"):\n    \"\"\"Convert parallax to distance (i.e. 1/parallax)\n\n    :param parallax: expression for the parallax, e.g. \"parallax\"\n    :param distance_name: name for the virtual column of the distance, e.g. \"distance\"\n    :param parallax_uncertainty: expression for the uncertainty on the parallax, e.g. \"parallax_error\"\n    :param uncertainty_postfix: distance_name + uncertainty_postfix is the name for the virtual column, e.g. \"distance_uncertainty\" by default\n    :return:\n    \"\"\"\n    \"\"\"\n\n\n    \"\"\"\n    import astropy.units\n    unit = self.unit(parallax)\n    # if unit:\n    #   convert = unit.to(astropy.units.mas)\n    #   distance_expression = \"%f/(%s)\" % (convert, parallax)\n    # else:\n    distance_expression = \"1/%s\" % (parallax)\n    self.ucds[distance_name] = \"pos.distance\"\n    self.descriptions[distance_name] = \"Derived from parallax (%s)\" % parallax\n    if unit:\n        if unit == astropy.units.milliarcsecond:\n            self.units[distance_name] = astropy.units.kpc\n        if unit == astropy.units.arcsecond:\n            self.units[distance_name] = astropy.units.parsec\n    self.add_virtual_column(distance_name, distance_expression)\n    if parallax_uncertainty:\n        \"\"\"\n        y = 1/x\n        sigma_y**2 = (1/x**2)**2 sigma_x**2\n        sigma_y = (1/x**2) sigma_x\n        sigma_y = y**2 sigma_x\n        sigma_y/y = (1/x) sigma_x\n        \"\"\"\n        name = distance_name + uncertainty_postfix\n        distance_uncertainty_expression = \"{parallax_uncertainty}/({parallax})**2\".format(**locals())\n        self.add_virtual_column(name, distance_uncertainty_expression)\n        self.descriptions[name] = \"Uncertainty on parallax (%s)\" % parallax\n        self.ucds[name] = \"stat.error;pos.distance\""}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef add_virtual_columns_cartesian_velocities_to_pmvr(self, x=\"x\", y=\"y\", z=\"z\", vx=\"vx\", vy=\"vy\", vz=\"vz\", vr=\"vr\", pm_long=\"pm_long\", pm_lat=\"pm_lat\", distance=None):\n    if distance is None:\n        distance = \"sqrt({x}**2+{y}**2+{z}**2)\".format(**locals())\n    k = 4.74057\n    self.add_variable(\"k\", k, overwrite=False)\n    self.add_virtual_column(vr, \"({x}*{vx}+{y}*{vy}+{z}*{vz})/{distance}\".format(**locals()))\n    self.add_virtual_column(pm_long, \"-({vx}*{y}-{x}*{vy})/sqrt({x}**2+{y}**2)/{distance}/k\".format(**locals()))\n    self.add_virtual_column(pm_lat, \"-({z}*({x}*{vx}+{y}*{vy}) - ({x}**2+{y}**2)*{vz})/( ({x}**2+{y}**2+{z}**2) * sqrt({x}**2+{y}**2) )/k\".format(**locals()))", "response": "Concert velocities from a cartesian system to proper motions and radial velocities"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef add_virtual_columns_proper_motion_eq2gal(self, long_in=\"ra\", lat_in=\"dec\", pm_long=\"pm_ra\", pm_lat=\"pm_dec\", pm_long_out=\"pm_l\", pm_lat_out=\"pm_b\",\n                                            name_prefix=\"__proper_motion_eq2gal\",\n                                            right_ascension_galactic_pole=192.85,\n                                            declination_galactic_pole=27.12,\n                                            propagate_uncertainties=False,\n                                            radians=False, inverse=False):\n    \"\"\"Transform/rotate proper motions from equatorial to galactic coordinates\n\n    Taken from http://arxiv.org/abs/1306.2945\n\n    :param long_in: Name/expression for right ascension\n    :param lat_in: Name/expression for declination\n    :param pm_long: Proper motion for ra\n    :param pm_lat: Proper motion for dec\n    :param pm_long_out:  Output name for output proper motion on l direction\n    :param pm_lat_out: Output name for output proper motion on b direction\n    :param name_prefix:\n    :param radians: input and output in radians (True), or degrees (False)\n    :parap inverse: (For internal use) convert from galactic to equatorial instead\n    :return:\n    \"\"\"\n    \"\"\"mu_gb =  mu_dec*(cdec*sdp-sdec*cdp*COS(ras))/cgb $\n      - mu_ra*cdp*SIN(ras)/cgb\"\"\"\n    long_in_original = long_in = self._expr(long_in)\n    lat_in_original = lat_in = self._expr(lat_in)\n    pm_long = self._expr(pm_long)\n    pm_lat = self._expr(pm_lat)\n    if not radians:\n        long_in = long_in * np.pi/180\n        lat_in = lat_in * np.pi/180\n    c1_name = name_prefix + \"_C1\"\n    c2_name = name_prefix + \"_C2\"\n    right_ascension_galactic_pole = math.radians(right_ascension_galactic_pole)\n    declination_galactic_pole = math.radians(declination_galactic_pole)\n    self[c1_name] = c1 = np.sin(declination_galactic_pole) * np.cos(lat_in) - np.cos(declination_galactic_pole)*np.sin(lat_in)*np.cos(long_in-right_ascension_galactic_pole)\n    self[c2_name] = c2 = np.cos(declination_galactic_pole) * np.sin(long_in - right_ascension_galactic_pole)\n    c1 = self[c1_name]\n    c2 = self[c2_name]\n    if inverse:\n        self[pm_long_out] = ( c1 * pm_long + -c2 * pm_lat)/np.sqrt(c1**2+c2**2)\n        self[pm_lat_out] =  ( c2 * pm_long +  c1 * pm_lat)/np.sqrt(c1**2+c2**2)\n    else:\n        self[pm_long_out] = ( c1 * pm_long + c2 * pm_lat)/np.sqrt(c1**2+c2**2)\n        self[pm_lat_out] =  (-c2 * pm_long + c1 * pm_lat)/np.sqrt(c1**2+c2**2)\n    if propagate_uncertainties:\n        self.propagate_uncertainties([self[pm_long_out], self[pm_lat_out]])", "response": "Transform and rotate proper motions from equatorial to galactic coordinates\n   "}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef add_virtual_columns_proper_motion_gal2eq(self, long_in=\"ra\", lat_in=\"dec\", pm_long=\"pm_l\", pm_lat=\"pm_b\", pm_long_out=\"pm_ra\", pm_lat_out=\"pm_dec\",\n                                            name_prefix=\"__proper_motion_gal2eq\",\n                                            right_ascension_galactic_pole=192.85,\n                                            declination_galactic_pole=27.12,\n                                            propagate_uncertainties=False,\n                                            radians=False):\n    \"\"\"Transform/rotate proper motions from galactic to equatorial coordinates.\n\n    Inverse of :py:`add_virtual_columns_proper_motion_eq2gal`\n    \"\"\"\n    kwargs = dict(**locals())\n    kwargs.pop('self')\n    kwargs['inverse'] = True\n    self.add_virtual_columns_proper_motion_eq2gal(**kwargs)", "response": "Transform and rotate proper motions from galactic to equatorial coordinates."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nconvert radial velocity and galactic proper motions and positions wrt the center_v", "response": "def add_virtual_columns_lbrvr_proper_motion2vcartesian(self, long_in=\"l\", lat_in=\"b\", distance=\"distance\", pm_long=\"pm_l\", pm_lat=\"pm_b\",\n                                                       vr=\"vr\", vx=\"vx\", vy=\"vy\", vz=\"vz\",\n                                                       center_v=(0, 0, 0),\n                                                       propagate_uncertainties=False, radians=False):\n    \"\"\"Convert radial velocity and galactic proper motions (and positions) to cartesian velocities wrt the center_v\n\n    Based on http://adsabs.harvard.edu/abs/1987AJ.....93..864J\n\n\n    :param long_in: Name/expression for galactic longitude\n    :param lat_in: Name/expression for galactic latitude\n    :param distance: Name/expression for heliocentric distance\n    :param pm_long: Name/expression for the galactic proper motion in latitude direction (pm_l*, so cosine(b) term should be included)\n    :param pm_lat: Name/expression for the galactic proper motion in longitude direction\n    :param vr: Name/expression for the radial velocity\n    :param vx: Output name for the cartesian velocity x-component\n    :param vy: Output name for the cartesian velocity y-component\n    :param vz: Output name for the cartesian velocity z-component\n    :param center_v: Extra motion that should be added, for instance lsr + motion of the sun wrt the galactic restframe\n    :param radians: input and output in radians (True), or degrees (False)\n    :return:\n    \"\"\"\n    k = 4.74057\n    a, d, distance = self._expr(long_in, lat_in, distance)\n    pm_long, pm_lat, vr = self._expr(pm_long, pm_lat, vr)\n    if not radians:\n        a = a * np.pi/180\n        d = d * np.pi/180\n    A = [[np.cos(a)*np.cos(d), -np.sin(a), -np.cos(a)*np.sin(d)],\n         [np.sin(a)*np.cos(d),  np.cos(a), -np.sin(a)*np.sin(d)],\n         [np.sin(d), d*0, np.cos(d)]]\n    self.add_virtual_columns_matrix3d(vr, k * pm_long * distance, k * pm_lat * distance, vx, vy, vz, A, translation=center_v)\n    if propagate_uncertainties:\n        self.propagate_uncertainties([self[vx], self[vy], self[vz]])"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef add_virtual_columns_equatorial_to_galactic_cartesian(self, alpha, delta, distance, xname, yname, zname, radians=True, alpha_gp=np.radians(192.85948), delta_gp=np.radians(27.12825), l_omega=np.radians(32.93192)):\n    if not radians:\n        alpha = \"pi/180.*%s\" % alpha\n        delta = \"pi/180.*%s\" % delta\n    self.virtual_columns[zname] = \"{distance} * (cos({delta}) * cos({delta_gp}) * cos({alpha} - {alpha_gp}) + sin({delta}) * sin({delta_gp}))\".format(**locals())\n    self.virtual_columns[xname] = \"{distance} * (cos({delta}) * sin({alpha} - {alpha_gp}))\".format(**locals())\n    self.virtual_columns[yname] = \"{distance} * (sin({delta}) * cos({delta_gp}) - cos({delta}) * sin({delta_gp}) * cos({alpha} - {alpha_gp}))\".format(**locals())", "response": "Add virtual columns to the Galactic Cartesian."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef add_virtual_columns_proper_motion2vperpendicular(self, distance=\"distance\", pm_long=\"pm_l\", pm_lat=\"pm_b\",\n                                                     vl=\"vl\", vb=\"vb\",\n                                                     propagate_uncertainties=False,\n                                                     radians=False):\n    \"\"\"Convert proper motion to perpendicular velocities.\n\n    :param distance:\n    :param pm_long:\n    :param pm_lat:\n    :param vl:\n    :param vb:\n    :param cov_matrix_distance_pm_long_pm_lat:\n    :param uncertainty_postfix:\n    :param covariance_postfix:\n    :param radians:\n    :return:\n    \"\"\"\n    k = 4.74057\n    self.add_variable(\"k\", k, overwrite=False)\n    self.add_virtual_column(vl, \"k*{pm_long}*{distance}\".format(**locals()))\n    self.add_virtual_column(vb, \"k* {pm_lat}*{distance}\".format(**locals()))\n    if propagate_uncertainties:\n        self.propagate_uncertainties([self[vl], self[vb]])", "response": "Convert proper motion to perpendicular velocities."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\ncalculating the angular momentum components provided Cartesian positions and velocities. Be mindful of the point of origin: ex. if considering Galactic dynamics, and positions and velocities should be as seen from the Galactic centre. :param x: x-position Cartesian component :param y: y-position Cartesian component :param z: z-position Cartesian component :param vx: x-velocity Cartesian component :param vy: y-velocity Cartesian component :param vz: z-velocity Cartesian component :param Lx: name of virtual column :param Ly: name of virtual column :param Lz: name of virtial column :propagate_uncertainties: (bool) whether to propagate the uncertainties of the positions and velocities to the angular momentum components", "response": "def add_virtual_columns_cartesian_angular_momenta(self, x='x', y='y', z='z',\n                                                  vx='vx', vy='vy', vz='vz',\n                                                  Lx='Lx', Ly='Ly', Lz='Lz',\n                                                  propagate_uncertainties=False):\n    \"\"\"\n    Calculate the angular momentum components provided Cartesian positions and velocities.\n    Be mindful of the point of origin: ex. if considering Galactic dynamics, and positions and\n    velocities should be as seen from the Galactic centre.\n\n    :param x: x-position Cartesian component\n    :param y: y-position Cartesian component\n    :param z: z-position Cartesian component\n    :param vx: x-velocity Cartesian component\n    :param vy: y-velocity Cartesian component\n    :param vz: z-velocity Cartesian component\n    :param Lx: name of virtual column\n    :param Ly: name of virtual column\n    :param Lz: name of virtial column\n    :propagate_uncertainties: (bool) whether to propagate the uncertainties of\n    the positions and velocities to the angular momentum components\n    \"\"\"\n\n    x, y, z, vx, vy, vz = self._expr(x, y, z, vx, vy, vz)\n    self.add_virtual_column(Lx, y * vz - z * vy)\n    self.add_virtual_column(Ly, z * vx - x * vz)\n    self.add_virtual_column(Lz, x * vy - y * vx)\n    if propagate_uncertainties:\n        self.propagate_uncertainties([self[Lx], self[Ly], self[Lz]])"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef _recompute_transform(self):\n\t\tcenter = (self.convert_xunits(self.center[0]),\n\t\t\t\t  self.convert_yunits(self.center[1]))\n\t\twidth = self.width #self.convert_xunits(self.width)\n\t\theight = self.height #self.convert_yunits(self.height)\n\t\ttrans = artist.Artist.get_transform(self)\n\t\tself._patch_transform = transforms.Affine2D() \\\n\t\t\t.scale(width * 0.5 * self.scale, height * 0.5* self.scale) \\\n\t\t\t.rotate_deg(self.angle) \\\n\t\t\t.translate(*trans.transform(center))", "response": "Recompute the transform for this object."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef _graph(self):\n        expression = self.expression\n\n        def walk(node):\n            if isinstance(node, six.string_types):\n                if node in self.ds.virtual_columns:\n                    ex = Expression(self.ds, self.ds.virtual_columns[node])\n                    return [node, None, None, [ex._graph()]]\n                else:\n                    return node\n            else:\n                fname, node_repr, deps = node\n                if len(node_repr) > 30:  # clip too long expressions\n                    node_repr = node_repr[:26] + ' ....'\n                deps = [walk(dep) for dep in deps]\n                obj = self.ds.functions.get(fname)\n                # we don't want the wrapper, we want the underlying object\n                if isinstance(obj, Function):\n                    obj = obj.f\n                if isinstance(obj, FunctionSerializablePickle):\n                    obj = obj.f\n                return [node_repr, fname, obj, deps]\n        return walk(expresso._graph(expression))", "response": "Return a graph containing the dependencies of this expression\n           "}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef _graphviz(self, dot=None):\n        from graphviz import Graph, Digraph\n        node = self._graph()\n        dot = dot or Digraph(comment=self.expression)\n        def walk(node):\n            if isinstance(node, six.string_types):\n                dot.node(node, node)\n                return node, node\n            else:\n                node_repr, fname, fobj, deps = node\n                node_id = node_repr\n                dot.node(node_id, node_repr)\n                for dep in deps:\n                    dep_id, dep = walk(dep)\n                    dot.edge(node_id, dep_id)\n                return node_id, node\n        walk(node)\n        return dot", "response": "Return a graphviz. Digraph object with a graph of the expression"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\ncompute the counts of unique values in the column.", "response": "def value_counts(self, dropna=False, dropnull=True, ascending=False, progress=False):\n        \"\"\"Computes counts of unique values.\n\n         WARNING:\n          * If the expression/column is not categorical, it will be converted on the fly\n          * dropna is False by default, it is True by default in pandas\n\n        :param dropna: when True, it will not report the missing values\n        :param ascending: when False (default) it will report the most frequent occuring item first\n        :returns: Pandas series containing the counts\n        \"\"\"\n        from pandas import Series\n        dtype = self.dtype\n\n        transient = self.transient or self.ds.filtered or self.ds.is_masked(self.expression)\n        if self.dtype == str_type and not transient:\n            # string is a special case, only ColumnString are not transient\n            ar = self.ds.columns[self.expression]\n            if not isinstance(ar, ColumnString):\n                transient = True\n\n        counter_type = counter_type_from_dtype(self.dtype, transient)\n        counters = [None] * self.ds.executor.thread_pool.nthreads\n        def map(thread_index, i1, i2, ar):\n            if counters[thread_index] is None:\n                counters[thread_index] = counter_type()\n            if dtype == str_type:\n                previous_ar = ar\n                ar = _to_string_sequence(ar)\n                if not transient:\n                    assert ar is previous_ar.string_sequence\n            if np.ma.isMaskedArray(ar):\n                mask = np.ma.getmaskarray(ar)\n                counters[thread_index].update(ar, mask)\n            else:\n                counters[thread_index].update(ar)\n            return 0\n        def reduce(a, b):\n            return a+b\n        self.ds.map_reduce(map, reduce, [self.expression], delay=False, progress=progress, name='value_counts', info=True, to_numpy=False)\n        counters = [k for k in counters if k is not None]\n        counter0 = counters[0]\n        for other in counters[1:]:\n            counter0.merge(other)\n        value_counts = counter0.extract()\n        index = np.array(list(value_counts.keys()))\n        counts = np.array(list(value_counts.values()))\n\n        order = np.argsort(counts)\n        if not ascending:\n            order = order[::-1]\n        counts = counts[order]\n        index = index[order]\n        if not dropna or not dropnull:\n            index = index.tolist()\n            counts = counts.tolist()\n            if not dropna and counter0.nan_count:\n                index = [np.nan] + index\n                counts = [counter0.nan_count] + counts\n            if not dropnull and counter0.null_count:\n                index = ['null'] + index\n                counts = [counter0.null_count] + counts\n\n        return Series(counts, index=index)"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef map(self, mapper, nan_mapping=None, null_mapping=None):\n        assert isinstance(mapper, collectionsAbc.Mapping), \"mapper should be a dict like object\"\n\n        df = self.ds\n        mapper_keys = np.array(list(mapper.keys()))\n\n        # we map the keys to a ordinal values [0, N-1] using the set\n        key_set = df._set(self.expression)\n        found_keys = key_set.keys()\n        mapper_has_nan = any([key != key for key in mapper_keys])\n\n        # we want all possible values to be converted\n        # so mapper's key should be a superset of the keys found\n        if not set(mapper_keys).issuperset(found_keys):\n            missing = set(found_keys).difference(mapper_keys)\n            missing0 = list(missing)[0]\n            if missing0 == missing0:  # safe nan check\n                raise ValueError('Missing values in mapper: %s' % missing)\n        \n        # and these are the corresponding choices\n        choices = [mapper[key] for key in found_keys]\n        if key_set.has_nan:\n            if mapper_has_nan:\n                choices = [mapper[np.nan]] + choices\n            else:\n                choices = [nan_mapping] + choices\n        if key_set.has_null:\n            choices = [null_mapping] + choices\n        choices = np.array(choices)\n\n        key_set_name = df.add_variable('map_key_set', key_set, unique=True)\n        choices_name = df.add_variable('map_choices', choices, unique=True)\n        expr = '_choose(_ordinal_values({}, {}), {})'.format(self, key_set_name, choices_name)\n        return Expression(df, expr)", "response": "Map values of an expression or in memory column accoring to an input\n        dictionary or a custom callable function."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\ncreates a vaex app", "response": "def app(*args, **kwargs):\n    \"\"\"Create a vaex app, the QApplication mainloop must be started.\n\n    In ipython notebook/jupyter do the following:\n\n    >>> import vaex.ui.main # this causes the qt api level to be set properly\n    >>> import vaex\n\n    Next cell:\n\n    >>> %gui qt\n\n    Next cell:\n\n    >>> app = vaex.app()\n\n    From now on, you can run the app along with jupyter\n\n    \"\"\"\n\n    import vaex.ui.main\n    return vaex.ui.main.VaexApp()"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nconvert a filename or list of filenames to a filename with. hdf5 and optionally a - shuffle suffix.", "response": "def _convert_name(filenames, shuffle=False):\n    '''Convert a filename (or list of) to a filename with .hdf5 and optionally a -shuffle suffix'''\n    if not isinstance(filenames, (list, tuple)):\n        filenames = [filenames]\n    base = filenames[0]\n    if shuffle:\n        base += '-shuffle'\n    if len(filenames) > 1:\n        return base + \"_and_{}_more.hdf5\".format(len(filenames)-1)\n    else:\n        return base + \".hdf5\""}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef open(path, convert=False, shuffle=False, copy_index=True, *args, **kwargs):\n    import vaex\n    try:\n        if path in aliases:\n            path = aliases[path]\n        if path.startswith(\"http://\") or path.startswith(\"ws://\"):  # TODO: think about https and wss\n            server, DataFrame = path.rsplit(\"/\", 1)\n            server = vaex.server(server, **kwargs)\n            DataFrames = server.DataFrames(as_dict=True)\n            if DataFrame not in DataFrames:\n                raise KeyError(\"no such DataFrame '%s' at server, possible DataFrame names: %s\" % (DataFrame, \" \".join(DataFrames.keys())))\n            return DataFrames[DataFrame]\n        if path.startswith(\"cluster\"):\n            import vaex.distributed\n            return vaex.distributed.open(path, *args, **kwargs)\n        else:\n            import vaex.file\n            import glob\n            # sort to get predicatable behaviour (useful for testing)\n            filenames = list(sorted(glob.glob(path)))\n            ds = None\n            if len(filenames) == 0:\n                raise IOError('Could not open file: {}, it does not exist'.format(path))\n            filename_hdf5 = _convert_name(filenames, shuffle=shuffle)\n            filename_hdf5_noshuffle = _convert_name(filenames, shuffle=False)\n            if len(filenames) == 1:\n                path = filenames[0]\n                ext = os.path.splitext(path)[1]\n                if os.path.exists(filename_hdf5) and convert:  # also check mtime?\n                    if convert:\n                        ds = vaex.file.open(filename_hdf5)\n                    else:\n                        ds = vaex.file.open(filename_hdf5, *args, **kwargs)\n                else:\n                    if ext == '.csv':  # special support for csv.. should probably approach it a different way\n                        ds = from_csv(path, copy_index=copy_index, **kwargs)\n                    else:\n                        ds = vaex.file.open(path, *args, **kwargs)\n                    if convert:\n                        ds.export_hdf5(filename_hdf5, shuffle=shuffle)\n                        ds = vaex.file.open(filename_hdf5) # argument were meant for pandas?\n                if ds is None:\n                    if os.path.exists(path):\n                        raise IOError('Could not open file: {}, did you install vaex-hdf5?'.format(path))\n                    if os.path.exists(path):\n                        raise IOError('Could not open file: {}, it does not exist?'.format(path))\n            elif len(filenames) > 1:\n                if convert not in [True, False]:\n                    filename_hdf5 = convert\n                else:\n                    filename_hdf5 = _convert_name(filenames, shuffle=shuffle)\n                if os.path.exists(filename_hdf5) and convert:  # also check mtime\n                    ds = open(filename_hdf5)\n                else:\n                    # with ProcessPoolExecutor() as executor:\n                    # executor.submit(read_csv_and_convert, filenames, shuffle=shuffle, **kwargs)\n                    DataFrames = []\n                    for filename in filenames:\n                        DataFrames.append(open(filename, convert=bool(convert), shuffle=shuffle, **kwargs))\n                    ds = vaex.dataframe.DataFrameConcatenated(DataFrames)\n                if convert:\n                    ds.export_hdf5(filename_hdf5, shuffle=shuffle)\n                    ds = vaex.file.open(filename_hdf5, *args, **kwargs)\n\n        if ds is None:\n            raise IOError('Unknown error opening: {}'.format(path))\n        return ds\n    except:\n        logging.getLogger(\"vaex\").error(\"error opening %r\" % path)\n        raise", "response": "Open a DataFrame from file given by path."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef open_many(filenames):\n    dfs = []\n    for filename in filenames:\n        filename = filename.strip()\n        if filename and filename[0] != \"#\":\n            dfs.append(open(filename))\n    return vaex.dataframe.DataFrameConcatenated(dfs=dfs)", "response": "Open a list of filenames and return a DataFrame with all DataFrames cocatenated."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef from_samp(username=None, password=None):\n    print(\"Waiting for SAMP message...\")\n    import vaex.samp\n    t = vaex.samp.single_table(username=username, password=password)\n    return from_astropy_table(t.to_table())", "response": "Connect to a SAMP Hub and wait for a single table load event and download the table and return the DataFrame."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef from_astropy_table(table):\n    import vaex.file.other\n    return vaex.file.other.DatasetAstropyTable(table=table)", "response": "Create a vaex DataFrame from an Astropy Table."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\ncreates an in memory DataFrame from numpy arrays.", "response": "def from_items(*items):\n    \"\"\"Create an in memory DataFrame from numpy arrays, in contrast to from_arrays this keeps the order of columns intact (for Python < 3.6).\n\n    Example\n\n    >>> import vaex, numpy as np\n    >>> x = np.arange(5)\n    >>> y = x ** 2\n    >>> vaex.from_items(('x', x), ('y', y))\n      #    x    y\n      0    0    0\n      1    1    1\n      2    2    4\n      3    3    9\n      4    4   16\n\n    :param items: list of [(name, numpy array), ...]\n    :rtype: DataFrame\n\n    \"\"\"\n    import numpy as np\n    df = vaex.dataframe.DataFrameArrays(\"array\")\n    for name, array in items:\n        df.add_column(name, np.asanyarray(array))\n    return df"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef from_arrays(**arrays):\n    import numpy as np\n    import six\n    from .column import Column\n    df = vaex.dataframe.DataFrameArrays(\"array\")\n    for name, array in arrays.items():\n        if isinstance(array, Column):\n            df.add_column(name, array)\n        else:\n            array = np.asanyarray(array)\n            df.add_column(name, array)\n    return df", "response": "Create an in memory DataFrame from numpy arrays."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef from_scalars(**kwargs):\n    import numpy as np\n    return from_arrays(**{k: np.array([v]) for k, v in kwargs.items()})", "response": "Similar to from_arrays but convenient for a DataFrame of length 1."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\ncreates an in memory DataFrame from a pandas DataFrame.", "response": "def from_pandas(df, name=\"pandas\", copy_index=True, index_name=\"index\"):\n    \"\"\"Create an in memory DataFrame from a pandas DataFrame.\n\n    :param: pandas.DataFrame df: Pandas DataFrame\n    :param: name: unique for the DataFrame\n\n    >>> import vaex, pandas as pd\n    >>> df_pandas = pd.from_csv('test.csv')\n    >>> df = vaex.from_pandas(df_pandas)\n\n    :rtype: DataFrame\n    \"\"\"\n    import six\n    vaex_df = vaex.dataframe.DataFrameArrays(name)\n\n    def add(name, column):\n        values = column.values\n        try:\n            vaex_df.add_column(name, values)\n        except Exception as e:\n            print(\"could not convert column %s, error: %r, will try to convert it to string\" % (name, e))\n            try:\n                values = values.astype(\"S\")\n                vaex_df.add_column(name, values)\n            except Exception as e:\n                print(\"Giving up column %s, error: %r\" % (name, e))\n    for name in df.columns:\n        add(name, df[name])\n    if copy_index:\n        add(index_name, df.index)\n    return vaex_df"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\ncreating an in memory DataFrame from an ASCII file.", "response": "def from_ascii(path, seperator=None, names=True, skip_lines=0, skip_after=0, **kwargs):\n    \"\"\"\n    Create an in memory DataFrame from an ascii file (whitespace seperated by default).\n\n    >>> ds = vx.from_ascii(\"table.asc\")\n    >>> ds = vx.from_ascii(\"table.csv\", seperator=\",\", names=[\"x\", \"y\", \"z\"])\n\n    :param path: file path\n    :param seperator: value seperator, by default whitespace, use \",\" for comma seperated values.\n    :param names: If True, the first line is used for the column names, otherwise provide a list of strings with names\n    :param skip_lines: skip lines at the start of the file\n    :param skip_after: skip lines at the end of the file\n    :param kwargs:\n    :rtype: DataFrame\n    \"\"\"\n\n    import vaex.ext.readcol as rc\n    ds = vaex.dataframe.DataFrameArrays(path)\n    if names not in [True, False]:\n        namelist = names\n        names = False\n    else:\n        namelist = None\n    data = rc.readcol(path, fsep=seperator, asdict=namelist is None, names=names, skipline=skip_lines, skipafter=skip_after, **kwargs)\n    if namelist:\n        for name, array in zip(namelist, data.T):\n            ds.add_column(name, array)\n    else:\n        for name, array in data.items():\n            ds.add_column(name, array)\n    return ds"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef from_csv(filename_or_buffer, copy_index=True, **kwargs):\n    import pandas as pd\n    return from_pandas(pd.read_csv(filename_or_buffer, **kwargs), copy_index=copy_index)", "response": "Shortcut to read a csv file using pandas and convert to a DataFrame directly."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nconvert a path or glob pattern to a single hdf5 file if exists.", "response": "def read_csv_and_convert(path, shuffle=False, copy_index=True, **kwargs):\n    '''Convert a path (or glob pattern) to a single hdf5 file, will open the hdf5 file if exists.\n\n    Example:\n            >>> vaex.read_csv_and_convert('test-*.csv', shuffle=True)  # this may take a while\n            >>> vaex.read_csv_and_convert('test-*.csv', shuffle=True)  # 2nd time it is instant\n\n    :param str path: path of file or glob pattern for multiple files\n    :param bool shuffle: shuffle DataFrame when converting to hdf5\n    :param bool copy_index: by default pandas will create an index (row number), set to false if you want to drop that\n    :param kwargs: parameters passed to pandas' read_cvs\n\n    '''\n    from concurrent.futures import ProcessPoolExecutor\n    import pandas as pd\n    filenames = glob.glob(path)\n    if len(filenames) > 1:\n        filename_hdf5 = _convert_name(filenames, shuffle=shuffle)\n        filename_hdf5_noshuffle = _convert_name(filenames, shuffle=False)\n        if not os.path.exists(filename_hdf5):\n            if not os.path.exists(filename_hdf5_noshuffle):\n                # with ProcessPoolExecutor() as executor:\n                # executor.submit(read_csv_and_convert, filenames, shuffle=shuffle, **kwargs)\n                for filename in filenames:\n                    read_csv_and_convert(filename, shuffle=shuffle, copy_index=copy_index, **kwargs)\n                ds = open_many([_convert_name(k, shuffle=shuffle) for k in filenames])\n            else:\n                ds = open(filename_hdf5_noshuffle)\n            ds.export_hdf5(filename_hdf5, shuffle=shuffle)\n        return open(filename_hdf5)\n    else:\n        filename = filenames[0]\n        filename_hdf5 = _convert_name(filename, shuffle=shuffle)\n        filename_hdf5_noshuffle = _convert_name(filename, shuffle=False)\n        if not os.path.exists(filename_hdf5):\n            if not os.path.exists(filename_hdf5_noshuffle):\n                df = pd.read_csv(filename, **kwargs)\n                ds = from_pandas(df, copy_index=copy_index)\n            else:\n                ds = open(filename_hdf5_noshuffle)\n            ds.export_hdf5(filename_hdf5, shuffle=shuffle)\n        return open(filename_hdf5)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef server(url, **kwargs):\n    from vaex.remote import ServerRest\n    url = urlparse(url)\n    if url.scheme == \"ws\":\n        websocket = True\n    else:\n        websocket = False\n    assert url.scheme in [\"ws\", \"http\"]\n    port = url.port\n    base_path = url.path\n    hostname = url.hostname\n    return vaex.remote.ServerRest(hostname, base_path=base_path, port=port, websocket=websocket, **kwargs)", "response": "Connect to hostname supporting the vaex web api."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nreturn an example DataFrame which comes with vaex for testing purposes.", "response": "def example(download=True):\n    \"\"\"Returns an example DataFrame which comes with vaex for testing/learning purposes.\n\n    :rtype: DataFrame\n    \"\"\"\n    from . import utils\n    path = utils.get_data_file(\"helmi-dezeeuw-2000-10p.hdf5\")\n    if path is None and download:\n        return vaex.datasets.helmi_de_zeeuw_10percent.fetch()\n    return open(path) if path else None"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\ncreates a zeldovich DataFrame.", "response": "def zeldovich(dim=2, N=256, n=-2.5, t=None, scale=1, seed=None):\n    \"\"\"Creates a zeldovich DataFrame.\n    \"\"\"\n    import vaex.file\n    return vaex.file.other.Zeldovich(dim=dim, N=N, n=n, t=t, scale=scale)"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nconcatenate a list of DataFrames.", "response": "def concat(dfs):\n    '''Concatenate a list of DataFrames.\n\n    :rtype: DataFrame\n    '''\n    ds = reduce((lambda x, y: x.concat(y)), dfs)\n    return ds"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\ncreate a virtual column which is the equivalent of numpy. arange but uses 0 memory", "response": "def vrange(start, stop, step=1, dtype='f8'):\n    \"\"\"Creates a virtual column which is the equivalent of numpy.arange, but uses 0 memory\"\"\"\n    from .column import ColumnVirtualRange\n    return ColumnVirtualRange(start, stop, step, dtype)"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef main(argv=sys.argv[1:]):\n    global main_thread\n    global vaex\n    global app\n    global kernel\n    global ipython_console\n    global current\n    vaex.set_log_level_warning()\n\n    if app is None:\n        app = QtGui.QApplication(argv)\n        if not (frozen and darwin):  # osx app has its own icon file\n            import vaex.ui.icons\n            icon = QtGui.QIcon(vaex.ui.icons.iconfile('vaex128'))\n            app.setWindowIcon(icon)\n    # import vaex.ipkernel_qtapp\n    # ipython_window = vaex.ipkernel_qtapp.SimpleWindow(app)\n    main_thread = QtCore.QThread.currentThread()\n\n    # print select_many(None, \"lala\", [\"aap\", \"noot\"] + [\"item-%d-%s\" % (k, \"-\" * k) for k in range(30)])\n    # sys.exit(0)\n\n    # sys._excepthook = sys.excepthook\n    def qt_exception_hook(exctype, value, traceback):\n        print(\"qt hook in thread: %r\" % threading.currentThread())\n        sys.__excepthook__(exctype, value, traceback)\n        qt_exception(None, exctype, value, traceback)\n        # sys._excepthook(exctype, value, traceback)\n        # sys.exit(1)\n    sys.excepthook = qt_exception_hook\n    vaex.promise.Promise.unhandled = staticmethod(qt_exception_hook)\n    # raise RuntimeError, \"blaat\"\n\n    vaex_app = VaexApp(argv, open_default=True)\n\n    def plot(*args, **kwargs):\n        vaex_app.plot(*args, **kwargs)\n\n    def select(*args, **kwargs):\n        vaex_app.select(*args, **kwargs)\n\n    \"\"\"if 1:\n        #   app = guisupport.get_app_qt4()\n        print_process_id()\n\n        # Create an in-process kernel\n        # >>> print_process_id( )\n        # will print the same process ID as the main process\n        kernel_manager = QtInProcessKernelManager()\n        kernel_manager.start_kernel()\n        kernel = kernel_manager.kernel\n        kernel.gui = 'qt4'\n        kernel.shell.push({'foo': 43, 'print_process_id': print_process_id, \"vaex_app\":vaex_app, \"plot\": plot, \"current\": current, \"select\": select})\n\n        kernel_client = kernel_manager.client()\n        kernel_client.start_channels()\n\n        def stop():\n            kernel_client.stop_channels()\n            kernel_manager.shutdown_kernel()\n            app.exit()\n\n        ipython_console = RichJupyterWidget()\n        ipython_console.kernel_manager = kernel_manager\n        ipython_console.kernel_client = kernel_client\n        ipython_console.exit_requested.connect(stop)\n        #ipython_console.show()\n\n        sys.exit(guisupport.start_event_loop_qt4(app))\n    \"\"\"\n\n    # w = QtGui.QWidget()\n    # w.resize(250, 150)\n    # w.move(300, 300)\n    # w.setWindowTitle('Simple')\n    # w.show()\n    # ipython_window.show()\n    # ipython_window.ipkernel.start()\n    sys.exit(app.exec_())", "response": "main function for the main function of the console."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef open(self, path):\n        logger.debug(\"open dataset: %r\", path)\n        if path.startswith(\"http\") or path.startswith(\"ws\"):\n            dataset = vaex.open(path, thread_mover=self.call_in_main_thread)\n        else:\n            dataset = vaex.open(path)\n        self.add_recently_opened(path)\n        self.dataset_selector.add(dataset)\n        return dataset", "response": "Open a dataset and add it to the UI"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef evaluate(self, expression, i1=None, i2=None, out=None, selection=None, delay=False):\n        expression = _ensure_strings_from_expressions(expression)\n        \"\"\"basic support for evaluate at server, at least to run some unittest, do not expect this to work from strings\"\"\"\n        result = self.server._call_dataset(\"evaluate\", self, expression=expression, i1=i1, i2=i2, selection=selection, delay=delay)\n        # TODO: we ignore out\n        return result", "response": "basic support for evaluate at server, at least to run some unittest, do not expect this to work from strings"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef _depending_columns(self, ds):\n        '''Find all columns that this selection depends on for df ds'''\n        depending = set()\n        for expression in self.expressions:\n            expression = ds._expr(expression)  # make sure it is an expression\n            depending |= expression.variables()\n        if self.previous_selection:\n            depending |= self.previous_selection._depending_columns(ds)\n        return depending", "response": "Find all columns that this selection depends on for df ds"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef limits(self, value, square=False):\n        if isinstance(value, six.string_types):\n            import re\n            match = re.match(r\"(\\d*)(\\D*)\", value)\n            if match is None:\n                raise ValueError(\"do not understand limit specifier %r, examples are 90%, 3sigma\")\n            else:\n                value, type = match.groups()\n                import ast\n                value = ast.literal_eval(value)\n                type = type.strip()\n                if type in [\"s\", \"sigma\"]:\n                    return self.limits_sigma(value)\n                elif type in [\"ss\", \"sigmasquare\"]:\n                    return self.limits_sigma(value, square=True)\n                elif type in [\"%\", \"percent\"]:\n                    return self.limits_percentage(value)\n                elif type in [\"%s\", \"%square\", \"percentsquare\"]:\n                    return self.limits_percentage(value, square=True)\n        if value is None:\n            return self.limits_percentage(square=square)\n        else:\n            return value", "response": "returns the limits of the current object"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef plot(self, grid=None, size=256, limits=None, square=False, center=None, weight=None, weight_stat=\"mean\", figsize=None,\n             aspect=\"auto\", f=\"identity\", axes=None, xlabel=None, ylabel=None,\n             group_by=None, group_limits=None, group_colors='jet', group_labels=None, group_count=None,\n             vmin=None, vmax=None,\n             cmap=\"afmhot\",\n             **kwargs):\n        \"\"\"Plot the subspace using sane defaults to get a quick look at the data.\n\n        :param grid: A 2d numpy array with the counts, if None it will be calculated using limits provided and Subspace.histogram\n        :param size: Passed to Subspace.histogram\n        :param limits: Limits for the subspace in the form [[xmin, xmax], [ymin, ymax]], if None it will be calculated using Subspace.limits_sigma\n        :param square: argument passed to Subspace.limits_sigma\n        :param Executor executor: responsible for executing the tasks\n        :param figsize: (x, y) tuple passed to pylab.figure for setting the figure size\n        :param aspect: Passed to matplotlib's axes.set_aspect\n        :param xlabel: String for label on x axis (may contain latex)\n        :param ylabel: Same for y axis\n        :param kwargs: extra argument passed to axes.imshow, useful for setting the colormap for instance, e.g. cmap='afmhot'\n        :return: matplotlib.image.AxesImage\n\n         \"\"\"\n        import pylab\n        f = _parse_f(f)\n        limits = self.limits(limits)\n        if limits is None:\n            limits = self.limits_sigma()\n        # if grid is None:\n        if group_limits is None and group_by:\n            group_limits = tuple(self.df(group_by).minmax()[0]) + (group_count,)\n        # grid = self.histogram(limits=limits, size=size, weight=weight, group_limits=group_limits, group_by=group_by)\n        if figsize is not None:\n            pylab.figure(num=None, figsize=figsize, dpi=80, facecolor='w', edgecolor='k')\n        if axes is None:\n            axes = pylab.gca()\n        fig = pylab.gcf()\n        # if xlabel:\n        pylab.xlabel(xlabel or self.expressions[0])\n        # if ylabel:\n        pylab.ylabel(ylabel or self.expressions[1])\n        # axes.set_aspect(aspect)\n        rgba8 = self.image_rgba(grid=grid, size=size, limits=limits, square=square, center=center, weight=weight, weight_stat=weight_stat,\n                                f=f, axes=axes,\n                                group_by=group_by, group_limits=group_limits, group_colors=group_colors, group_count=group_count,\n                                vmin=vmin, vmax=vmax,\n                                cmap=cmap)\n        import matplotlib\n        if group_by:\n            if isinstance(group_colors, six.string_types):\n                group_colors = matplotlib.cm.get_cmap(group_colors)\n            if isinstance(group_colors, matplotlib.colors.Colormap):\n                group_count = group_limits[2]\n                colors = [group_colors(k / float(group_count - 1.)) for k in range(group_count)]\n            else:\n                colors = [matplotlib.colors.colorConverter.to_rgba(k) for k in group_colors]\n            colormap = matplotlib.colors.ListedColormap(colors)\n            gmin, gmax, group_count = group_limits  # [:2]\n            delta = (gmax - gmin) / (group_count - 1.)\n            norm = matplotlib.colors.Normalize(gmin - delta / 2, gmax + delta / 2)\n            sm = matplotlib.cm.ScalarMappable(norm, colormap)\n            sm.set_array(1)  # make matplotlib happy (strange behavious)\n            colorbar = fig.colorbar(sm)\n            if group_labels:\n                colorbar.set_ticks(np.arange(gmin, gmax + delta / 2, delta))\n                colorbar.set_ticklabels(group_labels)\n            else:\n                colorbar.set_ticks(np.arange(gmin, gmax + delta / 2, delta))\n                colorbar.set_ticklabels(map(lambda x: \"%f\" % x, np.arange(gmin, gmax + delta / 2, delta)))\n            colorbar.ax.set_ylabel(group_by)\n            # matplotlib.colorbar.ColorbarBase(axes, norm=norm, cmap=colormap)\n            im = axes.imshow(rgba8, extent=np.array(limits).flatten(), origin=\"lower\", aspect=aspect, **kwargs)\n        else:\n            norm = matplotlib.colors.Normalize(0, 23)\n            sm = matplotlib.cm.ScalarMappable(norm, cmap)\n            sm.set_array(1)  # make matplotlib happy (strange behavious)\n            colorbar = fig.colorbar(sm)\n            im = axes.imshow(rgba8, extent=np.array(limits).flatten(), origin=\"lower\", aspect=aspect, **kwargs)\n            colorbar = None\n        return im, colorbar", "response": "Plot the subspace with a simple histogram plot."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef plot1d(self, grid=None, size=64, limits=None, weight=None, figsize=None, f=\"identity\", axes=None, xlabel=None, ylabel=None, **kwargs):\n        import pylab\n        f = _parse_f(f)\n        limits = self.limits(limits)\n        assert self.dimension == 1, \"can only plot 1d, not %s\" % self.dimension\n        if limits is None:\n            limits = self.limits_sigma()\n        if grid is None:\n            grid = self.histogram(limits=limits, size=size, weight=weight)\n        if figsize is not None:\n            pylab.figure(num=None, figsize=figsize, dpi=80, facecolor='w', edgecolor='k')\n        if axes is None:\n            axes = pylab.gca()\n        # if xlabel:\n        pylab.xlabel(xlabel or self.expressions[0])\n        # if ylabel:\n        # pylab.ylabel(ylabel or self.expressions[1])\n        pylab.ylabel(\"counts\" or ylabel)\n        # axes.set_aspect(aspect)\n        N = len(grid)\n        xmin, xmax = limits[0]\n        return pylab.plot(np.arange(N) / (N - 1.0) * (xmax - xmin) + xmin, f(grid,), drawstyle=\"steps\", **kwargs)", "response": "Plot the subspace with sane defaults"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nreturning a bounded subspace with limits given by the given number of sigmas.", "response": "def bounded_by_sigmas(self, sigmas=3, square=False):\n        \"\"\"Returns a bounded subspace (SubspaceBounded) with limits given by Subspace.limits_sigma()\n\n        :rtype: SubspaceBounded\n        \"\"\"\n        bounds = self.limits_sigma(sigmas=sigmas, square=square)\n        return SubspaceBounded(self, bounds)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nsorting the table by given column number.", "response": "def sort(self, Ncol, order):\n        \"\"\"Sort table by given column number.\n        \"\"\"\n        self.emit(QtCore.SIGNAL(\"layoutAboutToBeChanged()\"))\n        if Ncol == 0:\n            print(\"by name\")\n            # get indices, sorted by pair name\n            sortlist = list(zip(self.pairs, list(range(len(self.pairs)))))\n            print(sortlist)\n            sortlist.sort(key=operator.itemgetter(0))\n            print(sortlist)\n            self.indices = list(map(operator.itemgetter(1), sortlist))\n            print((self.indices))\n        if Ncol == 1:\n            # get indices, sorted by ranking, or no sorting\n            if None not in self.ranking:\n                sortlist = list(zip(self.ranking, list(range(len(self.pairs)))))\n                sortlist.sort(key=operator.itemgetter(0))\n                self.indices = list(map(operator.itemgetter(1), sortlist))\n            else:\n                self.indices = list(range(len(self.pairs)))\n            print((self.indices))\n        if order == QtCore.Qt.DescendingOrder:\n            self.indices.reverse()\n        print((self.indices))\n        self.emit(QtCore.SIGNAL(\"layoutChanged()\"))"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef export_hdf5_v1(dataset, path, column_names=None, byteorder=\"=\", shuffle=False, selection=False, progress=None, virtual=True):\n\n    if selection:\n        if selection == True:  # easier to work with the name\n            selection = \"default\"\n    # first open file using h5py api\n    with h5py.File(path, \"w\") as h5file_output:\n\n        h5data_output = h5file_output.require_group(\"data\")\n        # i1, i2 = dataset.current_slice\n        N = len(dataset) if not selection else dataset.selected_length(selection)\n        if N == 0:\n            raise ValueError(\"Cannot export empty table\")\n        logger.debug(\"virtual=%r\", virtual)\n        logger.debug(\"exporting %d rows to file %s\" % (N, path))\n        # column_names = column_names or (dataset.get_column_names() + (list(dataset.virtual_columns.keys()) if virtual else []))\n        column_names = column_names or dataset.get_column_names(virtual=virtual, strings=True)\n\n        logger.debug(\"exporting columns(hdf5): %r\" % column_names)\n        for column_name in column_names:\n            if column_name in dataset.get_column_names(strings=True):\n                column = dataset.columns[column_name]\n                shape = (N,) + column.shape[1:]\n                dtype = column.dtype\n            else:\n                dtype = np.float64().dtype\n                shape = (N,)\n            if dtype.type == np.datetime64:\n                array = h5file_output.require_dataset(\"/data/%s\" % column_name, shape=shape, dtype=np.int64)\n                array.attrs[\"dtype\"] = dtype.name\n            else:\n                try:\n                    array = h5file_output.require_dataset(\"/data/%s\" % column_name, shape=shape, dtype=dtype.newbyteorder(byteorder))\n                except:\n                    logging.exception(\"error creating dataset for %r, with type %r \" % (column_name, dtype))\n            array[0] = array[0]  # make sure the array really exists\n        random_index_name = None\n        column_order = list(column_names)  # copy\n        if shuffle:\n            random_index_name = \"random_index\"\n            while random_index_name in dataset.get_column_names():\n                random_index_name += \"_new\"\n            shuffle_array = h5file_output.require_dataset(\"/data/\" + random_index_name, shape=(N,), dtype=byteorder + \"i8\")\n            shuffle_array[0] = shuffle_array[0]\n            column_order.append(random_index_name)  # last item\n        h5data_output.attrs[\"column_order\"] = \",\".join(column_order)  # keep track or the ordering of columns\n\n    # after this the file is closed,, and reopen it using out class\n    dataset_output = vaex.hdf5.dataset.Hdf5MemoryMapped(path, write=True)\n\n    column_names = vaex.export._export(dataset_input=dataset,\n                                       dataset_output=dataset_output,\n                                       path=path,\n                                       random_index_column=random_index_name,\n                                       column_names=column_names,\n                                       selection=selection,\n                                       shuffle=shuffle,\n                                       byteorder=byteorder,\n                                       progress=progress)\n    import getpass\n    import datetime\n    user = getpass.getuser()\n    date = str(datetime.datetime.now())\n    source = dataset.path\n    description = \"file exported by vaex, by user %s, on date %s, from source %s\" % (user, date, source)\n    if dataset.description:\n        description += \"previous description:\\n\" + dataset.description\n    dataset_output.copy_metadata(dataset)\n    dataset_output.description = description\n    logger.debug(\"writing meta information\")\n    dataset_output.write_meta()\n    dataset_output.close_files()\n    return", "response": "Export the tables in the HDF5 format."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nread information from a Gadget file. Returns the offset of positions and velocities of the Gadget file.", "response": "def getinfo(filename, seek=None):\n\t\"\"\"Read header data from Gadget data file 'filename' with Gadget file\n\ttype 'gtype'. Returns offsets of positions and velocities.\"\"\"\n\tDESC = '=I4sII'                                # struct formatting string\n\tHEAD = '=I6I6dddii6iiiddddii6ii60xI'        # struct formatting string\n\tkeys = ('Npart', 'Massarr', 'Time', 'Redshift', 'FlagSfr', 'FlagFeedback', 'Nall', 'FlagCooling', 'NumFiles', 'BoxSize', 'Omega0', 'OmegaLambda', 'HubbleParam', 'FlagAge', 'FlagMetals', 'NallHW', 'flag_entr_ics', 'filename')\n\tf = open(filename, 'rb')\n\t\n\t\"\"\"Detects Gadget file type (type 1 or 2; resp. without or with the 16\n\tbyte block headers).\"\"\"\n\tfirstbytes = struct.unpack('I',f.read(4))\n\tif firstbytes[0] == 8:\n\t\tgtype = 2\n\telse:\n\t\tgtype = 1\n\tif gtype == 2:\n\t\tf.seek(16)\n\telse:\n\t \tf.seek(0)\n\tif seek is not None:\n\t\tf.seek(seek)\n\traw = struct.unpack(HEAD,f.read(264))[1:-1]\n\tvalues = (raw[:6], raw[6:12]) + raw[12:16] + (raw[16:22],) + raw[22:30] + (raw[30:36], raw[36], filename)\n\theader = dict(list(zip(keys, values)))\n\tf.close()\n\t\n\t\n\tif gtype == 2:\n\t\tposoffset = (2*16 + (8 + 256))\n\telse:\n\t\tposoffset = (8 + 256)\n\t\n\tNpart = sum(header['Npart'])\n\tif gtype == 2:\n\t\tveloffset = 3*16 + (8 + 256) + (8 + 3*4*Npart)\n\telse:\n\t\tveloffset= (8 + 256) + (8 + 3*4*Npart)\n\treturn Npart, posoffset+4, veloffset+4, header"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef _export(dataset_input, dataset_output, random_index_column, path, column_names=None, byteorder=\"=\", shuffle=False, selection=False, progress=None, virtual=True, sort=None, ascending=True):\n\n    if selection:\n        if selection == True:  # easier to work with the name\n            selection = \"default\"\n\n    N = len(dataset_input) if not selection else dataset_input.selected_length(selection)\n    if N == 0:\n        raise ValueError(\"Cannot export empty table\")\n\n    if shuffle and sort:\n        raise ValueError(\"Cannot shuffle and sort at the same time\")\n\n    if shuffle:\n        shuffle_array = dataset_output.columns[random_index_column]\n\n    partial_shuffle = shuffle and len(dataset_input) != N\n\n    order_array = None\n    order_array_inverse = None\n\n    # for strings we also need the inverse order_array, keep track of that\n    has_strings = any([dataset_input.dtype(k) == str_type for k in column_names])\n\n    if partial_shuffle:\n        # if we only export a portion, we need to create the full length random_index array, and\n        shuffle_array_full = np.random.choice(len(dataset_input), len(dataset_input), replace=False)\n        # then take a section of it\n        shuffle_array[:] = shuffle_array_full[shuffle_array_full < N]\n        del shuffle_array_full\n        order_array = shuffle_array\n    elif shuffle:\n        # better to do this in memory\n        shuffle_array_memory = np.random.choice(N, N, replace=False)\n        shuffle_array[:] = shuffle_array_memory\n        order_array = shuffle_array\n    if order_array is not None:\n        indices_r = np.zeros_like(order_array)\n        indices_r[order_array] = np.arange(len(order_array))\n        order_array_inverse = indices_r\n        del indices_r\n\n    if sort:\n        if selection:\n            raise ValueError(\"sorting selections not yet supported\")\n        # these indices sort the input array, but we evaluate the input in sequential order and write it out in sorted order\n        # e.g., not b[:] = a[indices]\n        # but b[indices_r] = a\n        logger.info(\"sorting...\")\n        indices = np.argsort(dataset_input.evaluate(sort))\n        indices_r = np.zeros_like(indices)\n        indices_r[indices] = np.arange(len(indices))\n        if has_strings:\n            # in this case we already have the inverse ready\n            order_array_inverse = indices if ascending else indices[:--1]\n        else:\n            del indices\n        order_array = indices_r if ascending else indices_r[::-1]\n        logger.info(\"sorting done\")\n\n    if progress == True:\n        progress = vaex.utils.progressbar_callable(title=\"exporting\")\n    progress = progress or (lambda value: True)\n    progress_total = len(column_names) * len(dataset_input)\n    progress_status = ProgressStatus()\n    progress_status.cancelled = False\n    progress_status.value = 0\n    if selection:\n        full_mask = dataset_input.evaluate_selection_mask(selection)\n    else:\n        full_mask = None\n\n    sparse_groups = collections.defaultdict(list)\n    sparse_matrices = {}  # alternative to a set of matrices, since they are not hashable\n    string_columns = []\n    futures = []\n\n    thread_pool = concurrent.futures.ThreadPoolExecutor(max_workers=1) \n    if True:\n        for column_name in column_names:\n            sparse_matrix = dataset_output._sparse_matrix(column_name)\n            if sparse_matrix is not None:\n                # sparse columns are written differently\n                sparse_groups[id(sparse_matrix)].append(column_name)\n                sparse_matrices[id(sparse_matrix)] = sparse_matrix\n                continue\n            logger.debug(\"  exporting column: %s \" % column_name)\n            future = thread_pool.submit(_export_column, dataset_input, dataset_output, column_name, full_mask,\n                shuffle, sort, selection, N, order_array, order_array_inverse, progress_status)\n            futures.append(future)\n\n    done = False\n    while not done:\n        done = True\n        for future in futures:\n            try:\n                future.result(0.1/4)\n            except concurrent.futures.TimeoutError:\n                done = False\n                break\n        if not done:\n            if not progress(progress_status.value / float(progress_total)):\n                progress_status.cancelled = True\n\n    for sparse_matrix_id, column_names in sparse_groups.items():\n        sparse_matrix = sparse_matrices[sparse_matrix_id]\n        for column_name in column_names:\n            assert not shuffle\n            assert selection in [None, False]\n            column = dataset_output.columns[column_name]\n            column.matrix.data[:] = dataset_input.columns[column_name].matrix.data\n            column.matrix.indptr[:] = dataset_input.columns[column_name].matrix.indptr\n            column.matrix.indices[:] = dataset_input.columns[column_name].matrix.indices\n    return column_names", "response": "Export a new name from dataset_input to dataset_output."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef export_fits(dataset, path, column_names=None, shuffle=False, selection=False, progress=None, virtual=True, sort=None, ascending=True):\n    if shuffle:\n        random_index_name = \"random_index\"\n        while random_index_name in dataset.get_column_names():\n            random_index_name += \"_new\"\n\n    column_names = column_names or dataset.get_column_names(virtual=virtual, strings=True)\n    logger.debug(\"exporting columns(fits): %r\" % column_names)\n    N = len(dataset) if not selection else dataset.selected_length(selection)\n    data_types = []\n    data_shapes = []\n    ucds = []\n    units = []\n    for column_name in column_names:\n        if column_name in dataset.get_column_names(strings=True):\n            column = dataset.columns[column_name]\n            shape = (N,) + column.shape[1:]\n            dtype = column.dtype\n            if dataset.dtype(column_name) == str_type:\n                max_length = dataset[column_name].apply(lambda x: len(x)).max(selection=selection)\n                dtype = np.dtype('S'+str(int(max_length)))\n        else:\n            dtype = np.float64().dtype\n            shape = (N,)\n        ucds.append(dataset.ucds.get(column_name))\n        units.append(dataset.units.get(column_name))\n        data_types.append(dtype)\n        data_shapes.append(shape)\n\n    if shuffle:\n        column_names.append(random_index_name)\n        data_types.append(np.int64().dtype)\n        data_shapes.append((N,))\n        ucds.append(None)\n        units.append(None)\n    else:\n        random_index_name = None\n\n    # TODO: all expressions can have missing values.. how to support that?\n    null_values = {key: dataset.columns[key].fill_value for key in dataset.get_column_names() if dataset.is_masked(key) and dataset.dtype(key).kind != \"f\"}\n    vaex.file.colfits.empty(path, N, column_names, data_types, data_shapes, ucds, units, null_values=null_values)\n    if shuffle:\n        del column_names[-1]\n        del data_types[-1]\n        del data_shapes[-1]\n    dataset_output = vaex.file.other.FitsBinTable(path, write=True)\n    _export(dataset_input=dataset, dataset_output=dataset_output, path=path, random_index_column=random_index_name,\n            column_names=column_names, selection=selection, shuffle=shuffle,\n            progress=progress, sort=sort, ascending=ascending)\n    dataset_output.close_files()", "response": "Export all the FITSs in a single file."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nwaits for the last plot to finish", "response": "def _wait(self):\n        \"\"\"Used for unittesting to make sure the plots are all done\"\"\"\n        logger.debug(\"will wait for last plot to finish\")\n        self._plot_event = threading.Event()\n        self.queue_update._wait()\n        self.queue_replot._wait()\n        self.queue_redraw._wait()\n        qt_app = QtCore.QCoreApplication.instance()\n        sleep = 10\n        while not self._plot_event.is_set():\n            logger.debug(\"waiting for last plot to finish\")\n            qt_app.processEvents()\n            QtTest.QTest.qSleep(sleep)\n        logger.debug(\"waiting for plot finished\")"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef _update_step2(self, layers):\n        logger.info(\"done with ranges, now update step2 for layers: %r\", layers)\n\n        for dimension in range(self.dimensions):\n            if self.state.ranges_viewport[dimension] is None:\n                vmin = min([layer.state.ranges_grid[dimension][0] for layer in layers])\n                vmax = max([layer.state.ranges_grid[dimension][1] for layer in layers])\n                self.state.ranges_viewport[dimension] = [vmin, vmax]\n\n        logger.debug(\"ranges before aspect check: %r\", self.state.ranges_viewport)\n        self.check_aspect(0)\n        logger.debug(\"ranges after aspect check: %r\", self.state.ranges_viewport)\n        # now make sure the layers all have the same ranges_grid\n        for layer in layers:\n            # layer.state.ranges_grid = copy.deepcopy(self.state.ranges_viewport)\n            for d in range(layer.dimensions):\n                layer.set_range(self.state.ranges_viewport[d][0], self.state.ranges_viewport[d][1], d)\n\n        # now we are ready to calculate histograms\n        promises = [layer.add_tasks_histograms() for layer in layers]\n        executors = list(set([layer.dataset.executor for layer in layers]))\n        for executor in executors:\n            executor.execute()\n\n        promises_histograms_done = vaex.promise.listPromise(promises)\n        promises_histograms_done.then(self._update_step3, self.on_error_or_cancel).end()", "response": "Update the ranges_grid for each layer in layers"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef subdivide(length, parts=None, max_length=None):\n    if max_length:\n        i1 = 0\n        done = False\n        while not done:\n            i2 = min(length, i1 + max_length)\n            # print i1, i2\n            yield i1, i2\n            i1 = i2\n            if i1 == length:\n                done = True\n    else:\n        part_length = int(math.ceil(float(length) / parts))\n        # subblock_count = math.ceil(total_length/subblock_size)\n        # args_list = []\n        for index in range(parts):\n            i1, i2 = index * part_length, min(length, (index + 1) * part_length)\n            yield i1, i2", "response": "Generates a list of start end indices of length parts and..."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef os_open(document):\n    osname = platform.system().lower()\n    if osname == \"darwin\":\n        os.system(\"open \\\"\" + document + \"\\\"\")\n    if osname == \"linux\":\n        cmd = \"xdg-open \\\"\" + document + \"\\\"&\"\n        os.system(cmd)\n    if osname == \"windows\":\n        os.system(\"start \\\"\" + document + \"\\\"\")", "response": "Open the given document by default handler of the OS."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef write_to(f, mode):\n    if hasattr(f, 'write'):\n        yield f\n    else:\n        f = open(f, mode)\n        yield f\n        f.close()", "response": "Flexible writing where f can be a filename or f object closed after writing"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef _split_and_combine_mask(arrays):\n\t'''Combines all masks from a list of arrays, and logically ors them into a single mask'''\n\tmasks = [np.ma.getmaskarray(block) for block in arrays if np.ma.isMaskedArray(block)]\n\tarrays = [block.data if np.ma.isMaskedArray(block) else block for block in arrays]\n\tmask = None\n\tif masks:\n\t\tmask = masks[0].copy()\n\t\tfor other in masks[1:]:\n\t\t\tmask |= other\n\treturn arrays, mask", "response": "Combines all masks from a list of arrays and logically ors them into a single mask"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nplots the conting contours on a 2D grid.", "response": "def plot2d_contour(self, x=None, y=None, what=\"count(*)\", limits=None, shape=256,\n                   selection=None, f=\"identity\", figsize=None,\n                   xlabel=None, ylabel=None,\n                   aspect=\"auto\", levels=None, fill=False,\n                   colorbar=False, colorbar_label=None,\n                   colormap=None, colors=None, linewidths=None, linestyles=None,\n                   vmin=None, vmax=None,\n                   grid=None, show=None, **kwargs):\n    \"\"\"\n    Plot conting contours on 2D grid.\n\n    :param x: {expression}\n    :param y: {expression}\n    :param what: What to plot, count(*) will show a N-d histogram, mean('x'), the mean of the x column, sum('x') the sum, std('x') the standard deviation, correlation('vx', 'vy') the correlation coefficient. Can also be a list of values, like ['count(x)', std('vx')], (by default maps to column)\n    :param limits: {limits}\n    :param shape: {shape}\n    :param selection: {selection}\n    :param f: transform values by: 'identity' does nothing 'log' or 'log10' will show the log of the value\n    :param figsize: (x, y) tuple passed to pylab.figure for setting the figure size\n    :param xlabel: label of the x-axis (defaults to param x)\n    :param ylabel: label of the y-axis (defaults to param y)\n    :param aspect: the aspect ratio of the figure\n    :param levels: the contour levels to be passed on pylab.contour or pylab.contourf\n    :param colorbar: plot a colorbar or not\n    :param colorbar_label: the label of the colourbar (defaults to param what)\n    :param colormap: matplotlib colormap to pass on to pylab.contour or pylab.contourf\n    :param colors: the colours of the contours\n    :param linewidths: the widths of the contours\n    :param linestyles: the style of the contour lines\n    :param vmin: instead of automatic normalization, scale the data between vmin and vmax\n    :param vmax: see vmin\n    :param grid: {grid}\n    :param show:\n    \"\"\"\n\n\n    # Get the function out of the string\n    f = vaex.dataset._parse_f(f)\n\n    # Internals on what to bin\n    binby = []\n    x = vaex.dataset._ensure_strings_from_expressions(x)\n    y = vaex.dataset._ensure_strings_from_expressions(y)\n    for expression in [y, x]:\n        if expression is not None:\n            binby = [expression] + binby\n\n    # The shape\n    shape = vaex.dataset._expand_shape(shape, 2)\n\n    # The limits and\n    limits = self.limits(binby, limits)\n\n    # Constructing the 2d histogram\n    if grid is None:\n        if what:\n            if isinstance(what, (vaex.stat.Expression)):\n                grid = what.calculate(self, binby=binby, limits=limits, shape=shape, selection=selection)\n            else:\n                what = what.strip()\n                index = what.index(\"(\")\n                groups = re.match(\"(.*)\\((.*)\\)\", what).groups()\n                if groups and len(groups) == 2:\n                    function = groups[0]\n                    arguments = groups[1].strip()\n                    functions = [\"mean\", \"sum\", \"std\", \"count\"]\n                    if function in functions:\n                        # grid = getattr(self, function)(arguments, binby, limits=limits, shape=shape, selection=selection)\n                        grid = getattr(vaex.stat, function)(arguments).calculate(self, binby=binby, limits=limits, shape=shape, selection=selection)\n                    elif function == \"count\" and arguments == \"*\":\n                        grid = self.count(binby=binby, shape=shape, limits=limits, selection=selection)\n                    elif function == \"cumulative\" and arguments == \"*\":\n                        # TODO: comulative should also include the tails outside limits\n                        grid = self.count(binby=binby, shape=shape, limits=limits, selection=selection)\n                        grid = np.cumsum(grid)\n                    else:\n                        raise ValueError(\"Could not understand method: %s, expected one of %r'\" % (function, functions))\n                else:\n                    raise ValueError(\"Could not understand 'what' argument %r, expected something in form: 'count(*)', 'mean(x)'\" % what)\n        else:\n            grid = self.histogram(binby, size=shape, limits=limits, selection=selection)\n\n    # Apply the function on the grid\n    fgrid = f(grid)\n\n    # Figure creation\n    if figsize is not None:\n        fig = plt.figure(num=None, figsize=figsize, dpi=80, facecolor='w', edgecolor='k')\n    fig = plt.gcf()\n\n    # labels\n    plt.xlabel(xlabel or x)\n    plt.ylabel(ylabel or y)\n\n    # The master contour plot\n    if fill == False:\n        value = plt.contour(fgrid.T, origin=\"lower\", extent=np.array(limits).ravel().tolist(),\n                            linestyles=linestyles, linewidths=linewidths, levels=levels,\n                            colors=colors, cmap=colormap, vmin=vmin, vmax=vmax, **kwargs)\n    else:\n        value = plt.contourf(fgrid.T, origin=\"lower\", extent=np.array(limits).ravel().tolist(),\n                             linestyles=linestyles, levels=levels, colors=colors,\n                             cmap=colormap, vmin=vmin, vmax=vmax, **kwargs)\n    if colorbar:\n        plt.colorbar(label=colorbar_label or what)\n\n    # Wrap things up\n    if show:\n        plt.show()\n    return value"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _export_table(dataset, column_names=None, byteorder=\"=\", shuffle=False, selection=False, progress=None, virtual=True, sort=None, ascending=True):\n    column_names = column_names or dataset.get_column_names(virtual=virtual, strings=True)\n    for name in column_names:\n        if name not in dataset.columns:\n            warnings.warn('Exporting to arrow with virtual columns is not efficient')\n    N = len(dataset) if not selection else dataset.selected_length(selection)\n    if N == 0:\n        raise ValueError(\"Cannot export empty table\")\n\n    if shuffle and sort:\n        raise ValueError(\"Cannot shuffle and sort at the same time\")\n\n    if shuffle:\n        random_index_column = \"random_index\"\n        while random_index_column in dataset.get_column_names():\n            random_index_column += \"_new\"\n    partial_shuffle = shuffle and len(dataset) != N\n\n    order_array = None\n    if partial_shuffle:\n        # if we only export a portion, we need to create the full length random_index array, and\n        shuffle_array_full = np.random.choice(len(dataset), len(dataset), replace=False)\n        # then take a section of it\n        # shuffle_array[:] = shuffle_array_full[:N]\n        shuffle_array = shuffle_array_full[shuffle_array_full < N]\n        del shuffle_array_full\n        order_array = shuffle_array\n    elif shuffle:\n        shuffle_array = np.random.choice(N, N, replace=False)\n        order_array = shuffle_array\n\n    if sort:\n        if selection:\n            raise ValueError(\"sorting selections not yet supported\")\n        logger.info(\"sorting...\")\n        indices = np.argsort(dataset.evaluate(sort))\n        order_array = indices if ascending else indices[::-1]\n        logger.info(\"sorting done\")\n\n    if selection:\n        full_mask = dataset.evaluate_selection_mask(selection)\n    else:\n        full_mask = None\n\n    arrow_arrays = []\n    for column_name in column_names:\n        mask = full_mask\n        if selection:\n            values = dataset.evaluate(column_name, filtered=False)\n            values = values[mask]\n        else:\n            values = dataset.evaluate(column_name)\n            if shuffle or sort:\n                indices = order_array\n                values = values[indices]\n        arrow_arrays.append(arrow_array_from_numpy_array(values))\n    if shuffle:\n        arrow_arrays.append(arrow_array_from_numpy_array(order_array))\n        column_names = column_names + [random_index_column]\n    table = pa.Table.from_arrays(arrow_arrays, column_names)\n    return table", "response": "Export a table of the n - items in the dataset."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef nop(self, expression, progress=False, delay=False):\n        expression = _ensure_string_from_expression(expression)\n        def map(ar):\n            pass\n        def reduce(a, b):\n            pass\n        return self.map_reduce(map, reduce, [expression], delay=delay, progress=progress, name='nop', to_numpy=False)", "response": "Evaluates expression and drop the result usefull for benchmarking since vaex is usually lazy"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nestimates the mutual information between x and y on a grid with shape mi_shape and mi_limits possibly on a grid defined by binby.", "response": "def mutual_information(self, x, y=None, mi_limits=None, mi_shape=256, binby=[], limits=None, shape=default_shape, sort=False, selection=False, delay=False):\n        \"\"\"Estimate the mutual information between and x and y on a grid with shape mi_shape and mi_limits, possibly on a grid defined by binby.\n\n        If sort is True, the mutual information is returned in sorted (descending) order and the list of expressions is returned in the same order.\n\n        Example:\n\n        >>> df.mutual_information(\"x\", \"y\")\n        array(0.1511814526380327)\n        >>> df.mutual_information([[\"x\", \"y\"], [\"x\", \"z\"], [\"E\", \"Lz\"]])\n        array([ 0.15118145,  0.18439181,  1.07067379])\n        >>> df.mutual_information([[\"x\", \"y\"], [\"x\", \"z\"], [\"E\", \"Lz\"]], sort=True)\n        (array([ 1.07067379,  0.18439181,  0.15118145]),\n        [['E', 'Lz'], ['x', 'z'], ['x', 'y']])\n\n\n        :param x: {expression}\n        :param y: {expression}\n        :param limits: {limits}\n        :param shape: {shape}\n        :param binby: {binby}\n        :param limits: {limits}\n        :param shape: {shape}\n        :param sort: return mutual information in sorted (descending) order, and also return the correspond list of expressions when sorted is True\n        :param selection: {selection}\n        :param delay: {delay}\n        :return: {return_stat_scalar},\n        \"\"\"\n        if y is None:\n            waslist, [x, ] = vaex.utils.listify(x)\n        else:\n            waslist, [x, y] = vaex.utils.listify(x, y)\n            x = list(zip(x, y))\n            if mi_limits:\n                mi_limits = [mi_limits]\n        # print(\"x, mi_limits\", x, mi_limits)\n        limits = self.limits(binby, limits, delay=True)\n        # print(\"$\"*80)\n        mi_limits = self.limits(x, mi_limits, delay=True)\n        # print(\"@\"*80)\n\n        @delayed\n        def calculate(counts):\n            # TODO: mutual information doesn't take axis arguments, so ugly solution for now\n            counts = counts.astype(np.float64)\n            fullshape = _expand_shape(shape, len(binby))\n            out = np.zeros((fullshape), dtype=float)\n            if len(fullshape) == 0:\n                out = vaex.kld.mutual_information(counts)\n                # print(\"count> \", np.sum(counts))\n            elif len(fullshape) == 1:\n                for i in range(fullshape[0]):\n                    out[i] = vaex.kld.mutual_information(counts[..., i])\n                    # print(\"counti> \", np.sum(counts[...,i]))\n                # print(\"countt> \", np.sum(counts))\n            elif len(fullshape) == 2:\n                for i in range(fullshape[0]):\n                    for j in range(fullshape[1]):\n                        out[i, j] = vaex.kld.mutual_information(counts[..., i, j])\n            elif len(fullshape) == 3:\n                for i in range(fullshape[0]):\n                    for j in range(fullshape[1]):\n                        for k in range(fullshape[2]):\n                            out[i, j, k] = vaex.kld.mutual_information(counts[..., i, j, k])\n            else:\n                raise ValueError(\"binby with dim > 3 is not yet supported\")\n            return out\n\n        @delayed\n        def has_limits(limits, mi_limits):\n            if not _issequence(binby):\n                limits = [list(limits)]\n            values = []\n            for expressions, expression_limits in zip(x, mi_limits):\n                # print(\"mi for\", expressions, expression_limits)\n                # total_shape =  _expand_shape(mi_shape, len(expressions)) + _expand_shape(shape, len(binby))\n                total_shape = _expand_shape(mi_shape, len(expressions)) + _expand_shape(shape, len(binby))\n                # print(\"expressions\", expressions)\n                # print(\"total_shape\", total_shape)\n                # print(\"limits\", limits,expression_limits)\n                # print(\"limits>\", list(limits) + list(expression_limits))\n                counts = self.count(binby=list(expressions) + list(binby), limits=list(expression_limits) + list(limits),\n                                    shape=total_shape, delay=True, selection=selection)\n                values.append(calculate(counts))\n            return values\n\n        @delayed\n        def finish(mi_list):\n            if sort:\n                mi_list = np.array(mi_list)\n                indices = np.argsort(mi_list)[::-1]\n                sorted_x = list([x[k] for k in indices])\n                return mi_list[indices], sorted_x\n            else:\n                return np.array(vaex.utils.unlistify(waslist, mi_list))\n        values = finish(delayed_list(has_limits(limits, mi_limits)))\n        return self._delay(delay, values)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef count(self, expression=None, binby=[], limits=None, shape=default_shape, selection=False, delay=False, edges=False, progress=None):\n        return self._compute_agg('count', expression, binby, limits, shape, selection, delay, edges, progress)", "response": "Count the number of non - NaN values in a column."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nreturns the first element of a binned expression where the values each bin are sorted by order_expression.", "response": "def first(self, expression, order_expression, binby=[], limits=None, shape=default_shape, selection=False, delay=False, edges=False, progress=None):\n        \"\"\"Return the first element of a binned `expression`, where the values each bin are sorted by `order_expression`.\n\n        Example:\n\n        >>> import vaex\n        >>> df = vaex.example()\n        >>> df.first(df.x, df.y, shape=8)\n        >>> df.first(df.x, df.y, shape=8, binby=[df.y])\n        >>> df.first(df.x, df.y, shape=8, binby=[df.y])\n        array([-4.81883764, 11.65378   ,  9.70084476, -7.3025589 ,  4.84954977,\n                8.47446537, -5.73602629, 10.18783   ])\n\n        :param expression: The value to be placed in the bin.\n        :param order_expression: Order the values in the bins by this expression.\n        :param binby: {binby}\n        :param limits: {limits}\n        :param shape: {shape}\n        :param selection: {selection}\n        :param delay: {delay}\n        :param progress: {progress}\n        :param edges: {edges}\n        :return: Ndarray containing the first elements.\n        :rtype: numpy.array\n        \"\"\"\n        return self._compute_agg('first', expression, binby, limits, shape, selection, delay, edges, progress, extra_expressions=[order_expression])\n        logger.debug(\"count(%r, binby=%r, limits=%r)\", expression, binby, limits)\n        logger.debug(\"count(%r, binby=%r, limits=%r)\", expression, binby, limits)\n        expression = _ensure_strings_from_expressions(expression)\n        order_expression = _ensure_string_from_expression(order_expression)\n        binby = _ensure_strings_from_expressions(binby)\n        waslist, [expressions,] = vaex.utils.listify(expression)\n        @delayed\n        def finish(*counts):\n            counts = np.asarray(counts)\n            return vaex.utils.unlistify(waslist, counts)\n        progressbar = vaex.utils.progressbars(progress)\n        limits = self.limits(binby, limits, delay=True, shape=shape)\n        stats = [self._first_calculation(expression, order_expression, binby=binby, limits=limits, shape=shape, selection=selection, edges=edges, progressbar=progressbar) for expression in expressions]\n        var = finish(*stats)\n        return self._delay(delay, var)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\ncalculating the mean of a set of expressions.", "response": "def mean(self, expression, binby=[], limits=None, shape=default_shape, selection=False, delay=False, progress=None, edges=False):\n        \"\"\"Calculate the mean for expression, possibly on a grid defined by binby.\n\n        Example:\n\n        >>> df.mean(\"x\")\n        -0.067131491264005971\n        >>> df.mean(\"(x**2+y**2)**0.5\", binby=\"E\", shape=4)\n        array([  2.43483742,   4.41840721,   8.26742458,  15.53846476])\n\n        :param expression: {expression}\n        :param binby: {binby}\n        :param limits: {limits}\n        :param shape: {shape}\n        :param selection: {selection}\n        :param delay: {delay}\n        :param progress: {progress}\n        :return: {return_stat_scalar}\n        \"\"\"\n        return self._compute_agg('mean', expression, binby, limits, shape, selection, delay, edges, progress)\n        logger.debug(\"mean of %r, with binby=%r, limits=%r, shape=%r, selection=%r, delay=%r\", expression, binby, limits, shape, selection, delay)\n        expression = _ensure_strings_from_expressions(expression)\n        selection = _ensure_strings_from_expressions(selection)\n        binby = _ensure_strings_from_expressions(binby)\n\n        @delayed\n        def calculate(expression, limits):\n            task = tasks.TaskStatistic(self, binby, shape, limits, weight=expression, op=tasks.OP_ADD_WEIGHT_MOMENTS_01, selection=selection)\n            self.executor.schedule(task)\n            progressbar.add_task(task, \"mean for %s\" % expression)\n            return task\n\n        @delayed\n        def finish(*stats_args):\n            stats = np.array(stats_args)\n            counts = stats[..., 0]\n            with np.errstate(divide='ignore', invalid='ignore'):\n                mean = stats[..., 1] / counts\n            return vaex.utils.unlistify(waslist, mean)\n        waslist, [expressions, ] = vaex.utils.listify(expression)\n        progressbar = vaex.utils.progressbars(progress)\n        limits = self.limits(binby, limits, delay=True)\n        stats = [calculate(expression, limits) for expression in expressions]\n        var = finish(*stats)\n        return self._delay(delay, var)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\ncalculating the sum of the given expression on a grid defined by binby.", "response": "def sum(self, expression, binby=[], limits=None, shape=default_shape, selection=False, delay=False, progress=None, edges=False):\n        \"\"\"Calculate the sum for the given expression, possible on a grid defined by binby\n\n        Example:\n\n        >>> df.sum(\"L\")\n        304054882.49378014\n        >>> df.sum(\"L\", binby=\"E\", shape=4)\n        array([  8.83517994e+06,   5.92217598e+07,   9.55218726e+07,\n                         1.40008776e+08])\n\n        :param expression: {expression}\n        :param binby: {binby}\n        :param limits: {limits}\n        :param shape: {shape}\n        :param selection: {selection}\n        :param delay: {delay}\n        :param progress: {progress}\n        :return: {return_stat_scalar}\n        \"\"\"\n        return self._compute_agg('sum', expression, binby, limits, shape, selection, delay, edges, progress)\n        @delayed\n        def finish(*sums):\n            return vaex.utils.unlistify(waslist, sums)\n        expression = _ensure_strings_from_expressions(expression)\n        binby = _ensure_strings_from_expressions(binby)\n        waslist, [expressions, ] = vaex.utils.listify(expression)\n        progressbar = vaex.utils.progressbars(progress)\n        limits = self.limits(binby, limits, delay=True)\n        # stats = [calculate(expression, limits) for expression in expressions]\n        sums = [self._sum_calculation(expression, binby=binby, limits=limits, shape=shape, selection=selection, progressbar=progressbar) for expression in expressions]\n        s = finish(*sums)\n        return self._delay(delay, s)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef std(self, expression, binby=[], limits=None, shape=default_shape, selection=False, delay=False, progress=None):\n        @delayed\n        def finish(var):\n            return var**0.5\n        return self._delay(delay, finish(self.var(expression, binby=binby, limits=limits, shape=shape, selection=selection, delay=True, progress=progress)))", "response": "Calculate the standard deviation for the given expression possible on a grid defined by binby"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\ncalculating the covariance of the current state of the object.", "response": "def covar(self, x, y, binby=[], limits=None, shape=default_shape, selection=False, delay=False, progress=None):\n        \"\"\"Calculate the covariance cov[x,y] between and x and y, possibly on a grid defined by binby.\n\n        Example:\n\n        >>> df.covar(\"x**2+y**2+z**2\", \"-log(-E+1)\")\n        array(52.69461456005138)\n        >>> df.covar(\"x**2+y**2+z**2\", \"-log(-E+1)\")/(df.std(\"x**2+y**2+z**2\") * df.std(\"-log(-E+1)\"))\n        0.63666373822156686\n        >>> df.covar(\"x**2+y**2+z**2\", \"-log(-E+1)\", binby=\"Lz\", shape=4)\n        array([ 10.17387143,  51.94954078,  51.24902796,  20.2163929 ])\n\n\n\n        :param x: {expression}\n        :param y: {expression}\n        :param binby: {binby}\n        :param limits: {limits}\n        :param shape: {shape}\n        :param selection: {selection}\n        :param delay: {delay}\n        :param progress: {progress}\n        :return: {return_stat_scalar}\n        \"\"\"\n        @delayed\n        def cov(mean_x, mean_y, mean_xy):\n            return mean_xy - mean_x * mean_y\n\n        waslist, [xlist, ylist] = vaex.utils.listify(x, y)\n        # print(\"limits\", limits)\n        limits = self.limits(binby, limits, selection=selection, delay=True)\n        # print(\"limits\", limits)\n\n        @delayed\n        def calculate(limits):\n            results = []\n            for x, y in zip(xlist, ylist):\n                mx = self.mean(x, binby=binby, limits=limits, shape=shape, selection=selection, delay=True, progress=progressbar)\n                my = self.mean(y, binby=binby, limits=limits, shape=shape, selection=selection, delay=True, progress=progressbar)\n                cxy = self.mean(\"(%s)*(%s)\" % (x, y), binby=binby, limits=limits, shape=shape, selection=selection,\n                                delay=True, progress=progressbar)\n                results.append(cov(mx, my, cxy))\n            return results\n\n        progressbar = vaex.utils.progressbars(progress)\n        covars = calculate(limits)\n\n        @delayed\n        def finish(covars):\n            value = np.array(vaex.utils.unlistify(waslist, covars))\n            return value\n        return self._delay(delay, finish(delayed_list(covars)))"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\ncalculating the correlation coefficient of a set of values.", "response": "def correlation(self, x, y=None, binby=[], limits=None, shape=default_shape, sort=False, sort_key=np.abs, selection=False, delay=False, progress=None):\n        \"\"\"Calculate the correlation coefficient cov[x,y]/(std[x]*std[y]) between and x and y, possibly on a grid defined by binby.\n\n        Example:\n\n        >>> df.correlation(\"x**2+y**2+z**2\", \"-log(-E+1)\")\n        array(0.6366637382215669)\n        >>> df.correlation(\"x**2+y**2+z**2\", \"-log(-E+1)\", binby=\"Lz\", shape=4)\n        array([ 0.40594394,  0.69868851,  0.61394099,  0.65266318])\n\n        :param x: {expression}\n        :param y: {expression}\n        :param binby: {binby}\n        :param limits: {limits}\n        :param shape: {shape}\n        :param selection: {selection}\n        :param delay: {delay}\n        :param progress: {progress}\n        :return: {return_stat_scalar}\n        \"\"\"\n        @delayed\n        def corr(cov):\n            with np.errstate(divide='ignore', invalid='ignore'):  # these are fine, we are ok with nan's in vaex\n                return cov[..., 0, 1] / (cov[..., 0, 0] * cov[..., 1, 1])**0.5\n\n        if y is None:\n            if not isinstance(x, (tuple, list)):\n                raise ValueError(\"if y not given, x is expected to be a list or tuple, not %r\" % x)\n            if _issequence(x) and not _issequence(x[0]) and len(x) == 2:\n                x = [x]\n            if not(_issequence(x) and all([_issequence(k) and len(k) == 2 for k in x])):\n                raise ValueError(\"if y not given, x is expected to be a list of lists with length 2, not %r\" % x)\n            # waslist, [xlist,ylist] = vaex.utils.listify(*x)\n            waslist = True\n            xlist, ylist = zip(*x)\n            # print xlist, ylist\n        else:\n            waslist, [xlist, ylist] = vaex.utils.listify(x, y)\n        limits = self.limits(binby, limits, selection=selection, delay=True)\n\n        @delayed\n        def echo(limits):\n            logger.debug(\">>>>>>>>: %r %r\", limits, np.array(limits).shape)\n        echo(limits)\n\n        @delayed\n        def calculate(limits):\n            results = []\n            for x, y in zip(xlist, ylist):\n                task = self.cov(x, y, binby=binby, limits=limits, shape=shape, selection=selection, delay=True,\n                                progress=progressbar)\n                results.append(corr(task))\n            return results\n\n        progressbar = vaex.utils.progressbars(progress)\n        correlations = calculate(limits)\n\n        @delayed\n        def finish(correlations):\n            if sort:\n                correlations = np.array(correlations)\n                indices = np.argsort(sort_key(correlations) if sort_key else correlations)[::-1]\n                sorted_x = list([x[k] for k in indices])\n                return correlations[indices], sorted_x\n            value = np.array(vaex.utils.unlistify(waslist, correlations))\n            return value\n        return self._delay(delay, finish(delayed_list(correlations)))"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\ncalculates the covariance matrix for x and y or more expressions possibly on a grid defined by binby.", "response": "def cov(self, x, y=None, binby=[], limits=None, shape=default_shape, selection=False, delay=False, progress=None):\n        \"\"\"Calculate the covariance matrix for x and y or more expressions, possibly on a grid defined by binby.\n\n        Either x and y are expressions, e.g:\n\n        >>> df.cov(\"x\", \"y\")\n\n        Or only the x argument is given with a list of expressions, e,g.:\n\n        >>> df.cov([\"x, \"y, \"z\"])\n\n        Example:\n\n        >>> df.cov(\"x\", \"y\")\n        array([[ 53.54521742,  -3.8123135 ],\n        [ -3.8123135 ,  60.62257881]])\n        >>> df.cov([\"x\", \"y\", \"z\"])\n        array([[ 53.54521742,  -3.8123135 ,  -0.98260511],\n        [ -3.8123135 ,  60.62257881,   1.21381057],\n        [ -0.98260511,   1.21381057,  25.55517638]])\n\n        >>> df.cov(\"x\", \"y\", binby=\"E\", shape=2)\n        array([[[  9.74852878e+00,  -3.02004780e-02],\n        [ -3.02004780e-02,   9.99288215e+00]],\n        [[  8.43996546e+01,  -6.51984181e+00],\n        [ -6.51984181e+00,   9.68938284e+01]]])\n\n\n        :param x: {expression}\n        :param y: {expression_single}\n        :param binby: {binby}\n        :param limits: {limits}\n        :param shape: {shape}\n        :param selection: {selection}\n        :param delay: {delay}\n        :return: {return_stat_scalar}, the last dimensions are of shape (2,2)\n        \"\"\"\n        selection = _ensure_strings_from_expressions(selection)\n        if y is None:\n            if not _issequence(x):\n                raise ValueError(\"if y argument is not given, x is expected to be sequence, not %r\", x)\n            expressions = x\n        else:\n            expressions = [x, y]\n        N = len(expressions)\n        binby = _ensure_list(binby)\n        shape = _expand_shape(shape, len(binby))\n        progressbar = vaex.utils.progressbars(progress)\n        limits = self.limits(binby, limits, selection=selection, delay=True)\n\n        @delayed\n        def calculate(expressions, limits):\n            # print('limits', limits)\n            task = tasks.TaskStatistic(self, binby, shape, limits, weights=expressions, op=tasks.OP_COV, selection=selection)\n            self.executor.schedule(task)\n            progressbar.add_task(task, \"covariance values for %r\" % expressions)\n            return task\n\n        @delayed\n        def finish(values):\n            N = len(expressions)\n            counts = values[..., :N]\n            sums = values[..., N:2 * N]\n            with np.errstate(divide='ignore', invalid='ignore'):\n                means = sums / counts\n            # matrix of means * means.T\n            meansxy = means[..., None] * means[..., None, :]\n\n            counts = values[..., 2 * N:2 * N + N**2]\n            sums = values[..., 2 * N + N**2:]\n            shape = counts.shape[:-1] + (N, N)\n            counts = counts.reshape(shape)\n            sums = sums.reshape(shape)\n            with np.errstate(divide='ignore', invalid='ignore'):\n                moments2 = sums / counts\n            cov_matrix = moments2 - meansxy\n            return cov_matrix\n        progressbar = vaex.utils.progressbars(progress)\n        values = calculate(expressions, limits)\n        cov_matrix = finish(values)\n        return self._delay(delay, cov_matrix)"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef minmax(self, expression, binby=[], limits=None, shape=default_shape, selection=False, delay=False, progress=None):\n        # vmin  = self._compute_agg('min', expression, binby, limits, shape, selection, delay, edges, progress)\n        # vmax =  self._compute_agg('max', expression, binby, limits, shape, selection, delay, edges, progress)\n        @delayed\n        def finish(*minmax_list):\n            value = vaex.utils.unlistify(waslist, np.array(minmax_list))\n            value = value.astype(dtype0)\n            return value\n\n        @delayed\n        def calculate(expression, limits):\n            task = tasks.TaskStatistic(self, binby, shape, limits, weight=expression, op=tasks.OP_MIN_MAX, selection=selection)\n            self.executor.schedule(task)\n            progressbar.add_task(task, \"minmax for %s\" % expression)\n            return task\n        @delayed\n        def finish(*minmax_list):\n            value = vaex.utils.unlistify(waslist, np.array(minmax_list))\n            value = value.astype(dtype0)\n            return value\n        expression = _ensure_strings_from_expressions(expression)\n        binby = _ensure_strings_from_expressions(binby)\n        waslist, [expressions, ] = vaex.utils.listify(expression)\n        dtypes = [self.dtype(expr) for expr in expressions]\n        dtype0 = dtypes[0]\n        if not all([k.kind == dtype0.kind for k in dtypes]):\n            raise ValueError(\"cannot mix datetime and non-datetime expressions\")\n        progressbar = vaex.utils.progressbars(progress, name=\"minmaxes\")\n        limits = self.limits(binby, limits, selection=selection, delay=True)\n        all_tasks = [calculate(expression, limits) for expression in expressions]\n        result = finish(*all_tasks)\n        return self._delay(delay, result)", "response": "Calculate the minimum and maximum for expressions possibly on a grid defined by binby."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef min(self, expression, binby=[], limits=None, shape=default_shape, selection=False, delay=False, progress=None, edges=False):\n        return self._compute_agg('min', expression, binby, limits, shape, selection, delay, edges, progress)\n        @delayed\n        def finish(result):\n            return result[..., 0]\n        return self._delay(delay, finish(self.minmax(expression, binby=binby, limits=limits, shape=shape, selection=selection, delay=delay, progress=progress)))", "response": "Calculate the minimum for given expressions possibly on a grid defined by binby."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\ncalculate the median value of a set of values", "response": "def median_approx(self, expression, percentage=50., binby=[], limits=None, shape=default_shape, percentile_shape=256, percentile_limits=\"minmax\", selection=False, delay=False):\n        \"\"\"Calculate the median , possibly on a grid defined by binby.\n\n        NOTE: this value is approximated by calculating the cumulative distribution on a grid defined by\n        percentile_shape and percentile_limits\n\n\n        :param expression: {expression}\n        :param binby: {binby}\n        :param limits: {limits}\n        :param shape: {shape}\n        :param percentile_limits: {percentile_limits}\n        :param percentile_shape: {percentile_shape}\n        :param selection: {selection}\n        :param delay: {delay}\n        :return: {return_stat_scalar}\n        \"\"\"\n        return self.percentile_approx(expression, 50, binby=binby, limits=limits, shape=shape, percentile_shape=percentile_shape, percentile_limits=percentile_limits, selection=selection, delay=delay)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef percentile_approx(self, expression, percentage=50., binby=[], limits=None, shape=default_shape, percentile_shape=1024, percentile_limits=\"minmax\", selection=False, delay=False):\n        waslist, [expressions, ] = vaex.utils.listify(expression)\n        if not isinstance(binby, (tuple, list)):\n            binby = [binby]\n        else:\n            binby = binby\n\n        @delayed\n        def calculate(expression, shape, limits):\n            # task =  TaskStatistic(self, [expression] + binby, shape, limits, op=OP_ADD1, selection=selection)\n            # self.executor.schedule(task)\n            # return task\n            return self.count(binby=list(binby) + [expression], shape=shape, limits=limits, selection=selection, delay=True, edges=True)\n\n        @delayed\n        def finish(percentile_limits, counts_list):\n            results = []\n            for i, counts in enumerate(counts_list):\n                counts = counts.astype(np.float)\n                # remove the nan and boundary edges from the first dimension,\n                nonnans = list([slice(2, -1, None) for k in range(len(counts.shape) - 1)])\n                nonnans.append(slice(1, None, None))  # we're gonna get rid only of the nan's, and keep the overflow edges\n                nonnans = tuple(nonnans)\n                cumulative_grid = np.cumsum(counts.__getitem__(nonnans), -1)  # convert to cumulative grid\n\n                totalcounts = np.sum(counts.__getitem__(nonnans), -1)\n                empty = totalcounts == 0\n\n                original_shape = counts.shape\n                shape = cumulative_grid.shape  # + (original_shape[-1] - 1,) #\n\n                counts = np.sum(counts, -1)\n                edges_floor = np.zeros(shape[:-1] + (2,), dtype=np.int64)\n                edges_ceil = np.zeros(shape[:-1] + (2,), dtype=np.int64)\n                # if we have an off  # of elements, say, N=3, the center is at i=1=(N-1)/2\n                # if we have an even # of elements, say, N=4, the center is between i=1=(N-2)/2 and i=2=(N/2)\n                # index = (shape[-1] -1-3) * percentage/100. # the -3 is for the edges\n                values = np.array((totalcounts + 1) * percentage / 100.)  # make sure it's an ndarray\n                values[empty] = 0\n                floor_values = np.array(np.floor(values))\n                ceil_values = np.array(np.ceil(values))\n                vaex.vaexfast.grid_find_edges(cumulative_grid, floor_values, edges_floor)\n                vaex.vaexfast.grid_find_edges(cumulative_grid, ceil_values, edges_ceil)\n\n                def index_choose(a, indices):\n                    # alternative to np.choise, which doesn't like the last dim to be >= 32\n                    # print(a, indices)\n                    out = np.zeros(a.shape[:-1])\n                    # print(out.shape)\n                    for i in np.ndindex(out.shape):\n                        # print(i, indices[i])\n                        out[i] = a[i + (indices[i],)]\n                    return out\n\n                def calculate_x(edges, values):\n                    left, right = edges[..., 0], edges[..., 1]\n                    left_value = index_choose(cumulative_grid, left)\n                    right_value = index_choose(cumulative_grid, right)\n                    u = np.array((values - left_value) / (right_value - left_value))\n                    # TODO: should it really be -3? not -2\n                    xleft, xright = percentile_limits[i][0] + (left - 0.5) * (percentile_limits[i][1] - percentile_limits[i][0]) / (shape[-1] - 3),\\\n                        percentile_limits[i][0] + (right - 0.5) * (percentile_limits[i][1] - percentile_limits[i][0]) / (shape[-1] - 3)\n                    x = xleft + (xright - xleft) * u  # /2\n                    return x\n\n                x1 = calculate_x(edges_floor, floor_values)\n                x2 = calculate_x(edges_ceil, ceil_values)\n                u = values - floor_values\n                x = x1 + (x2 - x1) * u\n                results.append(x)\n\n            return results\n\n        shape = _expand_shape(shape, len(binby))\n        percentile_shapes = _expand_shape(percentile_shape, len(expressions))\n        if percentile_limits:\n            percentile_limits = _expand_limits(percentile_limits, len(expressions))\n        limits = self.limits(binby, limits, selection=selection, delay=True)\n        percentile_limits = self.limits(expressions, percentile_limits, selection=selection, delay=True)\n\n        @delayed\n        def calculation(limits, percentile_limits):\n            # print(\">>>\", expressions, percentile_limits)\n            # print(percentile_limits[0], list(percentile_limits[0]))\n            # print(list(np.array(limits).tolist()) + list(percentile_limits[0]))\n            # print(\"limits\", limits, expressions, percentile_limits, \">>\", list(limits) + [list(percentile_limits[0]))\n            tasks = [calculate(expression, tuple(shape) + (percentile_shape, ), list(limits) + [list(percentile_limit)])\n                     for percentile_shape, percentile_limit, expression\n                     in zip(percentile_shapes, percentile_limits, expressions)]\n            return finish(percentile_limits, delayed_args(*tasks))\n            # return tasks\n        result = calculation(limits, percentile_limits)\n\n        @delayed\n        def finish2(grid):\n            value = vaex.utils.unlistify(waslist, np.array(grid))\n            return value\n        return self._delay(delay, finish2(result))", "response": "Calculate the percentile given by percentage possibly on a grid defined by binby."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\ncalculates the [ min max ] range for the expression containing approximately a percentage of the data.", "response": "def limits_percentage(self, expression, percentage=99.73, square=False, delay=False):\n        \"\"\"Calculate the [min, max] range for expression, containing approximately a percentage of the data as defined\n        by percentage.\n\n        The range is symmetric around the median, i.e., for a percentage of 90, this gives the same results as:\n\n        Example:\n\n        >>> df.limits_percentage(\"x\", 90)\n        array([-12.35081376,  12.14858052]\n        >>> df.percentile_approx(\"x\", 5), df.percentile_approx(\"x\", 95)\n        (array([-12.36813152]), array([ 12.13275818]))\n\n        NOTE: this value is approximated by calculating the cumulative distribution on a grid.\n        NOTE 2: The values above are not exactly the same, since percentile and limits_percentage do not share the same code\n\n        :param expression: {expression_limits}\n        :param float percentage: Value between 0 and 100\n        :param delay: {delay}\n        :return: {return_limits}\n        \"\"\"\n        # percentiles = self.percentile(expression, [100-percentage/2, 100-(100-percentage/2.)], delay=True)\n        # return self._delay(delay, percentiles)\n        # print(percentage)\n        import scipy\n        logger.info(\"limits_percentage for %r, with percentage=%r\", expression, percentage)\n        waslist, [expressions, ] = vaex.utils.listify(expression)\n        limits = []\n        for expr in expressions:\n            subspace = self(expr)\n            limits_minmax = subspace.minmax()\n            vmin, vmax = limits_minmax[0]\n            size = 1024 * 16\n            counts = subspace.histogram(size=size, limits=limits_minmax)\n            cumcounts = np.concatenate([[0], np.cumsum(counts)])\n            cumcounts /= cumcounts.max()\n            # TODO: this is crude.. see the details!\n            f = (1 - percentage / 100.) / 2\n            x = np.linspace(vmin, vmax, size + 1)\n            l = scipy.interp([f, 1 - f], cumcounts, x)\n            limits.append(l)\n        # return limits\n        return vaex.utils.unlistify(waslist, limits)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef limits(self, expression, value=None, square=False, selection=None, delay=False, shape=None):\n        if expression == []:\n            return [] if shape is None else ([], [])\n        waslist, [expressions, ] = vaex.utils.listify(expression)\n        expressions = _ensure_strings_from_expressions(expressions)\n        selection = _ensure_strings_from_expressions(selection)\n        # values =\n        # values = _expand_limits(value, len(expressions))\n        # logger.debug(\"limits %r\", list(zip(expressions, values)))\n        if value is None:\n            value = \"99.73%\"\n        # print(\"value is seq/limit?\", _issequence(value), _is_limit(value), value)\n        if _is_limit(value) or not _issequence(value):\n            values = (value,) * len(expressions)\n        else:\n            values = value\n\n        # print(\"expressions 1)\", expressions)\n        # print(\"values      1)\", values)\n        initial_expressions, initial_values = expressions, values\n        expression_values = dict()\n        expression_shapes = dict()\n        for i, (expression, value) in enumerate(zip(expressions, values)):\n            # print(\">>>\", expression, value)\n            if _issequence(expression):\n                expressions = expression\n                nested = True\n            else:\n                expressions = [expression]\n                nested = False\n            if _is_limit(value) or not _issequence(value):\n                values = (value,) * len(expressions)\n            else:\n                values = value\n            # print(\"expressions 2)\", expressions)\n            # print(\"values      2)\", values)\n            for j, (expression, value) in enumerate(zip(expressions, values)):\n                if shape is not None:\n                    if _issequence(shape):\n                        shapes = shape\n                    else:\n                        shapes = (shape, ) * (len(expressions) if nested else len(initial_expressions))\n\n                shape_index = j if nested else i\n\n                if not _is_limit(value):  # if a\n                    # value = tuple(value) # list is not hashable\n                    expression_values[(expression, value)] = None\n                if self.is_category(expression):\n                    N = self._categories[_ensure_string_from_expression(expression)]['N']\n                    expression_shapes[expression] = min(N, shapes[shape_index] if shape is not None else default_shape)\n                else:\n                    expression_shapes[expression] = shapes[shape_index] if shape is not None else default_shape\n\n        # print(\"##### 1)\", expression_values.keys())\n\n        limits_list = []\n        # for expression, value in zip(expressions, values):\n        for expression, value in expression_values.keys():\n            if self.is_category(expression):\n                N = self._categories[_ensure_string_from_expression(expression)]['N']\n                limits = [-0.5, N-0.5]\n            else:\n                if isinstance(value, six.string_types):\n                    if value == \"minmax\":\n                        limits = self.minmax(expression, selection=selection, delay=True)\n                    else:\n                        match = re.match(r\"([\\d.]*)(\\D*)\", value)\n                        if match is None:\n                            raise ValueError(\"do not understand limit specifier %r, examples are 90%, 3sigma\")\n                        else:\n                            number, type = match.groups()\n                            import ast\n                            number = ast.literal_eval(number)\n                            type = type.strip()\n                            if type in [\"s\", \"sigma\"]:\n                                limits = self.limits_sigma(number)\n                            elif type in [\"ss\", \"sigmasquare\"]:\n                                limits = self.limits_sigma(number, square=True)\n                            elif type in [\"%\", \"percent\"]:\n                                limits = self.limits_percentage(expression, number, delay=False)\n                            elif type in [\"%s\", \"%square\", \"percentsquare\"]:\n                                limits = self.limits_percentage(expression, number, square=True, delay=True)\n                elif value is None:\n                    limits = self.limits_percentage(expression, square=square, delay=True)\n                else:\n                    limits = value\n            limits_list.append(limits)\n            if limits is None:\n                raise ValueError(\"limit %r not understood\" % value)\n            expression_values[(expression, value)] = limits\n            logger.debug(\"!!!!!!!!!! limits: %r %r\", limits, np.array(limits).shape)\n\n            @delayed\n            def echo(limits):\n                logger.debug(\">>>>>>>> limits: %r %r\", limits, np.array(limits).shape)\n            echo(limits)\n\n        limits_list = delayed_args(*limits_list)\n\n        @delayed\n        def finish(limits_list):\n            # print(\"##### 2)\", expression_values.keys())\n            limits_outer = []\n            shapes_list = []\n            for expression, value in zip(initial_expressions, initial_values):\n                if _issequence(expression):\n                    expressions = expression\n                    waslist2 = True\n                else:\n                    expressions = [expression]\n                    waslist2 = False\n                if _is_limit(value) or not _issequence(value):\n                    values = (value,) * len(expressions)\n                else:\n                    values = value\n                # print(\"expressions 3)\", expressions)\n                # print(\"values      3)\", values)\n                limits = []\n                shapes = []\n                for expression, value in zip(expressions, values):\n                    if not _is_limit(value):\n                        value = expression_values[(expression, value)]\n                        if not _is_limit(value):\n                            # print(\">>> value\", value)\n                            value = value.get()\n                    limits.append(value)\n                    shapes.append(expression_shapes[expression])\n                    # if not _is_limit(value): # if a\n                    #   #value = tuple(value) # list is not hashable\n                    #   expression_values[(expression, value)] = expression_values[(expression, value)].get()\n                    # else:\n                    #   #value = tuple(value) # list is not hashable\n                    #   expression_values[(expression, value)] = ()\n                if waslist2:\n                    limits_outer.append(limits)\n                    shapes_list.append(shapes)\n                else:\n                    limits_outer.append(limits[0])\n                    shapes_list.append(shapes[0])\n            # logger.debug(\">>>>>>>> complete list of limits: %r %r\", limits_list, np.array(limits_list).shape)\n\n            # print(\"limits\", limits_outer)\n            if shape:\n                return vaex.utils.unlistify(waslist, limits_outer), vaex.utils.unlistify(waslist, shapes_list)\n            else:\n                return vaex.utils.unlistify(waslist, limits_outer)\n        return self._delay(delay, finish(limits_list))", "response": "Calculate the [ min max ] range for the expression."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\ncalculating the mode of a single instance of the class.", "response": "def mode(self, expression, binby=[], limits=None, shape=256, mode_shape=64, mode_limits=None, progressbar=False, selection=None):\n        \"\"\"Calculate/estimate the mode.\"\"\"\n        if len(binby) == 0:\n            raise ValueError(\"only supported with binby argument given\")\n        else:\n            # todo, fix progressbar into two...\n            try:\n                len(shape)\n                shape = tuple(shape)\n            except:\n                shape = len(binby) * (shape,)\n            shape = (mode_shape,) + shape\n            subspace = self(*(list(binby) + [expression]))\n            if selection:\n                subspace = subspace.selected()\n\n            limits = self.limits(list(binby), limits)\n            mode_limits = self.limits([expression], mode_limits)\n            limits = list(limits) + list(mode_limits)\n            counts = subspace.histogram(limits=limits, size=shape, progressbar=progressbar)\n\n            indices = np.argmax(counts, axis=0)\n            pmin, pmax = limits[-1]\n            centers = np.linspace(pmin, pmax, mode_shape + 1)[:-1]  # ignore last bin\n            centers += (centers[1] - centers[0]) / 2  # and move half a bin to the right\n\n            modes = centers[indices]\n            ok = counts.sum(axis=0) > 0\n            modes[~ok] = np.nan\n            return modes"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef plot_widget(self, x, y, z=None, grid=None, shape=256, limits=None, what=\"count(*)\", figsize=None,\n                    f=\"identity\", figure_key=None, fig=None, axes=None, xlabel=None, ylabel=None, title=None,\n                    show=True, selection=[None, True], colormap=\"afmhot\", grid_limits=None, normalize=\"normalize\",\n                    grid_before=None,\n                    what_kwargs={}, type=\"default\",\n                    scales=None, tool_select=False, bq_cleanup=True,\n                    backend=\"bqplot\",\n                    **kwargs):\n        \"\"\"Viz 1d, 2d or 3d in a Jupyter notebook\n\n        .. note::\n            This API is not fully settled and may change in the future\n\n        Example:\n\n        >>> df.plot_widget(df.x, df.y, backend='bqplot')\n        >>> df.plot_widget(df.pickup_longitude, df.pickup_latitude, backend='ipyleaflet')\n\n        :param backend: Widget backend to use: 'bqplot', 'ipyleaflet', 'ipyvolume', 'matplotlib'\n\n        \"\"\"\n        import vaex.jupyter.plot\n        backend = vaex.jupyter.plot.create_backend(backend)\n        cls = vaex.jupyter.plot.get_type(type)\n        x = _ensure_strings_from_expressions(x)\n        y = _ensure_strings_from_expressions(y)\n        z = _ensure_strings_from_expressions(z)\n        for name in 'vx vy vz'.split():\n            if name in kwargs:\n                kwargs[name] = _ensure_strings_from_expressions(kwargs[name])\n        plot2d = cls(backend=backend, dataset=self, x=x, y=y, z=z, grid=grid, shape=shape, limits=limits, what=what,\n                     f=f, figure_key=figure_key, fig=fig,\n                     selection=selection, grid_before=grid_before,\n                     grid_limits=grid_limits, normalize=normalize, colormap=colormap, what_kwargs=what_kwargs, **kwargs)\n        if show:\n            plot2d.show()\n        return plot2d", "response": "Plot a 2D widget of the current dataset."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef healpix_count(self, expression=None, healpix_expression=None, healpix_max_level=12, healpix_level=8, binby=None, limits=None, shape=default_shape, delay=False, progress=None, selection=None):\n        # if binby is None:\n        import healpy as hp\n        if healpix_expression is None:\n            if self.ucds.get(\"source_id\", None) == 'meta.id;meta.main':  # we now assume we have gaia data\n                healpix_expression = \"source_id/34359738368\"\n\n        if healpix_expression is None:\n            raise ValueError(\"no healpix_expression given, and was unable to guess\")\n\n        reduce_level = healpix_max_level - healpix_level\n        NSIDE = 2**healpix_level\n        nmax = hp.nside2npix(NSIDE)\n        scaling = 4**reduce_level\n        expr = \"%s/%s\" % (healpix_expression, scaling)\n        binby = [expr] + ([] if binby is None else _ensure_list(binby))\n        shape = (nmax,) + _expand_shape(shape, len(binby) - 1)\n        epsilon = 1. / scaling / 2\n        limits = [[-epsilon, nmax - epsilon]] + ([] if limits is None else limits)\n        return self.count(expression, binby=binby, limits=limits, shape=shape, delay=delay, progress=progress, selection=selection)", "response": "Count the number of non - missing values for expression on an array which represents healpix data."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef healpix_plot(self, healpix_expression=\"source_id/34359738368\", healpix_max_level=12, healpix_level=8, what=\"count(*)\", selection=None,\n                     grid=None,\n                     healpix_input=\"equatorial\", healpix_output=\"galactic\", f=None,\n                     colormap=\"afmhot\", grid_limits=None, image_size=800, nest=True,\n                     figsize=None, interactive=False, title=\"\", smooth=None, show=False, colorbar=True,\n                     rotation=(0, 0, 0), **kwargs):\n        \"\"\"Viz data in 2d using a healpix column.\n\n        :param healpix_expression: {healpix_max_level}\n        :param healpix_max_level: {healpix_max_level}\n        :param healpix_level: {healpix_level}\n        :param what: {what}\n        :param selection: {selection}\n        :param grid: {grid}\n        :param healpix_input: Specificy if the healpix index is in \"equatorial\", \"galactic\" or \"ecliptic\".\n        :param healpix_output: Plot in \"equatorial\", \"galactic\" or \"ecliptic\".\n        :param f: function to apply to the data\n        :param colormap: matplotlib colormap\n        :param grid_limits: Optional sequence [minvalue, maxvalue] that determine the min and max value that map to the colormap (values below and above these are clipped to the the min/max). (default is [min(f(grid)), max(f(grid)))\n        :param image_size: size for the image that healpy uses for rendering\n        :param nest: If the healpix data is in nested (True) or ring (False)\n        :param figsize: If given, modify the matplotlib figure size. Example (14,9)\n        :param interactive: (Experimental, uses healpy.mollzoom is True)\n        :param title: Title of figure\n        :param smooth: apply gaussian smoothing, in degrees\n        :param show: Call matplotlib's show (True) or not (False, defaut)\n        :param rotation: Rotatate the plot, in format (lon, lat, psi) such that (lon, lat) is the center, and rotate on the screen by angle psi. All angles are degrees.\n        :return:\n        \"\"\"\n        # plot_level = healpix_level #healpix_max_level-reduce_level\n        import healpy as hp\n        import pylab as plt\n        if grid is None:\n            reduce_level = healpix_max_level - healpix_level\n            NSIDE = 2**healpix_level\n            nmax = hp.nside2npix(NSIDE)\n            # print nmax, np.sqrt(nmax)\n            scaling = 4**reduce_level\n            # print nmax\n            epsilon = 1. / scaling / 2\n            grid = self._stat(what=what, binby=\"%s/%s\" % (healpix_expression, scaling), limits=[-epsilon, nmax - epsilon], shape=nmax, selection=selection)\n        if grid_limits:\n            grid_min, grid_max = grid_limits\n        else:\n            grid_min = grid_max = None\n        f_org = f\n        f = _parse_f(f)\n        if smooth:\n            if nest:\n                grid = hp.reorder(grid, inp=\"NEST\", out=\"RING\")\n                nest = False\n            # grid[np.isnan(grid)] = np.nanmean(grid)\n            grid = hp.smoothing(grid, sigma=np.radians(smooth))\n        fgrid = f(grid)\n        coord_map = dict(equatorial='C', galactic='G', ecliptic=\"E\")\n        fig = plt.gcf()\n        if figsize is not None:\n            fig.set_size_inches(*figsize)\n        what_label = what\n        if f_org:\n            what_label = f_org + \" \" + what_label\n        f = hp.mollzoom if interactive else hp.mollview\n        with warnings.catch_warnings():\n            warnings.simplefilter(\"ignore\")\n            coord = coord_map[healpix_input], coord_map[healpix_output]\n            if coord_map[healpix_input] == coord_map[healpix_output]:\n                coord = None\n            f(fgrid, unit=what_label, rot=rotation, nest=nest, title=title, coord=coord,\n              cmap=colormap, hold=True, xsize=image_size, min=grid_min, max=grid_max, cbar=colorbar, **kwargs)\n        if show:\n            plt.show()", "response": "Viz data in 2d using a healpix column."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nplotting a 3D plot of the current object.", "response": "def plot3d(self, x, y, z, vx=None, vy=None, vz=None, vwhat=None, limits=None, grid=None, what=\"count(*)\", shape=128, selection=[None, True], f=None,\n               vcount_limits=None,\n               smooth_pre=None, smooth_post=None, grid_limits=None, normalize=\"normalize\", colormap=\"afmhot\",\n               figure_key=None, fig=None,\n               lighting=True, level=[0.1, 0.5, 0.9], opacity=[0.01, 0.05, 0.1], level_width=0.1,\n               show=True, **kwargs):\n        \"\"\"Use at own risk, requires ipyvolume\"\"\"\n        import vaex.ext.ipyvolume\n        # vaex.ext.ipyvolume.\n        cls = vaex.ext.ipyvolume.PlotDefault\n        plot3d = cls(df=self, x=x, y=y, z=z, vx=vx, vy=vy, vz=vz,\n                     grid=grid, shape=shape, limits=limits, what=what,\n                     f=f, figure_key=figure_key, fig=fig,\n                     selection=selection, smooth_pre=smooth_pre, smooth_post=smooth_post,\n                     grid_limits=grid_limits, vcount_limits=vcount_limits, normalize=normalize, colormap=colormap, **kwargs)\n        if show:\n            plot3d.show()\n        return plot3d"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\ngive direct access to the columns only.", "response": "def col(self):\n        \"\"\"Gives direct access to the columns only (useful for tab completion).\n\n        Convenient when working with ipython in combination with small DataFrames, since this gives tab-completion.\n\n        Columns can be accesed by there names, which are attributes. The attribues are currently expressions, so you can\n        do computations with them.\n\n        Example\n\n        >>> ds = vaex.example()\n        >>> df.plot(df.col.x, df.col.y)\n\n        \"\"\"\n        class ColumnList(object):\n            pass\n        data = ColumnList()\n        for name in self.get_column_names():\n            expression = getattr(self, name, None)\n            if not isinstance(expression, Expression):\n                expression = Expression(self, name)\n            setattr(data, name, expression)\n        return data"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nreturning the size in bytes the whole DataFrame requires or the selection respecting the active_fraction.", "response": "def byte_size(self, selection=False, virtual=False):\n        \"\"\"Return the size in bytes the whole DataFrame requires (or the selection), respecting the active_fraction.\"\"\"\n        bytes_per_row = 0\n        N = self.count(selection=selection)\n        extra = 0\n        for column in list(self.get_column_names(virtual=virtual)):\n            dtype = self.dtype(column)\n            dtype_internal = self.dtype(column, internal=True)\n            #if dtype in [str_type, str] and dtype_internal.kind == 'O':\n            if isinstance(self.columns[column], ColumnString):\n                # TODO: document or fix this\n                # is it too expensive to calculate this exactly?\n                extra += self.columns[column].nbytes\n            else:\n                bytes_per_row += dtype_internal.itemsize\n                if np.ma.isMaskedArray(self.columns[column]):\n                    bytes_per_row += 1\n        return bytes_per_row * self.count(selection=selection) + extra"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef dtype(self, expression, internal=False):\n        expression = _ensure_string_from_expression(expression)\n        if expression in self.variables:\n            return np.float64(1).dtype\n        elif expression in self.columns.keys():\n            column = self.columns[expression]\n            data = column[0:1]\n            dtype = data.dtype\n        else:\n            data = self.evaluate(expression, 0, 1, filtered=False)\n            dtype = data.dtype\n        if not internal:\n            if dtype != str_type:\n                if dtype.kind in 'US':\n                    return str_type\n                if dtype.kind == 'O':\n                    # we lie about arrays containing strings\n                    if isinstance(data[0], six.string_types):\n                        return str_type\n        return dtype", "response": "Return the numpy dtype for the given expression."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef dtypes(self):\n        from pandas import Series\n        return Series({column_name:self.dtype(column_name) for column_name in self.get_column_names()})", "response": "Gives a Pandas series object containing all numpy dtypes of all columns except hidden."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nreturning if a column is a masked ( numpy. ma ) column.", "response": "def is_masked(self, column):\n        '''Return if a column is a masked (numpy.ma) column.'''\n        column = _ensure_string_from_expression(column)\n        if column in self.columns:\n            return np.ma.isMaskedArray(self.columns[column])\n        return False"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nreturning the astropy. units. Unit object for the expression.", "response": "def unit(self, expression, default=None):\n        \"\"\"Returns the unit (an astropy.unit.Units object) for the expression.\n\n        Example\n\n        >>> import vaex\n        >>> ds = vaex.example()\n        >>> df.unit(\"x\")\n        Unit(\"kpc\")\n        >>> df.unit(\"x*L\")\n        Unit(\"km kpc2 / s\")\n\n\n        :param expression: Expression, which can be a column name\n        :param default: if no unit is known, it will return this\n        :return: The resulting unit of the expression\n        :rtype: astropy.units.Unit\n        \"\"\"\n        expression = _ensure_string_from_expression(expression)\n        try:\n            # if an expression like pi * <some_expr> it will evaluate to a quantity instead of a unit\n            unit_or_quantity = eval(expression, expression_namespace, scopes.UnitScope(self))\n            unit = unit_or_quantity.unit if hasattr(unit_or_quantity, \"unit\") else unit_or_quantity\n            return unit if isinstance(unit, astropy.units.Unit) else None\n        except:\n            # logger.exception(\"error evaluating unit expression: %s\", expression)\n            # astropy doesn't add units, so we try with a quatiti\n            try:\n                return eval(expression, expression_namespace, scopes.UnitScope(self, 1.)).unit\n            except:\n                # logger.exception(\"error evaluating unit expression: %s\", expression)\n                return default"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef ucd_find(self, ucds, exclude=[]):\n        if isinstance(ucds, six.string_types):\n            ucds = [ucds]\n        if len(ucds) == 1:\n            ucd = ucds[0]\n            if ucd[0] == \"^\":  # we want it to start with\n                ucd = ucd[1:]\n                columns = [name for name in self.get_column_names() if self.ucds.get(name, \"\").startswith(ucd) and name not in exclude]\n            else:\n                columns = [name for name in self.get_column_names() if ucd in self.ucds.get(name, \"\") and name not in exclude]\n            return None if len(columns) == 0 else columns[0]\n        else:\n            columns = [self.ucd_find([ucd], exclude=exclude) for ucd in ucds]\n            return None if None in columns else columns", "response": "Find a set of columns which have the ucd or part of the ucd."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef get_private_dir(self, create=False):\n        if self.is_local():\n            name = os.path.abspath(self.path).replace(os.path.sep, \"_\")[:250]  # should not be too long for most os'es\n            name = name.replace(\":\", \"_\")  # for windows drive names\n        else:\n            server = self.server\n            name = \"%s_%s_%s_%s\" % (server.hostname, server.port, server.base_path.replace(\"/\", \"_\"), self.name)\n        dir = os.path.join(vaex.utils.get_private_dir(), \"dfs\", name)\n        if create and not os.path.exists(dir):\n            os.makedirs(dir)\n        return dir", "response": "This method returns the path to the private directory where files are stored for metadata etc."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nreturns the internal state of the DataFrame in a dictionary", "response": "def state_get(self):\n        \"\"\"Return the internal state of the DataFrame in a dictionary\n\n        Example:\n\n        >>> import vaex\n        >>> df = vaex.from_scalars(x=1, y=2)\n        >>> df['r'] = (df.x**2 + df.y**2)**0.5\n        >>> df.state_get()\n        {'active_range': [0, 1],\n        'column_names': ['x', 'y', 'r'],\n        'description': None,\n        'descriptions': {},\n        'functions': {},\n        'renamed_columns': [],\n        'selections': {'__filter__': None},\n        'ucds': {},\n        'units': {},\n        'variables': {},\n        'virtual_columns': {'r': '(((x ** 2) + (y ** 2)) ** 0.5)'}}\n        \"\"\"\n\n        virtual_names = list(self.virtual_columns.keys()) + list(self.variables.keys())\n        units = {key: str(value) for key, value in self.units.items()}\n        ucds = {key: value for key, value in self.ucds.items() if key in virtual_names}\n        descriptions = {key: value for key, value in self.descriptions.items()}\n        import vaex.serialize\n\n        def check(key, value):\n            if not vaex.serialize.can_serialize(value.f):\n                warnings.warn('Cannot serialize function for virtual column {} (use vaex.serialize.register)'.format(key))\n                return False\n            return True\n\n        def clean(value):\n            return vaex.serialize.to_dict(value.f)\n        functions = {key: clean(value) for key, value in self.functions.items() if check(key, value)}\n        virtual_columns = {key: value for key, value in self.virtual_columns.items()}\n        selections = {name: self.get_selection(name) for name, history in self.selection_histories.items()}\n        selections = {name: selection.to_dict() if selection is not None else None for name, selection in selections.items()}\n    # if selection is not None}\n        state = dict(virtual_columns=virtual_columns,\n                     column_names=self.column_names,\n                     renamed_columns=self._renamed_columns,\n                     variables=self.variables,\n                     functions=functions,\n                     selections=selections,\n                     ucds=ucds,\n                     units=units,\n                     descriptions=descriptions,\n                     description=self.description,\n                     active_range=[self._index_start, self._index_end])\n        return state"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nsets the internal state of the DataFrame.", "response": "def state_set(self, state, use_active_range=False):\n        \"\"\"Sets the internal state of the df\n\n        Example:\n\n        >>> import vaex\n        >>> df = vaex.from_scalars(x=1, y=2)\n        >>> df\n          #    x    y        r\n          0    1    2  2.23607\n        >>> df['r'] = (df.x**2 + df.y**2)**0.5\n        >>> state = df.state_get()\n        >>> state\n        {'active_range': [0, 1],\n        'column_names': ['x', 'y', 'r'],\n        'description': None,\n        'descriptions': {},\n        'functions': {},\n        'renamed_columns': [],\n        'selections': {'__filter__': None},\n        'ucds': {},\n        'units': {},\n        'variables': {},\n        'virtual_columns': {'r': '(((x ** 2) + (y ** 2)) ** 0.5)'}}\n        >>> df2 = vaex.from_scalars(x=3, y=4)\n        >>> df2.state_set(state)  # now the virtual functions are 'copied'\n        >>> df2\n          #    x    y    r\n          0    3    4    5\n\n        :param state: dict as returned by :meth:`DataFrame.state_get`.\n        :param bool use_active_range: Whether to use the active range or not.\n        \"\"\"\n        self.description = state['description']\n        if use_active_range:\n            self._index_start, self._index_end = state['active_range']\n        self._length_unfiltered = self._index_end - self._index_start\n        if 'renamed_columns' in state:\n            for old, new in state['renamed_columns']:\n                self._rename(old, new)\n        for name, value in state['functions'].items():\n            self.add_function(name, vaex.serialize.from_dict(value))\n        if 'column_names' in state:\n            # we clear all columns, and add them later on, since otherwise self[name] = ... will try\n            # to rename the columns (which is unsupported for remote dfs)\n            self.column_names = []\n            self.virtual_columns = collections.OrderedDict()\n            for name, value in state['virtual_columns'].items():\n                self[name] = self._expr(value)\n                # self._save_assign_expression(name)\n            self.column_names = state['column_names']\n        else:\n            # old behaviour\n            self.virtual_columns = collections.OrderedDict()\n            for name, value in state['virtual_columns'].items():\n                self[name] = self._expr(value)\n        self.variables = state['variables']\n        import astropy  # TODO: make this dep optional?\n        units = {key: astropy.units.Unit(value) for key, value in state[\"units\"].items()}\n        self.units.update(units)\n        for name, selection_dict in state['selections'].items():\n            # TODO: make selection use the vaex.serialize framework\n            if selection_dict is None:\n                selection = None\n            else:\n                selection = selections.selection_from_dict(selection_dict)\n            self.set_selection(selection, name=name)"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nloading a state previously stored by : meth : DataFrame. state_store.", "response": "def state_load(self, f, use_active_range=False):\n        \"\"\"Load a state previously stored by :meth:`DataFrame.state_store`, see also :meth:`DataFrame.state_set`.\"\"\"\n        state = vaex.utils.read_json_or_yaml(f)\n        self.state_set(state, use_active_range=use_active_range)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef remove_virtual_meta(self):\n        dir = self.get_private_dir(create=True)\n        path = os.path.join(dir, \"virtual_meta.yaml\")\n        try:\n            if os.path.exists(path):\n                os.remove(path)\n            if not os.listdir(dir):\n                os.rmdir(dir)\n        except:\n            logger.exception(\"error while trying to remove %s or %s\", path, dir)", "response": "Removes the file with the virtual column etc."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef write_virtual_meta(self):\n        path = os.path.join(self.get_private_dir(create=True), \"virtual_meta.yaml\")\n        virtual_names = list(self.virtual_columns.keys()) + list(self.variables.keys())\n        units = {key: str(value) for key, value in self.units.items() if key in virtual_names}\n        ucds = {key: value for key, value in self.ucds.items() if key in virtual_names}\n        descriptions = {key: value for key, value in self.descriptions.items() if key in virtual_names}\n        meta_info = dict(virtual_columns=self.virtual_columns,\n                         variables=self.variables,\n                         ucds=ucds, units=units, descriptions=descriptions)\n        vaex.utils.write_json_or_yaml(path, meta_info)", "response": "Writes virtual columns variables ucds description and units."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nwrites all meta data ucd description and units to a file called meta. yaml.", "response": "def write_meta(self):\n        \"\"\"Writes all meta data, ucd,description and units\n\n        The default implementation is to write this to a file called meta.yaml in the directory defined by\n        :func:`DataFrame.get_private_dir`. Other implementation may store this in the DataFrame file itself.\n        (For instance the vaex hdf5 implementation does this)\n\n        This method is called after virtual columns or variables are added. Upon opening a file, :func:`DataFrame.update_meta`\n        is called, so that the information is not lost between sessions.\n\n        Note: opening a DataFrame twice may result in corruption of this file.\n\n        \"\"\"\n        # raise NotImplementedError\n        path = os.path.join(self.get_private_dir(create=True), \"meta.yaml\")\n        units = {key: str(value) for key, value in self.units.items()}\n        meta_info = dict(description=self.description,\n                         ucds=self.ucds, units=units, descriptions=self.descriptions,\n                         )\n        vaex.utils.write_json_or_yaml(path, meta_info)"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef subspaces(self, expressions_list=None, dimensions=None, exclude=None, **kwargs):\n        if dimensions is not None:\n            expressions_list = list(itertools.combinations(self.get_column_names(), dimensions))\n            if exclude is not None:\n                import six\n\n                def excluded(expressions):\n                    if callable(exclude):\n                        return exclude(expressions)\n                    elif isinstance(exclude, six.string_types):\n                        return exclude in expressions\n                    elif isinstance(exclude, (list, tuple)):\n                        # $#expressions = set(expressions)\n                        for e in exclude:\n                            if isinstance(e, six.string_types):\n                                if e in expressions:\n                                    return True\n                            elif isinstance(e, (list, tuple)):\n                                if set(e).issubset(expressions):\n                                    return True\n                            else:\n                                raise ValueError(\"elements of exclude should contain a string or a sequence of strings\")\n                    else:\n                        raise ValueError(\"exclude should contain a string, a sequence of strings, or should be a callable\")\n                    return False\n                # test if any of the elements of exclude are a subset of the expression\n                expressions_list = [expr for expr in expressions_list if not excluded(expr)]\n            logger.debug(\"expression list generated: %r\", expressions_list)\n        import vaex.legacy\n        return vaex.legacy.Subspaces([self(*expressions, **kwargs) for expressions in expressions_list])", "response": "Generate a Subspaces object based on a custom list of expressions or all possible combinations for that dimension."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef set_variable(self, name, expression_or_value, write=True):\n        self.variables[name] = expression_or_value", "response": "Set the variable to an expression or value defined by expression_or_value."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef evaluate_variable(self, name):\n        if isinstance(self.variables[name], six.string_types):\n            # TODO: this does not allow more than one level deep variable, like a depends on b, b on c, c is a const\n            value = eval(self.variables[name], expression_namespace, self.variables)\n            return value\n        else:\n            return self.variables[name]", "response": "Evaluates the variable given by name."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef _evaluate_selection_mask(self, name=\"default\", i1=None, i2=None, selection=None, cache=False):\n        i1 = i1 or 0\n        i2 = i2 or len(self)\n        scope = scopes._BlockScopeSelection(self, i1, i2, selection, cache=cache)\n        return scope.evaluate(name)", "response": "Internal use ignores the filter"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef to_items(self, column_names=None, selection=None, strings=True, virtual=False):\n        items = []\n        for name in column_names or self.get_column_names(strings=strings, virtual=virtual):\n            items.append((name, self.evaluate(name, selection=selection)))\n        return items", "response": "Return a list of pairs where the ndarray corresponds to the evaluated data."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nreturning a dict containing the ndarray corresponding to the evaluated dataset s dataset.", "response": "def to_dict(self, column_names=None, selection=None, strings=True, virtual=False):\n        \"\"\"Return a dict containing the ndarray corresponding to the evaluated data\n\n        :param column_names: list of column names, to export, when None DataFrame.get_column_names(strings=strings, virtual=virtual) is used\n        :param selection: {selection}\n        :param strings: argument passed to DataFrame.get_column_names when column_names is None\n        :param virtual: argument passed to DataFrame.get_column_names when column_names is None\n        :return: dict\n        \"\"\"\n        return dict(self.to_items(column_names=column_names, selection=selection, strings=strings, virtual=virtual))"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef to_copy(self, column_names=None, selection=None, strings=True, virtual=False, selections=True):\n        if column_names:\n            column_names = _ensure_strings_from_expressions(column_names)\n        df = vaex.from_items(*self.to_items(column_names=column_names, selection=selection, strings=strings, virtual=False))\n        if virtual:\n            for name, value in self.virtual_columns.items():\n                df.add_virtual_column(name, value)\n        if selections:\n            # the filter selection does not need copying\n            for key, value in self.selection_histories.items():\n                if key != FILTER_SELECTION_NAME:\n                    df.selection_histories[key] = list(value)\n            for key, value in self.selection_history_indices.items():\n                if key != FILTER_SELECTION_NAME:\n                    df.selection_history_indices[key] = value\n        df.functions.update(self.functions)\n        df.copy_metadata(self)\n        return df", "response": "Return a copy of the DataFrame"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nreturns a pandas DataFrame containing the ndarray corresponding to the evaluated data", "response": "def to_pandas_df(self, column_names=None, selection=None, strings=True, virtual=False, index_name=None):\n        \"\"\"Return a pandas DataFrame containing the ndarray corresponding to the evaluated data\n\n         If index is given, that column is used for the index of the dataframe.\n\n         Example\n\n         >>> df_pandas = df.to_pandas_df([\"x\", \"y\", \"z\"])\n         >>> df_copy = vaex.from_pandas(df_pandas)\n\n        :param column_names: list of column names, to export, when None DataFrame.get_column_names(strings=strings, virtual=virtual) is used\n        :param selection: {selection}\n        :param strings: argument passed to DataFrame.get_column_names when column_names is None\n        :param virtual: argument passed to DataFrame.get_column_names when column_names is None\n        :param index_column: if this column is given it is used for the index of the DataFrame\n        :return: pandas.DataFrame object\n        \"\"\"\n        import pandas as pd\n        data = self.to_dict(column_names=column_names, selection=selection, strings=strings, virtual=virtual)\n        if index_name is not None:\n            if index_name in data:\n                index = data.pop(index_name)\n            else:\n                index = self.evaluate(index_name, selection=selection)\n        else:\n            index = None\n        df = pd.DataFrame(data=data, index=index)\n        if index is not None:\n            df.index.name = index_name\n        return df"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef to_arrow_table(self, column_names=None, selection=None, strings=True, virtual=False):\n        from vaex_arrow.convert import arrow_table_from_vaex_df\n        return arrow_table_from_vaex_df(self, column_names, selection, strings, virtual)", "response": "Returns an arrow table object containing the arrays corresponding to the evaluated dataset s entries."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nreturning an astropy. table. Table object containing the ndarrays corresponding to the evaluated data.", "response": "def to_astropy_table(self, column_names=None, selection=None, strings=True, virtual=False, index=None):\n        \"\"\"Returns a astropy table object containing the ndarrays corresponding to the evaluated data\n\n        :param column_names: list of column names, to export, when None DataFrame.get_column_names(strings=strings, virtual=virtual) is used\n        :param selection: {selection}\n        :param strings: argument passed to DataFrame.get_column_names when column_names is None\n        :param virtual: argument passed to DataFrame.get_column_names when column_names is None\n        :param index: if this column is given it is used for the index of the DataFrame\n        :return: astropy.table.Table object\n        \"\"\"\n        from astropy.table import Table, Column, MaskedColumn\n        meta = dict()\n        meta[\"name\"] = self.name\n        meta[\"description\"] = self.description\n\n        table = Table(meta=meta)\n        for name, data in self.to_items(column_names=column_names, selection=selection, strings=strings, virtual=virtual):\n            if self.dtype(name) == str_type:  # for astropy we convert it to unicode, it seems to ignore object type\n                data = np.array(data).astype('U')\n            meta = dict()\n            if name in self.ucds:\n                meta[\"ucd\"] = self.ucds[name]\n            if np.ma.isMaskedArray(data):\n                cls = MaskedColumn\n            else:\n                cls = Column\n            table[name] = cls(data, unit=self.unit(name), description=self.descriptions.get(name), meta=meta)\n        return table"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef validate_expression(self, expression):\n        # return self.evaluate(expression, 0, 2)\n        vars = set(self.get_column_names()) | set(self.variables.keys())\n        funcs = set(expression_namespace.keys())\n        return vaex.expresso.validate_expression(expression, vars, funcs)", "response": "Validate an expression (may throw Exceptions)"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nadds an in memory array as a column.", "response": "def add_column(self, name, f_or_array):\n        \"\"\"Add an in memory array as a column.\"\"\"\n        if isinstance(f_or_array, (np.ndarray, Column)):\n            data = ar = f_or_array\n            # it can be None when we have an 'empty' DataFrameArrays\n            if self._length_original is None:\n                self._length_unfiltered = _len(data)\n                self._length_original = _len(data)\n                self._index_end = self._length_unfiltered\n            if _len(ar) != self.length_original():\n                if self.filtered:\n                    # give a better warning to avoid confusion\n                    if len(self) == len(ar):\n                        raise ValueError(\"Array is of length %s, while the length of the DataFrame is %s due to the filtering, the (unfiltered) length is %s.\" % (len(ar), len(self), self.length_unfiltered()))\n                raise ValueError(\"array is of length %s, while the length of the DataFrame is %s\" % (len(ar), self.length_original()))\n            # assert self.length_unfiltered() == len(data), \"columns should be of equal length, length should be %d, while it is %d\" % ( self.length_unfiltered(), len(data))\n            self.columns[name] = f_or_array\n            if name not in self.column_names:\n                self.column_names.append(name)\n        else:\n            raise ValueError(\"functions not yet implemented\")\n        self._save_assign_expression(name, Expression(self, name))"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nrenaming a column in the table.", "response": "def rename_column(self, name, new_name, unique=False, store_in_state=True):\n        \"\"\"Renames a column, not this is only the in memory name, this will not be reflected on disk\"\"\"\n        new_name = vaex.utils.find_valid_name(new_name, used=[] if not unique else list(self))\n        data = self.columns.get(name)\n        if data is not None:\n            del self.columns[name]\n            self.column_names[self.column_names.index(name)] = new_name\n            self.columns[new_name] = data\n        else:\n            expression = self.virtual_columns[name]\n            del self.virtual_columns[name]\n            self.virtual_columns[new_name] = expression\n        if store_in_state:\n            self._renamed_columns.append((name, new_name))\n        for d in [self.ucds, self.units, self.descriptions]:\n            if name in d:\n                d[new_name] = d[name]\n                del d[name]\n        return new_name"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nadds a column based on a longitude and latitude.", "response": "def add_column_healpix(self, name=\"healpix\", longitude=\"ra\", latitude=\"dec\", degrees=True, healpix_order=12, nest=True):\n        \"\"\"Add a healpix (in memory) column based on a longitude and latitude\n\n        :param name: Name of column\n        :param longitude: longitude expression\n        :param latitude: latitude expression  (astronomical convenction latitude=90 is north pole)\n        :param degrees: If lon/lat are in degrees (default) or radians.\n        :param healpix_order: healpix order, >= 0\n        :param nest: Nested healpix (default) or ring.\n        \"\"\"\n        import healpy as hp\n        if degrees:\n            scale = \"*pi/180\"\n        else:\n            scale = \"\"\n        # TODO: multithread this\n        phi = self.evaluate(\"(%s)%s\" % (longitude, scale))\n        theta = self.evaluate(\"pi/2-(%s)%s\" % (latitude, scale))\n        hp_index = hp.ang2pix(hp.order2nside(healpix_order), theta, phi, nest=nest)\n        self.add_column(\"healpix\", hp_index)"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\npropagates uncertainties for a set of virtual columns.", "response": "def propagate_uncertainties(self, columns, depending_variables=None, cov_matrix='auto',\n                                covariance_format=\"{}_{}_covariance\",\n                                uncertainty_format=\"{}_uncertainty\"):\n        \"\"\"Propagates uncertainties (full covariance matrix) for a set of virtual columns.\n\n        Covariance matrix of the depending variables is guessed by finding columns prefixed by \"e\"\n        or `\"e_\"` or postfixed by \"_error\", \"_uncertainty\", \"e\" and `\"_e\"`.\n        Off diagonals (covariance or correlation) by postfixes with \"_correlation\" or \"_corr\" for\n        correlation or \"_covariance\" or \"_cov\" for covariances.\n        (Note that x_y_cov = x_e * y_e * x_y_correlation.)\n\n\n        Example\n\n        >>> df = vaex.from_scalars(x=1, y=2, e_x=0.1, e_y=0.2)\n        >>> df[\"u\"] = df.x + df.y\n        >>> df[\"v\"] = np.log10(df.x)\n        >>> df.propagate_uncertainties([df.u, df.v])\n        >>> df.u_uncertainty, df.v_uncertainty\n\n        :param columns: list of columns for which to calculate the covariance matrix.\n        :param depending_variables: If not given, it is found out automatically, otherwise a list of columns which have uncertainties.\n        :param cov_matrix: List of list with expressions giving the covariance matrix, in the same order as depending_variables. If 'full' or 'auto',\n            the covariance matrix for the depending_variables will be guessed, where 'full' gives an error if an entry was not found.\n        \"\"\"\n\n        names = _ensure_strings_from_expressions(columns)\n        virtual_columns = self._expr(*columns, always_list=True)\n\n        if depending_variables is None:\n            depending_variables = set()\n            for expression in virtual_columns:\n                depending_variables |= expression.variables()\n            depending_variables = list(sorted(list(depending_variables)))\n\n        fs = [self[self.virtual_columns[name]] for name in names]\n        jacobian = self._jacobian(fs, depending_variables)\n        m = len(fs)\n        n = len(depending_variables)\n\n        # n x n matrix\n        cov_matrix = self._covariance_matrix_guess(depending_variables, full=cov_matrix == \"full\", as_expression=True)\n\n        # empty m x m matrix\n        cov_matrix_out = [[self['0'] for __ in range(m)] for __ in range(m)]\n        for i in range(m):\n            for j in range(m):\n                for k in range(n):\n                    for l in range(n):\n                        if jacobian[i][k].expression == '0' or jacobian[j][l].expression == '0' or cov_matrix[k][l].expression == '0':\n                            pass\n                        else:\n                            cov_matrix_out[i][j] = cov_matrix_out[i][j] + jacobian[i][k] * cov_matrix[k][l] * jacobian[j][l]\n        for i in range(m):\n            for j in range(i + 1):\n                sigma = cov_matrix_out[i][j]\n                sigma = self._expr(vaex.expresso.simplify(_ensure_string_from_expression(sigma)))\n                if i != j:\n                    self.add_virtual_column(covariance_format.format(names[i], names[j]), sigma)\n                else:\n                    self.add_virtual_column(uncertainty_format.format(names[i]), np.sqrt(sigma))"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef add_virtual_columns_cartesian_to_polar(self, x=\"x\", y=\"y\", radius_out=\"r_polar\", azimuth_out=\"phi_polar\",\n                                               propagate_uncertainties=False,\n                                               radians=False):\n        \"\"\"Convert cartesian to polar coordinates\n\n        :param x: expression for x\n        :param y: expression for y\n        :param radius_out: name for the virtual column for the radius\n        :param azimuth_out: name for the virtual column for the azimuth angle\n        :param propagate_uncertainties: {propagate_uncertainties}\n        :param radians: if True, azimuth is in radians, defaults to degrees\n        :return:\n        \"\"\"\n        x = self[x]\n        y = self[y]\n        if radians:\n            to_degrees = \"\"\n        else:\n            to_degrees = \"*180/pi\"\n        r = np.sqrt(x**2 + y**2)\n        self[radius_out] = r\n        phi = np.arctan2(y, x)\n        if not radians:\n            phi = phi * 180/np.pi\n        self[azimuth_out] = phi\n        if propagate_uncertainties:\n            self.propagate_uncertainties([self[radius_out], self[azimuth_out]])", "response": "Convert cartesian to polar coordinates."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nconverting cartesian to polar velocities.", "response": "def add_virtual_columns_cartesian_velocities_to_polar(self, x=\"x\", y=\"y\", vx=\"vx\", radius_polar=None, vy=\"vy\", vr_out=\"vr_polar\", vazimuth_out=\"vphi_polar\",\n                                                          propagate_uncertainties=False,):\n        \"\"\"Convert cartesian to polar velocities.\n\n        :param x:\n        :param y:\n        :param vx:\n        :param radius_polar: Optional expression for the radius, may lead to a better performance when given.\n        :param vy:\n        :param vr_out:\n        :param vazimuth_out:\n        :param propagate_uncertainties: {propagate_uncertainties}\n        :return:\n        \"\"\"\n        x = self._expr(x)\n        y = self._expr(y)\n        vx = self._expr(vx)\n        vy = self._expr(vy)\n        if radius_polar is None:\n            radius_polar = np.sqrt(x**2 + y**2)\n        radius_polar = self._expr(radius_polar)\n        self[vr_out]       = (x*vx + y*vy) / radius_polar\n        self[vazimuth_out] = (x*vy - y*vx) / radius_polar\n        if propagate_uncertainties:\n            self.propagate_uncertainties([self[vr_out], self[vazimuth_out]])"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nconvert cylindrical polar velocities to Cartesian.", "response": "def add_virtual_columns_polar_velocities_to_cartesian(self, x='x', y='y', azimuth=None, vr='vr_polar', vazimuth='vphi_polar', vx_out='vx', vy_out='vy', propagate_uncertainties=False):\n        \"\"\" Convert cylindrical polar velocities to Cartesian.\n\n        :param x:\n        :param y:\n        :param azimuth: Optional expression for the azimuth in degrees , may lead to a better performance when given.\n        :param vr:\n        :param vazimuth:\n        :param vx_out:\n        :param vy_out:\n        :param propagate_uncertainties: {propagate_uncertainties}\n        \"\"\"\n        x = self._expr(x)\n        y = self._expr(y)\n        vr = self._expr(vr)\n        vazimuth = self._expr(vazimuth)\n        if azimuth is not None:\n            azimuth = self._expr(azimuth)\n            azimuth = np.deg2rad(azimuth)\n        else:\n            azimuth = np.arctan2(y, x)\n        azimuth = self._expr(azimuth)\n        self[vx_out] = vr * np.cos(azimuth) - vazimuth * np.sin(azimuth)\n        self[vy_out] = vr * np.sin(azimuth) + vazimuth * np.cos(azimuth)\n        if propagate_uncertainties:\n            self.propagate_uncertainties([self[vx_out], self[vy_out]])"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nconverting spherical to cartesian coordinates.", "response": "def add_virtual_columns_spherical_to_cartesian(self, alpha, delta, distance, xname=\"x\", yname=\"y\", zname=\"z\",\n                                                   propagate_uncertainties=False,\n                                                   center=[0, 0, 0], center_name=\"solar_position\", radians=False):\n        \"\"\"Convert spherical to cartesian coordinates.\n\n\n\n        :param alpha:\n        :param delta: polar angle, ranging from the -90 (south pole) to 90 (north pole)\n        :param distance: radial distance, determines the units of x, y and z\n        :param xname:\n        :param yname:\n        :param zname:\n        :param propagate_uncertainties: {propagate_uncertainties}\n        :param center:\n        :param center_name:\n        :param radians:\n        :return:\n        \"\"\"\n        alpha = self._expr(alpha)\n        delta = self._expr(delta)\n        distance = self._expr(distance)\n        if not radians:\n            alpha = alpha * self._expr('pi')/180\n            delta = delta * self._expr('pi')/180\n\n        # TODO: use sth like .optimize by default to get rid of the +0 ?\n        if center[0]:\n            self[xname] = np.cos(alpha) * np.cos(delta) * distance + center[0]\n        else:\n            self[xname] = np.cos(alpha) * np.cos(delta) * distance\n        if center[1]:\n            self[yname] = np.sin(alpha) * np.cos(delta) * distance + center[1]\n        else:\n            self[yname] = np.sin(alpha) * np.cos(delta) * distance\n        if center[2]:\n            self[zname] =                 np.sin(delta) * distance + center[2]\n        else:\n            self[zname] =                 np.sin(delta) * distance\n        if propagate_uncertainties:\n            self.propagate_uncertainties([self[xname], self[yname], self[zname]])"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nconverting cartesian to spherical coordinates.", "response": "def add_virtual_columns_cartesian_to_spherical(self, x=\"x\", y=\"y\", z=\"z\", alpha=\"l\", delta=\"b\", distance=\"distance\", radians=False, center=None, center_name=\"solar_position\"):\n        \"\"\"Convert cartesian to spherical coordinates.\n\n\n\n        :param x:\n        :param y:\n        :param z:\n        :param alpha:\n        :param delta: name for polar angle, ranges from -90 to 90 (or -pi to pi when radians is True).\n        :param distance:\n        :param radians:\n        :param center:\n        :param center_name:\n        :return:\n        \"\"\"\n        transform = \"\" if radians else \"*180./pi\"\n\n        if center is not None:\n            self.add_variable(center_name, center)\n        if center is not None and center[0] != 0:\n            x = \"({x} - {center_name}[0])\".format(**locals())\n        if center is not None and center[1] != 0:\n            y = \"({y} - {center_name}[1])\".format(**locals())\n        if center is not None and center[2] != 0:\n            z = \"({z} - {center_name}[2])\".format(**locals())\n        self.add_virtual_column(distance, \"sqrt({x}**2 + {y}**2 + {z}**2)\".format(**locals()))\n        # self.add_virtual_column(alpha, \"((arctan2({y}, {x}) + 2*pi) % (2*pi)){transform}\".format(**locals()))\n        self.add_virtual_column(alpha, \"arctan2({y}, {x}){transform}\".format(**locals()))\n        self.add_virtual_column(delta, \"(-arccos({z}/{distance})+pi/2){transform}\".format(**locals()))"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef add_virtual_columns_aitoff(self, alpha, delta, x, y, radians=True):\n        transform = \"\" if radians else \"*pi/180.\"\n        aitoff_alpha = \"__aitoff_alpha_%s_%s\" % (alpha, delta)\n        # sanatize\n        aitoff_alpha = re.sub(\"[^a-zA-Z_]\", \"_\", aitoff_alpha)\n\n        self.add_virtual_column(aitoff_alpha, \"arccos(cos({delta}{transform})*cos({alpha}{transform}/2))\".format(**locals()))\n        self.add_virtual_column(x, \"2*cos({delta}{transform})*sin({alpha}{transform}/2)/sinc({aitoff_alpha}/pi)/pi\".format(**locals()))\n        self.add_virtual_column(y, \"sin({delta}{transform})/sinc({aitoff_alpha}/pi)/pi\".format(**locals()))", "response": "Add virtual columns for a given aitoff."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nadd a virtual column to the DataFrame.", "response": "def add_virtual_column(self, name, expression, unique=False):\n        \"\"\"Add a virtual column to the DataFrame.\n\n        Example:\n\n        >>> df.add_virtual_column(\"r\", \"sqrt(x**2 + y**2 + z**2)\")\n        >>> df.select(\"r < 10\")\n\n        :param: str name: name of virtual column\n        :param: expression: expression for the column\n        :param str unique: if name is already used, make it unique by adding a postfix, e.g. _1, or _2\n        \"\"\"\n        type = \"change\" if name in self.virtual_columns else \"add\"\n        expression = _ensure_string_from_expression(expression)\n        if name in self.get_column_names(virtual=False):\n            renamed = '__' +vaex.utils.find_valid_name(name, used=self.get_column_names())\n            expression = self._rename(name, renamed, expression)[0].expression\n\n        name = vaex.utils.find_valid_name(name, used=[] if not unique else self.get_column_names())\n        self.virtual_columns[name] = expression\n        self.column_names.append(name)\n        self._save_assign_expression(name)\n        self.signal_column_changed.emit(self, name, \"add\")"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef delete_virtual_column(self, name):\n        del self.virtual_columns[name]\n        self.signal_column_changed.emit(self, name, \"delete\")", "response": "Deletes a virtual column from a DataFrame."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef add_variable(self, name, expression, overwrite=True, unique=True):\n        if unique or overwrite or name not in self.variables:\n            existing_names = self.get_column_names(virtual=False) + list(self.variables.keys())\n            name = vaex.utils.find_valid_name(name, used=[] if not unique else existing_names)\n            self.variables[name] = expression\n            self.signal_variable_changed.emit(self, name, \"add\")\n            if unique:\n                return name", "response": "Add a virtual variable to a DataFrame."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef delete_variable(self, name):\n        del self.variables[name]\n        self.signal_variable_changed.emit(self, name, \"delete\")", "response": "Deletes a variable from a DataFrame."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef tail(self, n=10):\n        N = len(self)\n        # self.cat(i1=max(0, N-n), i2=min(len(self), N))\n        return self[max(0, N - n):min(len(self), N)]", "response": "Return a shallow copy a DataFrame with the last n rows."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\ndisplaying the first and last n elements of a DataFrame.", "response": "def head_and_tail_print(self, n=5):\n        \"\"\"Display the first and last n elements of a DataFrame.\"\"\"\n        from IPython import display\n        display.display(display.HTML(self._head_and_tail_table(n)))"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\ngive a description of the DataFrame.", "response": "def describe(self, strings=True, virtual=True, selection=None):\n        \"\"\"Give a description of the DataFrame.\n\n        >>> import vaex\n        >>> df = vaex.example()[['x', 'y', 'z']]\n        >>> df.describe()\n                         x          y          z\n        dtype      float64    float64    float64\n        count       330000     330000     330000\n        missing          0          0          0\n        mean    -0.0671315 -0.0535899  0.0169582\n        std        7.31746    7.78605    5.05521\n        min       -128.294   -71.5524   -44.3342\n        max        271.366    146.466    50.7185\n        >>> df.describe(selection=df.x > 0)\n                           x         y          z\n        dtype        float64   float64    float64\n        count         164060    164060     164060\n        missing       165940    165940     165940\n        mean         5.13572 -0.486786 -0.0868073\n        std          5.18701   7.61621    5.02831\n        min      1.51635e-05  -71.5524   -44.3342\n        max          271.366   78.0724    40.2191\n\n        :param bool strings: Describe string columns or not\n        :param bool virtual: Describe virtual columns or not\n        :param selection: Optional selection to use.\n        :return: Pandas dataframe\n\n        \"\"\"\n        import pandas as pd\n        N = len(self)\n        columns = {}\n        for feature in self.get_column_names(strings=strings, virtual=virtual)[:]:\n            dtype = str(self.dtype(feature)) if self.dtype(feature) != str else 'str'\n            if self.dtype(feature) == str_type or self.dtype(feature).kind in ['S', 'U', 'O']:\n                count = self.count(feature, selection=selection, delay=True)\n                self.execute()\n                count = count.get()\n                columns[feature] = ((dtype, count, N-count, '--', '--', '--', '--'))\n            else:\n                count = self.count(feature, selection=selection, delay=True)\n                mean = self.mean(feature, selection=selection, delay=True)\n                std = self.std(feature, selection=selection, delay=True)\n                minmax = self.minmax(feature, selection=selection, delay=True)\n                self.execute()\n                count, mean, std, minmax = count.get(), mean.get(), std.get(), minmax.get()\n                count = int(count)\n                columns[feature] = ((dtype, count, N-count, mean, std, minmax[0], minmax[1]))\n        return pd.DataFrame(data=columns, index=['dtype', 'count', 'missing', 'mean', 'std', 'min', 'max'])"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\ndisplays the DataFrame from row i1 till i2", "response": "def cat(self, i1, i2, format='html'):\n        \"\"\"Display the DataFrame from row i1 till i2\n\n        For format, see https://pypi.org/project/tabulate/\n\n        :param int i1: Start row\n        :param int i2: End row.\n        :param str format: Format to use, e.g. 'html', 'plain', 'latex'\n        \"\"\"\n        from IPython import display\n        if format == 'html':\n            output = self._as_html_table(i1, i2)\n            display.display(display.HTML(output))\n        else:\n            output = self._as_table(i1, i2, format=format)\n            print(output)"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nsets the current row and emit the signal signal_pick.", "response": "def set_current_row(self, value):\n        \"\"\"Set the current row, and emit the signal signal_pick.\"\"\"\n        if (value is not None) and ((value < 0) or (value >= len(self))):\n            raise IndexError(\"index %d out of range [0,%d]\" % (value, len(self)))\n        self._current_row = value\n        self.signal_pick.emit(self, value)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nreturn a list of column names for the current instance of a table.", "response": "def get_column_names(self, virtual=True, strings=True, hidden=False, regex=None):\n        \"\"\"Return a list of column names\n\n        Example:\n\n        >>> import vaex\n        >>> df = vaex.from_scalars(x=1, x2=2, y=3, s='string')\n        >>> df['r'] = (df.x**2 + df.y**2)**2\n        >>> df.get_column_names()\n        ['x', 'x2', 'y', 's', 'r']\n        >>> df.get_column_names(virtual=False)\n        ['x', 'x2', 'y', 's']\n        >>> df.get_column_names(regex='x.*')\n        ['x', 'x2']\n\n        :param virtual: If False, skip virtual columns\n        :param hidden: If False, skip hidden columns\n        :param strings: If False, skip string columns\n        :param regex: Only return column names matching the (optional) regular expression\n        :rtype: list of str\n\n        Example:\n        >>> import vaex\n        >>> df = vaex.from_scalars(x=1, x2=2, y=3, s='string')\n        >>> df['r'] = (df.x**2 + df.y**2)**2\n        >>> df.get_column_names()\n        ['x', 'x2', 'y', 's', 'r']\n        >>> df.get_column_names(virtual=False)\n        ['x', 'x2', 'y', 's']\n        >>> df.get_column_names(regex='x.*')\n        ['x', 'x2']\n        \"\"\"\n        def column_filter(name):\n            '''Return True if column with specified name should be returned'''\n            if regex and not re.match(regex, name):\n                return False\n            if not virtual and name in self.virtual_columns:\n                return False\n            if not strings and (self.dtype(name) == str_type or self.dtype(name).type == np.string_):\n                return False\n            if not hidden and name.startswith('__'):\n                return False\n            return True\n        return [name for name in self.column_names if column_filter(name)]"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nsetting the active_fraction set picked row to None remove selection.", "response": "def set_active_fraction(self, value):\n        \"\"\"Sets the active_fraction, set picked row to None, and remove selection.\n\n        TODO: we may be able to keep the selection, if we keep the expression, and also the picked row\n        \"\"\"\n        if value != self._active_fraction:\n            self._active_fraction = value\n            # self._fraction_length = int(self._length * self._active_fraction)\n            self.select(None)\n            self.set_current_row(None)\n            self._length_unfiltered = int(round(self._length_original * self._active_fraction))\n            self._index_start = 0\n            self._index_end = self._length_unfiltered\n            self.signal_active_fraction_changed.emit(self, value)"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef set_active_range(self, i1, i2):\n        logger.debug(\"set active range to: %r\", (i1, i2))\n        self._active_fraction = (i2 - i1) / float(self.length_original())\n        # self._fraction_length = int(self._length * self._active_fraction)\n        self._index_start = i1\n        self._index_end = i2\n        self.select(None)\n        self.set_current_row(None)\n        self._length_unfiltered = i2 - i1\n        self.signal_active_fraction_changed.emit(self, self._active_fraction)", "response": "Sets the active_fraction set picked row to None remove selection."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nreturn a new DataFrame where all columns are trimmed by the active range.", "response": "def trim(self, inplace=False):\n        '''Return a DataFrame, where all columns are 'trimmed' by the active range.\n\n        For the returned DataFrame, df.get_active_range() returns (0, df.length_original()).\n\n        {note_copy}\n\n        :param inplace: {inplace}\n        :rtype: DataFrame\n        '''\n        df = self if inplace else self.copy()\n        for name in df:\n            column = df.columns.get(name)\n            if column is not None:\n                if self._index_start == 0 and len(column) == self._index_end:\n                    pass  # we already assigned it in .copy\n                else:\n                    if isinstance(column, np.ndarray):  # real array\n                        df.columns[name] = column[self._index_start:self._index_end]\n                    else:\n                        df.columns[name] = column.trim(self._index_start, self._index_end)\n        df._length_original = self.length_unfiltered()\n        df._length_unfiltered = df._length_original\n        df._index_start = 0\n        df._index_end = df._length_original\n        df._active_fraction = 1\n        return df"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nreturn a DataFrame containing only rows indexed by indices", "response": "def take(self, indices):\n        '''Returns a DataFrame containing only rows indexed by indices\n\n        {note_copy}\n\n        Example:\n\n        >>> import vaex, numpy as np\n        >>> df = vaex.from_arrays(s=np.array(['a', 'b', 'c', 'd']), x=np.arange(1,5))\n        >>> df.take([0,2])\n         #  s      x\n         0  a      1\n         1  c      3\n\n        :param indices: sequence (list or numpy array) with row numbers\n        :return: DataFrame which is a shallow copy of the original data.\n        :rtype: DataFrame\n        '''\n        df = self.copy()\n        # if the columns in ds already have a ColumnIndex\n        # we could do, direct_indices = df.column['bla'].indices[indices]\n        # which should be shared among multiple ColumnIndex'es, so we store\n        # them in this dict\n        direct_indices_map = {}\n        indices = np.array(indices)\n        for name in df:\n            column = df.columns.get(name)\n            if column is not None:\n                # we optimize this somewhere, so we don't do multiple\n                # levels of indirection\n                if isinstance(column, ColumnIndexed):\n                    # TODO: think about what happpens when the indices are masked.. ?\n                    if id(column.indices) not in direct_indices_map:\n                        direct_indices = column.indices[indices]\n                        direct_indices_map[id(column.indices)] = direct_indices\n                    else:\n                        direct_indices = direct_indices_map[id(column.indices)]\n                    df.columns[name] = ColumnIndexed(column.df, direct_indices, column.name)\n                else:\n                    df.columns[name] = ColumnIndexed(self, indices, name)\n        df._length_original = len(indices)\n        df._length_unfiltered = df._length_original\n        df.set_selection(None, name=FILTER_SELECTION_NAME)\n        return df"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nreturning a DataFrame containing only the filtered rows.", "response": "def extract(self):\n        '''Return a DataFrame containing only the filtered rows.\n\n        {note_copy}\n\n        The resulting DataFrame may be more efficient to work with when the original DataFrame is\n        heavily filtered (contains just a small number of rows).\n\n        If no filtering is applied, it returns a trimmed view.\n        For the returned df, len(df) == df.length_original() == df.length_unfiltered()\n\n        :rtype: DataFrame\n        '''\n        trimmed = self.trim()\n        if trimmed.filtered:\n            indices = trimmed._filtered_range_to_unfiltered_indices(0, len(trimmed))\n            return trimmed.take(indices)\n        else:\n            return trimmed"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef sample(self, n=None, frac=None, replace=False, weights=None, random_state=None):\n        '''Returns a DataFrame with a random set of rows\n\n        {note_copy}\n\n        Provide either n or frac.\n\n        Example:\n\n        >>> import vaex, numpy as np\n        >>> df = vaex.from_arrays(s=np.array(['a', 'b', 'c', 'd']), x=np.arange(1,5))\n        >>> df\n          #  s      x\n          0  a      1\n          1  b      2\n          2  c      3\n          3  d      4\n        >>> df.sample(n=2, random_state=42) # 2 random rows, fixed seed\n          #  s      x\n          0  b      2\n          1  d      4\n        >>> df.sample(frac=1, random_state=42) # 'shuffling'\n          #  s      x\n          0  c      3\n          1  a      1\n          2  d      4\n          3  b      2\n        >>> df.sample(frac=1, replace=True, random_state=42) # useful for bootstrap (may contain repeated samples)\n          #  s      x\n          0  d      4\n          1  a      1\n          2  a      1\n          3  d      4\n\n        :param int n: number of samples to take (default 1 if frac is None)\n        :param float frac: fractional number of takes to take\n        :param bool replace: If true, a row may be drawn multiple times\n        :param str or expression weights: (unnormalized) probability that a row can be drawn\n        :param int or RandomState: seed or RandomState for reproducability, when None a random seed it chosen\n        :return: {return_shallow_copy}\n        :rtype: DataFrame\n        '''\n        self = self.extract()\n        if type(random_state) == int or random_state is None:\n            random_state = np.random.RandomState(seed=random_state)\n        if n is None and frac is None:\n            n = 1\n        elif frac is not None:\n            n = int(round(frac * len(self)))\n        weights_values = None\n        if weights is not None:\n            weights_values = self.evaluate(weights)\n            weights_values = weights_values / self.sum(weights)\n        indices = random_state.choice(len(self), n, replace=replace, p=weights_values)\n        return self.take(indices)", "response": "Returns a DataFrame with a random set of rows with optional fractional number of takes."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nreturn a list containing random portions of the DataFrame.", "response": "def split_random(self, frac, random_state=None):\n        '''Returns a list containing random portions of the DataFrame.\n\n        {note_copy}\n\n        Example:\n\n        >>> import vaex, import numpy as np\n        >>> np.random.seed(111)\n        >>> df = vaex.from_arrays(x = [0, 1, 2, 3, 4, 5, 6, 7, 8, 9])\n        >>> for dfs in df.split_random(frac=0.3, random_state=42):\n        ...     print(dfs.x.values)\n        ...\n        [8 1 5]\n        [0 7 2 9 4 3 6]\n        >>> for split in df.split_random(frac=[0.2, 0.3, 0.5], random_state=42):\n        ...     print(dfs.x.values)\n        [8 1]\n        [5 0 7]\n        [2 9 4 3 6]\n\n        :param int/list frac: If int will split the DataFrame in two portions, the first of which will have size as specified by this parameter. If list, the generator will generate as many portions as elements in the list, where each element defines the relative fraction of that portion.\n        :param int random_state: (default, None) Random number seed for reproducibility.\n        :return: A list of DataFrames.\n        :rtype: list\n        '''\n        self = self.extract()\n        if type(random_state) == int or random_state is None:\n            random_state = np.random.RandomState(seed=random_state)\n        indices = random_state.choice(len(self), len(self), replace=False)\n        return self.take(indices).split(frac)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef split(self, frac):\n        '''Returns a list containing ordered subsets of the DataFrame.\n\n        {note_copy}\n\n        Example:\n\n        >>> import vaex\n        >>> df = vaex.from_arrays(x = [0, 1, 2, 3, 4, 5, 6, 7, 8, 9])\n        >>> for dfs in df.split(frac=0.3):\n        ...     print(dfs.x.values)\n        ...\n        [0 1 3]\n        [3 4 5 6 7 8 9]\n        >>> for split in df.split(frac=[0.2, 0.3, 0.5]):\n        ...     print(dfs.x.values)\n        [0 1]\n        [2 3 4]\n        [5 6 7 8 9]\n\n        :param int/list frac: If int will split the DataFrame in two portions, the first of which will have size as specified by this parameter. If list, the generator will generate as many portions as elements in the list, where each element defines the relative fraction of that portion.\n        :return: A list of DataFrames.\n        :rtype: list\n        '''\n        self = self.extract()\n        if _issequence(frac):\n            # make sure it is normalized\n            total = sum(frac)\n            frac = [k / total for k in frac]\n        else:\n            assert frac <= 1, \"fraction should be <= 1\"\n            frac = [frac, 1 - frac]\n        offsets = np.round(np.cumsum(frac) * len(self)).astype(np.int64)\n        start = 0\n        for offset in offsets:\n            yield self[start:offset]\n            start = offset", "response": "Returns a list of ordered subsets of the DataFrame."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nreturn a sorted DataFrame sorted by the expression by", "response": "def sort(self, by, ascending=True, kind='quicksort'):\n        '''Return a sorted DataFrame, sorted by the expression 'by'\n\n        {note_copy}\n\n        {note_filter}\n\n        Example:\n\n        >>> import vaex, numpy as np\n        >>> df = vaex.from_arrays(s=np.array(['a', 'b', 'c', 'd']), x=np.arange(1,5))\n        >>> df['y'] = (df.x-1.8)**2\n        >>> df\n          #  s      x     y\n          0  a      1  0.64\n          1  b      2  0.04\n          2  c      3  1.44\n          3  d      4  4.84\n        >>> df.sort('y', ascending=False)  # Note: passing '(x-1.8)**2' gives the same result\n          #  s      x     y\n          0  d      4  4.84\n          1  c      3  1.44\n          2  a      1  0.64\n          3  b      2  0.04\n\n        :param str or expression by: expression to sort by\n        :param bool ascending: ascending (default, True) or descending (False)\n        :param str kind: kind of algorithm to use (passed to numpy.argsort)\n        '''\n        self = self.trim()\n        values = self.evaluate(by, filtered=False)\n        indices = np.argsort(values, kind=kind)\n        if not ascending:\n            indices = indices[::-1].copy()  # this may be used a lot, so copy for performance\n        return self.take(indices)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef fillna(self, value, fill_nan=True, fill_masked=True, column_names=None, prefix='__original_', inplace=False):\n        '''Return a DataFrame, where missing values/NaN are filled with 'value'.\n\n        The original columns will be renamed, and by default they will be hidden columns. No data is lost.\n\n        {note_copy}\n\n        {note_filter}\n\n        Example:\n\n        >>> import vaex\n        >>> import numpy as np\n        >>> x = np.array([3, 1, np.nan, 10, np.nan])\n        >>> df = vaex.from_arrays(x=x)\n        >>> df_filled = df.fillna(value=-1, column_names=['x'])\n        >>> df_filled\n          #    x\n          0    3\n          1    1\n          2   -1\n          3   10\n          4   -1\n\n        :param float value: The value to use for filling nan or masked values.\n        :param bool fill_na: If True, fill np.nan values with `value`.\n        :param bool fill_masked: If True, fill masked values with `values`.\n        :param list column_names: List of column names in which to fill missing values.\n        :param str prefix: The prefix to give the original columns.\n        :param inplace: {inplace}\n        '''\n        df = self.trim(inplace=inplace)\n        column_names = column_names or list(self)\n        for name in column_names:\n            column = df.columns.get(name)\n            if column is not None:\n                new_name = df.rename_column(name, prefix + name)\n                expr = df[new_name]\n                df[name] = df.func.fillna(expr, value, fill_nan=fill_nan, fill_masked=fill_masked)\n            else:\n                df[name] = df.func.fillna(df[name], value, fill_nan=fill_nan, fill_masked=fill_masked)\n        return df", "response": "Return a DataFrame where missing values or NaN are filled with value."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nreturn a new DataFrame where the virtual column is turned into an in memory numpy array.", "response": "def materialize(self, virtual_column, inplace=False):\n        '''Returns a new DataFrame where the virtual column is turned into an in memory numpy array.\n\n        Example:\n\n        >>> x = np.arange(1,4)\n        >>> y = np.arange(2,5)\n        >>> df = vaex.from_arrays(x=x, y=y)\n        >>> df['r'] = (df.x**2 + df.y**2)**0.5 # 'r' is a virtual column (computed on the fly)\n        >>> df = df.materialize('r')  # now 'r' is a 'real' column (i.e. a numpy array)\n\n        :param inplace: {inplace}\n        '''\n        df = self.trim(inplace=inplace)\n        virtual_column = _ensure_string_from_expression(virtual_column)\n        if virtual_column not in df.virtual_columns:\n            raise KeyError('Virtual column not found: %r' % virtual_column)\n        ar = df.evaluate(virtual_column, filtered=False)\n        del df[virtual_column]\n        df.add_column(virtual_column, ar)\n        return df"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nget the current selection object.", "response": "def get_selection(self, name=\"default\"):\n        \"\"\"Get the current selection object (mostly for internal use atm).\"\"\"\n        name = _normalize_selection_name(name)\n        selection_history = self.selection_histories[name]\n        index = self.selection_history_indices[name]\n        if index == -1:\n            return None\n        else:\n            return selection_history[index]"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nundoes selection for the name.", "response": "def selection_undo(self, name=\"default\", executor=None):\n        \"\"\"Undo selection, for the name.\"\"\"\n        logger.debug(\"undo\")\n        executor = executor or self.executor\n        assert self.selection_can_undo(name=name)\n        selection_history = self.selection_histories[name]\n        index = self.selection_history_indices[name]\n        self.selection_history_indices[name] -= 1\n        self.signal_selection_changed.emit(self)\n        logger.debug(\"undo: selection history is %r, index is %r\", selection_history, self.selection_history_indices[name])"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nredo selection for the name.", "response": "def selection_redo(self, name=\"default\", executor=None):\n        \"\"\"Redo selection, for the name.\"\"\"\n        logger.debug(\"redo\")\n        executor = executor or self.executor\n        assert self.selection_can_redo(name=name)\n        selection_history = self.selection_histories[name]\n        index = self.selection_history_indices[name]\n        next = selection_history[index + 1]\n        self.selection_history_indices[name] += 1\n        self.signal_selection_changed.emit(self)\n        logger.debug(\"redo: selection history is %r, index is %r\", selection_history, index)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef select(self, boolean_expression, mode=\"replace\", name=\"default\", executor=None):\n        boolean_expression = _ensure_string_from_expression(boolean_expression)\n        if boolean_expression is None and not self.has_selection(name=name):\n            pass  # we don't want to pollute the history with many None selections\n            self.signal_selection_changed.emit(self)  # TODO: unittest want to know, does this make sense?\n        else:\n            def create(current):\n                return selections.SelectionExpression(boolean_expression, current, mode) if boolean_expression else None\n            self._selection(create, name)", "response": "Perform a selection of the entries in the history tree with the given boolean expression."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef select_non_missing(self, drop_nan=True, drop_masked=True, column_names=None, mode=\"replace\", name=\"default\"):\n        column_names = column_names or self.get_column_names(virtual=False)\n\n        def create(current):\n            return selections.SelectionDropNa(drop_nan, drop_masked, column_names, current, mode)\n        self._selection(create, name)", "response": "Create a selection that selects rows having non missing values for all columns in column_names."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef dropna(self, drop_nan=True, drop_masked=True, column_names=None):\n        copy = self.copy()\n        copy.select_non_missing(drop_nan=drop_nan, drop_masked=drop_masked, column_names=column_names,\n                                name=FILTER_SELECTION_NAME, mode='and')\n        return copy", "response": "Create a shallow copy of the DataFrame with drop_nan drop_masked and column_names set to None."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nselecting a 2d rectangular box in the space given by x y bounds by limits.", "response": "def select_rectangle(self, x, y, limits, mode=\"replace\", name=\"default\"):\n        \"\"\"Select a 2d rectangular box in the space given by x and y, bounds by limits.\n\n        Example:\n\n        >>> df.select_box('x', 'y', [(0, 10), (0, 1)])\n\n        :param x: expression for the x space\n        :param y: expression fo the y space\n        :param limits: sequence of shape [(x1, x2), (y1, y2)]\n        :param mode:\n        \"\"\"\n        self.select_box([x, y], limits, mode=mode, name=name)"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nselect a n - dimensional rectangular box bounded by limits.", "response": "def select_box(self, spaces, limits, mode=\"replace\", name=\"default\"):\n        \"\"\"Select a n-dimensional rectangular box bounded by limits.\n\n        The following examples are equivalent:\n\n        >>> df.select_box(['x', 'y'], [(0, 10), (0, 1)])\n        >>> df.select_rectangle('x', 'y', [(0, 10), (0, 1)])\n\n        :param spaces: list of expressions\n        :param limits: sequence of shape [(x1, x2), (y1, y2)]\n        :param mode:\n        :param name:\n        :return:\n        \"\"\"\n        sorted_limits = [(min(l), max(l)) for l in limits]\n        expressions = [\"((%s) >= %f) & ((%s) <= %f)\" % (expression, lmin, expression, lmax) for\n                       (expression, (lmin, lmax)) in zip(spaces, sorted_limits)]\n        self.select(\"&\".join(expressions), mode=mode, name=name)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nselecting a circular region centred on xc yc with a radius of r.", "response": "def select_circle(self, x, y, xc, yc, r, mode=\"replace\", name=\"default\", inclusive=True):\n        \"\"\"\n        Select a circular region centred on xc, yc, with a radius of r.\n\n        Example:\n\n        >>> df.select_circle('x','y',2,3,1)\n\n        :param x: expression for the x space\n        :param y: expression for the y space\n        :param xc: location of the centre of the circle in x\n        :param yc: location of the centre of the circle in y\n        :param r: the radius of the circle\n        :param name: name of the selection\n        :param mode:\n        :return:\n        \"\"\"\n\n        # expr = \"({x}-{xc})**2 + ({y}-{yc})**2 <={r}**2\".format(**locals())\n        if inclusive:\n            expr = (self[x] - xc)**2 + (self[y] - yc)**2 <= r**2\n        else:\n            expr = (self[x] - xc)**2 + (self[y] - yc)**2 < r**2\n\n        self.select(boolean_expression=expr, mode=mode, name=name)"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef select_ellipse(self, x, y, xc, yc, width, height, angle=0, mode=\"replace\", name=\"default\", radians=False, inclusive=True):\n\n        # Computing the properties of the ellipse prior to selection\n        if radians:\n            pass\n        else:\n            alpha = np.deg2rad(angle)\n        xr = width / 2\n        yr = height / 2\n        r = max(xr, yr)\n        a = xr / r\n        b = yr / r\n\n        expr = \"(({x}-{xc})*cos({alpha})+({y}-{yc})*sin({alpha}))**2/{a}**2 + (({x}-{xc})*sin({alpha})-({y}-{yc})*cos({alpha}))**2/{b}**2 <= {r}**2\".format(**locals())\n\n        if inclusive:\n            expr = ((self[x] - xc) * np.cos(alpha) + (self[y] - yc) * np.sin(alpha))**2 / a**2 + ((self[x] - xc) * np.sin(alpha) - (self[y] - yc) * np.cos(alpha))**2 / b**2 <= r**2\n        else:\n            expr = ((self[x] - xc) * np.cos(alpha) + (self[y] - yc) * np.sin(alpha))**2 / a**2 + ((self[x] - xc) * np.sin(alpha) - (self[y] - yc) * np.cos(alpha))**2 / b**2 < r**2\n\n        self.select(boolean_expression=expr, mode=mode, name=name)", "response": "Select an ellipse centred on x y with a certain width height and angle."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef select_lasso(self, expression_x, expression_y, xsequence, ysequence, mode=\"replace\", name=\"default\", executor=None):\n\n        def create(current):\n            return selections.SelectionLasso(expression_x, expression_y, xsequence, ysequence, current, mode)\n        self._selection(create, name, executor=executor)", "response": "Selects a lasso from the database."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef select_inverse(self, name=\"default\", executor=None):\n\n        def create(current):\n            return selections.SelectionInvert(current)\n        self._selection(create, name, executor=executor)", "response": "Invert the selection i. e. what is selected will not be and vice versa\n       "}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef set_selection(self, selection, name=\"default\", executor=None):\n        def create(current):\n            return selection\n        self._selection(create, name, executor=executor, execute_fully=True)", "response": "Sets the selection object"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef _selection(self, create_selection, name, executor=None, execute_fully=False):\n        selection_history = self.selection_histories[name]\n        previous_index = self.selection_history_indices[name]\n        current = selection_history[previous_index] if selection_history else None\n        selection = create_selection(current)\n        executor = executor or self.executor\n        selection_history.append(selection)\n        self.selection_history_indices[name] += 1\n        # clip any redo history\n        del selection_history[self.selection_history_indices[name]:-1]\n        if 0:\n            if self.is_local():\n                if selection:\n                    # result = selection.execute(executor=executor, execute_fully=execute_fully)\n                    result = vaex.promise.Promise.fulfilled(None)\n                    self.signal_selection_changed.emit(self)\n                else:\n                    result = vaex.promise.Promise.fulfilled(None)\n                    self.signal_selection_changed.emit(self)\n            else:\n                self.signal_selection_changed.emit(self)\n                result = vaex.promise.Promise.fulfilled(None)\n        self.signal_selection_changed.emit(self)\n        result = vaex.promise.Promise.fulfilled(None)\n        logger.debug(\"select selection history is %r, index is %r\", selection_history, self.selection_history_indices[name])\n        return result", "response": "select_lasso and select almost share the same code"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef drop(self, columns, inplace=False, check=True):\n        columns = _ensure_list(columns)\n        columns = _ensure_strings_from_expressions(columns)\n        df = self if inplace else self.copy()\n        depending_columns = df._depending_columns(columns_exclude=columns)\n        for column in columns:\n            if check and column in depending_columns:\n                df._hide_column(column)\n            else:\n                del df[column]\n        return df", "response": "Drop columns or a single column."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nhiding a column by prefixing the name with __", "response": "def _hide_column(self, column):\n        '''Hides a column by prefixing the name with \\'__\\''''\n        column = _ensure_string_from_expression(column)\n        new_name = self._find_valid_name('__' + column)\n        self._rename(column, new_name)"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nfind a non - colliding name by optional postfixing", "response": "def _find_valid_name(self, initial_name):\n        '''Finds a non-colliding name by optional postfixing'''\n        return vaex.utils.find_valid_name(initial_name, used=self.get_column_names(hidden=True))"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nfinding all depending column for a set of column names minus the excluded ones", "response": "def _depending_columns(self, columns=None, columns_exclude=None, check_filter=True):\n        '''Find all depending column for a set of column (default all), minus the excluded ones'''\n        columns = set(columns or self.get_column_names(hidden=True))\n        if columns_exclude:\n            columns -= set(columns_exclude)\n        depending_columns = set()\n        for column in columns:\n            expression = self._expr(column)\n            depending_columns |= expression.variables()\n        depending_columns -= set(columns)\n        if check_filter:\n            if self.filtered:\n                selection = self.get_selection(FILTER_SELECTION_NAME)\n                depending_columns |= selection._depending_columns(self)\n        return depending_columns"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef _root_nodes(self):\n        # these lists (~used as ordered set) keep track of leafes and root nodes\n        # root nodes\n        root_nodes = []\n        leafes = []\n        def walk(node):\n            # this function recursively walks the expression graph\n            if isinstance(node, six.string_types):\n                # we end up at a leaf\n                leafes.append(node)\n                if node in root_nodes:  # so it cannot be a root node\n                    root_nodes.remove(node)\n            else:\n                node_repr, fname, fobj, deps = node\n                if node_repr in self.virtual_columns:\n                    # we encountered a virtual column, similar behaviour as leaf\n                    leafes.append(node_repr)\n                    if node_repr in root_nodes:\n                        root_nodes.remove(node_repr)\n                # resursive part\n                for dep in deps:\n                    walk(dep)\n        for column in self.virtual_columns.keys():\n            if column not in leafes:\n                root_nodes.append(column)\n            node = self[column]._graph()\n            # we don't do the virtual column itself, just it's depedencies\n            node_repr, fname, fobj, deps = node\n            for dep in deps:\n                walk(dep)\n        return root_nodes", "response": "Returns a list of string which are the virtual columns that are not used in any other virtual column."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nreturn a graphviz. Digraph object with a graph of all virtual columns", "response": "def _graphviz(self, dot=None):\n        \"\"\"Return a graphviz.Digraph object with a graph of all virtual columns\"\"\"\n        from graphviz import Digraph\n        dot = dot or Digraph(comment='whole dataframe')\n        root_nodes = self._root_nodes()\n        for column in root_nodes:\n            self[column]._graphviz(dot=dot)\n        return dot"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nmarking the given column as categorical with given labels assuming zero indexing.", "response": "def categorize(self, column, labels=None, check=True):\n        \"\"\"Mark column as categorical, with given labels, assuming zero indexing\"\"\"\n        column = _ensure_string_from_expression(column)\n        if check:\n            vmin, vmax = self.minmax(column)\n            if labels is None:\n                N = int(vmax + 1)\n                labels = list(map(str, range(N)))\n            if (vmax - vmin) >= len(labels):\n                raise ValueError('value of {} found, which is larger than number of labels {}'.format(vmax, len(labels)))\n        self._categories[column] = dict(labels=labels, N=len(labels))"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef ordinal_encode(self, column, values=None, inplace=False):\n        column = _ensure_string_from_expression(column)\n        df = self if inplace else self.copy()\n        # for the codes, we need to work on the unfiltered dataset, since the filter\n        # may change, and we also cannot add an array that is smaller in length\n        df_unfiltered = df.copy()\n        # maybe we need some filter manipulation methods\n        df_unfiltered.select_nothing(name=FILTER_SELECTION_NAME)\n        df_unfiltered._length_unfiltered = df._length_original\n        df_unfiltered.set_active_range(0, df._length_original)\n        # codes point to the index of found_values\n        # meaning: found_values[codes[0]] == ds[column].values[0]\n        found_values, codes = df_unfiltered.unique(column, return_inverse=True)\n        if values is None:\n            values = found_values\n        else:\n            # we have specified which values we should support, anything\n            # not found will be masked\n            translation = np.zeros(len(found_values), dtype=np.uint64)\n            # mark values that are in the column, but not in values with a special value\n            missing_value = len(found_values)\n            for i, found_value in enumerate(found_values):\n                try:\n                    found_value = found_value.decode('ascii')\n                except:\n                    pass\n                if found_value not in values:  # not present, we need a missing value\n                    translation[i] = missing_value\n                else:\n                    translation[i] = values.index(found_value)\n            codes = translation[codes]\n            if missing_value in translation:\n                # all special values will be marked as missing\n                codes = np.ma.masked_array(codes, codes==missing_value)\n\n        original_column = df.rename_column(column, '__original_' + column, unique=True)\n        labels = [str(k) for k in values]\n        df.add_column(column, codes)\n        df._categories[column] = dict(labels=labels, N=len(values), values=values)\n        return df", "response": "Encode column as ordinal values and mark it as categorical."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef data(self):\n        class Datas(object):\n            pass\n\n        datas = Datas()\n        for name, array in self.columns.items():\n            setattr(datas, name, array)\n        return datas", "response": "Gives direct access to the data as numpy arrays."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\ncreate a shallow copy of the DataFrame.", "response": "def shallow_copy(self, virtual=True, variables=True):\n        \"\"\"Creates a (shallow) copy of the DataFrame.\n\n        It will link to the same data, but will have its own state, e.g. virtual columns, variables, selection etc.\n\n        \"\"\"\n        df = DataFrameLocal(self.name, self.path, self.column_names)\n        df.columns.update(self.columns)\n        df._length_unfiltered = self._length_unfiltered\n        df._length_original = self._length_original\n        df._index_end = self._index_end\n        df._index_start = self._index_start\n        df._active_fraction = self._active_fraction\n        if virtual:\n            df.virtual_columns.update(self.virtual_columns)\n        if variables:\n            df.variables.update(self.variables)\n        # half shallow/deep copy\n        # for key, value in self.selection_histories.items():\n        # df.selection_histories[key] = list(value)\n        # for key, value in self.selection_history_indices.items():\n        # df.selection_history_indices[key] = value\n        return df"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef length(self, selection=False):\n        if selection:\n            return 0 if self.mask is None else np.sum(self.mask)\n        else:\n            return len(self)", "response": "Get the length of the whole DataFrame for the selection of the whole DataFrame."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\njoining the columns of the other DataFrame to this DataFrame and add them to the self.", "response": "def _hstack(self, other, prefix=None):\n        \"\"\"Join the columns of the other DataFrame to this one, assuming the ordering is the same\"\"\"\n        assert len(self) == len(other), \"does not make sense to horizontally stack DataFrames with different lengths\"\n        for name in other.get_column_names():\n            if prefix:\n                new_name = prefix + name\n            else:\n                new_name = name\n            self.add_column(new_name, other.columns[name])"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nconcatenates two DataFrames adding the rows of one the other DataFrame to the current DataFrame.", "response": "def concat(self, other):\n        \"\"\"Concatenates two DataFrames, adding the rows of one the other DataFrame to the current, returned in a new DataFrame.\n\n        No copy of the data is made.\n\n        :param other: The other DataFrame that is concatenated with this DataFrame\n        :return: New DataFrame with the rows concatenated\n        :rtype: DataFrameConcatenated\n        \"\"\"\n        dfs = []\n        if isinstance(self, DataFrameConcatenated):\n            dfs.extend(self.dfs)\n        else:\n            dfs.extend([self])\n        if isinstance(other, DataFrameConcatenated):\n            dfs.extend(other.dfs)\n        else:\n            dfs.extend([other])\n        return DataFrameConcatenated(dfs)"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef evaluate(self, expression, i1=None, i2=None, out=None, selection=None, filtered=True, internal=False):\n        expression = _ensure_string_from_expression(expression)\n        selection = _ensure_strings_from_expressions(selection)\n        i1 = i1 or 0\n        i2 = i2 or (len(self) if (self.filtered and filtered) else self.length_unfiltered())\n        mask = None\n\n        if self.filtered and filtered:  # if we filter, i1:i2 has a different meaning\n            indices = self._filtered_range_to_unfiltered_indices(i1, i2)\n            i1 = indices[0]\n            i2 = indices[-1] + 1  # +1 to make it inclusive\n        # for both a selection or filtering we have a mask\n        if selection is not None or (self.filtered and filtered):\n            mask = self.evaluate_selection_mask(selection, i1, i2)\n        scope = scopes._BlockScope(self, i1, i2, mask=mask, **self.variables)\n        # value = value[mask]\n        if out is not None:\n            scope.buffers[expression] = out\n        value = scope.evaluate(expression)\n        if isinstance(value, ColumnString) and not internal:\n            value = value.to_numpy()\n        return value", "response": "Evaluate the expression and return the result as a numpy array."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\ncompare two DataFrames and report their difference", "response": "def compare(self, other, report_missing=True, report_difference=False, show=10, orderby=None, column_names=None):\n        \"\"\"Compare two DataFrames and report their difference, use with care for large DataFrames\"\"\"\n        if column_names is None:\n            column_names = self.get_column_names(virtual=False)\n            for other_column_name in other.get_column_names(virtual=False):\n                if other_column_name not in column_names:\n                    column_names.append(other_column_name)\n        different_values = []\n        missing = []\n        type_mismatch = []\n        meta_mismatch = []\n        assert len(self) == len(other)\n        if orderby:\n            index1 = np.argsort(self.columns[orderby])\n            index2 = np.argsort(other.columns[orderby])\n        for column_name in column_names:\n            if column_name not in self.get_column_names(virtual=False):\n                missing.append(column_name)\n                if report_missing:\n                    print(\"%s missing from this DataFrame\" % column_name)\n            elif column_name not in other.get_column_names(virtual=False):\n                missing.append(column_name)\n                if report_missing:\n                    print(\"%s missing from other DataFrame\" % column_name)\n            else:\n                ucd1 = self.ucds.get(column_name)\n                ucd2 = other.ucds.get(column_name)\n                if ucd1 != ucd2:\n                    print(\"ucd mismatch : %r vs %r for %s\" % (ucd1, ucd2, column_name))\n                    meta_mismatch.append(column_name)\n                unit1 = self.units.get(column_name)\n                unit2 = other.units.get(column_name)\n                if unit1 != unit2:\n                    print(\"unit mismatch : %r vs %r for %s\" % (unit1, unit2, column_name))\n                    meta_mismatch.append(column_name)\n                type1 = self.dtype(column_name)\n                if type1 != str_type:\n                    type1 = type1.type\n                type2 = other.dtype(column_name)\n                if type2 != str_type:\n                    type2 = type2.type\n                if type1 != type2:\n                    print(\"different dtypes: %s vs %s for %s\" % (self.dtype(column_name), other.dtype(column_name), column_name))\n                    type_mismatch.append(column_name)\n                else:\n                    # a = self.columns[column_name]\n                    # b = other.columns[column_name]\n                    # if self.filtered:\n                    #   a = a[self.evaluate_selection_mask(None)]\n                    # if other.filtered:\n                    #   b = b[other.evaluate_selection_mask(None)]\n                    a = self.evaluate(column_name)\n                    b = other.evaluate(column_name)\n                    if orderby:\n                        a = a[index1]\n                        b = b[index2]\n\n                    def normalize(ar):\n                        if ar.dtype == str_type:\n                            return ar\n                        if ar.dtype.kind == \"f\" and hasattr(ar, \"mask\"):\n                            mask = ar.mask\n                            ar = ar.copy()\n                            ar[mask] = np.nan\n                        if ar.dtype.kind in \"SU\":\n                            if hasattr(ar, \"mask\"):\n                                data = ar.data\n                            else:\n                                data = ar\n                            values = [value.strip() for value in data.tolist()]\n                            if hasattr(ar, \"mask\"):\n                                ar = np.ma.masked_array(values, ar.mask)\n                            else:\n                                ar = np.array(values)\n                        return ar\n\n                    def equal_mask(a, b):\n                        a = normalize(a)\n                        b = normalize(b)\n                        boolean_mask = (a == b)\n                        if self.dtype(column_name) != str_type and self.dtype(column_name).kind == 'f':  # floats with nan won't equal itself, i.e. NaN != NaN\n                            boolean_mask |= (np.isnan(a) & np.isnan(b))\n                        return boolean_mask\n                    boolean_mask = equal_mask(a, b)\n                    all_equal = np.all(boolean_mask)\n                    if not all_equal:\n                        count = np.sum(~boolean_mask)\n                        print(\"%s does not match for both DataFrames, %d rows are diffent out of %d\" % (column_name, count, len(self)))\n                        different_values.append(column_name)\n                        if report_difference:\n                            indices = np.arange(len(self))[~boolean_mask]\n                            values1 = self.columns[column_name][:][~boolean_mask]\n                            values2 = other.columns[column_name][:][~boolean_mask]\n                            print(\"\\tshowing difference for the first 10\")\n                            for i in range(min(len(values1), show)):\n                                try:\n                                    diff = values1[i] - values2[i]\n                                except:\n                                    diff = \"does not exists\"\n                                print(\"%s[%d] == %s != %s other.%s[%d] (diff = %s)\" % (column_name, indices[i], values1[i], values2[i], column_name, indices[i], diff))\n        return different_values, missing, type_mismatch, meta_mismatch"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef join(self, other, on=None, left_on=None, right_on=None, lsuffix='', rsuffix='', how='left', inplace=False):\n        ds = self if inplace else self.copy()\n        if how == 'left':\n            left = ds\n            right = other\n        elif how == 'right':\n            left = other\n            right = ds\n            lsuffix, rsuffix = rsuffix, lsuffix\n            left_on, right_on = right_on, left_on\n        else:\n            raise ValueError('join type not supported: {}, only left and right'.format(how))\n\n        for name in right:\n            if name in left and name + rsuffix == name + lsuffix:\n                raise ValueError('column name collision: {} exists in both column, and no proper suffix given'\n                                 .format(name))\n\n        right = right.extract()  # get rid of filters and active_range\n        assert left.length_unfiltered() == left.length_original()\n        N = left.length_unfiltered()\n        N_other = len(right)\n        left_on = left_on or on\n        right_on = right_on or on\n        if left_on is None and right_on is None:\n            for name in right:\n                right_name = name\n                if name in left:\n                    left.rename_column(name, name + lsuffix)\n                    right_name = name + rsuffix\n                if name in right.virtual_columns:\n                    left.add_virtual_column(right_name, right.virtual_columns[name])\n                else:\n                    left.add_column(right_name, right.columns[name])\n        else:\n            left_values = left.evaluate(left_on, filtered=False)\n            right_values = right.evaluate(right_on)\n            # maps from the left_values to row #\n            if np.ma.isMaskedArray(left_values):\n                mask = ~left_values.mask\n                left_values = left_values.data\n                index_left = dict(zip(left_values[mask], np.arange(N)[mask]))\n            else:\n                index_left = dict(zip(left_values, np.arange(N)))\n            # idem for right\n            if np.ma.isMaskedArray(right_values):\n                mask = ~right_values.mask\n                right_values = right_values.data\n                index_other = dict(zip(right_values[mask], np.arange(N_other)[mask]))\n            else:\n                index_other = dict(zip(right_values, np.arange(N_other)))\n\n            # we do a left join, find all rows of the right DataFrame\n            # that has an entry on the left\n            # for each row in the right\n            # find which row it needs to go to in the right\n            # from_indices = np.zeros(N_other, dtype=np.int64)  # row # of right\n            # to_indices = np.zeros(N_other, dtype=np.int64)    # goes to row # on the left\n            # keep a boolean mask of which rows are found\n            left_mask = np.ones(N, dtype=np.bool)\n            # and which row they point to in the right\n            left_row_to_right = np.zeros(N, dtype=np.int64) - 1\n            for i in range(N_other):\n                left_row = index_left.get(right_values[i])\n                if left_row is not None:\n                    left_mask[left_row] = False  # unmask, it exists\n                    left_row_to_right[left_row] = i\n\n            lookup = np.ma.array(left_row_to_right, mask=left_mask)\n            for name in right:\n                right_name = name\n                if name in left:\n                    left.rename_column(name, name + lsuffix)\n                    right_name = name + rsuffix\n                if name in right.virtual_columns:\n                    left.add_virtual_column(right_name, right.virtual_columns[name])\n                else:\n                    left.add_column(right_name, ColumnIndexed(right, lookup, name))\n        return left", "response": "Returns a new DataFrame with the same data and other."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nexport the DataFrame to a file", "response": "def export(self, path, column_names=None, byteorder=\"=\", shuffle=False, selection=False, progress=None, virtual=False, sort=None, ascending=True):\n        \"\"\"Exports the DataFrame to a file written with arrow\n\n        :param DataFrameLocal df: DataFrame to export\n        :param str path: path for file\n        :param lis[str] column_names: list of column names to export or None for all columns\n        :param str byteorder: = for native, < for little endian and > for big endian (not supported for fits)\n        :param bool shuffle: export rows in random order\n        :param bool selection: export selection or not\n        :param progress: progress callback that gets a progress fraction as argument and should return True to continue,\n                or a default progress bar when progress=True\n        :param: bool virtual: When True, export virtual columns\n        :param str sort: expression used for sorting the output\n        :param bool ascending: sort ascending (True) or descending\n        :return:\n        \"\"\"\n        if path.endswith('.arrow'):\n            self.export_arrow(path, column_names, byteorder, shuffle, selection, progress=progress, virtual=virtual, sort=sort, ascending=ascending)\n        elif path.endswith('.hdf5'):\n            self.export_hdf5(path, column_names, byteorder, shuffle, selection, progress=progress, virtual=virtual, sort=sort, ascending=ascending)\n        elif path.endswith('.fits'):\n            self.export_fits(path, column_names, shuffle, selection, progress=progress, virtual=virtual, sort=sort, ascending=ascending)\n        if path.endswith('.parquet'):\n            self.export_parquet(path, column_names, shuffle, selection, progress=progress, virtual=virtual, sort=sort, ascending=ascending)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef export_arrow(self, path, column_names=None, byteorder=\"=\", shuffle=False, selection=False, progress=None, virtual=False, sort=None, ascending=True):\n        import vaex_arrow.export\n        vaex_arrow.export.export(self, path, column_names, byteorder, shuffle, selection, progress=progress, virtual=virtual, sort=sort, ascending=ascending)", "response": "Exports the DataFrame to a file written with arrow."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nexport the DataFrame to a vaex hdf5 file", "response": "def export_hdf5(self, path, column_names=None, byteorder=\"=\", shuffle=False, selection=False, progress=None, virtual=False, sort=None, ascending=True):\n        \"\"\"Exports the DataFrame to a vaex hdf5 file\n\n        :param DataFrameLocal df: DataFrame to export\n        :param str path: path for file\n        :param lis[str] column_names: list of column names to export or None for all columns\n        :param str byteorder: = for native, < for little endian and > for big endian\n        :param bool shuffle: export rows in random order\n        :param bool selection: export selection or not\n        :param progress: progress callback that gets a progress fraction as argument and should return True to continue,\n                or a default progress bar when progress=True\n        :param: bool virtual: When True, export virtual columns\n        :param str sort: expression used for sorting the output\n        :param bool ascending: sort ascending (True) or descending\n        :return:\n        \"\"\"\n        import vaex.export\n        vaex.export.export_hdf5(self, path, column_names, byteorder, shuffle, selection, progress=progress, virtual=virtual, sort=sort, ascending=ascending)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nreturn a DataFrame object when groupby is not None", "response": "def groupby(self, by=None, agg=None):\n        \"\"\"Return a :class:`GroupBy` or :class:`DataFrame` object when agg is not None\n\n        Examples:\n\n        >>> import vaex\n        >>> import numpy as np\n        >>> np.random.seed(42)\n        >>> x = np.random.randint(1, 5, 10)\n        >>> y = x**2\n        >>> df = vaex.from_arrays(x=x, y=y)\n        >>> df.groupby(df.x, agg='count')\n        #    x    y_count\n        0    3          4\n        1    4          2\n        2    1          3\n        3    2          1\n        >>> df.groupby(df.x, agg=[vaex.agg.count('y'), vaex.agg.mean('y')])\n        #    x    y_count    y_mean\n        0    3          4         9\n        1    4          2        16\n        2    1          3         1\n        3    2          1         4\n        >>> df.groupby(df.x, agg={'z': [vaex.agg.count('y'), vaex.agg.mean('y')]})\n        #    x    z_count    z_mean\n        0    3          4         9\n        1    4          2        16\n        2    1          3         1\n        3    2          1         4\n\n        Example using datetime:\n\n        >>> import vaex\n        >>> import numpy as np\n        >>> t = np.arange('2015-01-01', '2015-02-01', dtype=np.datetime64)\n        >>> y = np.arange(len(t))\n        >>> df = vaex.from_arrays(t=t, y=y)\n        >>> df.groupby(vaex.BinnerTime.per_week(df.t)).agg({'y' : 'sum'})\n        #  t                      y\n        0  2015-01-01 00:00:00   21\n        1  2015-01-08 00:00:00   70\n        2  2015-01-15 00:00:00  119\n        3  2015-01-22 00:00:00  168\n        4  2015-01-29 00:00:00   87\n\n\n        :param dict, list or agg agg: Aggregate operation in the form of a string, vaex.agg object, a dictionary\n            where the keys indicate the target column names, and the values the operations, or the a list of aggregates.\n            When not given, it will return the groupby object.\n        :return: :class:`DataFrame` or :class:`GroupBy` object.\n        \"\"\"\n        from .groupby import GroupBy\n        groupby = GroupBy(self, by=by)\n        if agg is None:\n            return groupby\n        else:\n            return groupby.agg(agg)"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nreturn a BinBy object when agg is not None", "response": "def binby(self, by=None, agg=None):\n        \"\"\"Return a :class:`BinBy` or :class:`DataArray` object when agg is not None\n\n        The binby operations does not return a 'flat' DataFrame, instead it returns an N-d grid\n        in the form of an xarray.\n\n\n        :param dict, list or agg agg: Aggregate operation in the form of a string, vaex.agg object, a dictionary\n            where the keys indicate the target column names, and the values the operations, or the a list of aggregates.\n            When not given, it will return the binby object.\n        :return: :class:`DataArray` or :class:`BinBy` object.\n        \"\"\"\n        from .groupby import BinBy\n        binby = BinBy(self, by=by)\n        if agg is None:\n            return binby\n        else:\n            return binby.agg(agg)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nadding a column to the DataFrame", "response": "def add_column(self, name, data):\n        \"\"\"Add a column to the DataFrame\n\n        :param str name: name of column\n        :param data: numpy array with the data\n        \"\"\"\n        # assert _is_array_type_ok(data), \"dtype not supported: %r, %r\" % (data.dtype, data.dtype.type)\n        # self._length = len(data)\n        # if self._length_unfiltered is None:\n        #     self._length_unfiltered = len(data)\n        #     self._length_original = len(data)\n        #     self._index_end = self._length_unfiltered\n        super(DataFrameArrays, self).add_column(name, data)\n        self._length_unfiltered = int(round(self._length_original * self._active_fraction))"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef then(self, success=None, failure=None):\n        ret = self.create_next()\n\n        def callAndFulfill(v):\n            \"\"\"\n            A callback to be invoked if the \"self promise\"\n            is fulfilled.\n            \"\"\"\n            try:\n                if aplus._isFunction(success):\n                    ret.fulfill(success(v))\n                else:\n                    ret.fulfill(v)\n            except Exception as e:\n                Promise.last_exc_info = sys.exc_info()\n                e.exc_info = sys.exc_info()\n                ret.reject(e)\n\n        def callAndReject(r):\n            \"\"\"\n            A callback to be invoked if the \"self promise\"\n            is rejected.\n            \"\"\"\n            try:\n                if aplus._isFunction(failure):\n                    ret.fulfill(failure(r))\n                else:\n                    ret.reject(r)\n            except Exception as e:\n                Promise.last_exc_info = sys.exc_info()\n                e.exc_info = sys.exc_info()\n                ret.reject(e)\n\n        self.done(callAndFulfill, callAndReject)\n\n        return ret", "response": "A method that returns a promise that resolves with the value of the next entry in the cache."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef patch(f):\n    '''Adds method f to the DataFrame class'''\n    name = f.__name__\n    setattr(DataFrame, name, f)\n    return f", "response": "Adds a method f to the DataFrame class"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef plot1d(self, x=None, what=\"count(*)\", grid=None, shape=64, facet=None, limits=None, figsize=None, f=\"identity\", n=None, normalize_axis=None,\n           xlabel=None, ylabel=None, label=None,\n           selection=None, show=False, tight_layout=True, hardcopy=None,\n           **kwargs):\n    \"\"\"Viz data in 1d (histograms, running means etc)\n\n    Example\n\n    >>> df.plot1d(df.x)\n    >>> df.plot1d(df.x, limits=[0, 100], shape=100)\n    >>> df.plot1d(df.x, what='mean(y)', limits=[0, 100], shape=100)\n\n    If you want to do a computation yourself, pass the grid argument, but you are responsible for passing the\n    same limits arguments:\n\n    >>> counts = df.mean(df.y, binby=df.x, limits=[0, 100], shape=100)/100.\n    >>> df.plot1d(df.x, limits=[0, 100], shape=100, grid=means, label='mean(y)/100')\n\n    :param x: Expression to bin in the x direction\n    :param what: What to plot, count(*) will show a N-d histogram, mean('x'), the mean of the x column, sum('x') the sum\n    :param grid: If the binning is done before by yourself, you can pass it\n    :param facet: Expression to produce facetted plots ( facet='x:0,1,12' will produce 12 plots with x in a range between 0 and 1)\n    :param limits: list of [xmin, xmax], or a description such as 'minmax', '99%'\n    :param figsize: (x, y) tuple passed to pylab.figure for setting the figure size\n    :param f: transform values by: 'identity' does nothing 'log' or 'log10' will show the log of the value\n    :param n: normalization function, currently only 'normalize' is supported, or None for no normalization\n    :param normalize_axis: which axes to normalize on, None means normalize by the global maximum.\n    :param normalize_axis:\n    :param xlabel: String for label on x axis (may contain latex)\n    :param ylabel: Same for y axis\n    :param: tight_layout: call pylab.tight_layout or not\n    :param kwargs: extra argument passed to pylab.plot\n    :return:\n    \"\"\"\n\n    import pylab\n    f = _parse_f(f)\n    n = _parse_n(n)\n    if type(shape) == int:\n        shape = (shape,)\n    binby = []\n    x = _ensure_strings_from_expressions(x)\n    for expression in [x]:\n        if expression is not None:\n            binby = [expression] + binby\n    limits = self.limits(binby, limits)\n    if figsize is not None:\n        pylab.figure(num=None, figsize=figsize, dpi=80, facecolor='w', edgecolor='k')\n    fig = pylab.gcf()\n    import re\n    if facet is not None:\n        match = re.match(\"(.*):(.*),(.*),(.*)\", facet)\n        if match:\n            groups = match.groups()\n            facet_expression = groups[0]\n            facet_limits = [ast.literal_eval(groups[1]), ast.literal_eval(groups[2])]\n            facet_count = ast.literal_eval(groups[3])\n            limits.append(facet_limits)\n            binby.append(facet_expression)\n            shape = (facet_count,) + shape\n        else:\n            raise ValueError(\"Could not understand 'facet' argument %r, expected something in form: 'column:-1,10:5'\" % facet)\n\n    if grid is None:\n        if what:\n            if isinstance(what, (vaex.stat.Expression)):\n                grid = what.calculate(self, binby=binby, limits=limits, shape=shape, selection=selection)\n            else:\n                what = what.strip()\n                index = what.index(\"(\")\n                import re\n                groups = re.match(\"(.*)\\((.*)\\)\", what).groups()\n                if groups and len(groups) == 2:\n                    function = groups[0]\n                    arguments = groups[1].strip()\n                    functions = [\"mean\", \"sum\", \"std\", \"count\"]\n                    if function in functions:\n                        # grid = getattr(self, function)(arguments, binby, limits=limits, shape=shape, selection=selection)\n                        grid = getattr(vaex.stat, function)(arguments).calculate(self, binby=binby, limits=limits, shape=shape, selection=selection)\n                    elif function == \"count\" and arguments == \"*\":\n                        grid = self.count(binby=binby, shape=shape, limits=limits, selection=selection)\n                    elif function == \"cumulative\" and arguments == \"*\":\n                        # TODO: comulative should also include the tails outside limits\n                        grid = self.count(binby=binby, shape=shape, limits=limits, selection=selection)\n                        grid = np.cumsum(grid)\n                    else:\n                        raise ValueError(\"Could not understand method: %s, expected one of %r'\" % (function, functions))\n                else:\n                    raise ValueError(\"Could not understand 'what' argument %r, expected something in form: 'count(*)', 'mean(x)'\" % what)\n        else:\n            grid = self.histogram(binby, size=shape, limits=limits, selection=selection)\n    fgrid = f(grid)\n    if n is not None:\n        # ngrid = n(fgrid, axis=normalize_axis)\n        ngrid = fgrid / fgrid.sum()\n    else:\n        ngrid = fgrid\n        # reductions = [_parse_reduction(r, colormap, colors) for r in reduce]\n        # rgrid = ngrid * 1.\n        # for r in reduce:\n        #   r = _parse_reduction(r, colormap, colors)\n        #   rgrid = r(rgrid)\n        # grid = self.reduce(grid, )\n    xmin, xmax = limits[-1]\n    if facet:\n        N = len(grid[-1])\n    else:\n        N = len(grid)\n    xexpression = binby[0]\n    xar = np.arange(N + 1) / (N - 0.) * (xmax - xmin) + xmin\n    label = str(label or selection or x)\n    if facet:\n        import math\n        rows, columns = int(math.ceil(facet_count / 4.)), 4\n        values = np.linspace(facet_limits[0], facet_limits[1], facet_count + 1)\n        for i in range(facet_count):\n            ax = pylab.subplot(rows, columns, i + 1)\n            value = ax.plot(xar, ngrid[i], drawstyle=\"steps-mid\", label=label, **kwargs)\n            v1, v2 = values[i], values[i + 1]\n            pylab.xlabel(xlabel or x)\n            pylab.ylabel(ylabel or what)\n            ax.set_title(\"%3f <= %s < %3f\" % (v1, facet_expression, v2))\n            if self.iscategory(xexpression):\n                labels = self.category_labels(xexpression)\n                step = len(labels) // max_labels\n                pylab.xticks(range(len(labels))[::step], labels[::step], size='small')\n    else:\n        # im = pylab.imshow(rgrid, extent=np.array(limits[:2]).flatten(), origin=\"lower\", aspect=aspect)\n        pylab.xlabel(xlabel or self.label(x))\n        pylab.ylabel(ylabel or what)\n        # print(xar, ngrid)\n        # repeat the first element, that's how plot/steps likes it..\n        g = np.concatenate([ngrid[0:1], ngrid])\n        value = pylab.plot(xar, g, drawstyle=\"steps-pre\", label=label, **kwargs)\n        if self.iscategory(xexpression):\n            labels = self.category_labels(xexpression)\n            step = len(labels) // max_labels\n            pylab.xticks(range(len(labels))[::step], labels[::step], size='small')\n    if tight_layout:\n        pylab.tight_layout()\n    if hardcopy:\n        pylab.savefig(hardcopy)\n    if show:\n        pylab.show()\n    return value", "response": "Plot a 1D histogram of the data in a 1D grid."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef scatter(self, x, y, xerr=None, yerr=None, cov=None, corr=None, s_expr=None, c_expr=None, labels=None, selection=None, length_limit=50000,\n    length_check=True, label=None, xlabel=None, ylabel=None, errorbar_kwargs={}, ellipse_kwargs={}, **kwargs):\n    \"\"\"Viz (small amounts) of data in 2d using a scatter plot\n\n    Convenience wrapper around pylab.scatter when for working with small DataFrames or selections\n\n    :param x: Expression for x axis\n    :param y: Idem for y\n    :param s_expr: When given, use if for the s (size) argument of pylab.scatter\n    :param c_expr: When given, use if for the c (color) argument of pylab.scatter\n    :param labels: Annotate the points with these text values\n    :param selection: Single selection expression, or None\n    :param length_limit: maximum number of rows it will plot\n    :param length_check: should we do the maximum row check or not?\n    :param label: label for the legend\n    :param xlabel: label for x axis, if None .label(x) is used\n    :param ylabel: label for y axis, if None .label(y) is used\n    :param errorbar_kwargs: extra dict with arguments passed to plt.errorbar\n    :param kwargs: extra arguments passed to pylab.scatter\n    :return:\n    \"\"\"\n    import pylab as plt\n    x = _ensure_strings_from_expressions(x)\n    y = _ensure_strings_from_expressions(y)\n    label = str(label or selection)\n    selection = _ensure_strings_from_expressions(selection)\n    if length_check:\n        count = self.count(selection=selection)\n        if count > length_limit:\n            raise ValueError(\"the number of rows (%d) is above the limit (%d), pass length_check=False, or increase length_limit\" % (count, length_limit))\n    x_values = self.evaluate(x, selection=selection)\n    y_values = self.evaluate(y, selection=selection)\n    if s_expr:\n        kwargs[\"s\"] = self.evaluate(s_expr, selection=selection)\n    if c_expr:\n        kwargs[\"c\"] = self.evaluate(c_expr, selection=selection)\n    plt.xlabel(xlabel or self.label(x))\n    plt.ylabel(ylabel or self.label(y))\n    s = plt.scatter(x_values, y_values, label=label, **kwargs)\n    if labels:\n        label_values = self.evaluate(labels, selection=selection)\n        for i, label_value in enumerate(label_values):\n            plt.annotate(label_value, (x_values[i], y_values[i]))\n    xerr_values = None\n    yerr_values = None\n    if cov is not None or corr is not None:\n        from matplotlib.patches import Ellipse\n        sx = self.evaluate(xerr, selection=selection)\n        sy = self.evaluate(yerr, selection=selection)\n        if corr is not None:\n            sxy = self.evaluate(corr, selection=selection) * sx * sy\n        elif cov is not None:\n            sxy = self.evaluate(cov, selection=selection)\n        cov_matrix = np.zeros((len(sx), 2, 2))\n        cov_matrix[:,0,0] = sx**2\n        cov_matrix[:,1,1] = sy**2\n        cov_matrix[:,0,1] = cov_matrix[:,1,0] = sxy\n        ax = plt.gca()\n        ellipse_kwargs = dict(ellipse_kwargs)\n        ellipse_kwargs['facecolor'] = ellipse_kwargs.get('facecolor', 'none')\n        ellipse_kwargs['edgecolor'] = ellipse_kwargs.get('edgecolor', 'black')\n        for i in range(len(sx)):\n            eigen_values, eigen_vectors = np.linalg.eig(cov_matrix[i])\n            indices = np.argsort(eigen_values)[::-1]\n            eigen_values = eigen_values[indices]\n            eigen_vectors = eigen_vectors[:,indices]\n            v1 = eigen_vectors[:, 0]\n            v2 = eigen_vectors[:, 1]\n            varx = cov_matrix[i, 0, 0]\n            vary = cov_matrix[i, 1, 1]\n            angle = np.arctan2(v1[1], v1[0])\n            # round off errors cause negative values?\n            if eigen_values[1] < 0 and abs((eigen_values[1]/eigen_values[0])) < 1e-10:\n                eigen_values[1] = 0\n            if eigen_values[0] < 0 or eigen_values[1] < 0:\n                raise ValueError('neg val')\n            width, height = np.sqrt(np.max(eigen_values)), np.sqrt(np.min(eigen_values))\n            e = Ellipse(xy=(x_values[i], y_values[i]), width=width, height=height, angle=np.degrees(angle), **ellipse_kwargs)\n            ax.add_artist(e)\n    else:\n        if xerr is not None:\n            if _issequence(xerr):\n                assert len(xerr) == 2, \"if xerr is a sequence it should be of length 2\"\n                xerr_values = [self.evaluate(xerr[0], selection=selection), self.evaluate(xerr[1], selection=selection)]\n            else:\n                xerr_values = self.evaluate(xerr, selection=selection)\n        if yerr is not None:\n            if _issequence(yerr):\n                assert len(yerr) == 2, \"if yerr is a sequence it should be of length 2\"\n                yerr_values = [self.evaluate(yerr[0], selection=selection), self.evaluate(yerr[1], selection=selection)]\n            else:\n                yerr_values = self.evaluate(yerr, selection=selection)\n        if xerr_values is not None or yerr_values is not None:\n            errorbar_kwargs = dict(errorbar_kwargs)\n            errorbar_kwargs['fmt'] = errorbar_kwargs.get('fmt', 'none')\n            plt.errorbar(x_values, y_values, yerr=yerr_values, xerr=xerr_values, **errorbar_kwargs)\n    return s", "response": "Plots a scatter plot of the data in 2d using a scatter plot."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef register_function(scope=None, as_property=False, name=None):\n    prefix = ''\n    if scope:\n        prefix = scope + \"_\"\n        if scope not in scopes:\n            raise KeyError(\"unknown scope\")\n    def wrapper(f, name=name):\n        name = name or f.__name__\n        # remove possible prefix\n        if name.startswith(prefix):\n            name = name[len(prefix):]\n        full_name = prefix + name\n        if scope:\n            def closure(name=name, full_name=full_name, function=f):\n                def wrapper(self, *args, **kwargs):\n                    lazy_func = getattr(self.expression.ds.func, full_name)\n                    args = (self.expression, ) + args\n                    return lazy_func(*args, **kwargs)\n                return functools.wraps(function)(wrapper)\n            if as_property:\n                setattr(scopes[scope], name, property(closure()))\n            else:\n                setattr(scopes[scope], name, closure())\n        else:\n            def closure(name=name, full_name=full_name, function=f):\n                def wrapper(self, *args, **kwargs):\n                    lazy_func = getattr(self.ds.func, full_name)\n                    args = (self, ) + args\n                    return lazy_func(*args, **kwargs)\n                return functools.wraps(function)(wrapper)\n            setattr(vaex.expression.Expression, name, closure())\n        vaex.expression.expression_namespace[prefix + name] = f\n        return f  # we leave the original function as is\n    return wrapper", "response": "Decorator to register a new function with vaex.\n           "}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef fillna(ar, value, fill_nan=True, fill_masked=True):\n    '''Returns an array where missing values are replaced by value.\n\n    If the dtype is object, nan values and 'nan' string values\n    are replaced by value when fill_nan==True.\n    '''\n    ar = ar if not isinstance(ar, column.Column) else ar.to_numpy()\n    if ar.dtype.kind in 'O' and fill_nan:\n        strings = ar.astype(str)\n        mask = strings == 'nan'\n        ar = ar.copy()\n        ar[mask] = value\n    elif ar.dtype.kind in 'f' and fill_nan:\n        mask = np.isnan(ar)\n        if np.any(mask):\n            ar = ar.copy()\n            ar[mask] = value\n    if fill_masked and np.ma.isMaskedArray(ar):\n        mask = ar.mask\n        if np.any(mask):\n            ar = ar.data.copy()\n            ar[mask] = value\n    return ar", "response": "Returns an array where missing values are replaced by value."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef dt_dayofweek(x):\n    import pandas as pd\n    return pd.Series(x).dt.dayofweek.values", "response": "Return the day of the week in a sequence of time series."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nreturn an expression containing the ordinal day of the year.", "response": "def dt_dayofyear(x):\n    \"\"\"The ordinal day of the year.\n\n    :returns: an expression containing the ordinal day of the year.\n\n    Example:\n\n    >>> import vaex\n    >>> import numpy as np\n    >>> date = np.array(['2009-10-12T03:31:00', '2016-02-11T10:17:34', '2015-11-12T11:34:22'], dtype=np.datetime64)\n    >>> df = vaex.from_arrays(date=date)\n    >>> df\n      #  date\n      0  2009-10-12 03:31:00\n      1  2016-02-11 10:17:34\n      2  2015-11-12 11:34:22\n\n    >>> df.date.dt.dayofyear\n    Expression = dt_dayofyear(date)\n    Length: 3 dtype: int64 (expression)\n    -----------------------------------\n    0  285\n    1   42\n    2  316\n    \"\"\"\n    import pandas as pd\n    return pd.Series(x).dt.dayofyear.values"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nchecking whether a year is a leap year.", "response": "def dt_is_leap_year(x):\n    \"\"\"Check whether a year is a leap year.\n\n    :returns: an expression which evaluates to True if a year is a leap year, and to False otherwise.\n\n    Example:\n\n    >>> import vaex\n    >>> import numpy as np\n    >>> date = np.array(['2009-10-12T03:31:00', '2016-02-11T10:17:34', '2015-11-12T11:34:22'], dtype=np.datetime64)\n    >>> df = vaex.from_arrays(date=date)\n    >>> df\n      #  date\n      0  2009-10-12 03:31:00\n      1  2016-02-11 10:17:34\n      2  2015-11-12 11:34:22\n\n    >>> df.date.dt.is_leap_year\n    Expression = dt_is_leap_year(date)\n    Length: 3 dtype: bool (expression)\n    ----------------------------------\n    0  False\n    1   True\n    2  False\n    \"\"\"\n    import pandas as pd\n    return pd.Series(x).dt.is_leap_year.values"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nextract the year out of a datetime column.", "response": "def dt_year(x):\n    \"\"\"Extracts the year out of a datetime sample.\n\n    :returns: an expression containing the year extracted from a datetime column.\n\n    Example:\n\n    >>> import vaex\n    >>> import numpy as np\n    >>> date = np.array(['2009-10-12T03:31:00', '2016-02-11T10:17:34', '2015-11-12T11:34:22'], dtype=np.datetime64)\n    >>> df = vaex.from_arrays(date=date)\n    >>> df\n      #  date\n      0  2009-10-12 03:31:00\n      1  2016-02-11 10:17:34\n      2  2015-11-12 11:34:22\n\n    >>> df.date.dt.year\n    Expression = dt_year(date)\n    Length: 3 dtype: int64 (expression)\n    -----------------------------------\n    0  2009\n    1  2016\n    2  2015\n    \"\"\"\n    import pandas as pd\n    return pd.Series(x).dt.year.values"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef dt_month(x):\n    import pandas as pd\n    return pd.Series(x).dt.month.values", "response": "Extracts the month out of a datetime column."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef dt_month_name(x):\n    import pandas as pd\n    return pd.Series(_pandas_dt_fix(x)).dt.month_name().values.astype(str)", "response": "Returns the month names of a datetime sample in English."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef dt_day(x):\n    import pandas as pd\n    return pd.Series(x).dt.day.values", "response": "Extracts the day from a datetime column."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef dt_day_name(x):\n    import pandas as pd\n    return pd.Series(_pandas_dt_fix(x)).dt.day_name().values.astype(str)", "response": "Returns the day names of a datetime sample in English."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nreturns an expression containing the week ordinal of the year extracted from a datetime column.", "response": "def dt_weekofyear(x):\n    \"\"\"Returns the week ordinal of the year.\n\n    :returns: an expression containing the week ordinal of the year, extracted from a datetime column.\n\n    Example:\n\n    >>> import vaex\n    >>> import numpy as np\n    >>> date = np.array(['2009-10-12T03:31:00', '2016-02-11T10:17:34', '2015-11-12T11:34:22'], dtype=np.datetime64)\n    >>> df = vaex.from_arrays(date=date)\n    >>> df\n      #  date\n      0  2009-10-12 03:31:00\n      1  2016-02-11 10:17:34\n      2  2015-11-12 11:34:22\n\n    >>> df.date.dt.weekofyear\n    Expression = dt_weekofyear(date)\n    Length: 3 dtype: int64 (expression)\n    -----------------------------------\n    0  42\n    1   6\n    2  46\n    \"\"\"\n    import pandas as pd\n    return pd.Series(x).dt.weekofyear.values"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef dt_hour(x):\n    import pandas as pd\n    return pd.Series(x).dt.hour.values", "response": "Extracts the hour out of a datetime column."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef dt_minute(x):\n    import pandas as pd\n    return pd.Series(x).dt.minute.values", "response": "Extracts the minute out of a datetime column."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef dt_second(x):\n    import pandas as pd\n    return pd.Series(x).dt.second.values", "response": "Extracts the second out of a datetime column."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\ncapitalizing the first letter of a string sample.", "response": "def str_capitalize(x):\n    \"\"\"Capitalize the first letter of a string sample.\n\n    :returns: an expression containing the capitalized strings.\n\n    Example:\n\n    >>> import vaex\n    >>> text = ['Something', 'very pretty', 'is coming', 'our', 'way.']\n    >>> df = vaex.from_arrays(text=text)\n    >>> df\n      #  text\n      0  Something\n      1  very pretty\n      2  is coming\n      3  our\n      4  way.\n\n    >>> df.text.str.capitalize()\n    Expression = str_capitalize(text)\n    Length: 5 dtype: str (expression)\n    ---------------------------------\n    0    Something\n    1  Very pretty\n    2    Is coming\n    3          Our\n    4         Way.\n    \"\"\"\n    sl = _to_string_sequence(x).capitalize()\n    return column.ColumnStringArrow(sl.bytes, sl.indices, sl.length, sl.offset, string_sequence=sl)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nconcatenating two string columns on a row - by - row basis.", "response": "def str_cat(x, other):\n    \"\"\"Concatenate two string columns on a row-by-row basis.\n\n    :param expression other: The expression of the other column to be concatenated.\n    :returns: an expression containing the concatenated columns.\n\n    Example:\n\n    >>> import vaex\n    >>> text = ['Something', 'very pretty', 'is coming', 'our', 'way.']\n    >>> df = vaex.from_arrays(text=text)\n    >>> df\n      #  text\n      0  Something\n      1  very pretty\n      2  is coming\n      3  our\n      4  way.\n\n    >>> df.text.str.cat(df.text)\n    Expression = str_cat(text, text)\n    Length: 5 dtype: str (expression)\n    ---------------------------------\n    0      SomethingSomething\n    1  very prettyvery pretty\n    2      is comingis coming\n    3                  ourour\n    4                way.way.\n    \"\"\"\n    sl1 = _to_string_sequence(x)\n    sl2 = _to_string_sequence(other)\n    sl = sl1.concat(sl2)\n    return column.ColumnStringArrow.from_string_sequence(sl)"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nchecks if a string pattern or regex is contained within a sample of a string column.", "response": "def str_contains(x, pattern, regex=True):\n    \"\"\"Check if a string pattern or regex is contained within a sample of a string column.\n\n    :param str pattern: A string or regex pattern\n    :param bool regex: If True,\n    :returns: an expression which is evaluated to True if the pattern is found in a given sample, and it is False otherwise.\n\n    Example:\n\n    >>> import vaex\n    >>> text = ['Something', 'very pretty', 'is coming', 'our', 'way.']\n    >>> df = vaex.from_arrays(text=text)\n    >>> df\n      #  text\n      0  Something\n      1  very pretty\n      2  is coming\n      3  our\n      4  way.\n\n    >>> df.text.str.contains('very')\n    Expression = str_contains(text, 'very')\n    Length: 5 dtype: bool (expression)\n    ----------------------------------\n    0  False\n    1   True\n    2  False\n    3  False\n    4  False\n    \"\"\"\n    return _to_string_sequence(x).search(pattern, regex)"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\ncount the occurences of a pattern in a string column.", "response": "def str_count(x, pat, regex=False):\n    \"\"\"Count the occurences of a pattern in sample of a string column.\n\n    :param str pat: A string or regex pattern\n    :param bool regex: If True,\n    :returns: an expression containing the number of times a pattern is found in each sample.\n\n    Example:\n\n    >>> import vaex\n    >>> text = ['Something', 'very pretty', 'is coming', 'our', 'way.']\n    >>> df = vaex.from_arrays(text=text)\n    >>> df\n      #  text\n      0  Something\n      1  very pretty\n      2  is coming\n      3  our\n      4  way.\n\n    >>> df.text.str.count(pat=\"et\", regex=False)\n    Expression = str_count(text, pat='et', regex=False)\n    Length: 5 dtype: int64 (expression)\n    -----------------------------------\n    0  1\n    1  1\n    2  0\n    3  0\n    4  0\n    \"\"\"\n    return _to_string_sequence(x).count(pat, regex)"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef str_find(x, sub, start=0, end=None):\n    return _to_string_sequence(x).find(sub, start, 0 if end is None else end, end is None, True)", "response": "Returns the lowest indices in each string in a column where the provided substring is fully contained between within the provided sample."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nextracts a character from each sample at the specified position from a string column.", "response": "def str_get(x, i):\n    \"\"\"Extract a character from each sample at the specified position from a string column.\n    Note that if the specified position is out of bound of the string sample, this method returns '', while pandas retunrs nan.\n\n    :param int i: The index location, at which to extract the character.\n    :returns: an expression containing the extracted characters.\n\n    Example:\n\n    >>> import vaex\n    >>> text = ['Something', 'very pretty', 'is coming', 'our', 'way.']\n    >>> df = vaex.from_arrays(text=text)\n    >>> df\n      #  text\n      0  Something\n      1  very pretty\n      2  is coming\n      3  our\n      4  way.\n\n    >>> df.text.str.get(5)\n    Expression = str_get(text, 5)\n    Length: 5 dtype: str (expression)\n    ---------------------------------\n    0    h\n    1    p\n    2    m\n    3\n    4\n    \"\"\"\n    x = _to_string_sequence(x)\n    if i == -1:\n        sl = x.slice_string_end(-1)\n    else:\n        sl = x.slice_string(i, i+1)\n    return column.ColumnStringArrow(sl.bytes, sl.indices, sl.length, sl.offset, string_sequence=sl)"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef str_index(x, sub, start=0, end=None):\n    return str_find(x, sub, start, end)", "response": "Returns the lowest indices in each string in a column where the provided substring is fully contained between within the provided sample."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef str_join(x, sep):\n    sl = _to_string_list_sequence(x).join(sep)\n    return column.ColumnStringArrow(sl.bytes, sl.indices, sl.length, sl.offset, string_sequence=sl)", "response": "Same as find but uses pandas to join strings"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nconverts string samples to lower case.", "response": "def str_lower(x):\n    \"\"\"Converts string samples to lower case.\n\n    :returns: an expression containing the converted strings.\n\n    Example:\n\n    >>> import vaex\n    >>> text = ['Something', 'very pretty', 'is coming', 'our', 'way.']\n    >>> df = vaex.from_arrays(text=text)\n    >>> df\n      #  text\n      0  Something\n      1  very pretty\n      2  is coming\n      3  our\n      4  way.\n\n    >>> df.text.str.lower()\n    Expression = str_lower(text)\n    Length: 5 dtype: str (expression)\n    ---------------------------------\n    0    something\n    1  very pretty\n    2    is coming\n    3          our\n    4         way.\n    \"\"\"\n    sl = _to_string_sequence(x).lower()\n    return column.ColumnStringArrow(sl.bytes, sl.indices, sl.length, sl.offset, string_sequence=sl)"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nremoving leading characters from a string sample.", "response": "def str_lstrip(x, to_strip=None):\n    \"\"\"Remove leading characters from a string sample.\n\n    :param str to_strip: The string to be removed\n    :returns: an expression containing the modified string column.\n\n    Example:\n\n    >>> import vaex\n    >>> text = ['Something', 'very pretty', 'is coming', 'our', 'way.']\n    >>> df = vaex.from_arrays(text=text)\n    >>> df\n      #  text\n      0  Something\n      1  very pretty\n      2  is coming\n      3  our\n      4  way.\n\n    >>> df.text.str.lstrip(to_strip='very ')\n    Expression = str_lstrip(text, to_strip='very ')\n    Length: 5 dtype: str (expression)\n    ---------------------------------\n    0  Something\n    1     pretty\n    2  is coming\n    3        our\n    4       way.\n    \"\"\"\n    # in c++ we give empty string the same meaning as None\n    sl = _to_string_sequence(x).lstrip('' if to_strip is None else to_strip) if to_strip != '' else x\n    return column.ColumnStringArrow(sl.bytes, sl.indices, sl.length, sl.offset, string_sequence=sl)"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef str_pad(x, width, side='left', fillchar=' '):\n    sl = _to_string_sequence(x).pad(width, fillchar, side in ['left', 'both'], side in ['right', 'both'])\n    return column.ColumnStringArrow(sl.bytes, sl.indices, sl.length, sl.offset, string_sequence=sl)", "response": "Pads a string in a given column."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nduplicate each string in a column.", "response": "def str_repeat(x, repeats):\n    \"\"\"Duplicate each string in a column.\n\n    :param int repeats: number of times each string sample is to be duplicated.\n    :returns: an expression containing the duplicated strings\n\n    Example:\n\n    >>> import vaex\n    >>> text = ['Something', 'very pretty', 'is coming', 'our', 'way.']\n    >>> df = vaex.from_arrays(text=text)\n    >>> df\n      #  text\n      0  Something\n      1  very pretty\n      2  is coming\n      3  our\n      4  way.\n\n    >>> df.text.str.repeat(3)\n    Expression = str_repeat(text, 3)\n    Length: 5 dtype: str (expression)\n    ---------------------------------\n    0        SomethingSomethingSomething\n    1  very prettyvery prettyvery pretty\n    2        is comingis comingis coming\n    3                          ourourour\n    4                       way.way.way.\n    \"\"\"\n    sl = _to_string_sequence(x).repeat(repeats)\n    return column.ColumnStringArrow(sl.bytes, sl.indices, sl.length, sl.offset, string_sequence=sl)"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nreplacing occurences of a pattern or a regex in a column with some other string.", "response": "def str_replace(x, pat, repl, n=-1, flags=0, regex=False):\n    \"\"\"Replace occurences of a pattern/regex in a column with some other string.\n\n    :param str pattern: string or a regex pattern\n    :param str replace: a replacement string\n    :param int n: number of replacements to be made from the start. If -1 make all replacements.\n    :param int flags: ??\n    :param bool regex: If True, ...?\n    :returns: an expression containing the string replacements.\n\n    Example:\n\n    >>> import vaex\n    >>> text = ['Something', 'very pretty', 'is coming', 'our', 'way.']\n    >>> df = vaex.from_arrays(text=text)\n    >>> df\n      #  text\n      0  Something\n      1  very pretty\n      2  is coming\n      3  our\n      4  way.\n\n    >>> df.text.str.replace(pat='et', repl='__')\n    Expression = str_replace(text, pat='et', repl='__')\n    Length: 5 dtype: str (expression)\n    ---------------------------------\n    0    Som__hing\n    1  very pr__ty\n    2    is coming\n    3          our\n    4         way.\n    \"\"\"\n    sl = _to_string_sequence(x).replace(pat, repl, n, flags, regex)\n    return column.ColumnStringArrow(sl.bytes, sl.indices, sl.length, sl.offset, string_sequence=sl)"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nreturns the highest indices in each string in a column where the provided substring is fully contained between within the provided start and end indices.", "response": "def str_rfind(x, sub, start=0, end=None):\n    \"\"\"Returns the highest indices in each string in a column, where the provided substring is fully contained between within a\n    sample. If the substring is not found, -1 is returned.\n\n    :param str sub: A substring to be found in the samples\n    :param int start:\n    :param int end:\n    :returns: an expression containing the highest indices specifying the start of the substring.\n\n    Example:\n\n    >>> import vaex\n    >>> text = ['Something', 'very pretty', 'is coming', 'our', 'way.']\n    >>> df = vaex.from_arrays(text=text)\n    >>> df\n      #  text\n      0  Something\n      1  very pretty\n      2  is coming\n      3  our\n      4  way.\n\n    >>> df.text.str.rfind(sub=\"et\")\n    Expression = str_rfind(text, sub='et')\n    Length: 5 dtype: int64 (expression)\n    -----------------------------------\n    0   3\n    1   7\n    2  -1\n    3  -1\n    4  -1\n    \"\"\"\n    return _to_string_sequence(x).find(sub, start, 0 if end is None else end, end is None, False)"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef str_rindex(x, sub, start=0, end=None):\n    return str_rfind(x, sub, start, end)", "response": "Returns the highest indices in each string in a column where the provided substring is fully contained between within the provided start and end indices."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef str_rjust(x, width, fillchar=' '):\n    sl = _to_string_sequence(x).pad(width, fillchar, True, False)\n    return column.ColumnStringArrow(sl.bytes, sl.indices, sl.length, sl.offset, string_sequence=sl)", "response": "Fills the left side of a string sample with a specified character such that the strings are left - hand justified."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nremoves trailing characters from a string sample.", "response": "def str_rstrip(x, to_strip=None):\n    \"\"\"Remove trailing characters from a string sample.\n\n    :param str to_strip: The string to be removed\n    :returns: an expression containing the modified string column.\n\n    Example:\n\n    >>> import vaex\n    >>> text = ['Something', 'very pretty', 'is coming', 'our', 'way.']\n    >>> df = vaex.from_arrays(text=text)\n    >>> df\n      #  text\n      0  Something\n      1  very pretty\n      2  is coming\n      3  our\n      4  way.\n\n    >>> df.text.str.rstrip(to_strip='ing')\n    Expression = str_rstrip(text, to_strip='ing')\n    Length: 5 dtype: str (expression)\n    ---------------------------------\n    0       Someth\n    1  very pretty\n    2       is com\n    3          our\n    4         way.\n    \"\"\"\n    # in c++ we give empty string the same meaning as None\n    sl = _to_string_sequence(x).rstrip('' if to_strip is None else to_strip) if to_strip != '' else x\n    return column.ColumnStringArrow(sl.bytes, sl.indices, sl.length, sl.offset, string_sequence=sl)"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nslices substrings from each string element in a column.", "response": "def str_slice(x, start=0, stop=None):  # TODO: support n\n    \"\"\"Slice substrings from each string element in a column.\n\n    :param int start: The start position for the slice operation.\n    :param int end: The stop position for the slice operation.\n    :returns: an expression containing the sliced substrings.\n\n    Example:\n\n    >>> import vaex\n    >>> text = ['Something', 'very pretty', 'is coming', 'our', 'way.']\n    >>> df = vaex.from_arrays(text=text)\n    >>> df\n      #  text\n      0  Something\n      1  very pretty\n      2  is coming\n      3  our\n      4  way.\n\n    >>> df.text.str.slice(start=2, stop=5)\n    Expression = str_pandas_slice(text, start=2, stop=5)\n    Length: 5 dtype: str (expression)\n    ---------------------------------\n    0  met\n    1   ry\n    2   co\n    3    r\n    4   y.\n    \"\"\"\n    if stop is None:\n        sll = _to_string_sequence(x).slice_string_end(start)\n    else:\n        sll = _to_string_sequence(x).slice_string(start, stop)\n    return sll"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef str_strip(x, to_strip=None):\n    # in c++ we give empty string the same meaning as None\n    sl = _to_string_sequence(x).strip('' if to_strip is None else to_strip) if to_strip != '' else x\n    return column.ColumnStringArrow(sl.bytes, sl.indices, sl.length, sl.offset, string_sequence=sl)", "response": "Removes leading and trailing characters from each string saple in a column."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nconverting all string samples to titlecase.", "response": "def str_title(x):\n    \"\"\"Converts all string samples to titlecase.\n\n    :returns: an expression containing the converted strings.\n\n    Example:\n\n    >>> import vaex\n    >>> text = ['Something', 'very pretty', 'is coming', 'our', 'way.']\n    >>> df = vaex.from_arrays(text=text)\n    >>> df\n      #  text\n      0  Something\n      1  very pretty\n      2  is coming\n      3  our\n      4  way.\n\n    >>> df.text.str.title()\n    Expression = str_title(text)\n    Length: 5 dtype: str (expression)\n    ---------------------------------\n    0    Something\n    1  Very Pretty\n    2    Is Coming\n    3          Our\n    4         Way.\n    \"\"\"\n    sl = _to_string_sequence(x).title()\n    return column.ColumnStringArrow(sl.bytes, sl.indices, sl.length, sl.offset, string_sequence=sl)"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef str_upper(x):\n    sl = _to_string_sequence(x).upper()\n    return column.ColumnStringArrow(sl.bytes, sl.indices, sl.length, sl.offset, string_sequence=sl)", "response": "Converts all strings in a column to uppercase."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef format(x, format):\n    # don't change the dtype, otherwise for each block the dtype may be different (string length)\n    sl = vaex.strings.format(x, format)\n    return column.ColumnStringArrow(sl.bytes, sl.indices, sl.length, sl.offset, string_sequence=sl)", "response": "Formats x with the given format."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nwrites the metadata of the current instance of the class.", "response": "def write_meta(self):\n        \"\"\"ucds, descriptions and units are written as attributes in the hdf5 file, instead of a seperate file as\n         the default :func:`Dataset.write_meta`.\n         \"\"\"\n        with h5py.File(self.filename, \"r+\") as h5file_output:\n            h5table_root = h5file_output[self.h5table_root_name]\n            if self.description is not None:\n                h5table_root.attrs[\"description\"] = self.description\n            h5columns = h5table_root if self._version == 1 else h5table_root['columns']\n            for column_name in self.columns.keys():\n                h5dataset = None\n                if column_name in h5columns:\n                    h5dataset = h5columns[column_name]\n                else:\n                    for group in h5columns.values():\n                        if 'type' in group.attrs:\n                            if group.attrs['type'] in ['csr_matrix']: \n                                for name, column in group.items():\n                                    if name == column_name:\n                                        h5dataset = column\n                if h5dataset is None:\n                    raise ValueError('column {} not found'.format(column_name))\n                for name, values in [(\"ucd\", self.ucds), (\"unit\", self.units), (\"description\", self.descriptions)]:\n                    if column_name in values:\n                        value = ensure_string(values[column_name], cast=True)\n                        h5dataset.attrs[name] = value\n                    else:\n                        if name in h5columns.attrs:\n                            del h5dataset.attrs[name]"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef create(cls, path, N, column_names, dtypes=None, write=True):\n\n        dtypes = dtypes or [np.float] * len(column_names)\n\n        if N == 0:\n            raise ValueError(\"Cannot export empty table\")\n        with h5py.File(path, \"w\") as h5file_output:\n            h5data_output = h5file_output.require_group(\"data\")\n            for column_name, dtype in zip(column_names, dtypes):\n                shape = (N,)\n                print(dtype)\n                if dtype.type == np.datetime64:\n                    array = h5file_output.require_dataset(\"/data/%s\" % column_name, shape=shape, dtype=np.int64)\n                    array.attrs[\"dtype\"] = dtype.name\n                else:\n                    array = h5file_output.require_dataset(\"/data/%s\" % column_name, shape=shape, dtype=dtype)\n                array[0] = array[0]  # make sure the array really exists\n        return Hdf5MemoryMapped(path, write=write)", "response": "Create a new empty hdf5 file with columns given by column names of length N Optionally a numpy dtypes can be passed default is floats"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nread a single column from a table.", "response": "def readcol(filename,skipline=0,skipafter=0,names=False,fsep=None,twod=True,\n        fixedformat=None,asdict=False,comment='#',verbose=True,nullval=None,\n        asStruct=False,namecomment=True,removeblanks=False,header_badchars=None,\n        asRecArray=False):\n    \"\"\"\n    The default return is a two dimensional float array.  If you want a list of\n    columns output instead of a 2D array, pass 'twod=False'.  In this case,\n    each column's data type will be automatically detected.\n    \n    Example usage:\n    CASE 1) a table has the format:\n     X    Y    Z\n    0.0  2.4  8.2\n    1.0  3.4  5.6\n    0.7  3.2  2.1\n    ...\n    names,(x,y,z)=readcol(\"myfile.tbl\",names=True,twod=False)\n    or\n    x,y,z=readcol(\"myfile.tbl\",skipline=1,twod=False)\n    or \n    names,xx = readcol(\"myfile.tbl\",names=True)\n    or\n    xxdict = readcol(\"myfile.tbl\",asdict=True)\n    or\n    xxstruct = readcol(\"myfile.tbl\",asStruct=True)\n\n    CASE 2) no title is contained into the table, then there is\n    no need to skipline:\n    x,y,z=readcol(\"myfile.tbl\")\n    \n    CASE 3) there is a names column and then more descriptive text:\n     X      Y     Z\n    (deg) (deg) (km/s) \n    0.0    2.4   8.2\n    1.0    3.4.  5.6\n    ...\n    then use:\n    names,x,y,z=readcol(\"myfile.tbl\",names=True,skipline=1,twod=False)\n    or\n    x,y,z=readcol(\"myfile.tbl\",skipline=2,twod=False)\n\n    INPUTS:\n        fsep - field separator, e.g. for comma separated value (csv) files\n        skipline - number of lines to ignore at the start of the file\n        names - read / don't read in the first line as a list of column names\n                can specify an integer line number too, though it will be \n                the line number after skipping lines\n        twod - two dimensional or one dimensional output\n        nullval - if specified, all instances of this value will be replaced\n           with a floating NaN\n        asdict - zips names with data to create a dict with column headings \n            tied to column data.  If asdict=True, names will be set to True\n        asStruct - same as asdict, but returns a structure instead of a dictionary\n            (i.e. you call struct.key instead of struct['key'])\n        fixedformat - if you have a fixed format file, this is a python list of \n            column lengths.  e.g. the first table above would be [3,5,5].  Note\n            that if you specify the wrong fixed format, you will get junk; if your\n            format total is greater than the line length, the last entries will all\n            be blank but readcol will not report an error.\n        namecomment - assumed that \"Name\" row is on a comment line.  If it is not - \n            e.g., it is the first non-comment line, change this to False\n        removeblanks - remove all blank entries from split lines.  This can cause lost\n            data if you have blank entries on some lines.\n        header_badchars - remove these characters from a header before parsing it\n            (helpful for IPAC tables that are delimited with | )\n\n    If you get this error: \"scipy could not be imported.  Your table must have\n    full rows.\" it means readcol cannot automatically guess which columns\n    contain data.  If you have scipy and columns of varying length, readcol will\n    read in all of the rows with length=mode(row lengths).\n    \"\"\"\n    with open(filename,'r') as f:\n        f = f.readlines()\n    \n        null=[f.pop(0) for i in range(skipline)]\n\n        commentfilter = make_commentfilter(comment)\n\n        if not asStruct:\n            asStruct = asRecArray\n\n        if namecomment is False and (names or asdict or asStruct):\n            while 1:\n                line = f.pop(0)\n                if line[0] != comment:\n                    nameline = line\n                    if header_badchars:\n                        for c in header_badchars:\n                            nameline = nameline.replace(c,' ')\n                    nms=nameline.split(fsep)\n                    break\n                elif len(f) == 0:\n                    raise Exception(\"No uncommented lines found.\")\n        else:\n            if names or asdict or asStruct:\n                # can specify name line\n                if type(names) == type(1):\n                    nameline = f.pop(names)\n                else:\n                    nameline = f.pop(0)\n                if nameline[0]==comment:\n                    nameline = nameline[1:]\n                if header_badchars:\n                    for c in header_badchars:\n                        nameline = nameline.replace(c,' ')\n                nms=list([name.strip() for name in nameline.split(fsep)])\n\n        null=[f.pop(0) for i in range(skipafter)]\n\n        if fixedformat:\n            myreadff = lambda x: readff(x,fixedformat)\n            splitarr = list(map(myreadff,f))\n            splitarr = list(filter(commentfilter,splitarr))\n        else:\n            fstrip = list(map(str.strip,f))\n            fseps = [ fsep for i in range(len(f)) ]\n            splitarr = list(map(str.split,fstrip,fseps))\n            if removeblanks:\n                for i in range(splitarr.count([''])):\n                    splitarr.remove([''])\n\n            splitarr = list(filter(commentfilter,splitarr))\n\n            # check to make sure each line has the same number of columns to avoid\n            # \"ValueError: setting an array element with a sequence.\"\n            nperline = list(map(len,splitarr))\n            if hasmode:\n                ncols,nrows = mode(nperline)\n                if nrows != len(splitarr):\n                    if verbose:\n                        print(\"Removing %i rows that don't match most common length %i.  \\\n                         \\n%i rows read into array.\" % (len(splitarr) - nrows,ncols,nrows))\n                    for i in range(len(splitarr)-1,-1,-1):  # need to go backwards\n                        if nperline[i] != ncols:\n                            splitarr.pop(i)\n\n        try:\n            x = numpy.asarray( splitarr , dtype='float')\n        except ValueError:\n            if verbose:\n                print(\"WARNING: reading as string array because %s array failed\" % 'float')\n            try:\n                x = numpy.asarray( splitarr , dtype='S')\n            except ValueError:\n                if hasmode:\n                    raise Exception( \"ValueError when converting data to array.\" + \\\n                            \"  You have scipy.mode on your system, so this is \" + \\\n                            \"probably not an issue of differing row lengths.\" )\n                else:\n                    raise Exception( \"Conversion to array error.  You probably \" + \\\n                            \"have different row lengths and scipy.mode was not \" + \\\n                            \"imported.\" )\n\n        if nullval is not None:\n            x[x==nullval] = numpy.nan\n            x = get_autotype(x)\n\n        if asdict or asStruct:\n            mydict = OrderedDict(zip(nms,x.T))\n            for k,v in mydict.items():\n                mydict[k] = get_autotype(v)\n            if asdict:\n                return mydict\n            elif asRecArray:\n                return Struct(mydict).as_recarray()\n            elif asStruct:\n                return Struct(mydict)\n        elif names and twod:\n            return nms,x\n        elif names:\n            # if not returning a twod array, try to return each vector as the spec. type\n            return nms,[ get_autotype(x.T[i]) for i in range(x.shape[1]) ]\n        else:\n            if twod:\n                return x\n            else:\n                return [ get_autotype(x.T[i]) for i in range(x.shape[1]) ]"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nreturn a numpy array converted to the most sensible dtype", "response": "def get_autotype(arr):\n    \"\"\"\n    Attempts to return a numpy array converted to the most sensible dtype\n    Value errors will be caught and simply return the original array\n    Tries to make dtype int, then float, then no change\n    \"\"\"\n    try:\n        narr = arr.astype('float')\n        if (narr < sys.maxsize).all() and (narr % 1).sum() == 0:\n            return narr.astype('int')\n        else:\n            return narr\n    except ValueError:\n        return arr"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nread a single line string and return a list of strings", "response": "def readff(s,format):\n    \"\"\"\n    Fixed-format reader\n    Pass in a single line string (s) and a format list, \n    which needs to be a python list of string lengths \n    \"\"\"\n\n    F = numpy.array([0]+format).cumsum()\n    bothF = zip(F[:-1],F[1:])\n    strarr = [s[l:u] for l,u in bothF]\n\n    return strarr"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nconverts the object into a numpy recordarray", "response": "def as_recarray(self):\n        \"\"\" Convert into numpy recordarray \"\"\"\n        dtype = [(k,v.dtype) for k,v in self.__dict__.iteritems()]\n        R = numpy.recarray(len(self.__dict__[k]),dtype=dtype)\n        for key in self.__dict__:\n            R[key] = self.__dict__[key]\n        return R"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nwrites properties to the file - like object fh in Java properties format.", "response": "def store_properties(fh, props, comment=None, timestamp=True):\n  \"\"\"\n    Writes properties to the file in Java properties format.\n\n    :param fh: a writable file-like object\n    :param props: a mapping (dict) or iterable of key/value pairs\n    :param comment: comment to write to the beginning of the file\n    :param timestamp: boolean indicating whether to write a timestamp comment\n  \"\"\"\n  if comment is not None:\n    write_comment(fh, comment)\n\n  if timestamp:\n    write_comment(fh, time.strftime('%a %b %d %H:%M:%S %Z %Y'))\n\n  if hasattr(props, 'keys'):\n    for key in props:\n      write_property(fh, key, props[key])\n  else:\n    for key, value in props:\n      write_property(fh, key, value)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef write_comment(fh, comment):\n  _require_string(comment, 'comments')\n  fh.write(_escape_comment(comment))\n  fh.write(b'\\n')", "response": "Writes a comment to the file in Java properties format."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nwrite a single property to the file in Java properties format.", "response": "def write_property(fh, key, value):\n  \"\"\"\n    Write a single property to the file in Java properties format.\n\n    :param fh: a writable file-like object\n    :param key: the key to write\n    :param value: the value to write\n  \"\"\"\n  if key is COMMENT:\n    write_comment(fh, value)\n    return\n\n  _require_string(key, 'keys')\n  _require_string(value, 'values')\n\n  fh.write(_escape_key(key))\n  fh.write(b'=')\n  fh.write(_escape_value(value))\n  fh.write(b'\\n')"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef iter_properties(fh, comments=False):\n  for line in _property_lines(fh):\n    key, value = _split_key_value(line)\n    if key is not COMMENT:\n      key = _unescape(key)\n    elif not comments:\n      continue\n    yield key, _unescape(value)", "response": "Generator that iterates over the properties of a Java. properties file."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nwrapping a file to convert newlines regardless of whether it was opened with universal newlines support.", "response": "def _universal_newlines(fp):\n  \"\"\"\n    Wrap a file to convert newlines regardless of whether the file was opened\n    with the \"universal newlines\" option or not.\n  \"\"\"\n  # if file was opened with universal newline support we don't need to convert\n  if 'U' in getattr(fp, 'mode', ''):\n    for line in fp:\n      yield line\n  else:\n    for line in fp:\n      line = line.replace(b'\\r\\n', b'\\n').replace(b'\\r', b'\\n')\n      for piece in line.split(b'\\n'):\n        yield piece"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nreturning the version information for all librosa dependencies.", "response": "def show_versions():\n    '''Return the version information for all librosa dependencies.'''\n\n    core_deps = ['audioread',\n                 'numpy',\n                 'scipy',\n                 'sklearn',\n                 'joblib',\n                 'decorator',\n                 'six',\n                 'soundfile',\n                 'resampy',\n                 'numba']\n\n    extra_deps = ['numpydoc',\n                  'sphinx',\n                  'sphinx_rtd_theme',\n                  'sphinxcontrib.versioning',\n                  'sphinx-gallery',\n                  'pytest',\n                  'pytest-mpl',\n                  'pytest-cov',\n                  'matplotlib']\n\n    print('INSTALLED VERSIONS')\n    print('------------------')\n    print('python: {}\\n'.format(sys.version))\n    print('librosa: {}\\n'.format(version))\n    for dep in core_deps:\n        print('{}: {}'.format(dep, __get_mod_version(dep)))\n    print('')\n    for dep in extra_deps:\n        print('{}: {}'.format(dep, __get_mod_version(dep)))\n    pass"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef rename_kw(old_name, old_value, new_name, new_value,\n              version_deprecated, version_removed):\n    '''Handle renamed arguments.\n\n    Parameters\n    ----------\n    old_name : str\n    old_value\n        The name and value of the old argument\n\n    new_name : str\n    new_value\n        The name and value of the new argument\n\n    version_deprecated : str\n        The version at which the old name became deprecated\n\n    version_removed : str\n        The version at which the old name will be removed\n\n    Returns\n    -------\n    value\n        - `new_value` if `old_value` of type `Deprecated`\n        - `old_value` otherwise\n\n    Warnings\n    --------\n    if `old_value` is not of type `Deprecated`\n\n    '''\n    if isinstance(old_value, Deprecated):\n        return new_value\n    else:\n        stack = inspect.stack()\n        dep_func = stack[1]\n        caller = stack[2]\n\n        warnings.warn_explicit(\"{:s}() keyword argument '{:s}' has been \"\n                               \"renamed to '{:s}' in version {:}.\"\n                               \"\\n\\tThis alias will be removed in version \"\n                               \"{:}.\".format(dep_func[3],\n                                             old_name, new_name,\n                                             version_deprecated,\n                                             version_removed),\n                               category=DeprecationWarning,\n                               filename=caller[1],\n                               lineno=caller[2])\n\n        return old_value", "response": "Handle renamed arguments.\n\n    Parameters\n    ----------\n    old_name : str\n    old_value\n        The name and value of the old argument\n\n    new_name : str\n    new_value\n        The name and value of the new argument\n\n    version_deprecated : str\n        The version at which the old name became deprecated\n\n    version_removed : str\n        The version at which the old name will be removed\n\n    Returns\n    -------\n    value\n        - `new_value` if `old_value` of type `Deprecated`\n        - `old_value` otherwise\n\n    Warnings\n    --------\n    if `old_value` is not of type `Deprecated`"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef set_fftlib(lib=None):\n    '''Set the FFT library used by librosa.\n\n    Parameters\n    ----------\n    lib : None or module\n        Must implement an interface compatible with `numpy.fft`.\n        If `None`, reverts to `numpy.fft`.\n\n    Examples\n    --------\n    Use `pyfftw`:\n\n    >>> import pyfftw\n    >>> librosa.set_fftlib(pyfftw.interfaces.numpy_fft)\n\n    Reset to default `numpy` implementation\n\n    >>> librosa.set_fftlib()\n\n    '''\n\n    global __FFTLIB\n    if lib is None:\n        from numpy import fft\n        lib = fft\n\n    __FFTLIB = lib", "response": "Set the FFT library used by librosa."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nbeat tracking function :parameters: - input_file : str Path to input audio file (wav, mp3, m4a, flac, etc.) - output_file : str Path to save beat event timestamps as a CSV file", "response": "def beat_track(input_file, output_csv):\n    '''Beat tracking function\n\n    :parameters:\n      - input_file : str\n          Path to input audio file (wav, mp3, m4a, flac, etc.)\n\n      - output_file : str\n          Path to save beat event timestamps as a CSV file\n    '''\n\n    print('Loading ', input_file)\n    y, sr = librosa.load(input_file, sr=22050)\n\n    # Use a default hop size of 512 samples @ 22KHz ~= 23ms\n    hop_length = 512\n\n    # This is the window length used by default in stft\n    print('Tracking beats')\n    tempo, beats = librosa.beat.beat_track(y=y, sr=sr, hop_length=hop_length)\n\n    print('Estimated tempo: {:0.2f} beats per minute'.format(tempo))\n\n    # save output\n    # 'beats' will contain the frame numbers of beat events.\n    beat_times = librosa.frames_to_time(beats, sr=sr, hop_length=hop_length)\n\n    print('Saving output to ', output_csv)\n    librosa.output.times_csv(output_csv, beat_times)\n    print('done!')"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef adjust_tuning(input_file, output_file):\n    '''Load audio, estimate tuning, apply pitch correction, and save.'''\n    print('Loading ', input_file)\n    y, sr = librosa.load(input_file)\n\n    print('Separating harmonic component ... ')\n    y_harm = librosa.effects.harmonic(y)\n\n    print('Estimating tuning ... ')\n    # Just track the pitches associated with high magnitude\n    tuning = librosa.estimate_tuning(y=y_harm, sr=sr)\n\n    print('{:+0.2f} cents'.format(100 * tuning))\n    print('Applying pitch-correction of {:+0.2f} cents'.format(-100 * tuning))\n    y_tuned = librosa.effects.pitch_shift(y, sr, -tuning)\n\n    print('Saving tuned audio to: ', output_file)\n    librosa.output.write_wav(output_file, y_tuned, sr)", "response": "Load audio estimate tuning apply pitch correction and save."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef frames_to_samples(frames, hop_length=512, n_fft=None):\n\n    offset = 0\n    if n_fft is not None:\n        offset = int(n_fft // 2)\n\n    return (np.asanyarray(frames) * hop_length + offset).astype(int)", "response": "Convert frames to audio samples."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef samples_to_frames(samples, hop_length=512, n_fft=None):\n\n    offset = 0\n    if n_fft is not None:\n        offset = int(n_fft // 2)\n\n    samples = np.asanyarray(samples)\n    return np.floor((samples - offset) // hop_length).astype(int)", "response": "Converts a set of samples into a set of STFT frames."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nconvert frames to time.", "response": "def frames_to_time(frames, sr=22050, hop_length=512, n_fft=None):\n    \"\"\"Converts frame counts to time (seconds).\n\n    Parameters\n    ----------\n    frames     : np.ndarray [shape=(n,)]\n        frame index or vector of frame indices\n\n    sr         : number > 0 [scalar]\n        audio sampling rate\n\n    hop_length : int > 0 [scalar]\n        number of samples between successive frames\n\n    n_fft : None or int > 0 [scalar]\n        Optional: length of the FFT window.\n        If given, time conversion will include an offset of `n_fft / 2`\n        to counteract windowing effects when using a non-centered STFT.\n\n    Returns\n    -------\n    times : np.ndarray [shape=(n,)]\n        time (in seconds) of each given frame number:\n        `times[i] = frames[i] * hop_length / sr`\n\n    See Also\n    --------\n    time_to_frames : convert time values to frame indices\n    frames_to_samples : convert frame indices to sample indices\n\n    Examples\n    --------\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> tempo, beats = librosa.beat.beat_track(y, sr=sr)\n    >>> beat_times = librosa.frames_to_time(beats, sr=sr)\n    \"\"\"\n\n    samples = frames_to_samples(frames,\n                                hop_length=hop_length,\n                                n_fft=n_fft)\n\n    return samples_to_time(samples, sr=sr)"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef time_to_frames(times, sr=22050, hop_length=512, n_fft=None):\n\n    samples = time_to_samples(times, sr=sr)\n\n    return samples_to_frames(samples, hop_length=hop_length, n_fft=n_fft)", "response": "Converts time stamps into STFT frames."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef note_to_midi(note, round_midi=True):\n    '''Convert one or more spelled notes to MIDI number(s).\n\n    Notes may be spelled out with optional accidentals or octave numbers.\n\n    The leading note name is case-insensitive.\n\n    Sharps are indicated with ``#``, flats may be indicated with ``!`` or ``b``.\n\n    Parameters\n    ----------\n    note : str or iterable of str\n        One or more note names.\n\n    round_midi : bool\n        - If `True`, allow for fractional midi notes\n        - Otherwise, round cent deviations to the nearest note\n\n    Returns\n    -------\n    midi : float or np.array\n        Midi note numbers corresponding to inputs.\n\n    Raises\n    ------\n    ParameterError\n        If the input is not in valid note format\n\n    See Also\n    --------\n    midi_to_note\n    note_to_hz\n\n    Examples\n    --------\n    >>> librosa.note_to_midi('C')\n    12\n    >>> librosa.note_to_midi('C#3')\n    49\n    >>> librosa.note_to_midi('f4')\n    65\n    >>> librosa.note_to_midi('Bb-1')\n    10\n    >>> librosa.note_to_midi('A!8')\n    116\n    >>> # Lists of notes also work\n    >>> librosa.note_to_midi(['C', 'E', 'G'])\n    array([12, 16, 19])\n\n    '''\n\n    if not isinstance(note, six.string_types):\n        return np.array([note_to_midi(n, round_midi=round_midi) for n in note])\n\n    pitch_map = {'C': 0, 'D': 2, 'E': 4, 'F': 5, 'G': 7, 'A': 9, 'B': 11}\n    acc_map = {'#': 1, '': 0, 'b': -1, '!': -1}\n\n    match = re.match(r'^(?P<note>[A-Ga-g])'\n                     r'(?P<accidental>[#b!]*)'\n                     r'(?P<octave>[+-]?\\d+)?'\n                     r'(?P<cents>[+-]\\d+)?$',\n                     note)\n    if not match:\n        raise ParameterError('Improper note format: {:s}'.format(note))\n\n    pitch = match.group('note').upper()\n    offset = np.sum([acc_map[o] for o in match.group('accidental')])\n    octave = match.group('octave')\n    cents = match.group('cents')\n\n    if not octave:\n        octave = 0\n    else:\n        octave = int(octave)\n\n    if not cents:\n        cents = 0\n    else:\n        cents = int(cents) * 1e-2\n\n    note_value = 12 * (octave + 1) + pitch_map[pitch] + offset + cents\n\n    if round_midi:\n        note_value = int(np.round(note_value))\n\n    return note_value", "response": "Convert one or more spelled notes to MIDI number."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef midi_to_note(midi, octave=True, cents=False):\n    '''Convert one or more MIDI numbers to note strings.\n\n    MIDI numbers will be rounded to the nearest integer.\n\n    Notes will be of the format 'C0', 'C#0', 'D0', ...\n\n    Examples\n    --------\n    >>> librosa.midi_to_note(0)\n    'C-1'\n    >>> librosa.midi_to_note(37)\n    'C#2'\n    >>> librosa.midi_to_note(-2)\n    'A#-2'\n    >>> librosa.midi_to_note(104.7)\n    'A7'\n    >>> librosa.midi_to_note(104.7, cents=True)\n    'A7-30'\n    >>> librosa.midi_to_note(list(range(12, 24)))\n    ['C0', 'C#0', 'D0', 'D#0', 'E0', 'F0', 'F#0', 'G0', 'G#0', 'A0', 'A#0', 'B0']\n\n    Parameters\n    ----------\n    midi : int or iterable of int\n        Midi numbers to convert.\n\n    octave: bool\n        If True, include the octave number\n\n    cents: bool\n        If true, cent markers will be appended for fractional notes.\n        Eg, `midi_to_note(69.3, cents=True)` == `A4+03`\n\n    Returns\n    -------\n    notes : str or iterable of str\n        Strings describing each midi note.\n\n    Raises\n    ------\n    ParameterError\n        if `cents` is True and `octave` is False\n\n    See Also\n    --------\n    midi_to_hz\n    note_to_midi\n    hz_to_note\n    '''\n\n    if cents and not octave:\n        raise ParameterError('Cannot encode cents without octave information.')\n\n    if not np.isscalar(midi):\n        return [midi_to_note(x, octave=octave, cents=cents) for x in midi]\n\n    note_map = ['C', 'C#', 'D', 'D#',\n                'E', 'F', 'F#', 'G',\n                'G#', 'A', 'A#', 'B']\n\n    note_num = int(np.round(midi))\n    note_cents = int(100 * np.around(midi - note_num, 2))\n\n    note = note_map[note_num % 12]\n\n    if octave:\n        note = '{:s}{:0d}'.format(note, int(note_num / 12) - 1)\n    if cents:\n        note = '{:s}{:+02d}'.format(note, note_cents)\n\n    return note", "response": "Convert one or more MIDI numbers to note strings."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef hz_to_mel(frequencies, htk=False):\n\n    frequencies = np.asanyarray(frequencies)\n\n    if htk:\n        return 2595.0 * np.log10(1.0 + frequencies / 700.0)\n\n    # Fill in the linear part\n    f_min = 0.0\n    f_sp = 200.0 / 3\n\n    mels = (frequencies - f_min) / f_sp\n\n    # Fill in the log-scale part\n\n    min_log_hz = 1000.0                         # beginning of log region (Hz)\n    min_log_mel = (min_log_hz - f_min) / f_sp   # same (Mels)\n    logstep = np.log(6.4) / 27.0                # step size for log region\n\n    if frequencies.ndim:\n        # If we have array data, vectorize\n        log_t = (frequencies >= min_log_hz)\n        mels[log_t] = min_log_mel + np.log(frequencies[log_t]/min_log_hz) / logstep\n    elif frequencies >= min_log_hz:\n        # If we have scalar data, heck directly\n        mels = min_log_mel + np.log(frequencies / min_log_hz) / logstep\n\n    return mels", "response": "Convert Hz to Mels."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef mel_to_hz(mels, htk=False):\n\n    mels = np.asanyarray(mels)\n\n    if htk:\n        return 700.0 * (10.0**(mels / 2595.0) - 1.0)\n\n    # Fill in the linear scale\n    f_min = 0.0\n    f_sp = 200.0 / 3\n    freqs = f_min + f_sp * mels\n\n    # And now the nonlinear scale\n    min_log_hz = 1000.0                         # beginning of log region (Hz)\n    min_log_mel = (min_log_hz - f_min) / f_sp   # same (Mels)\n    logstep = np.log(6.4) / 27.0                # step size for log region\n\n    if mels.ndim:\n        # If we have vector data, vectorize\n        log_t = (mels >= min_log_mel)\n        freqs[log_t] = min_log_hz * np.exp(logstep * (mels[log_t] - min_log_mel))\n    elif mels >= min_log_mel:\n        # If we have scalar data, check directly\n        freqs = min_log_hz * np.exp(logstep * (mels - min_log_mel))\n\n    return freqs", "response": "Convert mel bin numbers to frequency intervals"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nconverting frequency of a log - 2 octave to fractional octave numbers.", "response": "def hz_to_octs(frequencies, A440=440.0):\n    \"\"\"Convert frequencies (Hz) to (fractional) octave numbers.\n\n    Examples\n    --------\n    >>> librosa.hz_to_octs(440.0)\n    4.\n    >>> librosa.hz_to_octs([32, 64, 128, 256])\n    array([ 0.219,  1.219,  2.219,  3.219])\n\n    Parameters\n    ----------\n    frequencies   : number >0 or np.ndarray [shape=(n,)] or float\n        scalar or vector of frequencies\n    A440          : float\n        frequency of A440 (in Hz)\n\n    Returns\n    -------\n    octaves       : number or np.ndarray [shape=(n,)]\n        octave number for each frequency\n\n    See Also\n    --------\n    octs_to_hz\n    \"\"\"\n    return np.log2(np.asanyarray(frequencies) / (float(A440) / 16))"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\ncomputes the center frequencies of each Constant - Q bin.", "response": "def cqt_frequencies(n_bins, fmin, bins_per_octave=12, tuning=0.0):\n    \"\"\"Compute the center frequencies of Constant-Q bins.\n\n    Examples\n    --------\n    >>> # Get the CQT frequencies for 24 notes, starting at C2\n    >>> librosa.cqt_frequencies(24, fmin=librosa.note_to_hz('C2'))\n    array([  65.406,   69.296,   73.416,   77.782,   82.407,   87.307,\n             92.499,   97.999,  103.826,  110.   ,  116.541,  123.471,\n            130.813,  138.591,  146.832,  155.563,  164.814,  174.614,\n            184.997,  195.998,  207.652,  220.   ,  233.082,  246.942])\n\n    Parameters\n    ----------\n    n_bins  : int > 0 [scalar]\n        Number of constant-Q bins\n\n    fmin    : float > 0 [scalar]\n        Minimum frequency\n\n    bins_per_octave : int > 0 [scalar]\n        Number of bins per octave\n\n    tuning : float in `[-0.5, +0.5)`\n        Deviation from A440 tuning in fractional bins (cents)\n\n    Returns\n    -------\n    frequencies : np.ndarray [shape=(n_bins,)]\n        Center frequency for each CQT bin\n    \"\"\"\n\n    correction = 2.0**(float(tuning) / bins_per_octave)\n    frequencies = 2.0**(np.arange(0, n_bins, dtype=float) / bins_per_octave)\n\n    return correction * fmin * frequencies"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef mel_frequencies(n_mels=128, fmin=0.0, fmax=11025.0, htk=False):\n\n    # 'Center freqs' of mel bands - uniformly spaced between limits\n    min_mel = hz_to_mel(fmin, htk=htk)\n    max_mel = hz_to_mel(fmax, htk=htk)\n\n    mels = np.linspace(min_mel, max_mel, n_mels)\n\n    return mel_to_hz(mels, htk=htk)", "response": "Compute an array of acoustic frequencies tuned to the mel scale."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\ncomputing the frequencies corresponding to a 384 - bin tempogram matrix.", "response": "def tempo_frequencies(n_bins, hop_length=512, sr=22050):\n    '''Compute the frequencies (in beats-per-minute) corresponding\n    to an onset auto-correlation or tempogram matrix.\n\n    Parameters\n    ----------\n    n_bins : int > 0\n        The number of lag bins\n\n    hop_length : int > 0\n        The number of samples between each bin\n\n    sr : number > 0\n        The audio sampling rate\n\n    Returns\n    -------\n    bin_frequencies : ndarray [shape=(n_bins,)]\n        vector of bin frequencies measured in BPM.\n\n        .. note:: `bin_frequencies[0] = +np.inf` corresponds to 0-lag\n\n    Examples\n    --------\n    Get the tempo frequencies corresponding to a 384-bin (8-second) tempogram\n\n    >>> librosa.tempo_frequencies(384)\n    array([      inf,  2583.984,  1291.992, ...,     6.782,\n               6.764,     6.747])\n    '''\n\n    bin_frequencies = np.zeros(int(n_bins), dtype=np.float)\n\n    bin_frequencies[0] = np.inf\n    bin_frequencies[1:] = 60.0 * sr / (hop_length * np.arange(1.0, n_bins))\n\n    return bin_frequencies"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef A_weighting(frequencies, min_db=-80.0):     # pylint: disable=invalid-name\n    '''Compute the A-weighting of a set of frequencies.\n\n    Parameters\n    ----------\n    frequencies : scalar or np.ndarray [shape=(n,)]\n        One or more frequencies (in Hz)\n\n    min_db : float [scalar] or None\n        Clip weights below this threshold.\n        If `None`, no clipping is performed.\n\n    Returns\n    -------\n    A_weighting : scalar or np.ndarray [shape=(n,)]\n        `A_weighting[i]` is the A-weighting of `frequencies[i]`\n\n    See Also\n    --------\n    perceptual_weighting\n\n\n    Examples\n    --------\n\n    Get the A-weighting for CQT frequencies\n\n    >>> import matplotlib.pyplot as plt\n    >>> freqs = librosa.cqt_frequencies(108, librosa.note_to_hz('C1'))\n    >>> aw = librosa.A_weighting(freqs)\n    >>> plt.plot(freqs, aw)\n    >>> plt.xlabel('Frequency (Hz)')\n    >>> plt.ylabel('Weighting (log10)')\n    >>> plt.title('A-Weighting of CQT frequencies')\n\n    '''\n\n    # Vectorize to make our lives easier\n    frequencies = np.asanyarray(frequencies)\n\n    # Pre-compute squared frequency\n    f_sq = frequencies**2.0\n\n    const = np.array([12200, 20.6, 107.7, 737.9])**2.0\n\n    weights = 2.0 + 20.0 * (np.log10(const[0]) + 4 * np.log10(frequencies)\n                            - np.log10(f_sq + const[0])\n                            - np.log10(f_sq + const[1])\n                            - 0.5 * np.log10(f_sq + const[2])\n                            - 0.5 * np.log10(f_sq + const[3]))\n\n    if min_db is not None:\n        weights = np.maximum(min_db, weights)\n\n    return weights", "response": "Compute the A - weighting of a set of frequencies."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef times_like(X, sr=22050, hop_length=512, n_fft=None, axis=-1):\n    samples = samples_like(X, hop_length=hop_length, n_fft=n_fft, axis=axis)\n    return samples_to_time(samples, sr=sr)", "response": "Returns an array of time values to match the time axis from a feature matrix."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nreturns an array of samples corresponding to each frame of X.", "response": "def samples_like(X, hop_length=512, n_fft=None, axis=-1):\n    \"\"\"Return an array of sample indices to match the time axis from a feature matrix.\n\n    Parameters\n    ----------\n    X : np.ndarray or scalar\n        - If ndarray, X is a feature matrix, e.g. STFT, chromagram, or mel spectrogram.\n        - If scalar, X represents the number of frames.\n\n    hop_length : int > 0 [scalar]\n        number of samples between successive frames\n\n    n_fft : None or int > 0 [scalar]\n        Optional: length of the FFT window.\n        If given, time conversion will include an offset of `n_fft / 2`\n        to counteract windowing effects when using a non-centered STFT.\n\n    axis : int [scalar]\n        The axis representing the time axis of X.\n        By default, the last axis (-1) is taken.\n\n    Returns\n    -------\n    samples : np.ndarray [shape=(n,)]\n        ndarray of sample indices corresponding to each frame of X.\n\n    See Also\n    --------\n    times_like : Return an array of time values to match the time axis from a feature matrix.\n\n    Examples\n    --------\n    Provide a feature matrix input:\n\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> X = librosa.stft(y)\n    >>> samples = librosa.samples_like(X)\n    >>> samples\n    array([      0,     512,    1024, ..., 1353728, 1354240, 1354752])\n\n    Provide a scalar input:\n\n    >>> n_frames = 2647\n    >>> samples = librosa.samples_like(n_frames)\n    >>> samples\n    array([      0,     512,    1024, ..., 1353728, 1354240, 1354752])\n    \"\"\"\n    if np.isscalar(X):\n        frames = np.arange(X)\n    else:\n        frames = np.arange(X.shape[axis])\n    return frames_to_samples(frames, hop_length=hop_length, n_fft=n_fft)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\ncompute the constant - Q transform of an audio signal.", "response": "def cqt(y, sr=22050, hop_length=512, fmin=None, n_bins=84,\n        bins_per_octave=12, tuning=0.0, filter_scale=1,\n        norm=1, sparsity=0.01, window='hann',\n        scale=True, pad_mode='reflect', res_type=None):\n    '''Compute the constant-Q transform of an audio signal.\n\n    This implementation is based on the recursive sub-sampling method\n    described by [1]_.\n\n    .. [1] Schoerkhuber, Christian, and Anssi Klapuri.\n        \"Constant-Q transform toolbox for music processing.\"\n        7th Sound and Music Computing Conference, Barcelona, Spain. 2010.\n\n    Parameters\n    ----------\n    y : np.ndarray [shape=(n,)]\n        audio time series\n\n    sr : number > 0 [scalar]\n        sampling rate of `y`\n\n    hop_length : int > 0 [scalar]\n        number of samples between successive CQT columns.\n\n    fmin : float > 0 [scalar]\n        Minimum frequency. Defaults to C1 ~= 32.70 Hz\n\n    n_bins : int > 0 [scalar]\n        Number of frequency bins, starting at `fmin`\n\n    bins_per_octave : int > 0 [scalar]\n        Number of bins per octave\n\n    tuning : None or float in `[-0.5, 0.5)`\n        Tuning offset in fractions of a bin (cents).\n\n        If `None`, tuning will be automatically estimated from the signal.\n\n    filter_scale : float > 0\n        Filter scale factor. Small values (<1) use shorter windows\n        for improved time resolution.\n\n    norm : {inf, -inf, 0, float > 0}\n        Type of norm to use for basis function normalization.\n        See `librosa.util.normalize`.\n\n    sparsity : float in [0, 1)\n        Sparsify the CQT basis by discarding up to `sparsity`\n        fraction of the energy in each basis.\n\n        Set `sparsity=0` to disable sparsification.\n\n    window : str, tuple, number, or function\n        Window specification for the basis filters.\n        See `filters.get_window` for details.\n\n    scale : bool\n        If `True`, scale the CQT response by square-root the length of\n        each channel's filter.  This is analogous to `norm='ortho'` in FFT.\n\n        If `False`, do not scale the CQT. This is analogous to\n        `norm=None` in FFT.\n\n    pad_mode : string\n        Padding mode for centered frame analysis.\n\n        See also: `librosa.core.stft` and `np.pad`.\n\n    res_type : string [optional]\n        The resampling mode for recursive downsampling.\n\n        By default, `cqt` will adaptively select a resampling mode\n        which trades off accuracy at high frequencies for efficiency at low frequencies.\n\n        You can override this by specifying a resampling mode as supported by\n        `librosa.core.resample`.  For example, `res_type='fft'` will use a high-quality,\n        but potentially slow FFT-based down-sampling, while `res_type='polyphase'` will\n        use a fast, but potentially inaccurate down-sampling.\n\n    Returns\n    -------\n    CQT : np.ndarray [shape=(n_bins, t), dtype=np.complex or np.float]\n        Constant-Q value each frequency at each time.\n\n    Raises\n    ------\n    ParameterError\n        If `hop_length` is not an integer multiple of\n        `2**(n_bins / bins_per_octave)`\n\n        Or if `y` is too short to support the frequency range of the CQT.\n\n    See Also\n    --------\n    librosa.core.resample\n    librosa.util.normalize\n\n    Notes\n    -----\n    This function caches at level 20.\n\n    Examples\n    --------\n    Generate and plot a constant-Q power spectrum\n\n    >>> import matplotlib.pyplot as plt\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> C = np.abs(librosa.cqt(y, sr=sr))\n    >>> librosa.display.specshow(librosa.amplitude_to_db(C, ref=np.max),\n    ...                          sr=sr, x_axis='time', y_axis='cqt_note')\n    >>> plt.colorbar(format='%+2.0f dB')\n    >>> plt.title('Constant-Q power spectrum')\n    >>> plt.tight_layout()\n\n\n    Limit the frequency range\n\n    >>> C = np.abs(librosa.cqt(y, sr=sr, fmin=librosa.note_to_hz('C2'),\n    ...                 n_bins=60))\n    >>> C\n    array([[  8.827e-04,   9.293e-04, ...,   3.133e-07,   2.942e-07],\n           [  1.076e-03,   1.068e-03, ...,   1.153e-06,   1.148e-06],\n           ...,\n           [  1.042e-07,   4.087e-07, ...,   1.612e-07,   1.928e-07],\n           [  2.363e-07,   5.329e-07, ...,   1.294e-07,   1.611e-07]])\n\n\n    Using a higher frequency resolution\n\n    >>> C = np.abs(librosa.cqt(y, sr=sr, fmin=librosa.note_to_hz('C2'),\n    ...                 n_bins=60 * 2, bins_per_octave=12 * 2))\n    >>> C\n    array([[  1.536e-05,   5.848e-05, ...,   3.241e-07,   2.453e-07],\n           [  1.856e-03,   1.854e-03, ...,   2.397e-08,   3.549e-08],\n           ...,\n           [  2.034e-07,   4.245e-07, ...,   6.213e-08,   1.463e-07],\n           [  4.896e-08,   5.407e-07, ...,   9.176e-08,   1.051e-07]])\n    '''\n\n    # How many octaves are we dealing with?\n    n_octaves = int(np.ceil(float(n_bins) / bins_per_octave))\n    n_filters = min(bins_per_octave, n_bins)\n\n    len_orig = len(y)\n\n    if fmin is None:\n        # C1 by default\n        fmin = note_to_hz('C1')\n\n    if tuning is None:\n        tuning = estimate_tuning(y=y, sr=sr)\n\n    # First thing, get the freqs of the top octave\n    freqs = cqt_frequencies(n_bins, fmin,\n                            bins_per_octave=bins_per_octave, tuning=tuning)[-bins_per_octave:]\n\n    fmin_t = np.min(freqs)\n    fmax_t = np.max(freqs)\n\n    # Determine required resampling quality\n    Q = float(filter_scale) / (2.0**(1. / bins_per_octave) - 1)\n    filter_cutoff = fmax_t * (1 + 0.5 * filters.window_bandwidth(window) / Q)\n    nyquist = sr / 2.0\n\n    auto_resample = False\n    if not res_type:\n        auto_resample = True\n        if filter_cutoff < audio.BW_FASTEST * nyquist:\n            res_type = 'kaiser_fast'\n        else:\n            res_type = 'kaiser_best'\n\n    y, sr, hop_length = __early_downsample(y, sr, hop_length,\n                                           res_type,\n                                           n_octaves,\n                                           nyquist, filter_cutoff, scale)\n\n    cqt_resp = []\n\n    if auto_resample and res_type != 'kaiser_fast':\n\n        # Do the top octave before resampling to allow for fast resampling\n        fft_basis, n_fft, _ = __cqt_filter_fft(sr, fmin_t,\n                                               n_filters,\n                                               bins_per_octave,\n                                               tuning,\n                                               filter_scale,\n                                               norm,\n                                               sparsity,\n                                               window=window)\n\n        # Compute the CQT filter response and append it to the stack\n        cqt_resp.append(__cqt_response(y, n_fft, hop_length, fft_basis, pad_mode))\n\n        fmin_t /= 2\n        fmax_t /= 2\n        n_octaves -= 1\n\n        filter_cutoff = fmax_t * (1 + 0.5 * filters.window_bandwidth(window) / Q)\n\n        res_type = 'kaiser_fast'\n\n    # Make sure our hop is long enough to support the bottom octave\n    num_twos = __num_two_factors(hop_length)\n    if num_twos < n_octaves - 1:\n        raise ParameterError('hop_length must be a positive integer '\n                             'multiple of 2^{0:d} for {1:d}-octave CQT'\n                             .format(n_octaves - 1, n_octaves))\n\n    # Now do the recursive bit\n    fft_basis, n_fft, _ = __cqt_filter_fft(sr, fmin_t,\n                                           n_filters,\n                                           bins_per_octave,\n                                           tuning,\n                                           filter_scale,\n                                           norm,\n                                           sparsity,\n                                           window=window)\n\n    my_y, my_sr, my_hop = y, sr, hop_length\n\n    # Iterate down the octaves\n    for i in range(n_octaves):\n\n        # Resample (except first time)\n        if i > 0:\n            if len(my_y) < 2:\n                raise ParameterError('Input signal length={} is too short for '\n                                     '{:d}-octave CQT'.format(len_orig,\n                                                              n_octaves))\n\n            my_y = audio.resample(my_y, 2, 1,\n                                  res_type=res_type,\n                                  scale=True)\n            # The re-scale the filters to compensate for downsampling\n            fft_basis[:] *= np.sqrt(2)\n\n            my_sr /= 2.0\n            my_hop //= 2\n\n        # Compute the cqt filter response and append to the stack\n        cqt_resp.append(__cqt_response(my_y, n_fft, my_hop, fft_basis, pad_mode))\n\n    C = __trim_stack(cqt_resp, n_bins)\n\n    if scale:\n        lengths = filters.constant_q_lengths(sr, fmin,\n                                             n_bins=n_bins,\n                                             bins_per_octave=bins_per_octave,\n                                             tuning=tuning,\n                                             window=window,\n                                             filter_scale=filter_scale)\n        C /= np.sqrt(lengths[:, np.newaxis])\n\n    return C"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\ncompute the hybrid constant - Q transform of an audio time series y.", "response": "def hybrid_cqt(y, sr=22050, hop_length=512, fmin=None, n_bins=84,\n               bins_per_octave=12, tuning=0.0, filter_scale=1,\n               norm=1, sparsity=0.01, window='hann', scale=True,\n               pad_mode='reflect', res_type=None):\n    '''Compute the hybrid constant-Q transform of an audio signal.\n\n    Here, the hybrid CQT uses the pseudo CQT for higher frequencies where\n    the hop_length is longer than half the filter length and the full CQT\n    for lower frequencies.\n\n    Parameters\n    ----------\n    y : np.ndarray [shape=(n,)]\n        audio time series\n\n    sr : number > 0 [scalar]\n        sampling rate of `y`\n\n    hop_length : int > 0 [scalar]\n        number of samples between successive CQT columns.\n\n    fmin : float > 0 [scalar]\n        Minimum frequency. Defaults to C1 ~= 32.70 Hz\n\n    n_bins : int > 0 [scalar]\n        Number of frequency bins, starting at `fmin`\n\n    bins_per_octave : int > 0 [scalar]\n        Number of bins per octave\n\n    tuning : None or float in `[-0.5, 0.5)`\n        Tuning offset in fractions of a bin (cents).\n\n        If `None`, tuning will be automatically estimated from the signal.\n\n    filter_scale : float > 0\n        Filter filter_scale factor. Larger values use longer windows.\n\n    sparsity : float in [0, 1)\n        Sparsify the CQT basis by discarding up to `sparsity`\n        fraction of the energy in each basis.\n\n        Set `sparsity=0` to disable sparsification.\n\n    window : str, tuple, number, or function\n        Window specification for the basis filters.\n        See `filters.get_window` for details.\n\n    pad_mode : string\n        Padding mode for centered frame analysis.\n\n        See also: `librosa.core.stft` and `np.pad`.\n\n    res_type : string\n        Resampling mode.  See `librosa.core.cqt` for details.\n\n    Returns\n    -------\n    CQT : np.ndarray [shape=(n_bins, t), dtype=np.float]\n        Constant-Q energy for each frequency at each time.\n\n    Raises\n    ------\n    ParameterError\n        If `hop_length` is not an integer multiple of\n        `2**(n_bins / bins_per_octave)`\n\n        Or if `y` is too short to support the frequency range of the CQT.\n\n    See Also\n    --------\n    cqt\n    pseudo_cqt\n\n    Notes\n    -----\n    This function caches at level 20.\n\n    '''\n\n    if fmin is None:\n        # C1 by default\n        fmin = note_to_hz('C1')\n\n    if tuning is None:\n        tuning = estimate_tuning(y=y, sr=sr)\n\n    # Get all CQT frequencies\n    freqs = cqt_frequencies(n_bins, fmin,\n                            bins_per_octave=bins_per_octave,\n                            tuning=tuning)\n\n    # Compute the length of each constant-Q basis function\n    lengths = filters.constant_q_lengths(sr, fmin,\n                                         n_bins=n_bins,\n                                         bins_per_octave=bins_per_octave,\n                                         tuning=tuning,\n                                         filter_scale=filter_scale,\n                                         window=window)\n\n    # Determine which filters to use with Pseudo CQT\n    # These are the ones that fit within 2 hop lengths after padding\n    pseudo_filters = 2.0**np.ceil(np.log2(lengths)) < 2 * hop_length\n\n    n_bins_pseudo = int(np.sum(pseudo_filters))\n\n    n_bins_full = n_bins - n_bins_pseudo\n    cqt_resp = []\n\n    if n_bins_pseudo > 0:\n        fmin_pseudo = np.min(freqs[pseudo_filters])\n\n        cqt_resp.append(pseudo_cqt(y, sr,\n                                   hop_length=hop_length,\n                                   fmin=fmin_pseudo,\n                                   n_bins=n_bins_pseudo,\n                                   bins_per_octave=bins_per_octave,\n                                   tuning=tuning,\n                                   filter_scale=filter_scale,\n                                   norm=norm,\n                                   sparsity=sparsity,\n                                   window=window,\n                                   scale=scale,\n                                   pad_mode=pad_mode))\n\n    if n_bins_full > 0:\n        cqt_resp.append(np.abs(cqt(y, sr,\n                                   hop_length=hop_length,\n                                   fmin=fmin,\n                                   n_bins=n_bins_full,\n                                   bins_per_octave=bins_per_octave,\n                                   tuning=tuning,\n                                   filter_scale=filter_scale,\n                                   norm=norm,\n                                   sparsity=sparsity,\n                                   window=window,\n                                   scale=scale,\n                                   pad_mode=pad_mode,\n                                   res_type=res_type)))\n\n    return __trim_stack(cqt_resp, n_bins)"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\ncompute the pseudo constant - Q transform of an audio time series y.", "response": "def pseudo_cqt(y, sr=22050, hop_length=512, fmin=None, n_bins=84,\n               bins_per_octave=12, tuning=0.0, filter_scale=1,\n               norm=1, sparsity=0.01, window='hann', scale=True,\n               pad_mode='reflect'):\n    '''Compute the pseudo constant-Q transform of an audio signal.\n\n    This uses a single fft size that is the smallest power of 2 that is greater\n    than or equal to the max of:\n\n        1. The longest CQT filter\n        2. 2x the hop_length\n\n    Parameters\n    ----------\n    y : np.ndarray [shape=(n,)]\n        audio time series\n\n    sr : number > 0 [scalar]\n        sampling rate of `y`\n\n    hop_length : int > 0 [scalar]\n        number of samples between successive CQT columns.\n\n    fmin : float > 0 [scalar]\n        Minimum frequency. Defaults to C1 ~= 32.70 Hz\n\n    n_bins : int > 0 [scalar]\n        Number of frequency bins, starting at `fmin`\n\n    bins_per_octave : int > 0 [scalar]\n        Number of bins per octave\n\n    tuning : None or float in `[-0.5, 0.5)`\n        Tuning offset in fractions of a bin (cents).\n\n        If `None`, tuning will be automatically estimated from the signal.\n\n    filter_scale : float > 0\n        Filter filter_scale factor. Larger values use longer windows.\n\n    sparsity : float in [0, 1)\n        Sparsify the CQT basis by discarding up to `sparsity`\n        fraction of the energy in each basis.\n\n        Set `sparsity=0` to disable sparsification.\n\n    window : str, tuple, number, or function\n        Window specification for the basis filters.\n        See `filters.get_window` for details.\n\n    pad_mode : string\n        Padding mode for centered frame analysis.\n\n        See also: `librosa.core.stft` and `np.pad`.\n\n    Returns\n    -------\n    CQT : np.ndarray [shape=(n_bins, t), dtype=np.float]\n        Pseudo Constant-Q energy for each frequency at each time.\n\n    Raises\n    ------\n    ParameterError\n        If `hop_length` is not an integer multiple of\n        `2**(n_bins / bins_per_octave)`\n\n        Or if `y` is too short to support the frequency range of the CQT.\n\n    Notes\n    -----\n    This function caches at level 20.\n\n    '''\n\n    if fmin is None:\n        # C1 by default\n        fmin = note_to_hz('C1')\n\n    if tuning is None:\n        tuning = estimate_tuning(y=y, sr=sr)\n\n    fft_basis, n_fft, _ = __cqt_filter_fft(sr, fmin, n_bins,\n                                           bins_per_octave,\n                                           tuning, filter_scale,\n                                           norm, sparsity,\n                                           hop_length=hop_length,\n                                           window=window)\n\n    fft_basis = np.abs(fft_basis)\n\n    # Compute the magnitude STFT with Hann window\n    D = np.abs(stft(y, n_fft=n_fft, hop_length=hop_length, pad_mode=pad_mode))\n\n    # Project onto the pseudo-cqt basis\n    C = fft_basis.dot(D)\n\n    if scale:\n        C /= np.sqrt(n_fft)\n    else:\n        lengths = filters.constant_q_lengths(sr, fmin,\n                                             n_bins=n_bins,\n                                             bins_per_octave=bins_per_octave,\n                                             tuning=tuning,\n                                             window=window,\n                                             filter_scale=filter_scale)\n\n        C *= np.sqrt(lengths[:, np.newaxis] / n_fft)\n\n    return C"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef icqt(C, sr=22050, hop_length=512, fmin=None, bins_per_octave=12,\n         tuning=0.0, filter_scale=1, norm=1, sparsity=0.01, window='hann',\n         scale=True, length=None, amin=util.Deprecated(), res_type='fft'):\n    '''Compute the inverse constant-Q transform.\n\n    Given a constant-Q transform representation `C` of an audio signal `y`,\n    this function produces an approximation `y_hat`.\n\n\n    Parameters\n    ----------\n    C : np.ndarray, [shape=(n_bins, n_frames)]\n        Constant-Q representation as produced by `core.cqt`\n\n    hop_length : int > 0 [scalar]\n        number of samples between successive frames\n\n    fmin : float > 0 [scalar]\n        Minimum frequency. Defaults to C1 ~= 32.70 Hz\n\n    tuning : float in `[-0.5, 0.5)` [scalar]\n        Tuning offset in fractions of a bin (cents).\n\n    filter_scale : float > 0 [scalar]\n        Filter scale factor. Small values (<1) use shorter windows\n        for improved time resolution.\n\n    norm : {inf, -inf, 0, float > 0}\n        Type of norm to use for basis function normalization.\n        See `librosa.util.normalize`.\n\n    sparsity : float in [0, 1)\n        Sparsify the CQT basis by discarding up to `sparsity`\n        fraction of the energy in each basis.\n\n        Set `sparsity=0` to disable sparsification.\n\n    window : str, tuple, number, or function\n        Window specification for the basis filters.\n        See `filters.get_window` for details.\n\n    scale : bool\n        If `True`, scale the CQT response by square-root the length\n        of each channel's filter. This is analogous to `norm='ortho'` in FFT.\n\n        If `False`, do not scale the CQT. This is analogous to `norm=None`\n        in FFT.\n\n    length : int > 0, optional\n        If provided, the output `y` is zero-padded or clipped to exactly\n        `length` samples.\n\n    amin : float or None [DEPRECATED]\n\n        .. note:: This parameter is deprecated in 0.7.0 and will be removed in 0.8.0.\n\n    res_type : string\n        Resampling mode.  By default, this uses `fft` mode for high-quality \n        reconstruction, but this may be slow depending on your signal duration.\n        See `librosa.resample` for supported modes.\n\n    Returns\n    -------\n    y : np.ndarray, [shape=(n_samples), dtype=np.float]\n        Audio time-series reconstructed from the CQT representation.\n\n    See Also\n    --------\n    cqt\n    core.resample\n\n    Notes\n    -----\n    This function caches at level 40.\n\n    Examples\n    --------\n    Using default parameters\n\n    >>> y, sr = librosa.load(librosa.util.example_audio_file(), duration=15)\n    >>> C = librosa.cqt(y=y, sr=sr)\n    >>> y_hat = librosa.icqt(C=C, sr=sr)\n\n    Or with a different hop length and frequency resolution:\n\n    >>> hop_length = 256\n    >>> bins_per_octave = 12 * 3\n    >>> C = librosa.cqt(y=y, sr=sr, hop_length=256, n_bins=7*bins_per_octave,\n    ...                 bins_per_octave=bins_per_octave)\n    >>> y_hat = librosa.icqt(C=C, sr=sr, hop_length=hop_length,\n    ...                 bins_per_octave=bins_per_octave)\n    '''\n\n    if fmin is None:\n        fmin = note_to_hz('C1')\n\n    # Get the top octave of frequencies\n    n_bins = len(C)\n\n    freqs = cqt_frequencies(n_bins, fmin,\n                            bins_per_octave=bins_per_octave,\n                            tuning=tuning)[-bins_per_octave:]\n\n    n_filters = min(n_bins, bins_per_octave)\n\n    fft_basis, n_fft, lengths = __cqt_filter_fft(sr, np.min(freqs),\n                                                 n_filters,\n                                                 bins_per_octave,\n                                                 tuning,\n                                                 filter_scale,\n                                                 norm,\n                                                 sparsity=sparsity,\n                                                 window=window)\n\n    if hop_length > min(lengths):\n        warnings.warn('hop_length={} exceeds minimum CQT filter length={:.3f}.\\n'\n                      'This will probably cause unpleasant acoustic artifacts. '\n                      'Consider decreasing your hop length or increasing the frequency resolution of your CQT.'.format(hop_length, min(lengths)))\n\n    # The basis gets renormalized by the effective window length above;\n    # This step undoes that\n    fft_basis = fft_basis.todense() * n_fft / lengths[:, np.newaxis]\n\n    # This step conjugate-transposes the filter\n    inv_basis = fft_basis.H\n\n    # How many octaves do we have?\n    n_octaves = int(np.ceil(float(n_bins) / bins_per_octave))\n\n    y = None\n    for octave in range(n_octaves - 1, -1, -1):\n        slice_ = slice(-(octave+1) * bins_per_octave - 1,\n                       -(octave) * bins_per_octave - 1)\n\n        # Slice this octave\n        C_oct = C[slice_]\n        inv_oct = inv_basis[:, -C_oct.shape[0]:]\n\n        oct_hop = hop_length // 2**octave\n\n        # Apply energy corrections\n        if scale:\n            C_scale = np.sqrt(lengths[-C_oct.shape[0]:, np.newaxis]) / n_fft\n        else:\n            C_scale = lengths[-C_oct.shape[0]:, np.newaxis] * np.sqrt(2**octave) / n_fft\n\n        # Inverse-project the basis for each octave\n        D_oct = inv_oct.dot(C_oct / C_scale)\n\n        # Inverse-STFT that response\n        y_oct = istft(D_oct, window='ones', hop_length=oct_hop)\n\n        # Up-sample that octave\n        if y is None:\n            y = y_oct\n        else:\n            # Up-sample the previous buffer and add in the new one\n            # Scipy-resampling is fast here, since it's a power-of-two relation\n            y = audio.resample(y, 1, 2, scale=True, res_type=res_type, fix=False)\n\n            y[:len(y_oct)] += y_oct\n\n    if length:\n        y = util.fix_length(y, length)\n\n    return y", "response": "Compute the inverse constant - Q transform representation of an audio signal y."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef __cqt_filter_fft(sr, fmin, n_bins, bins_per_octave, tuning,\n                     filter_scale, norm, sparsity, hop_length=None,\n                     window='hann'):\n    '''Generate the frequency domain constant-Q filter basis.'''\n\n    basis, lengths = filters.constant_q(sr,\n                                        fmin=fmin,\n                                        n_bins=n_bins,\n                                        bins_per_octave=bins_per_octave,\n                                        tuning=tuning,\n                                        filter_scale=filter_scale,\n                                        norm=norm,\n                                        pad_fft=True,\n                                        window=window)\n\n    # Filters are padded up to the nearest integral power of 2\n    n_fft = basis.shape[1]\n\n    if (hop_length is not None and\n            n_fft < 2.0**(1 + np.ceil(np.log2(hop_length)))):\n\n        n_fft = int(2.0 ** (1 + np.ceil(np.log2(hop_length))))\n\n    # re-normalize bases with respect to the FFT window length\n    basis *= lengths[:, np.newaxis] / float(n_fft)\n\n    # FFT and retain only the non-negative frequencies\n    fft = get_fftlib()\n    fft_basis = fft.fft(basis, n=n_fft, axis=1)[:, :(n_fft // 2)+1]\n\n    # sparsify the basis\n    fft_basis = util.sparsify_rows(fft_basis, quantile=sparsity)\n\n    return fft_basis, n_fft, lengths", "response": "Generate the frequency domain constant - Q filter basis."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef __trim_stack(cqt_resp, n_bins):\n    '''Helper function to trim and stack a collection of CQT responses'''\n\n    # cleanup any framing errors at the boundaries\n    max_col = min(x.shape[1] for x in cqt_resp)\n\n    cqt_resp = np.vstack([x[:, :max_col] for x in cqt_resp][::-1])\n\n    # Finally, clip out any bottom frequencies that we don't really want\n    # Transpose magic here to ensure column-contiguity\n    return np.ascontiguousarray(cqt_resp[-n_bins:].T).T", "response": "Helper function to trim and stack a collection of CQT responses"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\ncomputing the filter response with a target STFT hop.", "response": "def __cqt_response(y, n_fft, hop_length, fft_basis, mode):\n    '''Compute the filter response with a target STFT hop.'''\n\n    # Compute the STFT matrix\n    D = stft(y, n_fft=n_fft, hop_length=hop_length,\n             window='ones',\n             pad_mode=mode)\n\n    # And filter response energy\n    return fft_basis.dot(D)"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef __early_downsample_count(nyquist, filter_cutoff, hop_length, n_octaves):\n    '''Compute the number of early downsampling operations'''\n\n    downsample_count1 = max(0, int(np.ceil(np.log2(audio.BW_FASTEST * nyquist /\n                                                   filter_cutoff)) - 1) - 1)\n\n    num_twos = __num_two_factors(hop_length)\n    downsample_count2 = max(0, num_twos - n_octaves + 1)\n\n    return min(downsample_count1, downsample_count2)", "response": "Compute the number of early downsampling operations."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nperforming early downsampling on an audio signal if it applies.", "response": "def __early_downsample(y, sr, hop_length, res_type, n_octaves,\n                       nyquist, filter_cutoff, scale):\n    '''Perform early downsampling on an audio signal, if it applies.'''\n\n    downsample_count = __early_downsample_count(nyquist, filter_cutoff,\n                                                hop_length, n_octaves)\n\n    if downsample_count > 0 and res_type == 'kaiser_fast':\n        downsample_factor = 2**(downsample_count)\n\n        hop_length //= downsample_factor\n\n        if len(y) < downsample_factor:\n            raise ParameterError('Input signal length={:d} is too short for '\n                                 '{:d}-octave CQT'.format(len(y), n_octaves))\n\n        new_sr = sr / float(downsample_factor)\n        y = audio.resample(y, sr, new_sr,\n                           res_type=res_type,\n                           scale=True)\n\n        # If we're not going to length-scale after CQT, we\n        # need to compensate for the downsampling factor here\n        if not scale:\n            y *= np.sqrt(downsample_factor)\n\n        sr = new_sr\n\n    return y, sr, hop_length"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef delta(data, width=9, order=1, axis=-1, mode='interp', **kwargs):\n    r'''Compute delta features: local estimate of the derivative\n    of the input data along the selected axis.\n\n    Delta features are computed Savitsky-Golay filtering.\n\n    Parameters\n    ----------\n    data      : np.ndarray\n        the input data matrix (eg, spectrogram)\n\n    width     : int, positive, odd [scalar]\n        Number of frames over which to compute the delta features.\n        Cannot exceed the length of `data` along the specified axis.\n        If `mode='interp'`, then `width` must be at least `data.shape[axis]`.\n\n    order     : int > 0 [scalar]\n        the order of the difference operator.\n        1 for first derivative, 2 for second, etc.\n\n    axis      : int [scalar]\n        the axis along which to compute deltas.\n        Default is -1 (columns).\n\n    mode : str, {'interp', 'nearest', 'mirror', 'constant', 'wrap'}\n        Padding mode for estimating differences at the boundaries.\n\n    kwargs : additional keyword arguments\n        See `scipy.signal.savgol_filter`\n\n    Returns\n    -------\n    delta_data   : np.ndarray [shape=(d, t)]\n        delta matrix of `data` at specified order\n\n    Notes\n    -----\n    This function caches at level 40.\n\n    See Also\n    --------\n    scipy.signal.savgol_filter\n\n    Examples\n    --------\n    Compute MFCC deltas, delta-deltas\n\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> mfcc = librosa.feature.mfcc(y=y, sr=sr)\n    >>> mfcc_delta = librosa.feature.delta(mfcc)\n    >>> mfcc_delta\n    array([[  1.666e+01,   1.666e+01, ...,   1.869e-15,   1.869e-15],\n           [  1.784e+01,   1.784e+01, ...,   6.085e-31,   6.085e-31],\n           ...,\n           [  7.262e-01,   7.262e-01, ...,   9.259e-31,   9.259e-31],\n           [  6.578e-01,   6.578e-01, ...,   7.597e-31,   7.597e-31]])\n\n    >>> mfcc_delta2 = librosa.feature.delta(mfcc, order=2)\n    >>> mfcc_delta2\n    array([[ -1.703e+01,  -1.703e+01, ...,   3.834e-14,   3.834e-14],\n           [ -1.108e+01,  -1.108e+01, ...,  -1.068e-30,  -1.068e-30],\n           ...,\n           [  4.075e-01,   4.075e-01, ...,  -1.565e-30,  -1.565e-30],\n           [  1.676e-01,   1.676e-01, ...,  -2.104e-30,  -2.104e-30]])\n\n    >>> import matplotlib.pyplot as plt\n    >>> plt.subplot(3, 1, 1)\n    >>> librosa.display.specshow(mfcc)\n    >>> plt.title('MFCC')\n    >>> plt.colorbar()\n    >>> plt.subplot(3, 1, 2)\n    >>> librosa.display.specshow(mfcc_delta)\n    >>> plt.title(r'MFCC-$\\Delta$')\n    >>> plt.colorbar()\n    >>> plt.subplot(3, 1, 3)\n    >>> librosa.display.specshow(mfcc_delta2, x_axis='time')\n    >>> plt.title(r'MFCC-$\\Delta^2$')\n    >>> plt.colorbar()\n    >>> plt.tight_layout()\n\n    '''\n\n    data = np.atleast_1d(data)\n\n    if mode == 'interp' and width > data.shape[axis]:\n        raise ParameterError(\"when mode='interp', width={} \"\n                             \"cannot exceed data.shape[axis]={}\".format(width, data.shape[axis]))\n\n    if width < 3 or np.mod(width, 2) != 1:\n        raise ParameterError('width must be an odd integer >= 3')\n\n    if order <= 0 or not isinstance(order, int):\n        raise ParameterError('order must be a positive integer')\n\n    kwargs.pop('deriv', None)\n    kwargs.setdefault('polyorder', order)\n    return scipy.signal.savgol_filter(data, width,\n                                      deriv=order,\n                                      axis=axis,\n                                      mode=mode,\n                                      **kwargs)", "response": "r Compute the delta features of the input data."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nshorts - term history embedding.", "response": "def stack_memory(data, n_steps=2, delay=1, **kwargs):\n    \"\"\"Short-term history embedding: vertically concatenate a data\n    vector or matrix with delayed copies of itself.\n\n    Each column `data[:, i]` is mapped to::\n\n        data[:, i] ->  [data[:, i],\n                        data[:, i - delay],\n                        ...\n                        data[:, i - (n_steps-1)*delay]]\n\n    For columns `i < (n_steps - 1) * delay` , the data will be padded.\n    By default, the data is padded with zeros, but this behavior can be\n    overridden by supplying additional keyword arguments which are passed\n    to `np.pad()`.\n\n\n    Parameters\n    ----------\n    data : np.ndarray [shape=(t,) or (d, t)]\n        Input data matrix.  If `data` is a vector (`data.ndim == 1`),\n        it will be interpreted as a row matrix and reshaped to `(1, t)`.\n\n    n_steps : int > 0 [scalar]\n        embedding dimension, the number of steps back in time to stack\n\n    delay : int != 0 [scalar]\n        the number of columns to step.\n\n        Positive values embed from the past (previous columns).\n\n        Negative values embed from the future (subsequent columns).\n\n    kwargs : additional keyword arguments\n      Additional arguments to pass to `np.pad`.\n\n    Returns\n    -------\n    data_history : np.ndarray [shape=(m * d, t)]\n        data augmented with lagged copies of itself,\n        where `m == n_steps - 1`.\n\n    Notes\n    -----\n    This function caches at level 40.\n\n\n    Examples\n    --------\n    Keep two steps (current and previous)\n\n    >>> data = np.arange(-3, 3)\n    >>> librosa.feature.stack_memory(data)\n    array([[-3, -2, -1,  0,  1,  2],\n           [ 0, -3, -2, -1,  0,  1]])\n\n    Or three steps\n\n    >>> librosa.feature.stack_memory(data, n_steps=3)\n    array([[-3, -2, -1,  0,  1,  2],\n           [ 0, -3, -2, -1,  0,  1],\n           [ 0,  0, -3, -2, -1,  0]])\n\n    Use reflection padding instead of zero-padding\n\n    >>> librosa.feature.stack_memory(data, n_steps=3, mode='reflect')\n    array([[-3, -2, -1,  0,  1,  2],\n           [-2, -3, -2, -1,  0,  1],\n           [-1, -2, -3, -2, -1,  0]])\n\n    Or pad with edge-values, and delay by 2\n\n    >>> librosa.feature.stack_memory(data, n_steps=3, delay=2, mode='edge')\n    array([[-3, -2, -1,  0,  1,  2],\n           [-3, -3, -3, -2, -1,  0],\n           [-3, -3, -3, -3, -3, -2]])\n\n    Stack time-lagged beat-synchronous chroma edge padding\n\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> chroma = librosa.feature.chroma_stft(y=y, sr=sr)\n    >>> tempo, beats = librosa.beat.beat_track(y=y, sr=sr, hop_length=512)\n    >>> beats = librosa.util.fix_frames(beats, x_min=0, x_max=chroma.shape[1])\n    >>> chroma_sync = librosa.util.sync(chroma, beats)\n    >>> chroma_lag = librosa.feature.stack_memory(chroma_sync, n_steps=3,\n    ...                                           mode='edge')\n\n    Plot the result\n\n    >>> import matplotlib.pyplot as plt\n    >>> beat_times = librosa.frames_to_time(beats, sr=sr, hop_length=512)\n    >>> librosa.display.specshow(chroma_lag, y_axis='chroma', x_axis='time',\n    ...                          x_coords=beat_times)\n    >>> plt.yticks([0, 12, 24], ['Lag=0', 'Lag=1', 'Lag=2'])\n    >>> plt.title('Time-lagged chroma')\n    >>> plt.colorbar()\n    >>> plt.tight_layout()\n    \"\"\"\n\n    if n_steps < 1:\n        raise ParameterError('n_steps must be a positive integer')\n\n    if delay == 0:\n        raise ParameterError('delay must be a non-zero integer')\n\n    data = np.atleast_2d(data)\n\n    t = data.shape[1]\n    kwargs.setdefault('mode', 'constant')\n\n    if kwargs['mode'] == 'constant':\n        kwargs.setdefault('constant_values', [0])\n\n    # Pad the end with zeros, which will roll to the front below\n    if delay > 0:\n        padding = (int((n_steps - 1) * delay), 0)\n    else:\n        padding = (0, int((n_steps - 1) * -delay))\n\n    data = np.pad(data, [(0, 0), padding], **kwargs)\n\n    history = data\n\n    for i in range(1, n_steps):\n        history = np.vstack([np.roll(data, -i * delay, axis=1), history])\n\n    # Trim to original width\n    if delay > 0:\n        history = history[:, :t]\n    else:\n        history = history[:, -t:]\n\n    # Make contiguous\n    return np.ascontiguousarray(history.T).T"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\ncalculate the accumulated cost matrix D.", "response": "def __dtw_calc_accu_cost(C, D, D_steps, step_sizes_sigma,\n                         weights_mul, weights_add, max_0, max_1):  # pragma: no cover\n    '''Calculate the accumulated cost matrix D.\n\n    Use dynamic programming to calculate the accumulated costs.\n\n    Parameters\n    ----------\n    C : np.ndarray [shape=(N, M)]\n        pre-computed cost matrix\n\n    D : np.ndarray [shape=(N, M)]\n        accumulated cost matrix\n\n    D_steps : np.ndarray [shape=(N, M)]\n        steps which were used for calculating D\n\n    step_sizes_sigma : np.ndarray [shape=[n, 2]]\n        Specifies allowed step sizes as used by the dtw.\n\n    weights_add : np.ndarray [shape=[n, ]]\n        Additive weights to penalize certain step sizes.\n\n    weights_mul : np.ndarray [shape=[n, ]]\n        Multiplicative weights to penalize certain step sizes.\n\n    max_0 : int\n        maximum number of steps in step_sizes_sigma in dim 0.\n\n    max_1 : int\n        maximum number of steps in step_sizes_sigma in dim 1.\n\n    Returns\n    -------\n    D : np.ndarray [shape=(N,M)]\n        accumulated cost matrix.\n        D[N,M] is the total alignment cost.\n        When doing subsequence DTW, D[N,:] indicates a matching function.\n\n    D_steps : np.ndarray [shape=(N,M)]\n        steps which were used for calculating D.\n\n    See Also\n    --------\n    dtw\n    '''\n    for cur_n in range(max_0, D.shape[0]):\n        for cur_m in range(max_1, D.shape[1]):\n            # accumulate costs\n            for cur_step_idx, cur_w_add, cur_w_mul in zip(range(step_sizes_sigma.shape[0]),\n                                                          weights_add, weights_mul):\n                cur_D = D[cur_n - step_sizes_sigma[cur_step_idx, 0],\n                          cur_m - step_sizes_sigma[cur_step_idx, 1]]\n                cur_C = cur_w_mul * C[cur_n - max_0, cur_m - max_1]\n                cur_C += cur_w_add\n                cur_cost = cur_D + cur_C\n\n                # check if cur_cost is smaller than the one stored in D\n                if cur_cost < D[cur_n, cur_m]:\n                    D[cur_n, cur_m] = cur_cost\n\n                    # save step-index\n                    D_steps[cur_n, cur_m] = cur_step_idx\n\n    return D, D_steps"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef __dtw_backtracking(D_steps, step_sizes_sigma):  # pragma: no cover\n    '''Backtrack optimal warping path.\n\n    Uses the saved step sizes from the cost accumulation\n    step to backtrack the index pairs for an optimal\n    warping path.\n\n\n    Parameters\n    ----------\n    D_steps : np.ndarray [shape=(N, M)]\n        Saved indices of the used steps used in the calculation of D.\n\n    step_sizes_sigma : np.ndarray [shape=[n, 2]]\n        Specifies allowed step sizes as used by the dtw.\n\n    Returns\n    -------\n    wp : list [shape=(N,)]\n        Warping path with index pairs.\n        Each list entry contains an index pair\n        (n,m) as a tuple\n\n    See Also\n    --------\n    dtw\n    '''\n    wp = []\n    # Set starting point D(N,M) and append it to the path\n    cur_idx = (D_steps.shape[0] - 1, D_steps.shape[1] - 1)\n    wp.append((cur_idx[0], cur_idx[1]))\n\n    # Loop backwards.\n    # Stop criteria:\n    # Setting it to (0, 0) does not work for the subsequence dtw,\n    # so we only ask to reach the first row of the matrix.\n    while cur_idx[0] > 0:\n        cur_step_idx = D_steps[(cur_idx[0], cur_idx[1])]\n\n        # save tuple with minimal acc. cost in path\n        cur_idx = (cur_idx[0] - step_sizes_sigma[cur_step_idx][0],\n                   cur_idx[1] - step_sizes_sigma[cur_step_idx][1])\n\n        # append to warping path\n        wp.append((cur_idx[0], cur_idx[1]))\n\n    return wp", "response": "Backtrack optimal warping path."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _viterbi(log_prob, log_trans, log_p_init, state, value, ptr):  # pragma: no cover\n    '''Core Viterbi algorithm.\n\n    This is intended for internal use only.\n\n    Parameters\n    ----------\n    log_prob : np.ndarray [shape=(T, m)]\n        `log_prob[t, s]` is the conditional log-likelihood\n        log P[X = X(t) | State(t) = s]\n\n    log_trans : np.ndarray [shape=(m, m)]\n        The log transition matrix\n        `log_trans[i, j]` = log P[State(t+1) = j | State(t) = i]\n\n    log_p_init : np.ndarray [shape=(m,)]\n        log of the initial state distribution\n\n    state : np.ndarray [shape=(T,), dtype=int]\n        Pre-allocated state index array\n\n    value : np.ndarray [shape=(T, m)] float\n        Pre-allocated value array\n\n    ptr : np.ndarray [shape=(T, m), dtype=int]\n        Pre-allocated pointer array\n\n    Returns\n    -------\n    None\n        All computations are performed in-place on `state, value, ptr`.\n    '''\n    n_steps, n_states = log_prob.shape\n\n    # factor in initial state distribution\n    value[0] = log_prob[0] + log_p_init\n\n    for t in range(1, n_steps):\n        # Want V[t, j] <- p[t, j] * max_k V[t-1, k] * A[k, j]\n        #    assume at time t-1 we were in state k\n        #    transition k -> j\n\n        # Broadcast over rows:\n        #    Tout[k, j] = V[t-1, k] * A[k, j]\n        #    then take the max over columns\n        # We'll do this in log-space for stability\n\n        trans_out = value[t - 1] + log_trans.T\n\n        # Unroll the max/argmax loop to enable numba support\n        for j in range(n_states):\n            ptr[t, j] = np.argmax(trans_out[j])\n            # value[t, j] = log_prob[t, j] + np.max(trans_out[j])\n            value[t, j] = log_prob[t, j] + trans_out[j, ptr[t][j]]\n\n    # Now roll backward\n\n    # Get the last state\n    state[-1] = np.argmax(value[-1])\n\n    for t in range(n_steps - 2, -1, -1):\n        state[t] = ptr[t+1, state[t+1]]", "response": "Core Viterbi algorithm.\n\n    This is intended for internal use only.\n\n    Parameters\n    ----------\n    log_prob : np.ndarray [shape=(T, m)]\n        `log_prob[t, s]` is the conditional log-likelihood\n        log P[X = X(t) | State(t) = s]\n\n    log_trans : np.ndarray [shape=(m, m)]\n        The log transition matrix\n        `log_trans[i, j]` = log P[State(t+1) = j | State(t) = i]\n\n    log_p_init : np.ndarray [shape=(m,)]\n        log of the initial state distribution\n\n    state : np.ndarray [shape=(T,), dtype=int]\n        Pre-allocated state index array\n\n    value : np.ndarray [shape=(T, m)] float\n        Pre-allocated value array\n\n    ptr : np.ndarray [shape=(T, m), dtype=int]\n        Pre-allocated pointer array\n\n    Returns\n    -------\n    None\n        All computations are performed in-place on `state, value, ptr`."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef viterbi_discriminative(prob, transition, p_state=None, p_init=None, return_logp=False):\n    '''Viterbi decoding from discriminative state predictions.\n\n    Given a sequence of conditional state predictions `prob[s, t]`,\n    indicating the conditional likelihood of state `s` given the\n    observation at time `t`, and a transition matrix `transition[i, j]`\n    which encodes the conditional probability of moving from state `i`\n    to state `j`, the Viterbi algorithm computes the most likely sequence\n    of states from the observations.\n\n    This implementation uses the standard Viterbi decoding algorithm\n    for observation likelihood sequences, under the assumption that\n    `P[Obs(t) | State(t) = s]` is proportional to\n    `P[State(t) = s | Obs(t)] / P[State(t) = s]`, where the denominator\n    is the marginal probability of state `s` occurring as given by `p_state`.\n\n    Parameters\n    ----------\n    prob : np.ndarray [shape=(n_states, n_steps), non-negative]\n        `prob[s, t]` is the probability of state `s` conditional on\n        the observation at time `t`.\n        Must be non-negative and sum to 1 along each column.\n\n    transition : np.ndarray [shape=(n_states, n_states), non-negative]\n        `transition[i, j]` is the probability of a transition from i->j.\n        Each row must sum to 1.\n\n    p_state : np.ndarray [shape=(n_states,)]\n        Optional: marginal probability distribution over states,\n        must be non-negative and sum to 1.\n        If not provided, a uniform distribution is assumed.\n\n    p_init : np.ndarray [shape=(n_states,)]\n        Optional: initial state distribution.\n        If not provided, it is assumed to be uniform.\n\n    return_logp : bool\n        If `True`, return the log-likelihood of the state sequence.\n\n    Returns\n    -------\n    Either `states` or `(states, logp)`:\n\n    states : np.ndarray [shape=(n_steps,)]\n        The most likely state sequence.\n\n    logp : scalar [float]\n        If `return_logp=True`, the log probability of `states` given\n        the observations.\n\n    See Also\n    --------\n    viterbi : Viterbi decoding from observation likelihoods\n    viterbi_binary: Viterbi decoding for multi-label, conditional state likelihoods\n\n    Examples\n    --------\n    This example constructs a simple, template-based discriminative chord estimator,\n    using CENS chroma as input features.\n\n    .. note:: this chord model is not accurate enough to use in practice. It is only\n            intended to demonstrate how to use discriminative Viterbi decoding.\n\n    >>> # Create templates for major, minor, and no-chord qualities\n    >>> maj_template = np.array([1,0,0, 0,1,0, 0,1,0, 0,0,0])\n    >>> min_template = np.array([1,0,0, 1,0,0, 0,1,0, 0,0,0])\n    >>> N_template   = np.array([1,1,1, 1,1,1, 1,1,1, 1,1,1.]) / 4.\n    >>> # Generate the weighting matrix that maps chroma to labels\n    >>> weights = np.zeros((25, 12), dtype=float)\n    >>> labels = ['C:maj', 'C#:maj', 'D:maj', 'D#:maj', 'E:maj', 'F:maj',\n    ...           'F#:maj', 'G:maj', 'G#:maj', 'A:maj', 'A#:maj', 'B:maj',\n    ...           'C:min', 'C#:min', 'D:min', 'D#:min', 'E:min', 'F:min',\n    ...           'F#:min', 'G:min', 'G#:min', 'A:min', 'A#:min', 'B:min',\n    ...           'N']\n    >>> for c in range(12):\n    ...     weights[c, :] = np.roll(maj_template, c) # c:maj\n    ...     weights[c + 12, :] = np.roll(min_template, c)  # c:min\n    >>> weights[-1] = N_template  # the last row is the no-chord class\n    >>> # Make a self-loop transition matrix over 25 states\n    >>> trans = librosa.sequence.transition_loop(25, 0.9)\n\n    >>> # Load in audio and make features\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> chroma = librosa.feature.chroma_cens(y=y, sr=sr, bins_per_octave=36)\n    >>> # Map chroma (observations) to class (state) likelihoods\n    >>> probs = np.exp(weights.dot(chroma))  # P[class | chroma] proportional to exp(template' chroma)\n    >>> probs /= probs.sum(axis=0, keepdims=True)  # probabilities must sum to 1 in each column\n    >>> # Compute independent frame-wise estimates\n    >>> chords_ind = np.argmax(probs, axis=0)\n    >>> # And viterbi estimates\n    >>> chords_vit = librosa.sequence.viterbi_discriminative(probs, trans)\n\n    >>> # Plot the features and prediction map\n    >>> import matplotlib.pyplot as plt\n    >>> plt.figure(figsize=(10, 6))\n    >>> plt.subplot(2,1,1)\n    >>> librosa.display.specshow(chroma, x_axis='time', y_axis='chroma')\n    >>> plt.colorbar()\n    >>> plt.subplot(2,1,2)\n    >>> librosa.display.specshow(weights, x_axis='chroma')\n    >>> plt.yticks(np.arange(25) + 0.5, labels)\n    >>> plt.ylabel('Chord')\n    >>> plt.colorbar()\n    >>> plt.tight_layout()\n\n    >>> # And plot the results\n    >>> plt.figure(figsize=(10, 4))\n    >>> librosa.display.specshow(probs, x_axis='time', cmap='gray')\n    >>> plt.colorbar()\n    >>> times = librosa.frames_to_time(np.arange(len(chords_vit)))\n    >>> plt.scatter(times, chords_ind + 0.75, color='lime', alpha=0.5, marker='+', s=15, label='Independent')\n    >>> plt.scatter(times, chords_vit + 0.25, color='deeppink', alpha=0.5, marker='o', s=15, label='Viterbi')\n    >>> plt.yticks(0.5 + np.unique(chords_vit), [labels[i] for i in np.unique(chords_vit)], va='center')\n    >>> plt.legend(loc='best')\n    >>> plt.tight_layout()\n\n    '''\n\n    n_states, n_steps = prob.shape\n\n    if transition.shape != (n_states, n_states):\n        raise ParameterError('transition.shape={}, must be '\n                             '(n_states, n_states)={}'.format(transition.shape,\n                                                              (n_states, n_states)))\n\n    if np.any(transition < 0) or not np.allclose(transition.sum(axis=1), 1):\n        raise ParameterError('Invalid transition matrix: must be non-negative '\n                             'and sum to 1 on each row.')\n\n    if np.any(prob < 0) or not np.allclose(prob.sum(axis=0), 1):\n        raise ParameterError('Invalid probability values: each column must '\n                             'sum to 1 and be non-negative')\n\n    states = np.zeros(n_steps, dtype=int)\n    values = np.zeros((n_steps, n_states), dtype=float)\n    ptr = np.zeros((n_steps, n_states), dtype=int)\n\n    # Compute log-likelihoods while avoiding log-underflow\n    epsilon = np.finfo(prob.dtype).tiny\n\n    # Compute marginal log probabilities while avoiding underflow\n    if p_state is None:\n        p_state = np.empty(n_states)\n        p_state.fill(1./n_states)\n    elif p_state.shape != (n_states,):\n        raise ParameterError('Marginal distribution p_state must have shape (n_states,). '\n                             'Got p_state.shape={}'.format(p_state.shape))\n    elif np.any(p_state < 0) or not np.allclose(p_state.sum(axis=-1), 1):\n        raise ParameterError('Invalid marginal state distribution: '\n                             'p_state={}'.format(p_state))\n\n    log_trans = np.log(transition + epsilon)\n    log_marginal = np.log(p_state + epsilon)\n\n    # By Bayes' rule, P[X | Y] * P[Y] = P[Y | X] * P[X]\n    # P[X] is constant for the sake of maximum likelihood inference\n    # and P[Y] is given by the marginal distribution p_state.\n    #\n    # So we have P[X | y] \\propto P[Y | x] / P[Y]\n    # if X = observation and Y = states, this can be done in log space as\n    # log P[X | y] \\propto \\log P[Y | x] - \\log P[Y]\n    log_prob = np.log(prob.T + epsilon) - log_marginal\n\n    if p_init is None:\n        p_init = np.empty(n_states)\n        p_init.fill(1./n_states)\n    elif np.any(p_init < 0) or not np.allclose(p_init.sum(), 1):\n        raise ParameterError('Invalid initial state distribution: '\n                             'p_init={}'.format(p_init))\n\n    log_p_init = np.log(p_init + epsilon)\n\n    _viterbi(log_prob, log_trans, log_p_init, states, values, ptr)\n\n    if return_logp:\n        return states, values[-1, states[-1]]\n\n    return states", "response": "Viterbi decoding from discriminative state predictions."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef viterbi_binary(prob, transition, p_state=None, p_init=None, return_logp=False):\n    '''Viterbi decoding from binary (multi-label), discriminative state predictions.\n\n    Given a sequence of conditional state predictions `prob[s, t]`,\n    indicating the conditional likelihood of state `s` being active\n    conditional on observation at time `t`, and a 2*2 transition matrix\n    `transition` which encodes the conditional probability of moving from\n    state `s` to state `~s` (not-`s`), the Viterbi algorithm computes the\n    most likely sequence of states from the observations.\n\n    This function differs from `viterbi_discriminative` in that it does not assume the\n    states to be mutually exclusive.  `viterbi_binary` is implemented by\n    transforming the multi-label decoding problem to a collection\n    of binary Viterbi problems (one for each *state* or label).\n\n    The output is a binary matrix `states[s, t]` indicating whether each\n    state `s` is active at time `t`.\n\n    Parameters\n    ----------\n    prob : np.ndarray [shape=(n_steps,) or (n_states, n_steps)], non-negative\n        `prob[s, t]` is the probability of state `s` being active\n        conditional on the observation at time `t`.\n        Must be non-negative and less than 1.\n\n        If `prob` is 1-dimensional, it is expanded to shape `(1, n_steps)`.\n\n    transition : np.ndarray [shape=(2, 2) or (n_states, 2, 2)], non-negative\n        If 2-dimensional, the same transition matrix is applied to each sub-problem.\n        `transition[0, i]` is the probability of the state going from inactive to `i`,\n        `transition[1, i]` is the probability of the state going from active to `i`.\n        Each row must sum to 1.\n\n        If 3-dimensional, `transition[s]` is interpreted as the 2x2 transition matrix\n        for state label `s`.\n\n    p_state : np.ndarray [shape=(n_states,)]\n        Optional: marginal probability for each state (between [0,1]).\n        If not provided, a uniform distribution (0.5 for each state)\n        is assumed.\n\n    p_init : np.ndarray [shape=(n_states,)]\n        Optional: initial state distribution.\n        If not provided, it is assumed to be uniform.\n\n    return_logp : bool\n        If `True`, return the log-likelihood of the state sequence.\n\n    Returns\n    -------\n    Either `states` or `(states, logp)`:\n\n    states : np.ndarray [shape=(n_states, n_steps)]\n        The most likely state sequence.\n\n    logp : np.ndarray [shape=(n_states,)]\n        If `return_logp=True`, the log probability of each state activation\n        sequence `states`\n\n    See Also\n    --------\n    viterbi : Viterbi decoding from observation likelihoods\n    viterbi_discriminative : Viterbi decoding for discriminative (mutually exclusive) state predictions\n\n    Examples\n    --------\n    In this example, we have a sequence of binary state likelihoods that we want to de-noise\n    under the assumption that state changes are relatively uncommon.  Positive predictions\n    should only be retained if they persist for multiple steps, and any transient predictions\n    should be considered as errors.  This use case arises frequently in problems such as\n    instrument recognition, where state activations tend to be stable over time, but subject\n    to abrupt changes (e.g., when an instrument joins the mix).\n\n    We assume that the 0 state has a self-transition probability of 90%, and the 1 state\n    has a self-transition probability of 70%.  We assume the marginal and initial\n    probability of either state is 50%.\n\n    >>> trans = np.array([[0.9, 0.1], [0.3, 0.7]])\n    >>> prob = np.array([0.1, 0.7, 0.4, 0.3, 0.8, 0.9, 0.8, 0.2, 0.6, 0.3])\n    >>> librosa.sequence.viterbi_binary(prob, trans, p_state=0.5, p_init=0.5)\n    array([[0, 0, 0, 0, 1, 1, 1, 0, 0, 0]])\n    '''\n\n    prob = np.atleast_2d(prob)\n\n    n_states, n_steps = prob.shape\n\n    if transition.shape == (2, 2):\n        transition = np.tile(transition, (n_states, 1, 1))\n    elif transition.shape != (n_states, 2, 2):\n        raise ParameterError('transition.shape={}, must be (2,2) or '\n                             '(n_states, 2, 2)={}'.format(transition.shape, (n_states)))\n\n    if np.any(transition < 0) or not np.allclose(transition.sum(axis=-1), 1):\n        raise ParameterError('Invalid transition matrix: must be non-negative '\n                             'and sum to 1 on each row.')\n\n    if np.any(prob < 0) or np.any(prob > 1):\n        raise ParameterError('Invalid probability values: prob must be between [0, 1]')\n\n    if p_state is None:\n        p_state = np.empty(n_states)\n        p_state.fill(0.5)\n    else:\n        p_state = np.atleast_1d(p_state)\n\n    if p_state.shape != (n_states,) or np.any(p_state < 0) or np.any(p_state > 1):\n        raise ParameterError('Invalid marginal state distributions: p_state={}'.format(p_state))\n\n    if p_init is None:\n        p_init = np.empty(n_states)\n        p_init.fill(0.5)\n    else:\n        p_init = np.atleast_1d(p_init)\n\n    if p_init.shape != (n_states,) or np.any(p_init < 0) or np.any(p_init > 1):\n        raise ParameterError('Invalid initial state distributions: p_init={}'.format(p_init))\n\n    states = np.empty((n_states, n_steps), dtype=int)\n    logp = np.empty(n_states)\n\n    prob_binary = np.empty((2, n_steps))\n    p_state_binary = np.empty(2)\n    p_init_binary = np.empty(2)\n\n    for state in range(n_states):\n        prob_binary[0] = 1 - prob[state]\n        prob_binary[1] = prob[state]\n\n        p_state_binary[0] = 1 - p_state[state]\n        p_state_binary[1] = p_state[state]\n\n        p_init_binary[0] = 1 - p_init[state]\n        p_init_binary[1] = p_init[state]\n\n        states[state, :], logp[state] = viterbi_discriminative(prob_binary,\n                                                               transition[state],\n                                                               p_state=p_state_binary,\n                                                               p_init=p_init_binary,\n                                                               return_logp=True)\n\n    if return_logp:\n        return states, logp\n\n    return states", "response": "This function computes the most likely binary Viterbi decoding from binary Viterbi problems."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nconstructs a uniform transition matrix over n_states.", "response": "def transition_uniform(n_states):\n    '''Construct a uniform transition matrix over `n_states`.\n\n    Parameters\n    ----------\n    n_states : int > 0\n        The number of states\n\n    Returns\n    -------\n    transition : np.ndarray [shape=(n_states, n_states)]\n        `transition[i, j] = 1./n_states`\n\n    Examples\n    --------\n\n    >>> librosa.sequence.transition_uniform(3)\n    array([[0.333, 0.333, 0.333],\n           [0.333, 0.333, 0.333],\n           [0.333, 0.333, 0.333]])\n    '''\n\n    if not isinstance(n_states, int) or n_states <= 0:\n        raise ParameterError('n_states={} must be a positive integer')\n\n    transition = np.empty((n_states, n_states), dtype=np.float)\n    transition.fill(1./n_states)\n    return transition"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef transition_loop(n_states, prob):\n    '''Construct a self-loop transition matrix over `n_states`.\n\n    The transition matrix will have the following properties:\n\n        - `transition[i, i] = p` for all i\n        - `transition[i, j] = (1 - p) / (n_states - 1)` for all `j != i`\n\n    This type of transition matrix is appropriate when states tend to be\n    locally stable, and there is no additional structure between different\n    states.  This is primarily useful for de-noising frame-wise predictions.\n\n    Parameters\n    ----------\n    n_states : int > 1\n        The number of states\n\n    prob : float in [0, 1] or iterable, length=n_states\n        If a scalar, this is the probability of a self-transition.\n\n        If a vector of length `n_states`, `p[i]` is the probability of state `i`'s self-transition.\n\n    Returns\n    -------\n    transition : np.ndarray [shape=(n_states, n_states)]\n        The transition matrix\n\n    Examples\n    --------\n    >>> librosa.sequence.transition_loop(3, 0.5)\n    array([[0.5 , 0.25, 0.25],\n           [0.25, 0.5 , 0.25],\n           [0.25, 0.25, 0.5 ]])\n\n    >>> librosa.sequence.transition_loop(3, [0.8, 0.5, 0.25])\n    array([[0.8  , 0.1  , 0.1  ],\n           [0.25 , 0.5  , 0.25 ],\n           [0.375, 0.375, 0.25 ]])\n    '''\n\n    if not isinstance(n_states, int) or n_states <= 1:\n        raise ParameterError('n_states={} must be a positive integer > 1')\n\n    transition = np.empty((n_states, n_states), dtype=np.float)\n\n    # if it's a float, make it a vector\n    prob = np.asarray(prob, dtype=np.float)\n\n    if prob.ndim == 0:\n        prob = np.tile(prob, n_states)\n\n    if prob.shape != (n_states,):\n        raise ParameterError('prob={} must have length equal to n_states={}'.format(prob, n_states))\n\n    if np.any(prob < 0) or np.any(prob > 1):\n        raise ParameterError('prob={} must have values in the range [0, 1]'.format(prob))\n\n    for i, prob_i in enumerate(prob):\n        transition[i] = (1. - prob_i) / (n_states - 1)\n        transition[i, i] = prob_i\n\n    return transition", "response": "Construct a self - loop transition matrix over n_states."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef transition_cycle(n_states, prob):\n    '''Construct a cyclic transition matrix over `n_states`.\n\n    The transition matrix will have the following properties:\n\n        - `transition[i, i] = p`\n        - `transition[i, i + 1] = (1 - p)`\n\n    This type of transition matrix is appropriate for state spaces\n    with cyclical structure, such as metrical position within a bar.\n    For example, a song in 4/4 time has state transitions of the form\n\n        1->{1, 2}, 2->{2, 3}, 3->{3, 4}, 4->{4, 1}.\n\n    Parameters\n    ----------\n    n_states : int > 1\n        The number of states\n\n    prob : float in [0, 1] or iterable, length=n_states\n        If a scalar, this is the probability of a self-transition.\n\n        If a vector of length `n_states`, `p[i]` is the probability of state\n        `i`'s self-transition.\n\n    Returns\n    -------\n    transition : np.ndarray [shape=(n_states, n_states)]\n        The transition matrix\n\n    Examples\n    --------\n    >>> librosa.sequence.transition_cycle(4, 0.9)\n    array([[0.9, 0.1, 0. , 0. ],\n           [0. , 0.9, 0.1, 0. ],\n           [0. , 0. , 0.9, 0.1],\n           [0.1, 0. , 0. , 0.9]])\n    '''\n\n    if not isinstance(n_states, int) or n_states <= 1:\n        raise ParameterError('n_states={} must be a positive integer > 1')\n\n    transition = np.zeros((n_states, n_states), dtype=np.float)\n\n    # if it's a float, make it a vector\n    prob = np.asarray(prob, dtype=np.float)\n\n    if prob.ndim == 0:\n        prob = np.tile(prob, n_states)\n\n    if prob.shape != (n_states,):\n        raise ParameterError('prob={} must have length equal to n_states={}'.format(prob, n_states))\n\n    if np.any(prob < 0) or np.any(prob > 1):\n        raise ParameterError('prob={} must have values in the range [0, 1]'.format(prob))\n\n    for i, prob_i in enumerate(prob):\n        transition[i, np.mod(i + 1, n_states)] = 1. - prob_i\n        transition[i, i] = prob_i\n\n    return transition", "response": "Construct a cyclic transition matrix over n_states and a probability of state prob."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nconstructing a localized transition matrix for a set of states.", "response": "def transition_local(n_states, width, window='triangle', wrap=False):\n    '''Construct a localized transition matrix.\n\n    The transition matrix will have the following properties:\n\n        - `transition[i, j] = 0` if `|i - j| > width`\n        - `transition[i, i]` is maximal\n        - `transition[i, i - width//2 : i + width//2]` has shape `window`\n\n    This type of transition matrix is appropriate for state spaces\n    that discretely approximate continuous variables, such as in fundamental\n    frequency estimation.\n\n    Parameters\n    ----------\n    n_states : int > 1\n        The number of states\n\n    width : int >= 1 or iterable\n        The maximum number of states to treat as \"local\".\n        If iterable, it should have length equal to `n_states`,\n        and specify the width independently for each state.\n\n    window : str, callable, or window specification\n        The window function to determine the shape of the \"local\" distribution.\n\n        Any window specification supported by `filters.get_window` will work here.\n\n        .. note:: Certain windows (e.g., 'hann') are identically 0 at the boundaries,\n            so and effectively have `width-2` non-zero values.  You may have to expand\n            `width` to get the desired behavior.\n\n\n    wrap : bool\n        If `True`, then state locality `|i - j|` is computed modulo `n_states`.\n        If `False` (default), then locality is absolute.\n\n    See Also\n    --------\n    filters.get_window\n\n    Returns\n    -------\n    transition : np.ndarray [shape=(n_states, n_states)]\n        The transition matrix\n\n    Examples\n    --------\n\n    Triangular distributions with and without wrapping\n\n    >>> librosa.sequence.transition_local(5, 3, window='triangle', wrap=False)\n    array([[0.667, 0.333, 0.   , 0.   , 0.   ],\n           [0.25 , 0.5  , 0.25 , 0.   , 0.   ],\n           [0.   , 0.25 , 0.5  , 0.25 , 0.   ],\n           [0.   , 0.   , 0.25 , 0.5  , 0.25 ],\n           [0.   , 0.   , 0.   , 0.333, 0.667]])\n\n    >>> librosa.sequence.transition_local(5, 3, window='triangle', wrap=True)\n    array([[0.5 , 0.25, 0.  , 0.  , 0.25],\n           [0.25, 0.5 , 0.25, 0.  , 0.  ],\n           [0.  , 0.25, 0.5 , 0.25, 0.  ],\n           [0.  , 0.  , 0.25, 0.5 , 0.25],\n           [0.25, 0.  , 0.  , 0.25, 0.5 ]])\n\n    Uniform local distributions with variable widths and no wrapping\n\n    >>> librosa.sequence.transition_local(5, [1, 2, 3, 3, 1], window='ones', wrap=False)\n    array([[1.   , 0.   , 0.   , 0.   , 0.   ],\n           [0.5  , 0.5  , 0.   , 0.   , 0.   ],\n           [0.   , 0.333, 0.333, 0.333, 0.   ],\n           [0.   , 0.   , 0.333, 0.333, 0.333],\n           [0.   , 0.   , 0.   , 0.   , 1.   ]])\n    '''\n\n    if not isinstance(n_states, int) or n_states <= 1:\n        raise ParameterError('n_states={} must be a positive integer > 1')\n\n    width = np.asarray(width, dtype=int)\n    if width.ndim == 0:\n        width = np.tile(width, n_states)\n\n    if width.shape != (n_states,):\n        raise ParameterError('width={} must have length equal to n_states={}'.format(width, n_states))\n\n    if np.any(width < 1):\n        raise ParameterError('width={} must be at least 1')\n\n    transition = np.zeros((n_states, n_states), dtype=np.float)\n\n    # Fill in the widths.  This is inefficient, but simple\n    for i, width_i in enumerate(width):\n        trans_row = pad_center(get_window(window, width_i, fftbins=False), n_states)\n        trans_row = np.roll(trans_row, n_states//2 + i + 1)\n\n        if not wrap:\n            # Knock out the off-diagonal-band elements\n            trans_row[min(n_states, i + width_i//2 + 1):] = 0\n            trans_row[:max(0, i - width_i//2)] = 0\n\n        transition[i] = trans_row\n\n    # Row-normalize\n    transition /= transition.sum(axis=1, keepdims=True)\n\n    return transition"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef onset_strength(y=None, sr=22050, S=None, lag=1, max_size=1,\n                   ref=None,\n                   detrend=False, center=True,\n                   feature=None, aggregate=None,\n                   centering=None,\n                   **kwargs):\n    \"\"\"Compute a spectral flux onset strength envelope.\n\n    Onset strength at time `t` is determined by:\n\n    `mean_f max(0, S[f, t] - ref[f, t - lag])`\n\n    where `ref` is `S` after local max filtering along the frequency\n    axis [1]_.\n\n    By default, if a time series `y` is provided, S will be the\n    log-power Mel spectrogram.\n\n    .. [1] B\u00f6ck, Sebastian, and Gerhard Widmer.\n           \"Maximum filter vibrato suppression for onset detection.\"\n           16th International Conference on Digital Audio Effects,\n           Maynooth, Ireland. 2013.\n\n    Parameters\n    ----------\n    y        : np.ndarray [shape=(n,)]\n        audio time-series\n\n    sr       : number > 0 [scalar]\n        sampling rate of `y`\n\n    S        : np.ndarray [shape=(d, m)]\n        pre-computed (log-power) spectrogram\n\n    lag      : int > 0\n        time lag for computing differences\n\n    max_size : int > 0\n        size (in frequency bins) of the local max filter.\n        set to `1` to disable filtering.\n\n    ref : None or np.ndarray [shape=(d, m)]\n        An optional pre-computed reference spectrum, of the same shape as `S`.\n        If not provided, it will be computed from `S`.\n        If provided, it will override any local max filtering governed by `max_size`.\n\n    detrend : bool [scalar]\n        Filter the onset strength to remove the DC component\n\n    center : bool [scalar]\n        Shift the onset function by `n_fft / (2 * hop_length)` frames\n\n    feature : function\n        Function for computing time-series features, eg, scaled spectrograms.\n        By default, uses `librosa.feature.melspectrogram` with `fmax=11025.0`\n\n    aggregate : function\n        Aggregation function to use when combining onsets\n        at different frequency bins.\n\n        Default: `np.mean`\n\n    kwargs : additional keyword arguments\n        Additional parameters to `feature()`, if `S` is not provided.\n\n\n    Returns\n    -------\n    onset_envelope   : np.ndarray [shape=(m,)]\n        vector containing the onset strength envelope\n\n\n    Raises\n    ------\n    ParameterError\n        if neither `(y, sr)` nor `S` are provided\n\n        or if `lag` or `max_size` are not positive integers\n\n\n    See Also\n    --------\n    onset_detect\n    onset_strength_multi\n\n\n    Examples\n    --------\n    First, load some audio and plot the spectrogram\n\n    >>> import matplotlib.pyplot as plt\n    >>> y, sr = librosa.load(librosa.util.example_audio_file(),\n    ...                      duration=10.0)\n    >>> D = np.abs(librosa.stft(y))\n    >>> times = librosa.frames_to_time(np.arange(D.shape[1]))\n    >>> plt.figure()\n    >>> ax1 = plt.subplot(2, 1, 1)\n    >>> librosa.display.specshow(librosa.amplitude_to_db(D, ref=np.max),\n    ...                          y_axis='log', x_axis='time')\n    >>> plt.title('Power spectrogram')\n\n    Construct a standard onset function\n\n    >>> onset_env = librosa.onset.onset_strength(y=y, sr=sr)\n    >>> plt.subplot(2, 1, 2, sharex=ax1)\n    >>> plt.plot(times, 2 + onset_env / onset_env.max(), alpha=0.8,\n    ...          label='Mean (mel)')\n\n\n    Median aggregation, and custom mel options\n\n    >>> onset_env = librosa.onset.onset_strength(y=y, sr=sr,\n    ...                                          aggregate=np.median,\n    ...                                          fmax=8000, n_mels=256)\n    >>> plt.plot(times, 1 + onset_env / onset_env.max(), alpha=0.8,\n    ...          label='Median (custom mel)')\n\n\n    Constant-Q spectrogram instead of Mel\n\n    >>> onset_env = librosa.onset.onset_strength(y=y, sr=sr,\n    ...                                          feature=librosa.cqt)\n    >>> plt.plot(times, onset_env / onset_env.max(), alpha=0.8,\n    ...          label='Mean (CQT)')\n    >>> plt.legend(frameon=True, framealpha=0.75)\n    >>> plt.ylabel('Normalized strength')\n    >>> plt.yticks([])\n    >>> plt.axis('tight')\n    >>> plt.tight_layout()\n\n    \"\"\"\n\n    if aggregate is False:\n        raise ParameterError('aggregate={} cannot be False when computing full-spectrum onset strength.')\n\n    odf_all = onset_strength_multi(y=y,\n                                   sr=sr,\n                                   S=S,\n                                   lag=lag,\n                                   max_size=max_size,\n                                   ref=ref,\n                                   detrend=detrend,\n                                   center=center,\n                                   feature=feature,\n                                   aggregate=aggregate,\n                                   channels=None,\n                                   **kwargs)\n\n    return odf_all[0]", "response": "Compute a spectral flux onset strength envelope."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nbacktracking detected onset events to the nearest preceding localical minimum of an energy function.", "response": "def onset_backtrack(events, energy):\n    '''Backtrack detected onset events to the nearest preceding local\n    minimum of an energy function.\n\n    This function can be used to roll back the timing of detected onsets\n    from a detected peak amplitude to the preceding minimum.\n\n    This is most useful when using onsets to determine slice points for\n    segmentation, as described by [1]_.\n\n    .. [1] Jehan, Tristan.\n           \"Creating music by listening\"\n           Doctoral dissertation\n           Massachusetts Institute of Technology, 2005.\n\n    Parameters\n    ----------\n    events : np.ndarray, dtype=int\n        List of onset event frame indices, as computed by `onset_detect`\n\n    energy : np.ndarray, shape=(m,)\n        An energy function\n\n    Returns\n    -------\n    events_backtracked : np.ndarray, shape=events.shape\n        The input events matched to nearest preceding minima of `energy`.\n\n    Examples\n    --------\n    >>> y, sr = librosa.load(librosa.util.example_audio_file(),\n    ...                      offset=30, duration=2.0)\n    >>> oenv = librosa.onset.onset_strength(y=y, sr=sr)\n    >>> # Detect events without backtracking\n    >>> onset_raw = librosa.onset.onset_detect(onset_envelope=oenv,\n    ...                                        backtrack=False)\n    >>> # Backtrack the events using the onset envelope\n    >>> onset_bt = librosa.onset.onset_backtrack(onset_raw, oenv)\n    >>> # Backtrack the events using the RMS values\n    >>> rms = librosa.feature.rms(S=np.abs(librosa.stft(y=y)))\n    >>> onset_bt_rms = librosa.onset.onset_backtrack(onset_raw, rms[0])\n\n    >>> # Plot the results\n    >>> import matplotlib.pyplot as plt\n    >>> plt.figure()\n    >>> plt.subplot(2,1,1)\n    >>> plt.plot(oenv, label='Onset strength')\n    >>> plt.vlines(onset_raw, 0, oenv.max(), label='Raw onsets')\n    >>> plt.vlines(onset_bt, 0, oenv.max(), label='Backtracked', color='r')\n    >>> plt.legend(frameon=True, framealpha=0.75)\n    >>> plt.subplot(2,1,2)\n    >>> plt.plot(rms[0], label='RMS')\n    >>> plt.vlines(onset_bt_rms, 0, rms.max(), label='Backtracked (RMS)', color='r')\n    >>> plt.legend(frameon=True, framealpha=0.75)\n    '''\n\n    # Find points where energy is non-increasing\n    # all points:  energy[i] <= energy[i-1]\n    # tail points: energy[i] < energy[i+1]\n    minima = np.flatnonzero((energy[1:-1] <= energy[:-2]) &\n                            (energy[1:-1] < energy[2:]))\n\n    # Pad on a 0, just in case we have onsets with no preceding minimum\n    # Shift by one to account for slicing in minima detection\n    minima = util.fix_frames(1 + minima, x_min=0)\n\n    # Only match going left from the detected events\n    return minima[util.match_events(events, minima, right=False)]"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef onset_strength_multi(y=None, sr=22050, S=None, lag=1, max_size=1,\n                         ref=None, detrend=False, center=True, feature=None,\n                         aggregate=None, channels=None, **kwargs):\n    \"\"\"Compute a spectral flux onset strength envelope across multiple channels.\n\n    Onset strength for channel `i` at time `t` is determined by:\n\n    `mean_{f in channels[i]} max(0, S[f, t+1] - S[f, t])`\n\n\n    Parameters\n    ----------\n    y        : np.ndarray [shape=(n,)]\n        audio time-series\n\n    sr       : number > 0 [scalar]\n        sampling rate of `y`\n\n    S        : np.ndarray [shape=(d, m)]\n        pre-computed (log-power) spectrogram\n\n    lag      : int > 0\n        time lag for computing differences\n\n    max_size : int > 0\n        size (in frequency bins) of the local max filter.\n        set to `1` to disable filtering.\n\n    ref : None or np.ndarray [shape=(d, m)]\n        An optional pre-computed reference spectrum, of the same shape as `S`.\n        If not provided, it will be computed from `S`.\n        If provided, it will override any local max filtering governed by `max_size`.\n\n    detrend : bool [scalar]\n        Filter the onset strength to remove the DC component\n\n    center : bool [scalar]\n        Shift the onset function by `n_fft / (2 * hop_length)` frames\n\n    feature : function\n        Function for computing time-series features, eg, scaled spectrograms.\n        By default, uses `librosa.feature.melspectrogram` with `fmax=11025.0`\n\n    aggregate : function or False\n        Aggregation function to use when combining onsets\n        at different frequency bins.\n\n        If `False`, then no aggregation is performed.\n\n        Default: `np.mean`\n\n    channels : list or None\n        Array of channel boundaries or slice objects.\n        If `None`, then a single channel is generated to span all bands.\n\n    kwargs : additional keyword arguments\n        Additional parameters to `feature()`, if `S` is not provided.\n\n\n    Returns\n    -------\n    onset_envelope   : np.ndarray [shape=(n_channels, m)]\n        array containing the onset strength envelope for each specified channel\n\n\n    Raises\n    ------\n    ParameterError\n        if neither `(y, sr)` nor `S` are provided\n\n\n    See Also\n    --------\n    onset_strength\n\n    Notes\n    -----\n    This function caches at level 30.\n\n    Examples\n    --------\n    First, load some audio and plot the spectrogram\n\n    >>> import matplotlib.pyplot as plt\n    >>> y, sr = librosa.load(librosa.util.example_audio_file(),\n    ...                      duration=10.0)\n    >>> D = np.abs(librosa.stft(y))\n    >>> plt.figure()\n    >>> plt.subplot(2, 1, 1)\n    >>> librosa.display.specshow(librosa.amplitude_to_db(D, ref=np.max),\n    ...                          y_axis='log')\n    >>> plt.title('Power spectrogram')\n\n    Construct a standard onset function over four sub-bands\n\n    >>> onset_subbands = librosa.onset.onset_strength_multi(y=y, sr=sr,\n    ...                                                     channels=[0, 32, 64, 96, 128])\n    >>> plt.subplot(2, 1, 2)\n    >>> librosa.display.specshow(onset_subbands, x_axis='time')\n    >>> plt.ylabel('Sub-bands')\n    >>> plt.title('Sub-band onset strength')\n\n    \"\"\"\n\n    if feature is None:\n        feature = melspectrogram\n        kwargs.setdefault('fmax', 11025.0)\n\n    if aggregate is None:\n        aggregate = np.mean\n\n    if lag < 1 or not isinstance(lag, int):\n        raise ParameterError('lag must be a positive integer')\n\n    if max_size < 1 or not isinstance(max_size, int):\n        raise ParameterError('max_size must be a positive integer')\n\n    # First, compute mel spectrogram\n    if S is None:\n        S = np.abs(feature(y=y, sr=sr, **kwargs))\n\n        # Convert to dBs\n        S = core.power_to_db(S)\n\n    # Retrieve the n_fft and hop_length,\n    # or default values for onsets if not provided\n    n_fft = kwargs.get('n_fft', 2048)\n    hop_length = kwargs.get('hop_length', 512)\n\n    # Ensure that S is at least 2-d\n    S = np.atleast_2d(S)\n\n    # Compute the reference spectrogram.\n    # Efficiency hack: skip filtering step and pass by reference\n    # if max_size will produce a no-op.\n    if ref is None:\n        if max_size == 1:\n            ref = S\n        else:\n            ref = scipy.ndimage.maximum_filter1d(S, max_size, axis=0)\n    elif ref.shape != S.shape:\n        raise ParameterError('Reference spectrum shape {} must match input spectrum {}'.format(ref.shape, S.shape))\n\n    # Compute difference to the reference, spaced by lag\n    onset_env = S[:, lag:] - ref[:, :-lag]\n\n    # Discard negatives (decreasing amplitude)\n    onset_env = np.maximum(0.0, onset_env)\n\n    # Aggregate within channels\n    pad = True\n    if channels is None:\n        channels = [slice(None)]\n    else:\n        pad = False\n\n    if aggregate:\n        onset_env = util.sync(onset_env, channels,\n                              aggregate=aggregate,\n                              pad=pad, axis=0)\n\n    # compensate for lag\n    pad_width = lag\n    if center:\n        # Counter-act framing effects. Shift the onsets by n_fft / hop_length\n        pad_width += n_fft // (2 * hop_length)\n\n    onset_env = np.pad(onset_env, ([0, 0], [int(pad_width), 0]),\n                       mode='constant')\n\n    # remove the DC component\n    if detrend:\n        onset_env = scipy.signal.lfilter([1.0, -1.0], [1.0, -0.99],\n                                         onset_env, axis=-1)\n\n    # Trim to match the input duration\n    if center:\n        onset_env = onset_env[:, :S.shape[1]]\n\n    return onset_env", "response": "Compute a spectral flux onset strength envelope across multiple channels."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef times_csv(path, times, annotations=None, delimiter=',', fmt='%0.3f'):\n\n    if annotations is not None and len(annotations) != len(times):\n        raise ParameterError('len(annotations) != len(times)')\n\n    with open(path, 'w') as output_file:\n        writer = csv.writer(output_file, delimiter=delimiter)\n\n        if annotations is None:\n            for t in times:\n                writer.writerow([fmt % t])\n        else:\n            for t, lab in zip(times, annotations):\n                writer.writerow([(fmt % t), lab])", "response": "r Save time steps as in CSV format."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\noutput a time series as a. wav file.", "response": "def write_wav(path, y, sr, norm=False):\n    \"\"\"Output a time series as a .wav file\n\n    Note: only mono or stereo, floating-point data is supported.\n        For more advanced and flexible output options, refer to\n        `soundfile`.\n\n    Parameters\n    ----------\n    path : str\n        path to save the output wav file\n\n    y : np.ndarray [shape=(n,) or (2,n), dtype=np.float]\n        audio time series (mono or stereo).\n\n        Note that only floating-point values are supported.\n\n    sr : int > 0 [scalar]\n        sampling rate of `y`\n\n    norm : boolean [scalar]\n        enable amplitude normalization.\n        For floating point `y`, scale the data to the range [-1, +1].\n\n    Examples\n    --------\n    Trim a signal to 5 seconds and save it back\n\n    >>> y, sr = librosa.load(librosa.util.example_audio_file(),\n    ...                      duration=5.0)\n    >>> librosa.output.write_wav('file_trim_5s.wav', y, sr)\n\n    See Also\n    --------\n    soundfile.write\n    \"\"\"\n\n    # Validate the buffer.  Stereo is okay here.\n    util.valid_audio(y, mono=False)\n\n    # normalize\n    if norm and np.issubdtype(y.dtype, np.floating):\n        wav = util.normalize(y, norm=np.inf, axis=None)\n    else:\n        wav = y\n\n    # Check for stereo\n    if wav.ndim > 1 and wav.shape[0] == 2:\n        wav = wav.T\n\n    # Save\n    scipy.io.wavfile.write(path, sr, wav)"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef cmap(data, robust=True, cmap_seq='magma', cmap_bool='gray_r', cmap_div='coolwarm'):\n    '''Get a default colormap from the given data.\n\n    If the data is boolean, use a black and white colormap.\n\n    If the data has both positive and negative values,\n    use a diverging colormap.\n\n    Otherwise, use a sequential colormap.\n\n    Parameters\n    ----------\n    data : np.ndarray\n        Input data\n\n    robust : bool\n        If True, discard the top and bottom 2% of data when calculating\n        range.\n\n    cmap_seq : str\n        The sequential colormap name\n\n    cmap_bool : str\n        The boolean colormap name\n\n    cmap_div : str\n        The diverging colormap name\n\n    Returns\n    -------\n    cmap : matplotlib.colors.Colormap\n        The colormap to use for `data`\n\n    See Also\n    --------\n    matplotlib.pyplot.colormaps\n    '''\n\n    data = np.atleast_1d(data)\n\n    if data.dtype == 'bool':\n        return get_cmap(cmap_bool)\n\n    data = data[np.isfinite(data)]\n\n    if robust:\n        min_p, max_p = 2, 98\n    else:\n        min_p, max_p = 0, 100\n\n    max_val = np.percentile(data, max_p)\n    min_val = np.percentile(data, min_p)\n\n    if min_val >= 0 or max_val <= 0:\n        return get_cmap(cmap_seq)\n\n    return get_cmap(cmap_div)", "response": "Get a default colormap from the given data."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef __envelope(x, hop):\n    '''Compute the max-envelope of x at a stride/frame length of h'''\n    return util.frame(x, hop_length=hop, frame_length=hop).max(axis=0)", "response": "Compute the max - envelope of x at a stride / frame length of h"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef waveplot(y, sr=22050, max_points=5e4, x_axis='time', offset=0.0,\n             max_sr=1000, ax=None, **kwargs):\n    '''Plot the amplitude envelope of a waveform.\n\n    If `y` is monophonic, a filled curve is drawn between `[-abs(y), abs(y)]`.\n\n    If `y` is stereo, the curve is drawn between `[-abs(y[1]), abs(y[0])]`,\n    so that the left and right channels are drawn above and below the axis,\n    respectively.\n\n    Long signals (`duration >= max_points`) are down-sampled to at\n    most `max_sr` before plotting.\n\n    Parameters\n    ----------\n    y : np.ndarray [shape=(n,) or (2,n)]\n        audio time series (mono or stereo)\n\n    sr : number > 0 [scalar]\n        sampling rate of `y`\n\n    max_points : postive number or None\n        Maximum number of time-points to plot: if `max_points` exceeds\n        the duration of `y`, then `y` is downsampled.\n\n        If `None`, no downsampling is performed.\n\n    x_axis : str {'time', 'off', 'none'} or None\n        If 'time', the x-axis is given time tick-marks.\n\n    ax : matplotlib.axes.Axes or None\n        Axes to plot on instead of the default `plt.gca()`.\n\n    offset : float\n        Horizontal offset (in seconds) to start the waveform plot\n\n    max_sr : number > 0 [scalar]\n        Maximum sampling rate for the visualization\n\n    kwargs\n        Additional keyword arguments to `matplotlib.pyplot.fill_between`\n\n    Returns\n    -------\n    pc : matplotlib.collections.PolyCollection\n        The PolyCollection created by `fill_between`.\n\n    See also\n    --------\n    librosa.core.resample\n    matplotlib.pyplot.fill_between\n\n\n    Examples\n    --------\n    Plot a monophonic waveform\n\n    >>> import matplotlib.pyplot as plt\n    >>> y, sr = librosa.load(librosa.util.example_audio_file(), duration=10)\n    >>> plt.figure()\n    >>> plt.subplot(3, 1, 1)\n    >>> librosa.display.waveplot(y, sr=sr)\n    >>> plt.title('Monophonic')\n\n    Or a stereo waveform\n\n    >>> y, sr = librosa.load(librosa.util.example_audio_file(),\n    ...                      mono=False, duration=10)\n    >>> plt.subplot(3, 1, 2)\n    >>> librosa.display.waveplot(y, sr=sr)\n    >>> plt.title('Stereo')\n\n    Or harmonic and percussive components with transparency\n\n    >>> y, sr = librosa.load(librosa.util.example_audio_file(), duration=10)\n    >>> y_harm, y_perc = librosa.effects.hpss(y)\n    >>> plt.subplot(3, 1, 3)\n    >>> librosa.display.waveplot(y_harm, sr=sr, alpha=0.25)\n    >>> librosa.display.waveplot(y_perc, sr=sr, color='r', alpha=0.5)\n    >>> plt.title('Harmonic + Percussive')\n    >>> plt.tight_layout()\n    '''\n\n    util.valid_audio(y, mono=False)\n\n    if not (isinstance(max_sr, int) and max_sr > 0):\n        raise ParameterError('max_sr must be a non-negative integer')\n\n    target_sr = sr\n    hop_length = 1\n\n    if max_points is not None:\n        if max_points <= 0:\n            raise ParameterError('max_points must be strictly positive')\n\n        if max_points < y.shape[-1]:\n            target_sr = min(max_sr, (sr * y.shape[-1]) // max_points)\n\n        hop_length = sr // target_sr\n\n        if y.ndim == 1:\n            y = __envelope(y, hop_length)\n        else:\n            y = np.vstack([__envelope(_, hop_length) for _ in y])\n\n    if y.ndim > 1:\n        y_top = y[0]\n        y_bottom = -y[1]\n    else:\n        y_top = y\n        y_bottom = -y\n\n    axes = __check_axes(ax)\n\n    kwargs.setdefault('color', next(axes._get_lines.prop_cycler)['color'])\n\n    locs = offset + core.frames_to_time(np.arange(len(y_top)),\n                                        sr=sr,\n                                        hop_length=hop_length)\n\n    out = axes.fill_between(locs, y_bottom, y_top, **kwargs)\n\n    axes.set_xlim([locs.min(), locs.max()])\n    if x_axis == 'time':\n        axes.xaxis.set_major_formatter(TimeFormatter(lag=False))\n        axes.xaxis.set_label_text('Time')\n    elif x_axis is None or x_axis in ['off', 'none']:\n        axes.set_xticks([])\n    else:\n        raise ParameterError('Unknown x_axis value: {}'.format(x_axis))\n\n    return out", "response": "Plot the amplitude envelope of a monophonic waveform."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\ndisplaying a spectrogram in a single axis.", "response": "def specshow(data, x_coords=None, y_coords=None,\n             x_axis=None, y_axis=None,\n             sr=22050, hop_length=512,\n             fmin=None, fmax=None,\n             bins_per_octave=12,\n             ax=None,\n             **kwargs):\n    '''Display a spectrogram/chromagram/cqt/etc.\n\n\n    Parameters\n    ----------\n    data : np.ndarray [shape=(d, n)]\n        Matrix to display (e.g., spectrogram)\n\n    sr : number > 0 [scalar]\n        Sample rate used to determine time scale in x-axis.\n\n    hop_length : int > 0 [scalar]\n        Hop length, also used to determine time scale in x-axis\n\n    x_axis : None or str\n\n    y_axis : None or str\n        Range for the x- and y-axes.\n\n        Valid types are:\n\n        - None, 'none', or 'off' : no axis decoration is displayed.\n\n        Frequency types:\n\n        - 'linear', 'fft', 'hz' : frequency range is determined by\n          the FFT window and sampling rate.\n        - 'log' : the spectrum is displayed on a log scale.\n        - 'mel' : frequencies are determined by the mel scale.\n        - 'cqt_hz' : frequencies are determined by the CQT scale.\n        - 'cqt_note' : pitches are determined by the CQT scale.\n\n        All frequency types are plotted in units of Hz.\n\n        Categorical types:\n\n        - 'chroma' : pitches are determined by the chroma filters.\n          Pitch classes are arranged at integer locations (0-11).\n\n        - 'tonnetz' : axes are labeled by Tonnetz dimensions (0-5)\n        - 'frames' : markers are shown as frame counts.\n\n        Time types:\n\n        - 'time' : markers are shown as milliseconds, seconds,\n          minutes, or hours.\n          Values are plotted in units of seconds.\n        - 's' : markers are shown as seconds.\n        - 'ms' : markers are shown as milliseconds.\n\n        - 'lag' : like time, but past the halfway point counts\n          as negative values.\n        - 'lag_s' : same as lag, but in seconds.\n        - 'lag_ms' : same as lag, but in milliseconds.\n\n        Other:\n\n        - 'tempo' : markers are shown as beats-per-minute (BPM)\n            using a logarithmic scale.\n\n    x_coords : np.ndarray [shape=data.shape[1]+1]\n    y_coords : np.ndarray [shape=data.shape[0]+1]\n\n        Optional positioning coordinates of the input data.\n        These can be use to explicitly set the location of each\n        element `data[i, j]`, e.g., for displaying beat-synchronous\n        features in natural time coordinates.\n\n        If not provided, they are inferred from `x_axis` and `y_axis`.\n\n    fmin : float > 0 [scalar] or None\n        Frequency of the lowest spectrogram bin.  Used for Mel and CQT\n        scales.\n\n        If `y_axis` is `cqt_hz` or `cqt_note` and `fmin` is not given,\n        it is set by default to `note_to_hz('C1')`.\n\n    fmax : float > 0 [scalar] or None\n        Used for setting the Mel frequency scales\n\n    bins_per_octave : int > 0 [scalar]\n        Number of bins per octave.  Used for CQT frequency scale.\n\n    ax : matplotlib.axes.Axes or None\n        Axes to plot on instead of the default `plt.gca()`.\n\n    kwargs : additional keyword arguments\n        Arguments passed through to `matplotlib.pyplot.pcolormesh`.\n\n        By default, the following options are set:\n\n            - `rasterized=True`\n            - `shading='flat'`\n            - `edgecolors='None'`\n\n    Returns\n    -------\n    axes\n        The axis handle for the figure.\n\n\n    See Also\n    --------\n    cmap : Automatic colormap detection\n\n    matplotlib.pyplot.pcolormesh\n\n\n    Examples\n    --------\n    Visualize an STFT power spectrum\n\n    >>> import matplotlib.pyplot as plt\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> plt.figure(figsize=(12, 8))\n\n    >>> D = librosa.amplitude_to_db(np.abs(librosa.stft(y)), ref=np.max)\n    >>> plt.subplot(4, 2, 1)\n    >>> librosa.display.specshow(D, y_axis='linear')\n    >>> plt.colorbar(format='%+2.0f dB')\n    >>> plt.title('Linear-frequency power spectrogram')\n\n\n    Or on a logarithmic scale\n\n    >>> plt.subplot(4, 2, 2)\n    >>> librosa.display.specshow(D, y_axis='log')\n    >>> plt.colorbar(format='%+2.0f dB')\n    >>> plt.title('Log-frequency power spectrogram')\n\n\n    Or use a CQT scale\n\n    >>> CQT = librosa.amplitude_to_db(np.abs(librosa.cqt(y, sr=sr)), ref=np.max)\n    >>> plt.subplot(4, 2, 3)\n    >>> librosa.display.specshow(CQT, y_axis='cqt_note')\n    >>> plt.colorbar(format='%+2.0f dB')\n    >>> plt.title('Constant-Q power spectrogram (note)')\n\n    >>> plt.subplot(4, 2, 4)\n    >>> librosa.display.specshow(CQT, y_axis='cqt_hz')\n    >>> plt.colorbar(format='%+2.0f dB')\n    >>> plt.title('Constant-Q power spectrogram (Hz)')\n\n\n    Draw a chromagram with pitch classes\n\n    >>> C = librosa.feature.chroma_cqt(y=y, sr=sr)\n    >>> plt.subplot(4, 2, 5)\n    >>> librosa.display.specshow(C, y_axis='chroma')\n    >>> plt.colorbar()\n    >>> plt.title('Chromagram')\n\n\n    Force a grayscale colormap (white -> black)\n\n    >>> plt.subplot(4, 2, 6)\n    >>> librosa.display.specshow(D, cmap='gray_r', y_axis='linear')\n    >>> plt.colorbar(format='%+2.0f dB')\n    >>> plt.title('Linear power spectrogram (grayscale)')\n\n\n    Draw time markers automatically\n\n    >>> plt.subplot(4, 2, 7)\n    >>> librosa.display.specshow(D, x_axis='time', y_axis='log')\n    >>> plt.colorbar(format='%+2.0f dB')\n    >>> plt.title('Log power spectrogram')\n\n\n    Draw a tempogram with BPM markers\n\n    >>> plt.subplot(4, 2, 8)\n    >>> Tgram = librosa.feature.tempogram(y=y, sr=sr)\n    >>> librosa.display.specshow(Tgram, x_axis='time', y_axis='tempo')\n    >>> plt.colorbar()\n    >>> plt.title('Tempogram')\n    >>> plt.tight_layout()\n\n\n    Draw beat-synchronous chroma in natural time\n\n    >>> plt.figure()\n    >>> tempo, beat_f = librosa.beat.beat_track(y=y, sr=sr, trim=False)\n    >>> beat_f = librosa.util.fix_frames(beat_f, x_max=C.shape[1])\n    >>> Csync = librosa.util.sync(C, beat_f, aggregate=np.median)\n    >>> beat_t = librosa.frames_to_time(beat_f, sr=sr)\n    >>> ax1 = plt.subplot(2,1,1)\n    >>> librosa.display.specshow(C, y_axis='chroma', x_axis='time')\n    >>> plt.title('Chroma (linear time)')\n    >>> ax2 = plt.subplot(2,1,2, sharex=ax1)\n    >>> librosa.display.specshow(Csync, y_axis='chroma', x_axis='time',\n    ...                          x_coords=beat_t)\n    >>> plt.title('Chroma (beat time)')\n    >>> plt.tight_layout()\n    '''\n\n    if np.issubdtype(data.dtype, np.complexfloating):\n        warnings.warn('Trying to display complex-valued input. '\n                      'Showing magnitude instead.')\n        data = np.abs(data)\n\n    kwargs.setdefault('cmap', cmap(data))\n    kwargs.setdefault('rasterized', True)\n    kwargs.setdefault('edgecolors', 'None')\n    kwargs.setdefault('shading', 'flat')\n\n    all_params = dict(kwargs=kwargs,\n                      sr=sr,\n                      fmin=fmin,\n                      fmax=fmax,\n                      bins_per_octave=bins_per_octave,\n                      hop_length=hop_length)\n\n    # Get the x and y coordinates\n    y_coords = __mesh_coords(y_axis, y_coords, data.shape[0], **all_params)\n    x_coords = __mesh_coords(x_axis, x_coords, data.shape[1], **all_params)\n\n    axes = __check_axes(ax)\n    out = axes.pcolormesh(x_coords, y_coords, data, **kwargs)\n    __set_current_image(ax, out)\n\n    axes.set_xlim(x_coords.min(), x_coords.max())\n    axes.set_ylim(y_coords.min(), y_coords.max())\n\n    # Set up axis scaling\n    __scale_axes(axes, x_axis, 'x')\n    __scale_axes(axes, y_axis, 'y')\n\n    # Construct tickers and locators\n    __decorate_axis(axes.xaxis, x_axis)\n    __decorate_axis(axes.yaxis, y_axis)\n\n    return axes"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef __check_axes(axes):\n    '''Check if \"axes\" is an instance of an axis object. If not, use `gca`.'''\n    if axes is None:\n        import matplotlib.pyplot as plt\n        axes = plt.gca()\n    elif not isinstance(axes, Axes):\n        raise ValueError(\"`axes` must be an instance of matplotlib.axes.Axes. \"\n                         \"Found type(axes)={}\".format(type(axes)))\n    return axes", "response": "Check if axes is an instance of Axes. If not use matplotlib. ca."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef __scale_axes(axes, ax_type, which):\n    '''Set the axis scaling'''\n\n    kwargs = dict()\n    if which == 'x':\n        thresh = 'linthreshx'\n        base = 'basex'\n        scale = 'linscalex'\n        scaler = axes.set_xscale\n        limit = axes.set_xlim\n    else:\n        thresh = 'linthreshy'\n        base = 'basey'\n        scale = 'linscaley'\n        scaler = axes.set_yscale\n        limit = axes.set_ylim\n\n    # Map ticker scales\n    if ax_type == 'mel':\n        mode = 'symlog'\n        kwargs[thresh] = 1000.0\n        kwargs[base] = 2\n\n    elif ax_type == 'log':\n        mode = 'symlog'\n        kwargs[base] = 2\n        kwargs[thresh] = core.note_to_hz('C2')\n        kwargs[scale] = 0.5\n\n    elif ax_type in ['cqt', 'cqt_hz', 'cqt_note']:\n        mode = 'log'\n        kwargs[base] = 2\n\n    elif ax_type == 'tempo':\n        mode = 'log'\n        kwargs[base] = 2\n        limit(16, 480)\n    else:\n        return\n\n    scaler(mode, **kwargs)", "response": "Set the axis scaling"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef __decorate_axis(axis, ax_type):\n    '''Configure axis tickers, locators, and labels'''\n\n    if ax_type == 'tonnetz':\n        axis.set_major_formatter(TonnetzFormatter())\n        axis.set_major_locator(FixedLocator(0.5 + np.arange(6)))\n        axis.set_label_text('Tonnetz')\n\n    elif ax_type == 'chroma':\n        axis.set_major_formatter(ChromaFormatter())\n        axis.set_major_locator(FixedLocator(0.5 +\n                                            np.add.outer(12 * np.arange(10),\n                                                         [0, 2, 4, 5, 7, 9, 11]).ravel()))\n        axis.set_label_text('Pitch class')\n\n    elif ax_type == 'tempo':\n        axis.set_major_formatter(ScalarFormatter())\n        axis.set_major_locator(LogLocator(base=2.0))\n        axis.set_label_text('BPM')\n\n    elif ax_type == 'time':\n        axis.set_major_formatter(TimeFormatter(unit=None, lag=False))\n        axis.set_major_locator(MaxNLocator(prune=None,\n                                           steps=[1, 1.5, 5, 6, 10]))\n        axis.set_label_text('Time')\n\n    elif ax_type == 's':\n        axis.set_major_formatter(TimeFormatter(unit='s', lag=False))\n        axis.set_major_locator(MaxNLocator(prune=None,\n                                           steps=[1, 1.5, 5, 6, 10]))\n        axis.set_label_text('Time (s)')\n\n    elif ax_type == 'ms':\n        axis.set_major_formatter(TimeFormatter(unit='ms', lag=False))\n        axis.set_major_locator(MaxNLocator(prune=None,\n                                           steps=[1, 1.5, 5, 6, 10]))\n        axis.set_label_text('Time (ms)')\n\n    elif ax_type == 'lag':\n        axis.set_major_formatter(TimeFormatter(unit=None, lag=True))\n        axis.set_major_locator(MaxNLocator(prune=None,\n                                           steps=[1, 1.5, 5, 6, 10]))\n        axis.set_label_text('Lag')\n\n    elif ax_type == 'lag_s':\n        axis.set_major_formatter(TimeFormatter(unit='s', lag=True))\n        axis.set_major_locator(MaxNLocator(prune=None,\n                                           steps=[1, 1.5, 5, 6, 10]))\n        axis.set_label_text('Lag (s)')\n\n    elif ax_type == 'lag_ms':\n        axis.set_major_formatter(TimeFormatter(unit='ms', lag=True))\n        axis.set_major_locator(MaxNLocator(prune=None,\n                                           steps=[1, 1.5, 5, 6, 10]))\n        axis.set_label_text('Lag (ms)')\n\n    elif ax_type == 'cqt_note':\n        axis.set_major_formatter(NoteFormatter())\n        axis.set_major_locator(LogLocator(base=2.0))\n        axis.set_minor_formatter(NoteFormatter(major=False))\n        axis.set_minor_locator(LogLocator(base=2.0,\n                                          subs=2.0**(np.arange(1, 12)/12.0)))\n        axis.set_label_text('Note')\n\n    elif ax_type in ['cqt_hz']:\n        axis.set_major_formatter(LogHzFormatter())\n        axis.set_major_locator(LogLocator(base=2.0))\n        axis.set_minor_formatter(LogHzFormatter(major=False))\n        axis.set_minor_locator(LogLocator(base=2.0,\n                                          subs=2.0**(np.arange(1, 12)/12.0)))\n        axis.set_label_text('Hz')\n\n    elif ax_type in ['mel', 'log']:\n        axis.set_major_formatter(ScalarFormatter())\n        axis.set_major_locator(SymmetricalLogLocator(axis.get_transform()))\n        axis.set_label_text('Hz')\n\n    elif ax_type in ['linear', 'hz']:\n        axis.set_major_formatter(ScalarFormatter())\n        axis.set_label_text('Hz')\n\n    elif ax_type in ['frames']:\n        axis.set_label_text('Frames')\n\n    elif ax_type in ['off', 'none', None]:\n        axis.set_label_text('')\n        axis.set_ticks([])", "response": "Configure axis tickers locators and labels"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef __coord_fft_hz(n, sr=22050, **_kwargs):\n    '''Get the frequencies for FFT bins'''\n    n_fft = 2 * (n - 1)\n    # The following code centers the FFT bins at their frequencies\n    # and clips to the non-negative frequency range [0, nyquist]\n    basis = core.fft_frequencies(sr=sr, n_fft=n_fft)\n    fmax = basis[-1]\n    basis -= 0.5 * (basis[1] - basis[0])\n    basis = np.append(np.maximum(0, basis), [fmax])\n    return basis", "response": "Get the frequencies for FFT bins"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\ngetting the frequencies for Mel bins", "response": "def __coord_mel_hz(n, fmin=0, fmax=11025.0, **_kwargs):\n    '''Get the frequencies for Mel bins'''\n\n    if fmin is None:\n        fmin = 0\n    if fmax is None:\n        fmax = 11025.0\n\n    basis = core.mel_frequencies(n, fmin=fmin, fmax=fmax)\n    basis[1:] -= 0.5 * np.diff(basis)\n    basis = np.append(np.maximum(0, basis), [fmax])\n    return basis"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nget the frequency of the given number of octaves in the CQT.", "response": "def __coord_cqt_hz(n, fmin=None, bins_per_octave=12, **_kwargs):\n    '''Get CQT bin frequencies'''\n    if fmin is None:\n        fmin = core.note_to_hz('C1')\n\n    # we drop by half a bin so that CQT bins are centered vertically\n    return core.cqt_frequencies(n+1,\n                                fmin=fmin / 2.0**(0.5/bins_per_octave),\n                                bins_per_octave=bins_per_octave)"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nget chroma bin numbers", "response": "def __coord_chroma(n, bins_per_octave=12, **_kwargs):\n    '''Get chroma bin numbers'''\n    return np.linspace(0, (12.0 * n) / bins_per_octave, num=n+1, endpoint=True)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef __coord_time(n, sr=22050, hop_length=512, **_kwargs):\n    '''Get time coordinates from frames'''\n    return core.frames_to_time(np.arange(n+1), sr=sr, hop_length=hop_length)", "response": "Get time coordinates from frames"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nestimating the tuning of an audio time series or spectrogram input.", "response": "def estimate_tuning(y=None, sr=22050, S=None, n_fft=2048,\n                    resolution=0.01, bins_per_octave=12, **kwargs):\n    '''Estimate the tuning of an audio time series or spectrogram input.\n\n    Parameters\n    ----------\n    y: np.ndarray [shape=(n,)] or None\n        audio signal\n\n    sr : number > 0 [scalar]\n        audio sampling rate of `y`\n\n    S: np.ndarray [shape=(d, t)] or None\n        magnitude or power spectrogram\n\n    n_fft : int > 0 [scalar] or None\n        number of FFT bins to use, if `y` is provided.\n\n    resolution : float in `(0, 1)`\n        Resolution of the tuning as a fraction of a bin.\n        0.01 corresponds to measurements in cents.\n\n    bins_per_octave : int > 0 [scalar]\n        How many frequency bins per octave\n\n    kwargs : additional keyword arguments\n        Additional arguments passed to `piptrack`\n\n    Returns\n    -------\n    tuning: float in `[-0.5, 0.5)`\n        estimated tuning deviation (fractions of a bin)\n\n    See Also\n    --------\n    piptrack\n        Pitch tracking by parabolic interpolation\n\n    Examples\n    --------\n    >>> # With time-series input\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> librosa.estimate_tuning(y=y, sr=sr)\n    0.089999999999999969\n\n    >>> # In tenths of a cent\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> librosa.estimate_tuning(y=y, sr=sr, resolution=1e-3)\n    0.093999999999999972\n\n    >>> # Using spectrogram input\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> S = np.abs(librosa.stft(y))\n    >>> librosa.estimate_tuning(S=S, sr=sr)\n    0.089999999999999969\n\n    >>> # Using pass-through arguments to `librosa.piptrack`\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> librosa.estimate_tuning(y=y, sr=sr, n_fft=8192,\n    ...                         fmax=librosa.note_to_hz('G#9'))\n    0.070000000000000062\n\n    '''\n\n    pitch, mag = piptrack(y=y, sr=sr, S=S, n_fft=n_fft, **kwargs)\n\n    # Only count magnitude where frequency is > 0\n    pitch_mask = pitch > 0\n\n    if pitch_mask.any():\n        threshold = np.median(mag[pitch_mask])\n    else:\n        threshold = 0.0\n\n    return pitch_tuning(pitch[(mag >= threshold) & pitch_mask],\n                        resolution=resolution,\n                        bins_per_octave=bins_per_octave)"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef pitch_tuning(frequencies, resolution=0.01, bins_per_octave=12):\n    '''Given a collection of pitches, estimate its tuning offset\n    (in fractions of a bin) relative to A440=440.0Hz.\n\n    Parameters\n    ----------\n    frequencies : array-like, float\n        A collection of frequencies detected in the signal.\n        See `piptrack`\n\n    resolution : float in `(0, 1)`\n        Resolution of the tuning as a fraction of a bin.\n        0.01 corresponds to cents.\n\n    bins_per_octave : int > 0 [scalar]\n        How many frequency bins per octave\n\n    Returns\n    -------\n    tuning: float in `[-0.5, 0.5)`\n        estimated tuning deviation (fractions of a bin)\n\n    See Also\n    --------\n    estimate_tuning\n        Estimating tuning from time-series or spectrogram input\n\n    Examples\n    --------\n    >>> # Generate notes at +25 cents\n    >>> freqs = librosa.cqt_frequencies(24, 55, tuning=0.25)\n    >>> librosa.pitch_tuning(freqs)\n    0.25\n\n    >>> # Track frequencies from a real spectrogram\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> pitches, magnitudes, stft = librosa.ifptrack(y, sr)\n    >>> # Select out pitches with high energy\n    >>> pitches = pitches[magnitudes > np.median(magnitudes)]\n    >>> librosa.pitch_tuning(pitches)\n    0.089999999999999969\n\n    '''\n\n    frequencies = np.atleast_1d(frequencies)\n\n    # Trim out any DC components\n    frequencies = frequencies[frequencies > 0]\n\n    if not np.any(frequencies):\n        warnings.warn('Trying to estimate tuning from empty frequency set.')\n        return 0.0\n\n    # Compute the residual relative to the number of bins\n    residual = np.mod(bins_per_octave *\n                      time_frequency.hz_to_octs(frequencies), 1.0)\n\n    # Are we on the wrong side of the semitone?\n    # A residual of 0.95 is more likely to be a deviation of -0.05\n    # from the next tone up.\n    residual[residual >= 0.5] -= 1.0\n\n    bins = np.linspace(-0.5, 0.5, int(np.ceil(1. / resolution)) + 1)\n\n    counts, tuning = np.histogram(residual, bins)\n\n    # return the histogram peak\n    return tuning[np.argmax(counts)]", "response": "Given a collection of pitches estimate its tuning offset relative to A440. 0Hz."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\npitching tracking on thresholded parabolically - interpolated STFT.", "response": "def piptrack(y=None, sr=22050, S=None, n_fft=2048, hop_length=None,\n             fmin=150.0, fmax=4000.0, threshold=0.1,\n             win_length=None, window='hann', center=True, pad_mode='reflect',\n             ref=None):\n    '''Pitch tracking on thresholded parabolically-interpolated STFT.\n\n    This implementation uses the parabolic interpolation method described by [1]_.\n\n    .. [1] https://ccrma.stanford.edu/~jos/sasp/Sinusoidal_Peak_Interpolation.html\n\n    Parameters\n    ----------\n    y: np.ndarray [shape=(n,)] or None\n        audio signal\n\n    sr : number > 0 [scalar]\n        audio sampling rate of `y`\n\n    S: np.ndarray [shape=(d, t)] or None\n        magnitude or power spectrogram\n\n    n_fft : int > 0 [scalar] or None\n        number of FFT bins to use, if `y` is provided.\n\n    hop_length : int > 0 [scalar] or None\n        number of samples to hop\n\n    threshold : float in `(0, 1)`\n        A bin in spectrum `S` is considered a pitch when it is greater than\n        `threshold*ref(S)`.\n\n        By default, `ref(S)` is taken to be `max(S, axis=0)` (the maximum value in\n        each column).\n\n    fmin : float > 0 [scalar]\n        lower frequency cutoff.\n\n    fmax : float > 0 [scalar]\n        upper frequency cutoff.\n\n    win_length : int <= n_fft [scalar]\n        Each frame of audio is windowed by `window()`.\n        The window will be of length `win_length` and then padded\n        with zeros to match `n_fft`.\n\n        If unspecified, defaults to ``win_length = n_fft``.\n\n    window : string, tuple, number, function, or np.ndarray [shape=(n_fft,)]\n        - a window specification (string, tuple, or number);\n          see `scipy.signal.get_window`\n        - a window function, such as `scipy.signal.hanning`\n        - a vector or array of length `n_fft`\n\n        .. see also:: `filters.get_window`\n\n    center : boolean\n        - If `True`, the signal `y` is padded so that frame\n          `t` is centered at `y[t * hop_length]`.\n        - If `False`, then frame `t` begins at `y[t * hop_length]`\n\n    pad_mode : string\n        If `center=True`, the padding mode to use at the edges of the signal.\n        By default, STFT uses reflection padding.\n\n    ref : scalar or callable [default=np.max]\n        If scalar, the reference value against which `S` is compared for determining\n        pitches.\n\n        If callable, the reference value is computed as `ref(S, axis=0)`.\n\n    .. note::\n        One of `S` or `y` must be provided.\n\n        If `S` is not given, it is computed from `y` using\n        the default parameters of `librosa.core.stft`.\n\n    Returns\n    -------\n    pitches : np.ndarray [shape=(d, t)]\n    magnitudes : np.ndarray [shape=(d,t)]\n        Where `d` is the subset of FFT bins within `fmin` and `fmax`.\n\n        `pitches[f, t]` contains instantaneous frequency at bin\n        `f`, time `t`\n\n        `magnitudes[f, t]` contains the corresponding magnitudes.\n\n        Both `pitches` and `magnitudes` take value 0 at bins\n        of non-maximal magnitude.\n\n    Notes\n    -----\n    This function caches at level 30.\n\n    Examples\n    --------\n    Computing pitches from a waveform input\n\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> pitches, magnitudes = librosa.piptrack(y=y, sr=sr)\n\n    Or from a spectrogram input\n\n    >>> S = np.abs(librosa.stft(y))\n    >>> pitches, magnitudes = librosa.piptrack(S=S, sr=sr)\n\n    Or with an alternate reference value for pitch detection, where\n    values above the mean spectral energy in each frame are counted as pitches\n\n    >>> pitches, magnitudes = librosa.piptrack(S=S, sr=sr, threshold=1,\n    ...                                        ref=np.mean)\n\n    '''\n\n    # Check that we received an audio time series or STFT\n    S, n_fft = _spectrogram(y=y, S=S, n_fft=n_fft, hop_length=hop_length,\n                            win_length=win_length, window=window,\n                            center=center, pad_mode=pad_mode)\n\n    # Make sure we're dealing with magnitudes\n    S = np.abs(S)\n\n    # Truncate to feasible region\n    fmin = np.maximum(fmin, 0)\n    fmax = np.minimum(fmax, float(sr) / 2)\n\n    fft_freqs = time_frequency.fft_frequencies(sr=sr, n_fft=n_fft)\n\n    # Do the parabolic interpolation everywhere,\n    # then figure out where the peaks are\n    # then restrict to the feasible range (fmin:fmax)\n    avg = 0.5 * (S[2:] - S[:-2])\n\n    shift = 2 * S[1:-1] - S[2:] - S[:-2]\n\n    # Suppress divide-by-zeros.\n    # Points where shift == 0 will never be selected by localmax anyway\n    shift = avg / (shift + (np.abs(shift) < util.tiny(shift)))\n\n    # Pad back up to the same shape as S\n    avg = np.pad(avg, ([1, 1], [0, 0]), mode='constant')\n    shift = np.pad(shift, ([1, 1], [0, 0]), mode='constant')\n\n    dskew = 0.5 * avg * shift\n\n    # Pre-allocate output\n    pitches = np.zeros_like(S)\n    mags = np.zeros_like(S)\n\n    # Clip to the viable frequency range\n    freq_mask = ((fmin <= fft_freqs) & (fft_freqs < fmax)).reshape((-1, 1))\n\n    # Compute the column-wise local max of S after thresholding\n    # Find the argmax coordinates\n    if ref is None:\n        ref = np.max\n\n    if six.callable(ref):\n        ref_value = threshold * ref(S, axis=0)\n    else:\n        ref_value = np.abs(ref)\n\n    idx = np.argwhere(freq_mask & util.localmax(S * (S > ref_value)))\n\n    # Store pitch and magnitude\n    pitches[idx[:, 0], idx[:, 1]] = ((idx[:, 0] + shift[idx[:, 0], idx[:, 1]])\n                                     * float(sr) / n_fft)\n\n    mags[idx[:, 0], idx[:, 1]] = (S[idx[:, 0], idx[:, 1]]\n                                  + dskew[idx[:, 0], idx[:, 1]])\n\n    return pitches, mags"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\ndecompose an audio time series into harmonic and percussive components.", "response": "def hpss(y, **kwargs):\n    '''Decompose an audio time series into harmonic and percussive components.\n\n    This function automates the STFT->HPSS->ISTFT pipeline, and ensures that\n    the output waveforms have equal length to the input waveform `y`.\n\n\n    Parameters\n    ----------\n    y : np.ndarray [shape=(n,)]\n        audio time series\n    kwargs : additional keyword arguments.\n        See `librosa.decompose.hpss` for details.\n\n\n    Returns\n    -------\n    y_harmonic : np.ndarray [shape=(n,)]\n        audio time series of the harmonic elements\n\n    y_percussive : np.ndarray [shape=(n,)]\n        audio time series of the percussive elements\n\n    See Also\n    --------\n    harmonic : Extract only the harmonic component\n    percussive : Extract only the percussive component\n    librosa.decompose.hpss : HPSS on spectrograms\n\n\n    Examples\n    --------\n    >>> # Extract harmonic and percussive components\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> y_harmonic, y_percussive = librosa.effects.hpss(y)\n\n    >>> # Get a more isolated percussive component by widening its margin\n    >>> y_harmonic, y_percussive = librosa.effects.hpss(y, margin=(1.0,5.0))\n\n    '''\n\n    # Compute the STFT matrix\n    stft = core.stft(y)\n\n    # Decompose into harmonic and percussives\n    stft_harm, stft_perc = decompose.hpss(stft, **kwargs)\n\n    # Invert the STFTs.  Adjust length to match the input.\n    y_harm = util.fix_length(core.istft(stft_harm, dtype=y.dtype), len(y))\n    y_perc = util.fix_length(core.istft(stft_perc, dtype=y.dtype), len(y))\n\n    return y_harm, y_perc"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef harmonic(y, **kwargs):\n    '''Extract harmonic elements from an audio time-series.\n\n    Parameters\n    ----------\n    y : np.ndarray [shape=(n,)]\n        audio time series\n    kwargs : additional keyword arguments.\n        See `librosa.decompose.hpss` for details.\n\n    Returns\n    -------\n    y_harmonic : np.ndarray [shape=(n,)]\n        audio time series of just the harmonic portion\n\n    See Also\n    --------\n    hpss : Separate harmonic and percussive components\n    percussive : Extract only the percussive component\n    librosa.decompose.hpss : HPSS for spectrograms\n\n    Examples\n    --------\n    >>> # Extract harmonic component\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> y_harmonic = librosa.effects.harmonic(y)\n\n    >>> # Use a margin > 1.0 for greater harmonic separation\n    >>> y_harmonic = librosa.effects.harmonic(y, margin=3.0)\n\n    '''\n\n    # Compute the STFT matrix\n    stft = core.stft(y)\n\n    # Remove percussives\n    stft_harm = decompose.hpss(stft, **kwargs)[0]\n\n    # Invert the STFTs\n    y_harm = util.fix_length(core.istft(stft_harm, dtype=y.dtype), len(y))\n\n    return y_harm", "response": "Extract harmonic elements from an audio time - series."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nextract percussive elements from an audio time - series.", "response": "def percussive(y, **kwargs):\n    '''Extract percussive elements from an audio time-series.\n\n    Parameters\n    ----------\n    y : np.ndarray [shape=(n,)]\n        audio time series\n    kwargs : additional keyword arguments.\n        See `librosa.decompose.hpss` for details.\n\n    Returns\n    -------\n    y_percussive : np.ndarray [shape=(n,)]\n        audio time series of just the percussive portion\n\n    See Also\n    --------\n    hpss : Separate harmonic and percussive components\n    harmonic : Extract only the harmonic component\n    librosa.decompose.hpss : HPSS for spectrograms\n\n    Examples\n    --------\n    >>> # Extract percussive component\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> y_percussive = librosa.effects.percussive(y)\n\n    >>> # Use a margin > 1.0 for greater percussive separation\n    >>> y_percussive = librosa.effects.percussive(y, margin=3.0)\n\n    '''\n\n    # Compute the STFT matrix\n    stft = core.stft(y)\n\n    # Remove harmonics\n    stft_perc = decompose.hpss(stft, **kwargs)[1]\n\n    # Invert the STFT\n    y_perc = util.fix_length(core.istft(stft_perc, dtype=y.dtype), len(y))\n\n    return y_perc"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef time_stretch(y, rate):\n    '''Time-stretch an audio series by a fixed rate.\n\n\n    Parameters\n    ----------\n    y : np.ndarray [shape=(n,)]\n        audio time series\n\n    rate : float > 0 [scalar]\n        Stretch factor.  If `rate > 1`, then the signal is sped up.\n\n        If `rate < 1`, then the signal is slowed down.\n\n    Returns\n    -------\n    y_stretch : np.ndarray [shape=(rate * n,)]\n        audio time series stretched by the specified rate\n\n    See Also\n    --------\n    pitch_shift : pitch shifting\n    librosa.core.phase_vocoder : spectrogram phase vocoder\n\n\n    Examples\n    --------\n    Compress to be twice as fast\n\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> y_fast = librosa.effects.time_stretch(y, 2.0)\n\n    Or half the original speed\n\n    >>> y_slow = librosa.effects.time_stretch(y, 0.5)\n\n    '''\n\n    if rate <= 0:\n        raise ParameterError('rate must be a positive number')\n\n    # Construct the stft\n    stft = core.stft(y)\n\n    # Stretch by phase vocoding\n    stft_stretch = core.phase_vocoder(stft, rate)\n\n    # Invert the stft\n    y_stretch = core.istft(stft_stretch, dtype=y.dtype)\n\n    return y_stretch", "response": "Time - stretch an audio series by a fixed rate."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef pitch_shift(y, sr, n_steps, bins_per_octave=12, res_type='kaiser_best'):\n    '''Pitch-shift the waveform by `n_steps` half-steps.\n\n\n    Parameters\n    ----------\n    y : np.ndarray [shape=(n,)]\n        audio time-series\n\n    sr : number > 0 [scalar]\n        audio sampling rate of `y`\n\n    n_steps : float [scalar]\n        how many (fractional) half-steps to shift `y`\n\n    bins_per_octave : float > 0 [scalar]\n        how many steps per octave\n\n    res_type : string\n        Resample type.\n        Possible options: 'kaiser_best', 'kaiser_fast', and 'scipy', 'polyphase',\n        'fft'.\n        By default, 'kaiser_best' is used.\n        \n        See `core.resample` for more information.\n\n    Returns\n    -------\n    y_shift : np.ndarray [shape=(n,)]\n        The pitch-shifted audio time-series\n\n\n    See Also\n    --------\n    time_stretch : time stretching\n    librosa.core.phase_vocoder : spectrogram phase vocoder\n\n\n    Examples\n    --------\n    Shift up by a major third (four half-steps)\n\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> y_third = librosa.effects.pitch_shift(y, sr, n_steps=4)\n\n    Shift down by a tritone (six half-steps)\n\n    >>> y_tritone = librosa.effects.pitch_shift(y, sr, n_steps=-6)\n\n    Shift up by 3 quarter-tones\n\n    >>> y_three_qt = librosa.effects.pitch_shift(y, sr, n_steps=3,\n    ...                                          bins_per_octave=24)\n    '''\n\n    if bins_per_octave < 1 or not np.issubdtype(type(bins_per_octave), np.integer):\n        raise ParameterError('bins_per_octave must be a positive integer.')\n\n    rate = 2.0 ** (-float(n_steps) / bins_per_octave)\n\n    # Stretch in time, then resample\n    y_shift = core.resample(time_stretch(y, rate), float(sr) / rate, sr,\n                            res_type=res_type)\n\n    # Crop to the same dimension as the input\n    return util.fix_length(y_shift, len(y))", "response": "Pitch - shift the waveform by n_steps half - steps."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef _signal_to_frame_nonsilent(y, frame_length=2048, hop_length=512, top_db=60,\n                               ref=np.max):\n    '''Frame-wise non-silent indicator for audio input.\n\n    This is a helper function for `trim` and `split`.\n\n    Parameters\n    ----------\n    y : np.ndarray, shape=(n,) or (2,n)\n        Audio signal, mono or stereo\n\n    frame_length : int > 0\n        The number of samples per frame\n\n    hop_length : int > 0\n        The number of samples between frames\n\n    top_db : number > 0\n        The threshold (in decibels) below reference to consider as\n        silence\n\n    ref : callable or float\n        The reference power\n\n    Returns\n    -------\n    non_silent : np.ndarray, shape=(m,), dtype=bool\n        Indicator of non-silent frames\n    '''\n    # Convert to mono\n    y_mono = core.to_mono(y)\n\n    # Compute the MSE for the signal\n    mse = feature.rms(y=y_mono,\n                      frame_length=frame_length,\n                      hop_length=hop_length)**2\n\n    return (core.power_to_db(mse.squeeze(),\n                             ref=ref,\n                             top_db=None) > - top_db)", "response": "Frame - wise non - silent indicator for audio input."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef trim(y, top_db=60, ref=np.max, frame_length=2048, hop_length=512):\n    '''Trim leading and trailing silence from an audio signal.\n\n    Parameters\n    ----------\n    y : np.ndarray, shape=(n,) or (2,n)\n        Audio signal, can be mono or stereo\n\n    top_db : number > 0\n        The threshold (in decibels) below reference to consider as\n        silence\n\n    ref : number or callable\n        The reference power.  By default, it uses `np.max` and compares\n        to the peak power in the signal.\n\n    frame_length : int > 0\n        The number of samples per analysis frame\n\n    hop_length : int > 0\n        The number of samples between analysis frames\n\n    Returns\n    -------\n    y_trimmed : np.ndarray, shape=(m,) or (2, m)\n        The trimmed signal\n\n    index : np.ndarray, shape=(2,)\n        the interval of `y` corresponding to the non-silent region:\n        `y_trimmed = y[index[0]:index[1]]` (for mono) or\n        `y_trimmed = y[:, index[0]:index[1]]` (for stereo).\n\n\n    Examples\n    --------\n    >>> # Load some audio\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> # Trim the beginning and ending silence\n    >>> yt, index = librosa.effects.trim(y)\n    >>> # Print the durations\n    >>> print(librosa.get_duration(y), librosa.get_duration(yt))\n    61.45886621315193 60.58086167800454\n    '''\n\n    non_silent = _signal_to_frame_nonsilent(y,\n                                            frame_length=frame_length,\n                                            hop_length=hop_length,\n                                            ref=ref,\n                                            top_db=top_db)\n\n    nonzero = np.flatnonzero(non_silent)\n\n    if nonzero.size > 0:\n        # Compute the start and end positions\n        # End position goes one frame past the last non-zero\n        start = int(core.frames_to_samples(nonzero[0], hop_length))\n        end = min(y.shape[-1],\n                  int(core.frames_to_samples(nonzero[-1] + 1, hop_length)))\n    else:\n        # The signal only contains zeros\n        start, end = 0, 0\n\n    # Build the mono/stereo index\n    full_index = [slice(None)] * y.ndim\n    full_index[-1] = slice(start, end)\n\n    return y[tuple(full_index)], np.asarray([start, end])", "response": "Trim leading and trailing silence from an audio signal."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nsplit an audio signal into non - silent intervals.", "response": "def split(y, top_db=60, ref=np.max, frame_length=2048, hop_length=512):\n    '''Split an audio signal into non-silent intervals.\n\n    Parameters\n    ----------\n    y : np.ndarray, shape=(n,) or (2, n)\n        An audio signal\n\n    top_db : number > 0\n        The threshold (in decibels) below reference to consider as\n        silence\n\n    ref : number or callable\n        The reference power.  By default, it uses `np.max` and compares\n        to the peak power in the signal.\n\n    frame_length : int > 0\n        The number of samples per analysis frame\n\n    hop_length : int > 0\n        The number of samples between analysis frames\n\n    Returns\n    -------\n    intervals : np.ndarray, shape=(m, 2)\n        `intervals[i] == (start_i, end_i)` are the start and end time\n        (in samples) of non-silent interval `i`.\n\n    '''\n\n    non_silent = _signal_to_frame_nonsilent(y,\n                                            frame_length=frame_length,\n                                            hop_length=hop_length,\n                                            ref=ref,\n                                            top_db=top_db)\n\n    # Interval slicing, adapted from\n    # https://stackoverflow.com/questions/2619413/efficiently-finding-the-interval-with-non-zeros-in-scipy-numpy-in-python\n    # Find points where the sign flips\n    edges = np.flatnonzero(np.diff(non_silent.astype(int)))\n\n    # Pad back the sample lost in the diff\n    edges = [edges + 1]\n\n    # If the first frame had high energy, count it\n    if non_silent[0]:\n        edges.insert(0, [0])\n\n    # Likewise for the last frame\n    if non_silent[-1]:\n        edges.append([len(non_silent)])\n\n    # Convert from frames to samples\n    edges = core.frames_to_samples(np.concatenate(edges),\n                                   hop_length=hop_length)\n\n    # Clip to the signal duration\n    edges = np.minimum(edges, y.shape[-1])\n\n    # Stack the results back as an ndarray\n    return edges.reshape((-1, 2))"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef stft(y, n_fft=2048, hop_length=None, win_length=None, window='hann',\n         center=True, dtype=np.complex64, pad_mode='reflect'):\n    \"\"\"Short-time Fourier transform (STFT)\n\n    Returns a complex-valued matrix D such that\n        `np.abs(D[f, t])` is the magnitude of frequency bin `f`\n        at frame `t`\n\n        `np.angle(D[f, t])` is the phase of frequency bin `f`\n        at frame `t`\n\n    Parameters\n    ----------\n    y : np.ndarray [shape=(n,)], real-valued\n        the input signal (audio time series)\n\n    n_fft : int > 0 [scalar]\n        FFT window size\n\n    hop_length : int > 0 [scalar]\n        number audio of frames between STFT columns.\n        If unspecified, defaults `win_length / 4`.\n\n    win_length : int <= n_fft [scalar]\n        Each frame of audio is windowed by `window()`.\n        The window will be of length `win_length` and then padded\n        with zeros to match `n_fft`.\n\n        If unspecified, defaults to ``win_length = n_fft``.\n\n    window : string, tuple, number, function, or np.ndarray [shape=(n_fft,)]\n        - a window specification (string, tuple, or number);\n          see `scipy.signal.get_window`\n        - a window function, such as `scipy.signal.hanning`\n        - a vector or array of length `n_fft`\n\n        .. see also:: `filters.get_window`\n\n    center : boolean\n        - If `True`, the signal `y` is padded so that frame\n          `D[:, t]` is centered at `y[t * hop_length]`.\n        - If `False`, then `D[:, t]` begins at `y[t * hop_length]`\n\n    dtype : numeric type\n        Complex numeric type for `D`.  Default is 64-bit complex.\n\n    pad_mode : string\n        If `center=True`, the padding mode to use at the edges of the signal.\n        By default, STFT uses reflection padding.\n\n\n    Returns\n    -------\n    D : np.ndarray [shape=(1 + n_fft/2, t), dtype=dtype]\n        STFT matrix\n\n\n    See Also\n    --------\n    istft : Inverse STFT\n\n    ifgram : Instantaneous frequency spectrogram\n\n    np.pad : array padding\n\n    Notes\n    -----\n    This function caches at level 20.\n\n\n    Examples\n    --------\n\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> D = np.abs(librosa.stft(y))\n    >>> D\n    array([[2.58028018e-03, 4.32422794e-02, 6.61255598e-01, ...,\n            6.82710262e-04, 2.51654536e-04, 7.23036574e-05],\n           [2.49403086e-03, 5.15930466e-02, 6.00107312e-01, ...,\n            3.48026224e-04, 2.35853557e-04, 7.54836728e-05],\n           [7.82410789e-04, 1.05394892e-01, 4.37517226e-01, ...,\n            6.29352580e-04, 3.38571583e-04, 8.38094638e-05],\n           ...,\n           [9.48568513e-08, 4.74725084e-07, 1.50052492e-05, ...,\n            1.85637656e-08, 2.89708542e-08, 5.74304337e-09],\n           [1.25165826e-07, 8.58259284e-07, 1.11157215e-05, ...,\n            3.49099771e-08, 3.11740926e-08, 5.29926236e-09],\n           [1.70630571e-07, 8.92518756e-07, 1.23656537e-05, ...,\n            5.33256745e-08, 3.33264900e-08, 5.13272980e-09]], dtype=float32)\n\n\n    Use left-aligned frames, instead of centered frames\n\n    >>> D_left = np.abs(librosa.stft(y, center=False))\n\n\n    Use a shorter hop length\n\n    >>> D_short = np.abs(librosa.stft(y, hop_length=64))\n\n\n    Display a spectrogram\n\n    >>> import matplotlib.pyplot as plt\n    >>> librosa.display.specshow(librosa.amplitude_to_db(D,\n    ...                                                  ref=np.max),\n    ...                          y_axis='log', x_axis='time')\n    >>> plt.title('Power spectrogram')\n    >>> plt.colorbar(format='%+2.0f dB')\n    >>> plt.tight_layout()\n\n    \"\"\"\n\n    # By default, use the entire frame\n    if win_length is None:\n        win_length = n_fft\n\n    # Set the default hop, if it's not already specified\n    if hop_length is None:\n        hop_length = int(win_length // 4)\n\n    fft_window = get_window(window, win_length, fftbins=True)\n\n    # Pad the window out to n_fft size\n    fft_window = util.pad_center(fft_window, n_fft)\n\n    # Reshape so that the window can be broadcast\n    fft_window = fft_window.reshape((-1, 1))\n\n    # Check audio is valid\n    util.valid_audio(y)\n\n    # Pad the time series so that frames are centered\n    if center:\n        y = np.pad(y, int(n_fft // 2), mode=pad_mode)\n\n    # Window the time series.\n    y_frames = util.frame(y, frame_length=n_fft, hop_length=hop_length)\n\n    # Pre-allocate the STFT matrix\n    stft_matrix = np.empty((int(1 + n_fft // 2), y_frames.shape[1]),\n                           dtype=dtype,\n                           order='F')\n\n    fft = get_fftlib()\n\n    # how many columns can we fit within MAX_MEM_BLOCK?\n    n_columns = int(util.MAX_MEM_BLOCK / (stft_matrix.shape[0] *\n                                          stft_matrix.itemsize))\n\n    for bl_s in range(0, stft_matrix.shape[1], n_columns):\n        bl_t = min(bl_s + n_columns, stft_matrix.shape[1])\n\n        stft_matrix[:, bl_s:bl_t] = fft.rfft(fft_window *\n                                             y_frames[:, bl_s:bl_t],\n                                             axis=0)\n    return stft_matrix", "response": "Short - time Fourier transform of a signal y."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef istft(stft_matrix, hop_length=None, win_length=None, window='hann',\n          center=True, dtype=np.float32, length=None):\n    \"\"\"\n    Inverse short-time Fourier transform (ISTFT).\n\n    Converts a complex-valued spectrogram `stft_matrix` to time-series `y`\n    by minimizing the mean squared error between `stft_matrix` and STFT of\n    `y` as described in [1]_ up to Section 2 (reconstruction from MSTFT).\n\n    In general, window function, hop length and other parameters should be same\n    as in stft, which mostly leads to perfect reconstruction of a signal from\n    unmodified `stft_matrix`.\n\n    .. [1] D. W. Griffin and J. S. Lim,\n        \"Signal estimation from modified short-time Fourier transform,\"\n        IEEE Trans. ASSP, vol.32, no.2, pp.236\u2013243, Apr. 1984.\n\n    Parameters\n    ----------\n    stft_matrix : np.ndarray [shape=(1 + n_fft/2, t)]\n        STFT matrix from `stft`\n\n    hop_length : int > 0 [scalar]\n        Number of frames between STFT columns.\n        If unspecified, defaults to `win_length / 4`.\n\n    win_length : int <= n_fft = 2 * (stft_matrix.shape[0] - 1)\n        When reconstructing the time series, each frame is windowed\n        and each sample is normalized by the sum of squared window\n        according to the `window` function (see below).\n\n        If unspecified, defaults to `n_fft`.\n\n    window : string, tuple, number, function, np.ndarray [shape=(n_fft,)]\n        - a window specification (string, tuple, or number);\n          see `scipy.signal.get_window`\n        - a window function, such as `scipy.signal.hanning`\n        - a user-specified window vector of length `n_fft`\n\n        .. see also:: `filters.get_window`\n\n    center : boolean\n        - If `True`, `D` is assumed to have centered frames.\n        - If `False`, `D` is assumed to have left-aligned frames.\n\n    dtype : numeric type\n        Real numeric type for `y`.  Default is 32-bit float.\n\n    length : int > 0, optional\n        If provided, the output `y` is zero-padded or clipped to exactly\n        `length` samples.\n\n    Returns\n    -------\n    y : np.ndarray [shape=(n,)]\n        time domain signal reconstructed from `stft_matrix`\n\n    See Also\n    --------\n    stft : Short-time Fourier Transform\n\n    Notes\n    -----\n    This function caches at level 30.\n\n    Examples\n    --------\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> D = librosa.stft(y)\n    >>> y_hat = librosa.istft(D)\n    >>> y_hat\n    array([ -4.812e-06,  -4.267e-06, ...,   6.271e-06,   2.827e-07], dtype=float32)\n\n    Exactly preserving length of the input signal requires explicit padding.\n    Otherwise, a partial frame at the end of `y` will not be represented.\n\n    >>> n = len(y)\n    >>> n_fft = 2048\n    >>> y_pad = librosa.util.fix_length(y, n + n_fft // 2)\n    >>> D = librosa.stft(y_pad, n_fft=n_fft)\n    >>> y_out = librosa.istft(D, length=n)\n    >>> np.max(np.abs(y - y_out))\n    1.4901161e-07\n    \"\"\"\n\n    n_fft = 2 * (stft_matrix.shape[0] - 1)\n\n    # By default, use the entire frame\n    if win_length is None:\n        win_length = n_fft\n\n    # Set the default hop, if it's not already specified\n    if hop_length is None:\n        hop_length = int(win_length // 4)\n\n    ifft_window = get_window(window, win_length, fftbins=True)\n\n    # Pad out to match n_fft, and add a broadcasting axis\n    ifft_window = util.pad_center(ifft_window, n_fft)[:, np.newaxis]\n\n    n_frames = stft_matrix.shape[1]\n    expected_signal_len = n_fft + hop_length * (n_frames - 1)\n    y = np.zeros(expected_signal_len, dtype=dtype)\n\n    n_columns = int(util.MAX_MEM_BLOCK // (stft_matrix.shape[0] *\n                                           stft_matrix.itemsize))\n\n    fft = get_fftlib()\n\n    frame = 0\n    for bl_s in range(0, n_frames, n_columns):\n        bl_t = min(bl_s + n_columns, n_frames)\n\n        # invert the block and apply the window function\n        ytmp = ifft_window * fft.irfft(stft_matrix[:, bl_s:bl_t], axis=0)\n\n        # Overlap-add the istft block starting at the i'th frame\n        __overlap_add(y[frame * hop_length:], ytmp, hop_length)\n\n        frame += (bl_t - bl_s)\n\n    # Normalize by sum of squared window\n    ifft_window_sum = window_sumsquare(window,\n                                       n_frames,\n                                       win_length=win_length,\n                                       n_fft=n_fft,\n                                       hop_length=hop_length,\n                                       dtype=dtype)\n\n    approx_nonzero_indices = ifft_window_sum > util.tiny(ifft_window_sum)\n    y[approx_nonzero_indices] /= ifft_window_sum[approx_nonzero_indices]\n\n    if length is None:\n        # If we don't need to control length, just do the usual center trimming\n        # to eliminate padded data\n        if center:\n            y = y[int(n_fft // 2):-int(n_fft // 2)]\n    else:\n        if center:\n            # If we're centering, crop off the first n_fft//2 samples\n            # and then trim/pad to the target length.\n            # We don't trim the end here, so that if the signal is zero-padded\n            # to a longer duration, the decay is smooth by windowing\n            start = int(n_fft // 2)\n        else:\n            # If we're not centering, start at 0 and trim/pad as necessary\n            start = 0\n\n        y = util.fix_length(y[start:], length)\n\n    return y", "response": "This function converts a complex - valued spectrogram from modified short - time Fourier transform to time - series y."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\ncomputes the instantaneous frequency of the audio time series y.", "response": "def ifgram(y, sr=22050, n_fft=2048, hop_length=None, win_length=None,\n           window='hann', norm=False, center=True, ref_power=1e-6,\n           clip=True, dtype=np.complex64, pad_mode='reflect'):\n    '''Compute the instantaneous frequency (as a proportion of the sampling rate)\n    obtained as the time-derivative of the phase of the complex spectrum as\n    described by [1]_.\n\n    Calculates regular STFT as a side effect.\n\n    .. [1] Abe, Toshihiko, Takao Kobayashi, and Satoshi Imai.\n        \"Harmonics tracking and pitch extraction based on instantaneous\n        frequency.\"\n        International Conference on Acoustics, Speech, and Signal Processing,\n        ICASSP-95., Vol. 1. IEEE, 1995.\n\n    Parameters\n    ----------\n    y : np.ndarray [shape=(n,)]\n        audio time series\n\n    sr : number > 0 [scalar]\n        sampling rate of `y`\n\n    n_fft : int > 0 [scalar]\n        FFT window size\n\n    hop_length : int > 0 [scalar]\n        hop length, number samples between subsequent frames.\n        If not supplied, defaults to `win_length / 4`.\n\n    win_length : int > 0, <= n_fft\n        Window length. Defaults to `n_fft`.\n        See `stft` for details.\n\n    window : string, tuple, number, function, or np.ndarray [shape=(n_fft,)]\n        - a window specification (string, tuple, number);\n          see `scipy.signal.get_window`\n        - a window function, such as `scipy.signal.hanning`\n        - a user-specified window vector of length `n_fft`\n\n        See `stft` for details.\n\n        .. see also:: `filters.get_window`\n\n    norm : bool\n        Normalize the STFT.\n\n    center : boolean\n        - If `True`, the signal `y` is padded so that frame\n            `D[:, t]` (and `if_gram`) is centered at `y[t * hop_length]`.\n        - If `False`, then `D[:, t]` at `y[t * hop_length]`\n\n    ref_power : float >= 0 or callable\n        Minimum power threshold for estimating instantaneous frequency.\n        Any bin with `np.abs(D[f, t])**2 < ref_power` will receive the\n        default frequency estimate.\n\n        If callable, the threshold is set to `ref_power(np.abs(D)**2)`.\n\n    clip : boolean\n        - If `True`, clip estimated frequencies to the range `[0, 0.5 * sr]`.\n        - If `False`, estimated frequencies can be negative or exceed\n          `0.5 * sr`.\n\n    dtype : numeric type\n        Complex numeric type for `D`.  Default is 64-bit complex.\n\n    pad_mode : string\n        If `center=True`, the padding mode to use at the edges of the signal.\n        By default, STFT uses reflection padding.\n\n\n    Returns\n    -------\n    if_gram : np.ndarray [shape=(1 + n_fft/2, t), dtype=real]\n        Instantaneous frequency spectrogram:\n        `if_gram[f, t]` is the frequency at bin `f`, time `t`\n\n    D : np.ndarray [shape=(1 + n_fft/2, t), dtype=complex]\n        Short-time Fourier transform\n\n    See Also\n    --------\n    stft : Short-time Fourier Transform\n\n    Examples\n    --------\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> frequencies, D = librosa.ifgram(y, sr=sr)\n    >>> frequencies\n    array([[  0.000e+00,   0.000e+00, ...,   0.000e+00,   0.000e+00],\n           [  3.150e+01,   3.070e+01, ...,   1.077e+01,   1.077e+01],\n           ...,\n           [  1.101e+04,   1.101e+04, ...,   1.101e+04,   1.101e+04],\n           [  1.102e+04,   1.102e+04, ...,   1.102e+04,   1.102e+04]])\n\n    '''\n\n    if win_length is None:\n        win_length = n_fft\n\n    if hop_length is None:\n        hop_length = int(win_length // 4)\n\n    # Construct a padded hann window\n    fft_window = util.pad_center(get_window(window, win_length,\n                                            fftbins=True),\n                                 n_fft)\n\n    # Window for discrete differentiation\n    freq_angular = np.linspace(0, 2 * np.pi, n_fft, endpoint=False)\n\n    d_window = np.sin(-freq_angular) * np.pi / n_fft\n\n    stft_matrix = stft(y, n_fft=n_fft, hop_length=hop_length,\n                       win_length=win_length,\n                       window=window, center=center,\n                       dtype=dtype, pad_mode=pad_mode)\n\n    diff_stft = stft(y, n_fft=n_fft, hop_length=hop_length,\n                     window=d_window, center=center,\n                     dtype=dtype, pad_mode=pad_mode).conj()\n\n    # Compute power normalization. Suppress zeros.\n    mag, phase = magphase(stft_matrix)\n\n    if six.callable(ref_power):\n        ref_power = ref_power(mag**2)\n    elif ref_power < 0:\n        raise ParameterError('ref_power must be non-negative or callable.')\n\n    # Pylint does not correctly infer the type here, but it's correct.\n    # pylint: disable=maybe-no-member\n    freq_angular = freq_angular.reshape((-1, 1))\n    bin_offset = (-phase * diff_stft).imag / mag\n\n    bin_offset[mag < ref_power**0.5] = 0\n\n    if_gram = freq_angular[:n_fft//2 + 1] + bin_offset\n\n    if norm:\n        stft_matrix = stft_matrix * 2.0 / fft_window.sum()\n\n    if clip:\n        np.clip(if_gram, 0, np.pi, out=if_gram)\n\n    if_gram *= float(sr) * 0.5 / np.pi\n\n    return if_gram, stft_matrix"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef magphase(D, power=1):\n\n    mag = np.abs(D)\n    mag **= power\n    phase = np.exp(1.j * np.angle(D))\n\n    return mag, phase", "response": "Separate a complex - valued spectrogram D into its magnitude and phase components."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nphasing vocoder. Given an STFT matrix D, speed up by a factor of `rate` Based on the implementation provided by [1]_. .. [1] Ellis, D. P. W. \"A phase vocoder in Matlab.\" Columbia University, 2002. http://www.ee.columbia.edu/~dpwe/resources/matlab/pvoc/ Examples -------- >>> # Play at double speed >>> y, sr = librosa.load(librosa.util.example_audio_file()) >>> D = librosa.stft(y, n_fft=2048, hop_length=512) >>> D_fast = librosa.phase_vocoder(D, 2.0, hop_length=512) >>> y_fast = librosa.istft(D_fast, hop_length=512) >>> # Or play at 1/3 speed >>> y, sr = librosa.load(librosa.util.example_audio_file()) >>> D = librosa.stft(y, n_fft=2048, hop_length=512) >>> D_slow = librosa.phase_vocoder(D, 1./3, hop_length=512) >>> y_slow = librosa.istft(D_slow, hop_length=512) Parameters ---------- D : np.ndarray [shape=(d, t), dtype=complex] STFT matrix rate : float > 0 [scalar] Speed-up factor: `rate > 1` is faster, `rate < 1` is slower. hop_length : int > 0 [scalar] or None The number of samples between successive columns of `D`. If None, defaults to `n_fft/4 = (D.shape[0]-1)/2` Returns ------- D_stretched : np.ndarray [shape=(d, t / rate), dtype=complex] time-stretched STFT", "response": "def phase_vocoder(D, rate, hop_length=None):\n    \"\"\"Phase vocoder.  Given an STFT matrix D, speed up by a factor of `rate`\n\n    Based on the implementation provided by [1]_.\n\n    .. [1] Ellis, D. P. W. \"A phase vocoder in Matlab.\"\n        Columbia University, 2002.\n        http://www.ee.columbia.edu/~dpwe/resources/matlab/pvoc/\n\n    Examples\n    --------\n    >>> # Play at double speed\n    >>> y, sr   = librosa.load(librosa.util.example_audio_file())\n    >>> D       = librosa.stft(y, n_fft=2048, hop_length=512)\n    >>> D_fast  = librosa.phase_vocoder(D, 2.0, hop_length=512)\n    >>> y_fast  = librosa.istft(D_fast, hop_length=512)\n\n    >>> # Or play at 1/3 speed\n    >>> y, sr   = librosa.load(librosa.util.example_audio_file())\n    >>> D       = librosa.stft(y, n_fft=2048, hop_length=512)\n    >>> D_slow  = librosa.phase_vocoder(D, 1./3, hop_length=512)\n    >>> y_slow  = librosa.istft(D_slow, hop_length=512)\n\n    Parameters\n    ----------\n    D : np.ndarray [shape=(d, t), dtype=complex]\n        STFT matrix\n\n    rate :  float > 0 [scalar]\n        Speed-up factor: `rate > 1` is faster, `rate < 1` is slower.\n\n    hop_length : int > 0 [scalar] or None\n        The number of samples between successive columns of `D`.\n\n        If None, defaults to `n_fft/4 = (D.shape[0]-1)/2`\n\n    Returns\n    -------\n    D_stretched : np.ndarray [shape=(d, t / rate), dtype=complex]\n        time-stretched STFT\n    \"\"\"\n\n    n_fft = 2 * (D.shape[0] - 1)\n\n    if hop_length is None:\n        hop_length = int(n_fft // 4)\n\n    time_steps = np.arange(0, D.shape[1], rate, dtype=np.float)\n\n    # Create an empty output array\n    d_stretch = np.zeros((D.shape[0], len(time_steps)), D.dtype, order='F')\n\n    # Expected phase advance in each bin\n    phi_advance = np.linspace(0, np.pi * hop_length, D.shape[0])\n\n    # Phase accumulator; initialize to the first sample\n    phase_acc = np.angle(D[:, 0])\n\n    # Pad 0 columns to simplify boundary logic\n    D = np.pad(D, [(0, 0), (0, 2)], mode='constant')\n\n    for (t, step) in enumerate(time_steps):\n\n        columns = D[:, int(step):int(step + 2)]\n\n        # Weighting for linear magnitude interpolation\n        alpha = np.mod(step, 1.0)\n        mag = ((1.0 - alpha) * np.abs(columns[:, 0])\n               + alpha * np.abs(columns[:, 1]))\n\n        # Store to output array\n        d_stretch[:, t] = mag * np.exp(1.j * phase_acc)\n\n        # Compute phase advance\n        dphase = (np.angle(columns[:, 1])\n                  - np.angle(columns[:, 0])\n                  - phi_advance)\n\n        # Wrap to -pi:pi range\n        dphase = dphase - 2.0 * np.pi * np.round(dphase / (2.0 * np.pi))\n\n        # Accumulate phase\n        phase_acc += phi_advance + dphase\n\n    return d_stretch"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef power_to_db(S, ref=1.0, amin=1e-10, top_db=80.0):\n\n    S = np.asarray(S)\n\n    if amin <= 0:\n        raise ParameterError('amin must be strictly positive')\n\n    if np.issubdtype(S.dtype, np.complexfloating):\n        warnings.warn('power_to_db was called on complex input so phase '\n                      'information will be discarded. To suppress this warning, '\n                      'call power_to_db(np.abs(D)**2) instead.')\n        magnitude = np.abs(S)\n    else:\n        magnitude = S\n\n    if six.callable(ref):\n        # User supplied a function to calculate reference power\n        ref_value = ref(magnitude)\n    else:\n        ref_value = np.abs(ref)\n\n    log_spec = 10.0 * np.log10(np.maximum(amin, magnitude))\n    log_spec -= 10.0 * np.log10(np.maximum(amin, ref_value))\n\n    if top_db is not None:\n        if top_db < 0:\n            raise ParameterError('top_db must be non-negative')\n        log_spec = np.maximum(log_spec, log_spec.max() - top_db)\n\n    return log_spec", "response": "Convert a power spectrogram to decibel units."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nconvert an amplitude spectrogram to dB - scaled spectrogram.", "response": "def amplitude_to_db(S, ref=1.0, amin=1e-5, top_db=80.0):\n    '''Convert an amplitude spectrogram to dB-scaled spectrogram.\n\n    This is equivalent to ``power_to_db(S**2)``, but is provided for convenience.\n\n    Parameters\n    ----------\n    S : np.ndarray\n        input amplitude\n\n    ref : scalar or callable\n        If scalar, the amplitude `abs(S)` is scaled relative to `ref`:\n        `20 * log10(S / ref)`.\n        Zeros in the output correspond to positions where `S == ref`.\n\n        If callable, the reference value is computed as `ref(S)`.\n\n    amin : float > 0 [scalar]\n        minimum threshold for `S` and `ref`\n\n    top_db : float >= 0 [scalar]\n        threshold the output at `top_db` below the peak:\n        ``max(20 * log10(S)) - top_db``\n\n\n    Returns\n    -------\n    S_db : np.ndarray\n        ``S`` measured in dB\n\n    See Also\n    --------\n    power_to_db, db_to_amplitude\n\n    Notes\n    -----\n    This function caches at level 30.\n    '''\n\n    S = np.asarray(S)\n\n    if np.issubdtype(S.dtype, np.complexfloating):\n        warnings.warn('amplitude_to_db was called on complex input so phase '\n                      'information will be discarded. To suppress this warning, '\n                      'call amplitude_to_db(np.abs(S)) instead.')\n\n    magnitude = np.abs(S)\n\n    if six.callable(ref):\n        # User supplied a function to calculate reference power\n        ref_value = ref(magnitude)\n    else:\n        ref_value = np.abs(ref)\n\n    power = np.square(magnitude, out=magnitude)\n\n    return power_to_db(power, ref=ref_value**2, amin=amin**2,\n                       top_db=top_db)"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef perceptual_weighting(S, frequencies, **kwargs):\n    '''Perceptual weighting of a power spectrogram:\n\n    `S_p[f] = A_weighting(f) + 10*log(S[f] / ref)`\n\n    Parameters\n    ----------\n    S : np.ndarray [shape=(d, t)]\n        Power spectrogram\n\n    frequencies : np.ndarray [shape=(d,)]\n        Center frequency for each row of `S`\n\n    kwargs : additional keyword arguments\n        Additional keyword arguments to `power_to_db`.\n\n    Returns\n    -------\n    S_p : np.ndarray [shape=(d, t)]\n        perceptually weighted version of `S`\n\n    See Also\n    --------\n    power_to_db\n\n    Notes\n    -----\n    This function caches at level 30.\n\n\n    Examples\n    --------\n    Re-weight a CQT power spectrum, using peak power as reference\n\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> C = np.abs(librosa.cqt(y, sr=sr, fmin=librosa.note_to_hz('A1')))\n    >>> freqs = librosa.cqt_frequencies(C.shape[0],\n    ...                                 fmin=librosa.note_to_hz('A1'))\n    >>> perceptual_CQT = librosa.perceptual_weighting(C**2,\n    ...                                               freqs,\n    ...                                               ref=np.max)\n    >>> perceptual_CQT\n    array([[ -80.076,  -80.049, ..., -104.735, -104.735],\n           [ -78.344,  -78.555, ..., -103.725, -103.725],\n           ...,\n           [ -76.272,  -76.272, ...,  -76.272,  -76.272],\n           [ -76.485,  -76.485, ...,  -76.485,  -76.485]])\n\n    >>> import matplotlib.pyplot as plt\n    >>> plt.figure()\n    >>> plt.subplot(2, 1, 1)\n    >>> librosa.display.specshow(librosa.amplitude_to_db(C,\n    ...                                                  ref=np.max),\n    ...                          fmin=librosa.note_to_hz('A1'),\n    ...                          y_axis='cqt_hz')\n    >>> plt.title('Log CQT power')\n    >>> plt.colorbar(format='%+2.0f dB')\n    >>> plt.subplot(2, 1, 2)\n    >>> librosa.display.specshow(perceptual_CQT, y_axis='cqt_hz',\n    ...                          fmin=librosa.note_to_hz('A1'),\n    ...                          x_axis='time')\n    >>> plt.title('Perceptually weighted log CQT')\n    >>> plt.colorbar(format='%+2.0f dB')\n    >>> plt.tight_layout()\n    '''\n\n    offset = time_frequency.A_weighting(frequencies).reshape((-1, 1))\n\n    return offset + power_to_db(S, **kwargs)", "response": "Perceptual weighting of a power spectrogram."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef fmt(y, t_min=0.5, n_fmt=None, kind='cubic', beta=0.5, over_sample=1, axis=-1):\n\n    n = y.shape[axis]\n\n    if n < 3:\n        raise ParameterError('y.shape[{:}]=={:} < 3'.format(axis, n))\n\n    if t_min <= 0:\n        raise ParameterError('t_min must be a positive number')\n\n    if n_fmt is None:\n        if over_sample < 1:\n            raise ParameterError('over_sample must be >= 1')\n\n        # The base is the maximum ratio between adjacent samples\n        # Since the sample spacing is increasing, this is simply the\n        # ratio between the positions of the last two samples: (n-1)/(n-2)\n        log_base = np.log(n - 1) - np.log(n - 2)\n\n        n_fmt = int(np.ceil(over_sample * (np.log(n - 1) - np.log(t_min)) / log_base))\n\n    elif n_fmt < 3:\n        raise ParameterError('n_fmt=={:} < 3'.format(n_fmt))\n    else:\n        log_base = (np.log(n_fmt - 1) - np.log(n_fmt - 2)) / over_sample\n\n    if not np.all(np.isfinite(y)):\n        raise ParameterError('y must be finite everywhere')\n\n    base = np.exp(log_base)\n    # original grid: signal covers [0, 1).  This range is arbitrary, but convenient.\n    # The final sample is positioned at (n-1)/n, so we omit the endpoint\n    x = np.linspace(0, 1, num=n, endpoint=False)\n\n    # build the interpolator\n    f_interp = scipy.interpolate.interp1d(x, y, kind=kind, axis=axis)\n\n    # build the new sampling grid\n    # exponentially spaced between t_min/n and 1 (exclusive)\n    # we'll go one past where we need, and drop the last sample\n    # When over-sampling, the last input sample contributions n_over samples.\n    # To keep the spacing consistent, we over-sample by n_over, and then\n    # trim the final samples.\n    n_over = int(np.ceil(over_sample))\n    x_exp = np.logspace((np.log(t_min) - np.log(n)) / log_base,\n                        0,\n                        num=n_fmt + n_over,\n                        endpoint=False,\n                        base=base)[:-n_over]\n\n    # Clean up any rounding errors at the boundaries of the interpolation\n    # The interpolator gets angry if we try to extrapolate, so clipping is necessary here.\n    if x_exp[0] < t_min or x_exp[-1] > float(n - 1.0) / n:\n        x_exp = np.clip(x_exp, float(t_min) / n, x[-1])\n\n    # Make sure that all sample points are unique\n    # This should never happen!\n    if len(np.unique(x_exp)) != len(x_exp):\n        raise RuntimeError('Redundant sample positions in Mellin transform')\n\n    # Resample the signal\n    y_res = f_interp(x_exp)\n\n    # Broadcast the window correctly\n    shape = [1] * y_res.ndim\n    shape[axis] = -1\n\n    # Apply the window and fft\n    # Normalization is absorbed into the window here for expedience\n    fft = get_fftlib()\n    return fft.rfft(y_res * ((x_exp**beta).reshape(shape) * np.sqrt(n) / n_fmt),\n                    axis=axis)", "response": "A simple fast Mellin transform of a uniformly sampled signal y."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef pcen(S, sr=22050, hop_length=512, gain=0.98, bias=2, power=0.5,\n         time_constant=0.400, eps=1e-6, b=None, max_size=1, ref=None,\n         axis=-1, max_axis=None, zi=None, return_zf=False):\n    '''Per-channel energy normalization (PCEN) [1]_\n\n    This function normalizes a time-frequency representation `S` by\n    performing automatic gain control, followed by nonlinear compression:\n\n        P[f, t] = (S / (eps + M[f, t])**gain + bias)**power - bias**power\n\n    where `M` is the result of applying a low-pass, temporal IIR filter\n    to `S`:\n\n        M[f, t] = (1 - b) * M[f, t - 1] + b * S[f, t]\n\n    If `b` is not provided, it is calculated as:\n\n        b = (sqrt(1 + 4* T**2) - 1) / (2 * T**2)\n\n    where `T = time_constant * sr / hop_length`.\n\n    This normalization is designed to suppress background noise and\n    emphasize foreground signals, and can be used as an alternative to\n    decibel scaling (`amplitude_to_db`).\n\n    This implementation also supports smoothing across frequency bins\n    by specifying `max_size > 1`.  If this option is used, the filtered\n    spectrogram `M` is computed as\n\n        M[f, t] = (1 - b) * M[f, t - 1] + b * R[f, t]\n\n    where `R` has been max-filtered along the frequency axis, similar to\n    the SuperFlux algorithm implemented in `onset.onset_strength`:\n\n        R[f, t] = max(S[f - max_size//2: f + max_size//2, t])\n\n    This can be used to perform automatic gain control on signals that cross\n    or span multiple frequency bans, which may be desirable for spectrograms\n    with high frequency resolution.\n\n    .. [1] Wang, Y., Getreuer, P., Hughes, T., Lyon, R. F., & Saurous, R. A.\n       (2017, March). Trainable frontend for robust and far-field keyword spotting.\n       In Acoustics, Speech and Signal Processing (ICASSP), 2017\n       IEEE International Conference on (pp. 5670-5674). IEEE.\n\n    Parameters\n    ----------\n    S : np.ndarray (non-negative)\n        The input (magnitude) spectrogram\n\n    sr : number > 0 [scalar]\n        The audio sampling rate\n\n    hop_length : int > 0 [scalar]\n        The hop length of `S`, expressed in samples\n\n    gain : number >= 0 [scalar]\n        The gain factor.  Typical values should be slightly less than 1.\n\n    bias : number >= 0 [scalar]\n        The bias point of the nonlinear compression (default: 2)\n\n    power : number > 0 [scalar]\n        The compression exponent.  Typical values should be between 0 and 1.\n        Smaller values of `power` result in stronger compression.\n\n    time_constant : number > 0 [scalar]\n        The time constant for IIR filtering, measured in seconds.\n\n    eps : number > 0 [scalar]\n        A small constant used to ensure numerical stability of the filter.\n\n    b : number in [0, 1]  [scalar]\n        The filter coefficient for the low-pass filter.\n        If not provided, it will be inferred from `time_constant`.\n\n    max_size : int > 0 [scalar]\n        The width of the max filter applied to the frequency axis.\n        If left as `1`, no filtering is performed.\n\n    ref : None or np.ndarray (shape=S.shape)\n        An optional pre-computed reference spectrum (`R` in the above).\n        If not provided it will be computed from `S`.\n\n    axis : int [scalar]\n        The (time) axis of the input spectrogram.\n\n    max_axis : None or int [scalar]\n        The frequency axis of the input spectrogram.\n        If `None`, and `S` is two-dimensional, it will be inferred\n        as the opposite from `axis`.\n        If `S` is not two-dimensional, and `max_size > 1`, an error\n        will be raised.\n\n    zi : np.ndarray\n        The initial filter delay values.\n\n        This may be the `zf` (final delay values) of a previous call to `pcen`, or\n        computed by `scipy.signal.lfilter_zi`.\n\n    return_zf : bool\n        If `True`, return the final filter delay values along with the PCEN output `P`.\n        This is primarily useful in streaming contexts, where the final state of one\n        block of processing should be used to initialize the next block.\n\n        If `False` (default) only the PCEN values `P` are returned.\n\n\n    Returns\n    -------\n    P : np.ndarray, non-negative [shape=(n, m)]\n        The per-channel energy normalized version of `S`.\n\n    zf : np.ndarray (optional)\n        The final filter delay values.  Only returned if `return_zf=True`.\n\n    See Also\n    --------\n    amplitude_to_db\n    librosa.onset.onset_strength\n\n    Examples\n    --------\n\n    Compare PCEN to log amplitude (dB) scaling on Mel spectra\n\n    >>> import matplotlib.pyplot as plt\n    >>> y, sr = librosa.load(librosa.util.example_audio_file(),\n    ...                      offset=10, duration=10)\n\n    >>> # We'll use power=1 to get a magnitude spectrum\n    >>> # instead of a power spectrum\n    >>> S = librosa.feature.melspectrogram(y, sr=sr, power=1)\n    >>> log_S = librosa.amplitude_to_db(S, ref=np.max)\n    >>> pcen_S = librosa.pcen(S)\n    >>> plt.figure()\n    >>> plt.subplot(2,1,1)\n    >>> librosa.display.specshow(log_S, x_axis='time', y_axis='mel')\n    >>> plt.title('log amplitude (dB)')\n    >>> plt.colorbar()\n    >>> plt.subplot(2,1,2)\n    >>> librosa.display.specshow(pcen_S, x_axis='time', y_axis='mel')\n    >>> plt.title('Per-channel energy normalization')\n    >>> plt.colorbar()\n    >>> plt.tight_layout()\n\n    Compare PCEN with and without max-filtering\n\n    >>> pcen_max = librosa.pcen(S, max_size=3)\n    >>> plt.figure()\n    >>> plt.subplot(2,1,1)\n    >>> librosa.display.specshow(pcen_S, x_axis='time', y_axis='mel')\n    >>> plt.title('Per-channel energy normalization (no max-filter)')\n    >>> plt.colorbar()\n    >>> plt.subplot(2,1,2)\n    >>> librosa.display.specshow(pcen_max, x_axis='time', y_axis='mel')\n    >>> plt.title('Per-channel energy normalization (max_size=3)')\n    >>> plt.colorbar()\n    >>> plt.tight_layout()\n\n    '''\n\n    if power <= 0:\n        raise ParameterError('power={} must be strictly positive'.format(power))\n\n    if gain < 0:\n        raise ParameterError('gain={} must be non-negative'.format(gain))\n\n    if bias < 0:\n        raise ParameterError('bias={} must be non-negative'.format(bias))\n\n    if eps <= 0:\n        raise ParameterError('eps={} must be strictly positive'.format(eps))\n\n    if time_constant <= 0:\n        raise ParameterError('time_constant={} must be strictly positive'.format(time_constant))\n\n    if max_size < 1 or not isinstance(max_size, int):\n        raise ParameterError('max_size={} must be a positive integer'.format(max_size))\n\n    if b is None:\n        t_frames = time_constant * sr / float(hop_length)\n        # By default, this solves the equation for b:\n        #   b**2  + (1 - b) / t_frames  - 2 = 0\n        # which approximates the full-width half-max of the\n        # squared frequency response of the IIR low-pass filter\n\n        b = (np.sqrt(1 + 4 * t_frames**2) - 1) / (2 * t_frames**2)\n\n    if not 0 <= b <= 1:\n        raise ParameterError('b={} must be between 0 and 1'.format(b))\n\n    if np.issubdtype(S.dtype, np.complexfloating):\n        warnings.warn('pcen was called on complex input so phase '\n                      'information will be discarded. To suppress this warning, '\n                      'call pcen(np.abs(D)) instead.')\n        S = np.abs(S)\n\n    if ref is None:\n        if max_size == 1:\n            ref = S\n        elif S.ndim == 1:\n            raise ParameterError('Max-filtering cannot be applied to 1-dimensional input')\n        else:\n            if max_axis is None:\n                if S.ndim != 2:\n                    raise ParameterError('Max-filtering a {:d}-dimensional spectrogram '\n                                         'requires you to specify max_axis'.format(S.ndim))\n                # if axis = 0, max_axis=1\n                # if axis = +- 1, max_axis = 0\n                max_axis = np.mod(1 - axis, 2)\n\n            ref = scipy.ndimage.maximum_filter1d(S, max_size, axis=max_axis)\n\n    if zi is None:\n        # Make sure zi matches dimension to input\n        shape = tuple([1] * ref.ndim)\n        zi = np.empty(shape)\n        zi[:] = scipy.signal.lfilter_zi([b], [1, b - 1])[:]\n\n    S_smooth, zf = scipy.signal.lfilter([b], [1, b - 1], ref, zi=zi,\n                                        axis=axis)\n\n    # Working in log-space gives us some stability, and a slight speedup\n    smooth = np.exp(-gain * (np.log(eps) + np.log1p(S_smooth / eps)))\n    S_out = (S * smooth + bias)**power - bias**power\n\n    if return_zf:\n        return S_out, zf\n    else:\n        return S_out", "response": "Per - channel energy normalization"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef hpss_beats(input_file, output_csv):\n    '''HPSS beat tracking\n\n    :parameters:\n      - input_file : str\n          Path to input audio file (wav, mp3, m4a, flac, etc.)\n\n      - output_file : str\n          Path to save beat event timestamps as a CSV file\n    '''\n\n    # Load the file\n    print('Loading  ', input_file)\n    y, sr = librosa.load(input_file)\n\n    # Do HPSS\n    print('Harmonic-percussive separation ... ')\n    y = librosa.effects.percussive(y)\n\n    # Construct onset envelope from percussive component\n    print('Tracking beats on percussive component')\n    onset_env = librosa.onset.onset_strength(y=y,\n                                             sr=sr,\n                                             hop_length=HOP_LENGTH,\n                                             n_fft=N_FFT,\n                                             aggregate=np.median)\n\n    # Track the beats\n    tempo, beats = librosa.beat.beat_track(onset_envelope=onset_env,\n                                           sr=sr,\n                                           hop_length=HOP_LENGTH)\n\n    beat_times = librosa.frames_to_time(beats,\n                                        sr=sr,\n                                        hop_length=HOP_LENGTH)\n\n    # Save the output\n    print('Saving beats to ', output_csv)\n    librosa.output.times_csv(output_csv, beat_times)", "response": "This function is used to track the beats of the HPSS audio file and save them as a CSV file."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef decompose(S, n_components=None, transformer=None, sort=False, fit=True, **kwargs):\n\n    if transformer is None:\n        if fit is False:\n            raise ParameterError('fit must be True if transformer is None')\n\n        transformer = sklearn.decomposition.NMF(n_components=n_components,\n                                                **kwargs)\n\n    if n_components is None:\n        n_components = S.shape[0]\n\n    if fit:\n        activations = transformer.fit_transform(S.T).T\n    else:\n        activations = transformer.transform(S.T).T\n\n    components = transformer.components_.T\n\n    if sort:\n        components, idx = util.axis_sort(components, index=True)\n        activations = activations[idx]\n\n    return components, activations", "response": "Decompose a feature matrix S into components and activations."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nfiltering by nearest - neighbors.", "response": "def nn_filter(S, rec=None, aggregate=None, axis=-1, **kwargs):\n    '''Filtering by nearest-neighbors.\n\n    Each data point (e.g, spectrogram column) is replaced\n    by aggregating its nearest neighbors in feature space.\n\n    This can be useful for de-noising a spectrogram or feature matrix.\n\n    The non-local means method [1]_ can be recovered by providing a\n    weighted recurrence matrix as input and specifying `aggregate=np.average`.\n\n    Similarly, setting `aggregate=np.median` produces sparse de-noising\n    as in REPET-SIM [2]_.\n\n    .. [1] Buades, A., Coll, B., & Morel, J. M.\n        (2005, June). A non-local algorithm for image denoising.\n        In Computer Vision and Pattern Recognition, 2005.\n        CVPR 2005. IEEE Computer Society Conference on (Vol. 2, pp. 60-65). IEEE.\n\n    .. [2] Rafii, Z., & Pardo, B.\n        (2012, October).  \"Music/Voice Separation Using the Similarity Matrix.\"\n        International Society for Music Information Retrieval Conference, 2012.\n\n    Parameters\n    ----------\n    S : np.ndarray\n        The input data (spectrogram) to filter\n\n    rec : (optional) scipy.sparse.spmatrix or np.ndarray\n        Optionally, a pre-computed nearest-neighbor matrix\n        as provided by `librosa.segment.recurrence_matrix`\n\n    aggregate : function\n        aggregation function (default: `np.mean`)\n\n        If `aggregate=np.average`, then a weighted average is\n        computed according to the (per-row) weights in `rec`.\n\n        For all other aggregation functions, all neighbors\n        are treated equally.\n\n\n    axis : int\n        The axis along which to filter (by default, columns)\n\n    kwargs\n        Additional keyword arguments provided to\n        `librosa.segment.recurrence_matrix` if `rec` is not provided\n\n    Returns\n    -------\n    S_filtered : np.ndarray\n        The filtered data\n\n    Raises\n    ------\n    ParameterError\n        if `rec` is provided and its shape is incompatible with `S`.\n\n    See also\n    --------\n    decompose\n    hpss\n    librosa.segment.recurrence_matrix\n\n\n    Notes\n    -----\n    This function caches at level 30.\n\n\n    Examples\n    --------\n\n    De-noise a chromagram by non-local median filtering.\n    By default this would use euclidean distance to select neighbors,\n    but this can be overridden directly by setting the `metric` parameter.\n\n    >>> y, sr = librosa.load(librosa.util.example_audio_file(),\n    ...                      offset=30, duration=10)\n    >>> chroma = librosa.feature.chroma_cqt(y=y, sr=sr)\n    >>> chroma_med = librosa.decompose.nn_filter(chroma,\n    ...                                          aggregate=np.median,\n    ...                                          metric='cosine')\n\n    To use non-local means, provide an affinity matrix and `aggregate=np.average`.\n\n    >>> rec = librosa.segment.recurrence_matrix(chroma, mode='affinity',\n    ...                                         metric='cosine', sparse=True)\n    >>> chroma_nlm = librosa.decompose.nn_filter(chroma, rec=rec,\n    ...                                          aggregate=np.average)\n\n    >>> import matplotlib.pyplot as plt\n    >>> plt.figure(figsize=(10, 8))\n    >>> plt.subplot(5, 1, 1)\n    >>> librosa.display.specshow(chroma, y_axis='chroma')\n    >>> plt.colorbar()\n    >>> plt.title('Unfiltered')\n    >>> plt.subplot(5, 1, 2)\n    >>> librosa.display.specshow(chroma_med, y_axis='chroma')\n    >>> plt.colorbar()\n    >>> plt.title('Median-filtered')\n    >>> plt.subplot(5, 1, 3)\n    >>> librosa.display.specshow(chroma_nlm, y_axis='chroma')\n    >>> plt.colorbar()\n    >>> plt.title('Non-local means')\n    >>> plt.subplot(5, 1, 4)\n    >>> librosa.display.specshow(chroma - chroma_med,\n    ...                          y_axis='chroma')\n    >>> plt.colorbar()\n    >>> plt.title('Original - median')\n    >>> plt.subplot(5, 1, 5)\n    >>> librosa.display.specshow(chroma - chroma_nlm,\n    ...                          y_axis='chroma', x_axis='time')\n    >>> plt.colorbar()\n    >>> plt.title('Original - NLM')\n    >>> plt.tight_layout()\n    '''\n    if aggregate is None:\n        aggregate = np.mean\n\n    if rec is None:\n        kwargs = dict(kwargs)\n        kwargs['sparse'] = True\n        rec = segment.recurrence_matrix(S, axis=axis, **kwargs)\n    elif not scipy.sparse.issparse(rec):\n        rec = scipy.sparse.csr_matrix(rec)\n\n    if rec.shape[0] != S.shape[axis] or rec.shape[0] != rec.shape[1]:\n        raise ParameterError('Invalid self-similarity matrix shape '\n                             'rec.shape={} for S.shape={}'.format(rec.shape,\n                                                                  S.shape))\n\n    return __nn_filter_helper(rec.data, rec.indices, rec.indptr,\n                              S.swapaxes(0, axis), aggregate).swapaxes(0, axis)"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef mel(sr, n_fft, n_mels=128, fmin=0.0, fmax=None, htk=False,\n        norm=1, dtype=np.float32):\n    \"\"\"Create a Filterbank matrix to combine FFT bins into Mel-frequency bins\n\n    Parameters\n    ----------\n    sr        : number > 0 [scalar]\n        sampling rate of the incoming signal\n\n    n_fft     : int > 0 [scalar]\n        number of FFT components\n\n    n_mels    : int > 0 [scalar]\n        number of Mel bands to generate\n\n    fmin      : float >= 0 [scalar]\n        lowest frequency (in Hz)\n\n    fmax      : float >= 0 [scalar]\n        highest frequency (in Hz).\n        If `None`, use `fmax = sr / 2.0`\n\n    htk       : bool [scalar]\n        use HTK formula instead of Slaney\n\n    norm : {None, 1, np.inf} [scalar]\n        if 1, divide the triangular mel weights by the width of the mel band\n        (area normalization).  Otherwise, leave all the triangles aiming for\n        a peak value of 1.0\n\n    dtype : np.dtype\n        The data type of the output basis.\n        By default, uses 32-bit (single-precision) floating point.\n\n    Returns\n    -------\n    M         : np.ndarray [shape=(n_mels, 1 + n_fft/2)]\n        Mel transform matrix\n\n    Notes\n    -----\n    This function caches at level 10.\n\n    Examples\n    --------\n    >>> melfb = librosa.filters.mel(22050, 2048)\n    >>> melfb\n    array([[ 0.   ,  0.016, ...,  0.   ,  0.   ],\n           [ 0.   ,  0.   , ...,  0.   ,  0.   ],\n           ...,\n           [ 0.   ,  0.   , ...,  0.   ,  0.   ],\n           [ 0.   ,  0.   , ...,  0.   ,  0.   ]])\n\n\n    Clip the maximum frequency to 8KHz\n\n    >>> librosa.filters.mel(22050, 2048, fmax=8000)\n    array([[ 0.  ,  0.02, ...,  0.  ,  0.  ],\n           [ 0.  ,  0.  , ...,  0.  ,  0.  ],\n           ...,\n           [ 0.  ,  0.  , ...,  0.  ,  0.  ],\n           [ 0.  ,  0.  , ...,  0.  ,  0.  ]])\n\n\n    >>> import matplotlib.pyplot as plt\n    >>> plt.figure()\n    >>> librosa.display.specshow(melfb, x_axis='linear')\n    >>> plt.ylabel('Mel filter')\n    >>> plt.title('Mel filter bank')\n    >>> plt.colorbar()\n    >>> plt.tight_layout()\n    \"\"\"\n\n    if fmax is None:\n        fmax = float(sr) / 2\n\n    if norm is not None and norm != 1 and norm != np.inf:\n        raise ParameterError('Unsupported norm: {}'.format(repr(norm)))\n\n    # Initialize the weights\n    n_mels = int(n_mels)\n    weights = np.zeros((n_mels, int(1 + n_fft // 2)), dtype=dtype)\n\n    # Center freqs of each FFT bin\n    fftfreqs = fft_frequencies(sr=sr, n_fft=n_fft)\n\n    # 'Center freqs' of mel bands - uniformly spaced between limits\n    mel_f = mel_frequencies(n_mels + 2, fmin=fmin, fmax=fmax, htk=htk)\n\n    fdiff = np.diff(mel_f)\n    ramps = np.subtract.outer(mel_f, fftfreqs)\n\n    for i in range(n_mels):\n        # lower and upper slopes for all bins\n        lower = -ramps[i] / fdiff[i]\n        upper = ramps[i+2] / fdiff[i+1]\n\n        # .. then intersect them with each other and zero\n        weights[i] = np.maximum(0, np.minimum(lower, upper))\n\n    if norm == 1:\n        # Slaney-style mel is scaled to be approx constant energy per channel\n        enorm = 2.0 / (mel_f[2:n_mels+2] - mel_f[:n_mels])\n        weights *= enorm[:, np.newaxis]\n\n    # Only check weights if f_mel[0] is positive\n    if not np.all((mel_f[:-2] == 0) | (weights.max(axis=1) > 0)):\n        # This means we have an empty channel somewhere\n        warnings.warn('Empty filters detected in mel frequency basis. '\n                      'Some channels will produce empty responses. '\n                      'Try increasing your sampling rate (and fmax) or '\n                      'reducing n_mels.')\n\n    return weights", "response": "Create a filterbank matrix to combine FFT bins into Mel - frequency bins."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef chroma(sr, n_fft, n_chroma=12, A440=440.0, ctroct=5.0,\n           octwidth=2, norm=2, base_c=True, dtype=np.float32):\n    \"\"\"Create a Filterbank matrix to convert STFT to chroma\n\n\n    Parameters\n    ----------\n    sr        : number > 0 [scalar]\n        audio sampling rate\n\n    n_fft     : int > 0 [scalar]\n        number of FFT bins\n\n    n_chroma  : int > 0 [scalar]\n        number of chroma bins\n\n    A440      : float > 0 [scalar]\n        Reference frequency for A440\n\n    ctroct    : float > 0 [scalar]\n\n    octwidth  : float > 0 or None [scalar]\n        `ctroct` and `octwidth` specify a dominance window -\n        a Gaussian weighting centered on `ctroct` (in octs, A0 = 27.5Hz)\n        and with a gaussian half-width of `octwidth`.\n        Set `octwidth` to `None` to use a flat weighting.\n\n    norm : float > 0 or np.inf\n        Normalization factor for each filter\n\n    base_c : bool\n        If True, the filter bank will start at 'C'.\n        If False, the filter bank will start at 'A'.\n\n    dtype : np.dtype\n        The data type of the output basis.\n        By default, uses 32-bit (single-precision) floating point.\n\n    Returns\n    -------\n    wts : ndarray [shape=(n_chroma, 1 + n_fft / 2)]\n        Chroma filter matrix\n\n    See Also\n    --------\n    util.normalize\n    feature.chroma_stft\n\n    Notes\n    -----\n    This function caches at level 10.\n\n    Examples\n    --------\n    Build a simple chroma filter bank\n\n    >>> chromafb = librosa.filters.chroma(22050, 4096)\n    array([[  1.689e-05,   3.024e-04, ...,   4.639e-17,   5.327e-17],\n           [  1.716e-05,   2.652e-04, ...,   2.674e-25,   3.176e-25],\n    ...,\n           [  1.578e-05,   3.619e-04, ...,   8.577e-06,   9.205e-06],\n           [  1.643e-05,   3.355e-04, ...,   1.474e-10,   1.636e-10]])\n\n    Use quarter-tones instead of semitones\n\n    >>> librosa.filters.chroma(22050, 4096, n_chroma=24)\n    array([[  1.194e-05,   2.138e-04, ...,   6.297e-64,   1.115e-63],\n           [  1.206e-05,   2.009e-04, ...,   1.546e-79,   2.929e-79],\n    ...,\n           [  1.162e-05,   2.372e-04, ...,   6.417e-38,   9.923e-38],\n           [  1.180e-05,   2.260e-04, ...,   4.697e-50,   7.772e-50]])\n\n\n    Equally weight all octaves\n\n    >>> librosa.filters.chroma(22050, 4096, octwidth=None)\n    array([[  3.036e-01,   2.604e-01, ...,   2.445e-16,   2.809e-16],\n           [  3.084e-01,   2.283e-01, ...,   1.409e-24,   1.675e-24],\n    ...,\n           [  2.836e-01,   3.116e-01, ...,   4.520e-05,   4.854e-05],\n           [  2.953e-01,   2.888e-01, ...,   7.768e-10,   8.629e-10]])\n\n    >>> import matplotlib.pyplot as plt\n    >>> plt.figure()\n    >>> librosa.display.specshow(chromafb, x_axis='linear')\n    >>> plt.ylabel('Chroma filter')\n    >>> plt.title('Chroma filter bank')\n    >>> plt.colorbar()\n    >>> plt.tight_layout()\n    \"\"\"\n\n    wts = np.zeros((n_chroma, n_fft))\n\n    # Get the FFT bins, not counting the DC component\n    frequencies = np.linspace(0, sr, n_fft, endpoint=False)[1:]\n\n    frqbins = n_chroma * hz_to_octs(frequencies, A440)\n\n    # make up a value for the 0 Hz bin = 1.5 octaves below bin 1\n    # (so chroma is 50% rotated from bin 1, and bin width is broad)\n    frqbins = np.concatenate(([frqbins[0] - 1.5 * n_chroma], frqbins))\n\n    binwidthbins = np.concatenate((np.maximum(frqbins[1:] - frqbins[:-1],\n                                              1.0), [1]))\n\n    D = np.subtract.outer(frqbins, np.arange(0, n_chroma, dtype='d')).T\n\n    n_chroma2 = np.round(float(n_chroma) / 2)\n\n    # Project into range -n_chroma/2 .. n_chroma/2\n    # add on fixed offset of 10*n_chroma to ensure all values passed to\n    # rem are positive\n    D = np.remainder(D + n_chroma2 + 10*n_chroma, n_chroma) - n_chroma2\n\n    # Gaussian bumps - 2*D to make them narrower\n    wts = np.exp(-0.5 * (2*D / np.tile(binwidthbins, (n_chroma, 1)))**2)\n\n    # normalize each column\n    wts = util.normalize(wts, norm=norm, axis=0)\n\n    # Maybe apply scaling for fft bins\n    if octwidth is not None:\n        wts *= np.tile(\n            np.exp(-0.5 * (((frqbins/n_chroma - ctroct)/octwidth)**2)),\n            (n_chroma, 1))\n\n    if base_c:\n        wts = np.roll(wts, -3, axis=0)\n\n    # remove aliasing columns, copy to ensure row-contiguity\n    return np.ascontiguousarray(wts[:, :int(1 + n_fft/2)], dtype=dtype)", "response": "Create a filterbank matrix to convert STFT to chroma."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef constant_q(sr, fmin=None, n_bins=84, bins_per_octave=12, tuning=0.0,\n               window='hann', filter_scale=1, pad_fft=True, norm=1,\n               dtype=np.complex64, **kwargs):\n    r'''Construct a constant-Q basis.\n\n    This uses the filter bank described by [1]_.\n\n    .. [1] McVicar, Matthew.\n            \"A machine learning approach to automatic chord extraction.\"\n            Dissertation, University of Bristol. 2013.\n\n\n    Parameters\n    ----------\n    sr : number > 0 [scalar]\n        Audio sampling rate\n\n    fmin : float > 0 [scalar]\n        Minimum frequency bin. Defaults to `C1 ~= 32.70`\n\n    n_bins : int > 0 [scalar]\n        Number of frequencies.  Defaults to 7 octaves (84 bins).\n\n    bins_per_octave : int > 0 [scalar]\n        Number of bins per octave\n\n    tuning : float in `[-0.5, +0.5)` [scalar]\n        Tuning deviation from A440 in fractions of a bin\n\n    window : string, tuple, number, or function\n        Windowing function to apply to filters.\n\n    filter_scale : float > 0 [scalar]\n        Scale of filter windows.\n        Small values (<1) use shorter windows for higher temporal resolution.\n\n    pad_fft : boolean\n        Center-pad all filters up to the nearest integral power of 2.\n\n        By default, padding is done with zeros, but this can be overridden\n        by setting the `mode=` field in *kwargs*.\n\n    norm : {inf, -inf, 0, float > 0}\n        Type of norm to use for basis function normalization.\n        See librosa.util.normalize\n\n    dtype : np.dtype\n        The data type of the output basis.\n        By default, uses 64-bit (single precision) complex floating point.\n\n    kwargs : additional keyword arguments\n        Arguments to `np.pad()` when `pad==True`.\n\n    Returns\n    -------\n    filters : np.ndarray, `len(filters) == n_bins`\n        `filters[i]` is `i`\\ th time-domain CQT basis filter\n\n    lengths : np.ndarray, `len(lengths) == n_bins`\n        The (fractional) length of each filter\n\n    Notes\n    -----\n    This function caches at level 10.\n\n    See Also\n    --------\n    constant_q_lengths\n    librosa.core.cqt\n    librosa.util.normalize\n\n\n    Examples\n    --------\n    Use a shorter window for each filter\n\n    >>> basis, lengths = librosa.filters.constant_q(22050, filter_scale=0.5)\n\n    Plot one octave of filters in time and frequency\n\n    >>> import matplotlib.pyplot as plt\n    >>> basis, lengths = librosa.filters.constant_q(22050)\n    >>> plt.figure(figsize=(10, 6))\n    >>> plt.subplot(2, 1, 1)\n    >>> notes = librosa.midi_to_note(np.arange(24, 24 + len(basis)))\n    >>> for i, (f, n) in enumerate(zip(basis, notes[:12])):\n    ...     f_scale = librosa.util.normalize(f) / 2\n    ...     plt.plot(i + f_scale.real)\n    ...     plt.plot(i + f_scale.imag, linestyle=':')\n    >>> plt.axis('tight')\n    >>> plt.yticks(np.arange(len(notes[:12])), notes[:12])\n    >>> plt.ylabel('CQ filters')\n    >>> plt.title('CQ filters (one octave, time domain)')\n    >>> plt.xlabel('Time (samples at 22050 Hz)')\n    >>> plt.legend(['Real', 'Imaginary'], frameon=True, framealpha=0.8)\n    >>> plt.subplot(2, 1, 2)\n    >>> F = np.abs(np.fft.fftn(basis, axes=[-1]))\n    >>> # Keep only the positive frequencies\n    >>> F = F[:, :(1 + F.shape[1] // 2)]\n    >>> librosa.display.specshow(F, x_axis='linear')\n    >>> plt.yticks(np.arange(len(notes))[::12], notes[::12])\n    >>> plt.ylabel('CQ filters')\n    >>> plt.title('CQ filter magnitudes (frequency domain)')\n    >>> plt.tight_layout()\n    '''\n\n    if fmin is None:\n        fmin = note_to_hz('C1')\n\n    # Pass-through parameters to get the filter lengths\n    lengths = constant_q_lengths(sr, fmin,\n                                 n_bins=n_bins,\n                                 bins_per_octave=bins_per_octave,\n                                 tuning=tuning,\n                                 window=window,\n                                 filter_scale=filter_scale)\n\n    # Apply tuning correction\n    correction = 2.0**(float(tuning) / bins_per_octave)\n    fmin = correction * fmin\n\n    # Q should be capitalized here, so we suppress the name warning\n    # pylint: disable=invalid-name\n    Q = float(filter_scale) / (2.0**(1. / bins_per_octave) - 1)\n\n    # Convert lengths back to frequencies\n    freqs = Q * sr / lengths\n\n    # Build the filters\n    filters = []\n    for ilen, freq in zip(lengths, freqs):\n        # Build the filter: note, length will be ceil(ilen)\n        sig = np.exp(np.arange(-ilen//2, ilen//2, dtype=float) * 1j * 2 * np.pi * freq / sr)\n\n        # Apply the windowing function\n        sig = sig * __float_window(window)(len(sig))\n\n        # Normalize\n        sig = util.normalize(sig, norm=norm)\n\n        filters.append(sig)\n\n    # Pad and stack\n    max_len = max(lengths)\n    if pad_fft:\n        max_len = int(2.0**(np.ceil(np.log2(max_len))))\n    else:\n        max_len = int(np.ceil(max_len))\n\n    filters = np.asarray([util.pad_center(filt, max_len, **kwargs)\n                          for filt in filters], dtype=dtype)\n\n    return filters, np.asarray(lengths)", "response": "r Constructs a constant - Q basis for a given audio sample rate fmin and n_bins."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef constant_q_lengths(sr, fmin, n_bins=84, bins_per_octave=12,\n                       tuning=0.0, window='hann', filter_scale=1):\n    r'''Return length of each filter in a constant-Q basis.\n\n    Parameters\n    ----------\n    sr : number > 0 [scalar]\n        Audio sampling rate\n\n    fmin : float > 0 [scalar]\n        Minimum frequency bin.\n\n    n_bins : int > 0 [scalar]\n        Number of frequencies.  Defaults to 7 octaves (84 bins).\n\n    bins_per_octave : int > 0 [scalar]\n        Number of bins per octave\n\n    tuning : float in `[-0.5, +0.5)` [scalar]\n        Tuning deviation from A440 in fractions of a bin\n\n    window : str or callable\n        Window function to use on filters\n\n    filter_scale : float > 0 [scalar]\n        Resolution of filter windows. Larger values use longer windows.\n\n    Returns\n    -------\n    lengths : np.ndarray\n        The length of each filter.\n\n    Notes\n    -----\n    This function caches at level 10.\n\n    See Also\n    --------\n    constant_q\n    librosa.core.cqt\n    '''\n\n    if fmin <= 0:\n        raise ParameterError('fmin must be positive')\n\n    if bins_per_octave <= 0:\n        raise ParameterError('bins_per_octave must be positive')\n\n    if filter_scale <= 0:\n        raise ParameterError('filter_scale must be positive')\n\n    if n_bins <= 0 or not isinstance(n_bins, int):\n        raise ParameterError('n_bins must be a positive integer')\n\n    correction = 2.0**(float(tuning) / bins_per_octave)\n\n    fmin = correction * fmin\n\n    # Q should be capitalized here, so we suppress the name warning\n    # pylint: disable=invalid-name\n    Q = float(filter_scale) / (2.0**(1. / bins_per_octave) - 1)\n\n    # Compute the frequencies\n    freq = fmin * (2.0 ** (np.arange(n_bins, dtype=float) / bins_per_octave))\n\n    if freq[-1] * (1 + 0.5 * window_bandwidth(window) / Q) > sr / 2.0:\n        raise ParameterError('Filter pass-band lies beyond Nyquist')\n\n    # Convert frequencies to filter lengths\n    lengths = Q * sr / freq\n\n    return lengths", "response": "r Return the length of each filter in a constant - Q basis."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nconvert a Constant - Q basis to Chroma.", "response": "def cq_to_chroma(n_input, bins_per_octave=12, n_chroma=12,\n                 fmin=None, window=None, base_c=True, dtype=np.float32):\n    '''Convert a Constant-Q basis to Chroma.\n\n\n    Parameters\n    ----------\n    n_input : int > 0 [scalar]\n        Number of input components (CQT bins)\n\n    bins_per_octave : int > 0 [scalar]\n        How many bins per octave in the CQT\n\n    n_chroma : int > 0 [scalar]\n        Number of output bins (per octave) in the chroma\n\n    fmin : None or float > 0\n        Center frequency of the first constant-Q channel.\n        Default: 'C1' ~= 32.7 Hz\n\n    window : None or np.ndarray\n        If provided, the cq_to_chroma filter bank will be\n        convolved with `window`.\n\n    base_c : bool\n        If True, the first chroma bin will start at 'C'\n        If False, the first chroma bin will start at 'A'\n\n    dtype : np.dtype\n        The data type of the output basis.\n        By default, uses 32-bit (single-precision) floating point.\n\n\n    Returns\n    -------\n    cq_to_chroma : np.ndarray [shape=(n_chroma, n_input)]\n        Transformation matrix: `Chroma = np.dot(cq_to_chroma, CQT)`\n\n    Raises\n    ------\n    ParameterError\n        If `n_input` is not an integer multiple of `n_chroma`\n\n    Notes\n    -----\n    This function caches at level 10.\n\n    Examples\n    --------\n    Get a CQT, and wrap bins to chroma\n\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> CQT = np.abs(librosa.cqt(y, sr=sr))\n    >>> chroma_map = librosa.filters.cq_to_chroma(CQT.shape[0])\n    >>> chromagram = chroma_map.dot(CQT)\n    >>> # Max-normalize each time step\n    >>> chromagram = librosa.util.normalize(chromagram, axis=0)\n\n    >>> import matplotlib.pyplot as plt\n    >>> plt.subplot(3, 1, 1)\n    >>> librosa.display.specshow(librosa.amplitude_to_db(CQT,\n    ...                                                  ref=np.max),\n    ...                          y_axis='cqt_note')\n    >>> plt.title('CQT Power')\n    >>> plt.colorbar()\n    >>> plt.subplot(3, 1, 2)\n    >>> librosa.display.specshow(chromagram, y_axis='chroma')\n    >>> plt.title('Chroma (wrapped CQT)')\n    >>> plt.colorbar()\n    >>> plt.subplot(3, 1, 3)\n    >>> chroma = librosa.feature.chroma_stft(y=y, sr=sr)\n    >>> librosa.display.specshow(chroma, y_axis='chroma', x_axis='time')\n    >>> plt.title('librosa.feature.chroma_stft')\n    >>> plt.colorbar()\n    >>> plt.tight_layout()\n\n    '''\n\n    # How many fractional bins are we merging?\n    n_merge = float(bins_per_octave) / n_chroma\n\n    if fmin is None:\n        fmin = note_to_hz('C1')\n\n    if np.mod(n_merge, 1) != 0:\n        raise ParameterError('Incompatible CQ merge: '\n                             'input bins must be an '\n                             'integer multiple of output bins.')\n\n    # Tile the identity to merge fractional bins\n    cq_to_ch = np.repeat(np.eye(n_chroma), n_merge, axis=1)\n\n    # Roll it left to center on the target bin\n    cq_to_ch = np.roll(cq_to_ch, - int(n_merge // 2), axis=1)\n\n    # How many octaves are we repeating?\n    n_octaves = np.ceil(np.float(n_input) / bins_per_octave)\n\n    # Repeat and trim\n    cq_to_ch = np.tile(cq_to_ch, int(n_octaves))[:, :n_input]\n\n    # What's the note number of the first bin in the CQT?\n    # midi uses 12 bins per octave here\n    midi_0 = np.mod(hz_to_midi(fmin), 12)\n\n    if base_c:\n        # rotate to C\n        roll = midi_0\n    else:\n        # rotate to A\n        roll = midi_0 - 9\n\n    # Adjust the roll in terms of how many chroma we want out\n    # We need to be careful with rounding here\n    roll = int(np.round(roll * (n_chroma / 12.)))\n\n    # Apply the roll\n    cq_to_ch = np.roll(cq_to_ch, roll, axis=0).astype(dtype)\n\n    if window is not None:\n        cq_to_ch = scipy.signal.convolve(cq_to_ch,\n                                         np.atleast_2d(window),\n                                         mode='same')\n\n    return cq_to_ch"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef window_bandwidth(window, n=1000):\n    '''Get the equivalent noise bandwidth of a window function.\n\n\n    Parameters\n    ----------\n    window : callable or string\n        A window function, or the name of a window function.\n        Examples:\n        - scipy.signal.hann\n        - 'boxcar'\n\n    n : int > 0\n        The number of coefficients to use in estimating the\n        window bandwidth\n\n    Returns\n    -------\n    bandwidth : float\n        The equivalent noise bandwidth (in FFT bins) of the\n        given window function\n\n    Notes\n    -----\n    This function caches at level 10.\n\n    See Also\n    --------\n    get_window\n    '''\n\n    if hasattr(window, '__name__'):\n        key = window.__name__\n    else:\n        key = window\n\n    if key not in WINDOW_BANDWIDTHS:\n        win = get_window(window, n)\n        WINDOW_BANDWIDTHS[key] = n * np.sum(win**2) / np.sum(np.abs(win))**2\n\n    return WINDOW_BANDWIDTHS[key]", "response": "Get the equivalent noise bandwidth of a window function."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef get_window(window, Nx, fftbins=True):\n    '''Compute a window function.\n\n    This is a wrapper for `scipy.signal.get_window` that additionally\n    supports callable or pre-computed windows.\n\n    Parameters\n    ----------\n    window : string, tuple, number, callable, or list-like\n        The window specification:\n\n        - If string, it's the name of the window function (e.g., `'hann'`)\n        - If tuple, it's the name of the window function and any parameters\n          (e.g., `('kaiser', 4.0)`)\n        - If numeric, it is treated as the beta parameter of the `'kaiser'`\n          window, as in `scipy.signal.get_window`.\n        - If callable, it's a function that accepts one integer argument\n          (the window length)\n        - If list-like, it's a pre-computed window of the correct length `Nx`\n\n    Nx : int > 0\n        The length of the window\n\n    fftbins : bool, optional\n        If True (default), create a periodic window for use with FFT\n        If False, create a symmetric window for filter design applications.\n\n    Returns\n    -------\n    get_window : np.ndarray\n        A window of length `Nx` and type `window`\n\n    See Also\n    --------\n    scipy.signal.get_window\n\n    Notes\n    -----\n    This function caches at level 10.\n\n    Raises\n    ------\n    ParameterError\n        If `window` is supplied as a vector of length != `n_fft`,\n        or is otherwise mis-specified.\n    '''\n    if six.callable(window):\n        return window(Nx)\n\n    elif (isinstance(window, (six.string_types, tuple)) or\n          np.isscalar(window)):\n        # TODO: if we add custom window functions in librosa, call them here\n\n        return scipy.signal.get_window(window, Nx, fftbins=fftbins)\n\n    elif isinstance(window, (np.ndarray, list)):\n        if len(window) == Nx:\n            return np.asarray(window)\n\n        raise ParameterError('Window size mismatch: '\n                             '{:d} != {:d}'.format(len(window), Nx))\n    else:\n        raise ParameterError('Invalid window specification: {}'.format(window))", "response": "Compute a window function. This is a wrapper for scipy. signal. get_window that additionally supports callable or pre - computed windows."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef _multirate_fb(center_freqs=None, sample_rates=None, Q=25.0,\n                  passband_ripple=1, stopband_attenuation=50, ftype='ellip', flayout='ba'):\n    r'''Helper function to construct a multirate filterbank.\n\n     A filter bank consists of multiple band-pass filters which divide the input signal\n     into subbands. In the case of a multirate filter bank, the band-pass filters\n     operate with resampled versions of the input signal, e.g. to keep the length\n     of a filter constant while shifting its center frequency.\n\n     This implementation uses `scipy.signal.iirdesign` to design the filters.\n\n\n    Parameters\n    ----------\n    center_freqs : np.ndarray [shape=(n,), dtype=float]\n        Center frequencies of the filter kernels.\n        Also defines the number of filters in the filterbank.\n\n    sample_rates : np.ndarray [shape=(n,), dtype=float]\n        Samplerate for each filter (used for multirate filterbank).\n\n    Q : float\n        Q factor (influences the filter bandwith).\n\n    passband_ripple : float\n        The maximum loss in the passband (dB)\n        See `scipy.signal.iirdesign` for details.\n\n    stopband_attenuation : float\n        The minimum attenuation in the stopband (dB)\n        See `scipy.signal.iirdesign` for details.\n\n    ftype : str\n        The type of IIR filter to design\n        See `scipy.signal.iirdesign` for details.\n\n    flayout : string\n        Valid `output` argument for `scipy.signal.iirdesign`.\n        - If `ba`, returns numerators/denominators of the transfer functions,\n          used for filtering with `scipy.signal.filtfilt`.\n          Can be unstable for high-order filters.\n        - If `sos`, returns a series of second-order filters,\n          used for filtering with `scipy.signal.sosfiltfilt`.\n          Minimizes numerical precision errors for high-order filters, but is slower.\n        - If `zpk`, returns zeros, poles, and system gains of the transfer functions.\n\n    Returns\n    -------\n    filterbank : list [shape=(n,), dtype=float]\n        Each list entry comprises the filter coefficients for a single filter.\n\n    sample_rates : np.ndarray [shape=(n,), dtype=float]\n        Samplerate for each filter.\n\n    Notes\n    -----\n    This function caches at level 10.\n\n    See Also\n    --------\n    scipy.signal.iirdesign\n\n    Raises\n    ------\n    ParameterError\n        If `center_freqs` is `None`.\n        If `sample_rates` is `None`.\n        If `center_freqs.shape` does not match `sample_rates.shape`.\n    '''\n\n    if center_freqs is None:\n        raise ParameterError('center_freqs must be provided.')\n\n    if sample_rates is None:\n        raise ParameterError('sample_rates must be provided.')\n\n    if center_freqs.shape != sample_rates.shape:\n        raise ParameterError('Number of provided center_freqs and sample_rates must be equal.')\n\n    nyquist = 0.5 * sample_rates\n    filter_bandwidths = center_freqs / float(Q)\n\n    filterbank = []\n\n    for cur_center_freq, cur_nyquist, cur_bw in zip(center_freqs, nyquist, filter_bandwidths):\n        passband_freqs = [cur_center_freq - 0.5 * cur_bw, cur_center_freq + 0.5 * cur_bw] / cur_nyquist\n        stopband_freqs = [cur_center_freq - cur_bw, cur_center_freq + cur_bw] / cur_nyquist\n\n        cur_filter = scipy.signal.iirdesign(passband_freqs, stopband_freqs,\n                                            passband_ripple, stopband_attenuation,\n                                            analog=False, ftype=ftype, output=flayout)\n\n        filterbank.append(cur_filter)\n\n    return filterbank, sample_rates", "response": "r Returns a multirate filterbank."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef mr_frequencies(tuning):\n    r'''Helper function for generating center frequency and sample rate pairs.\n\n    This function will return center frequency and corresponding sample rates\n    to obtain similar pitch filterbank settings as described in [1]_.\n    Instead of starting with MIDI pitch `A0`, we start with `C0`.\n\n    .. [1] M\u00fcller, Meinard.\n           \"Information Retrieval for Music and Motion.\"\n           Springer Verlag. 2007.\n\n\n    Parameters\n    ----------\n    tuning : float in `[-0.5, +0.5)` [scalar]\n        Tuning deviation from A440, measure as a fraction of the equally\n        tempered semitone (1/12 of an octave).\n\n    Returns\n    -------\n    center_freqs : np.ndarray [shape=(n,), dtype=float]\n        Center frequencies of the filter kernels.\n        Also defines the number of filters in the filterbank.\n\n    sample_rates : np.ndarray [shape=(n,), dtype=float]\n        Sample rate for each filter, used for multirate filterbank.\n\n    Notes\n    -----\n    This function caches at level 10.\n\n\n    See Also\n    --------\n    librosa.filters.semitone_filterbank\n    librosa.filters._multirate_fb\n    '''\n\n    center_freqs = midi_to_hz(np.arange(24 + tuning, 109 + tuning))\n\n    sample_rates = np.asarray(len(np.arange(0, 36)) * [882, ] +\n                              len(np.arange(36, 70)) * [4410, ] +\n                              len(np.arange(70, 85)) * [22050, ])\n\n    return center_freqs, sample_rates", "response": "r This function returns the center frequency and sample rate pairs for the given MIDI pitch tuning."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\ncompute the sum - square envelope of a window function at a given hop length.", "response": "def window_sumsquare(window, n_frames, hop_length=512, win_length=None, n_fft=2048,\n                     dtype=np.float32, norm=None):\n    '''\n    Compute the sum-square envelope of a window function at a given hop length.\n\n    This is used to estimate modulation effects induced by windowing observations\n    in short-time fourier transforms.\n\n    Parameters\n    ----------\n    window : string, tuple, number, callable, or list-like\n        Window specification, as in `get_window`\n\n    n_frames : int > 0\n        The number of analysis frames\n\n    hop_length : int > 0\n        The number of samples to advance between frames\n\n    win_length : [optional]\n        The length of the window function.  By default, this matches `n_fft`.\n\n    n_fft : int > 0\n        The length of each analysis frame.\n\n    dtype : np.dtype\n        The data type of the output\n\n    Returns\n    -------\n    wss : np.ndarray, shape=`(n_fft + hop_length * (n_frames - 1))`\n        The sum-squared envelope of the window function\n\n    Examples\n    --------\n    For a fixed frame length (2048), compare modulation effects for a Hann window\n    at different hop lengths:\n\n    >>> n_frames = 50\n    >>> wss_256 = librosa.filters.window_sumsquare('hann', n_frames, hop_length=256)\n    >>> wss_512 = librosa.filters.window_sumsquare('hann', n_frames, hop_length=512)\n    >>> wss_1024 = librosa.filters.window_sumsquare('hann', n_frames, hop_length=1024)\n\n    >>> import matplotlib.pyplot as plt\n    >>> plt.figure()\n    >>> plt.subplot(3,1,1)\n    >>> plt.plot(wss_256)\n    >>> plt.title('hop_length=256')\n    >>> plt.subplot(3,1,2)\n    >>> plt.plot(wss_512)\n    >>> plt.title('hop_length=512')\n    >>> plt.subplot(3,1,3)\n    >>> plt.plot(wss_1024)\n    >>> plt.title('hop_length=1024')\n    >>> plt.tight_layout()\n\n    '''\n    if win_length is None:\n        win_length = n_fft\n\n    n = n_fft + hop_length * (n_frames - 1)\n    x = np.zeros(n, dtype=dtype)\n\n    # Compute the squared window at the desired length\n    win_sq = get_window(window, win_length)\n    win_sq = util.normalize(win_sq, norm=norm)**2\n    win_sq = util.pad_center(win_sq, n_fft)\n\n    # Fill the envelope\n    __window_ss_fill(x, win_sq, n_frames, hop_length)\n\n    return x"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef diagonal_filter(window, n, slope=1.0, angle=None, zero_mean=False):\n    '''Build a two-dimensional diagonal filter.\n\n    This is primarily used for smoothing recurrence or self-similarity matrices.\n\n    Parameters\n    ----------\n    window : string, tuple, number, callable, or list-like\n        The window function to use for the filter.\n\n        See `get_window` for details.\n\n        Note that the window used here should be non-negative.\n\n    n : int > 0\n        the length of the filter\n\n    slope : float\n        The slope of the diagonal filter to produce\n\n    angle : float or None\n        If given, the slope parameter is ignored,\n        and angle directly sets the orientation of the filter (in radians).\n        Otherwise, angle is inferred as `arctan(slope)`.\n\n    zero_mean : bool\n        If True, a zero-mean filter is used.\n        Otherwise, a non-negative averaging filter is used.\n\n        This should be enabled if you want to enhance paths and suppress\n        blocks.\n\n\n    Returns\n    -------\n    kernel : np.ndarray, shape=[(m, m)]\n        The 2-dimensional filter kernel\n\n\n    Notes\n    -----\n    This function caches at level 10.\n    '''\n\n    if angle is None:\n        angle = np.arctan(slope)\n\n    win = np.diag(get_window(window, n, fftbins=False))\n\n    if not np.isclose(angle, np.pi/4):\n        win = scipy.ndimage.rotate(win, 45 - angle * 180 / np.pi,\n                                   order=5, prefilter=False)\n\n    np.clip(win, 0, None, out=win)\n    win /= win.sum()\n\n    if zero_mean:\n        win -= win.mean()\n\n    return win", "response": "Build a 2 - dimensional diagonal filter."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef spectral_centroid(y=None, sr=22050, S=None, n_fft=2048, hop_length=512,\n                      freq=None, win_length=None, window='hann', center=True,\n                      pad_mode='reflect'):\n    '''Compute the spectral centroid.\n\n    Each frame of a magnitude spectrogram is normalized and treated as a\n    distribution over frequency bins, from which the mean (centroid) is\n    extracted per frame.\n\n    Parameters\n    ----------\n    y : np.ndarray [shape=(n,)] or None\n        audio time series\n\n    sr : number > 0 [scalar]\n        audio sampling rate of `y`\n\n    S : np.ndarray [shape=(d, t)] or None\n        (optional) spectrogram magnitude\n\n    n_fft : int > 0 [scalar]\n        FFT window size\n\n    hop_length : int > 0 [scalar]\n        hop length for STFT. See `librosa.core.stft` for details.\n\n    freq : None or np.ndarray [shape=(d,) or shape=(d, t)]\n        Center frequencies for spectrogram bins.\n        If `None`, then FFT bin center frequencies are used.\n        Otherwise, it can be a single array of `d` center frequencies,\n        or a matrix of center frequencies as constructed by\n        `librosa.core.ifgram`\n\n    win_length : int <= n_fft [scalar]\n        Each frame of audio is windowed by `window()`.\n        The window will be of length `win_length` and then padded\n        with zeros to match `n_fft`.\n\n        If unspecified, defaults to ``win_length = n_fft``.\n\n    window : string, tuple, number, function, or np.ndarray [shape=(n_fft,)]\n        - a window specification (string, tuple, or number);\n          see `scipy.signal.get_window`\n        - a window function, such as `scipy.signal.hanning`\n        - a vector or array of length `n_fft`\n\n        .. see also:: `filters.get_window`\n\n    center : boolean\n        - If `True`, the signal `y` is padded so that frame\n          `t` is centered at `y[t * hop_length]`.\n        - If `False`, then frame `t` begins at `y[t * hop_length]`\n\n    pad_mode : string\n        If `center=True`, the padding mode to use at the edges of the signal.\n        By default, STFT uses reflection padding.\n\n\n    Returns\n    -------\n    centroid : np.ndarray [shape=(1, t)]\n        centroid frequencies\n\n    See Also\n    --------\n    librosa.core.stft\n        Short-time Fourier Transform\n\n    librosa.core.ifgram\n        Instantaneous-frequency spectrogram\n\n    Examples\n    --------\n    From time-series input:\n\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> cent = librosa.feature.spectral_centroid(y=y, sr=sr)\n    >>> cent\n    array([[ 4382.894,   626.588, ...,  5037.07 ,  5413.398]])\n\n    From spectrogram input:\n\n    >>> S, phase = librosa.magphase(librosa.stft(y=y))\n    >>> librosa.feature.spectral_centroid(S=S)\n    array([[ 4382.894,   626.588, ...,  5037.07 ,  5413.398]])\n\n    Using variable bin center frequencies:\n\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> if_gram, D = librosa.ifgram(y)\n    >>> librosa.feature.spectral_centroid(S=np.abs(D), freq=if_gram)\n    array([[ 4420.719,   625.769, ...,  5011.86 ,  5221.492]])\n\n    Plot the result\n\n    >>> import matplotlib.pyplot as plt\n    >>> plt.figure()\n    >>> plt.subplot(2, 1, 1)\n    >>> plt.semilogy(cent.T, label='Spectral centroid')\n    >>> plt.ylabel('Hz')\n    >>> plt.xticks([])\n    >>> plt.xlim([0, cent.shape[-1]])\n    >>> plt.legend()\n    >>> plt.subplot(2, 1, 2)\n    >>> librosa.display.specshow(librosa.amplitude_to_db(S, ref=np.max),\n    ...                          y_axis='log', x_axis='time')\n    >>> plt.title('log Power spectrogram')\n    >>> plt.tight_layout()\n    '''\n\n    S, n_fft = _spectrogram(y=y, S=S, n_fft=n_fft, hop_length=hop_length,\n                            win_length=win_length, window=window, center=center,\n                            pad_mode=pad_mode)\n\n    if not np.isrealobj(S):\n        raise ParameterError('Spectral centroid is only defined '\n                             'with real-valued input')\n    elif np.any(S < 0):\n        raise ParameterError('Spectral centroid is only defined '\n                             'with non-negative energies')\n\n    # Compute the center frequencies of each bin\n    if freq is None:\n        freq = fft_frequencies(sr=sr, n_fft=n_fft)\n\n    if freq.ndim == 1:\n        freq = freq.reshape((-1, 1))\n\n    # Column-normalize S\n    return np.sum(freq * util.normalize(S, norm=1, axis=0),\n                  axis=0, keepdims=True)", "response": "Compute the spectral centroid of a time series."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef spectral_bandwidth(y=None, sr=22050, S=None, n_fft=2048, hop_length=512,\n                       win_length=None, window='hann', center=True, pad_mode='reflect',\n                       freq=None, centroid=None, norm=True, p=2):\n    '''Compute p'th-order spectral bandwidth:\n\n        (sum_k S[k] * (freq[k] - centroid)**p)**(1/p)\n\n    Parameters\n    ----------\n    y : np.ndarray [shape=(n,)] or None\n        audio time series\n\n    sr : number > 0 [scalar]\n        audio sampling rate of `y`\n\n    S : np.ndarray [shape=(d, t)] or None\n        (optional) spectrogram magnitude\n\n    n_fft : int > 0 [scalar]\n        FFT window size\n\n    hop_length : int > 0 [scalar]\n        hop length for STFT. See `librosa.core.stft` for details.\n\n    win_length : int <= n_fft [scalar]\n        Each frame of audio is windowed by `window()`.\n        The window will be of length `win_length` and then padded\n        with zeros to match `n_fft`.\n\n        If unspecified, defaults to ``win_length = n_fft``.\n\n    window : string, tuple, number, function, or np.ndarray [shape=(n_fft,)]\n        - a window specification (string, tuple, or number);\n          see `scipy.signal.get_window`\n        - a window function, such as `scipy.signal.hanning`\n        - a vector or array of length `n_fft`\n\n        .. see also:: `filters.get_window`\n\n    center : boolean\n        - If `True`, the signal `y` is padded so that frame\n          `t` is centered at `y[t * hop_length]`.\n        - If `False`, then frame `t` begins at `y[t * hop_length]`\n\n    pad_mode : string\n        If `center=True`, the padding mode to use at the edges of the signal.\n        By default, STFT uses reflection padding.\n\n    freq : None or np.ndarray [shape=(d,) or shape=(d, t)]\n        Center frequencies for spectrogram bins.\n        If `None`, then FFT bin center frequencies are used.\n        Otherwise, it can be a single array of `d` center frequencies,\n        or a matrix of center frequencies as constructed by\n        `librosa.core.ifgram`\n\n    centroid : None or np.ndarray [shape=(1, t)]\n        pre-computed centroid frequencies\n\n    norm : bool\n        Normalize per-frame spectral energy (sum to one)\n\n    p : float > 0\n        Power to raise deviation from spectral centroid.\n\n\n    Returns\n    -------\n    bandwidth : np.ndarray [shape=(1, t)]\n        frequency bandwidth for each frame\n\n\n    Examples\n    --------\n    From time-series input\n\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> spec_bw = librosa.feature.spectral_bandwidth(y=y, sr=sr)\n    >>> spec_bw\n    array([[ 3379.878,  1429.486, ...,  3235.214,  3080.148]])\n\n    From spectrogram input\n\n    >>> S, phase = librosa.magphase(librosa.stft(y=y))\n    >>> librosa.feature.spectral_bandwidth(S=S)\n    array([[ 3379.878,  1429.486, ...,  3235.214,  3080.148]])\n\n    Using variable bin center frequencies\n\n    >>> if_gram, D = librosa.ifgram(y)\n    >>> librosa.feature.spectral_bandwidth(S=np.abs(D), freq=if_gram)\n    array([[ 3380.011,  1429.11 , ...,  3235.22 ,  3080.148]])\n\n    Plot the result\n\n    >>> import matplotlib.pyplot as plt\n    >>> plt.figure()\n    >>> plt.subplot(2, 1, 1)\n    >>> plt.semilogy(spec_bw.T, label='Spectral bandwidth')\n    >>> plt.ylabel('Hz')\n    >>> plt.xticks([])\n    >>> plt.xlim([0, spec_bw.shape[-1]])\n    >>> plt.legend()\n    >>> plt.subplot(2, 1, 2)\n    >>> librosa.display.specshow(librosa.amplitude_to_db(S, ref=np.max),\n    ...                          y_axis='log', x_axis='time')\n    >>> plt.title('log Power spectrogram')\n    >>> plt.tight_layout()\n\n    '''\n\n    S, n_fft = _spectrogram(y=y, S=S, n_fft=n_fft, hop_length=hop_length,\n                            win_length=win_length, window=window, center=center,\n                            pad_mode=pad_mode)\n\n    if not np.isrealobj(S):\n        raise ParameterError('Spectral bandwidth is only defined '\n                             'with real-valued input')\n    elif np.any(S < 0):\n        raise ParameterError('Spectral bandwidth is only defined '\n                             'with non-negative energies')\n\n    if centroid is None:\n        centroid = spectral_centroid(y=y, sr=sr, S=S,\n                                     n_fft=n_fft,\n                                     hop_length=hop_length,\n                                     freq=freq)\n\n    # Compute the center frequencies of each bin\n    if freq is None:\n        freq = fft_frequencies(sr=sr, n_fft=n_fft)\n\n    if freq.ndim == 1:\n        deviation = np.abs(np.subtract.outer(freq, centroid[0]))\n    else:\n        deviation = np.abs(freq - centroid[0])\n\n    # Column-normalize S\n    if norm:\n        S = util.normalize(S, norm=1, axis=0)\n\n    return np.sum(S * deviation**p, axis=0, keepdims=True)**(1./p)", "response": "Compute pth - order spectral bandwidth of a time series y."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef spectral_contrast(y=None, sr=22050, S=None, n_fft=2048, hop_length=512,\n                      win_length=None, window='hann', center=True, pad_mode='reflect',\n                      freq=None, fmin=200.0, n_bands=6, quantile=0.02,\n                      linear=False):\n    '''Compute spectral contrast [1]_\n\n    .. [1] Jiang, Dan-Ning, Lie Lu, Hong-Jiang Zhang, Jian-Hua Tao,\n           and Lian-Hong Cai.\n           \"Music type classification by spectral contrast feature.\"\n           In Multimedia and Expo, 2002. ICME'02. Proceedings.\n           2002 IEEE International Conference on, vol. 1, pp. 113-116.\n           IEEE, 2002.\n\n    Parameters\n    ----------\n    y : np.ndarray [shape=(n,)] or None\n        audio time series\n\n    sr : number  > 0 [scalar]\n        audio sampling rate of `y`\n\n    S : np.ndarray [shape=(d, t)] or None\n        (optional) spectrogram magnitude\n\n    n_fft : int > 0 [scalar]\n        FFT window size\n\n    hop_length : int > 0 [scalar]\n        hop length for STFT. See `librosa.core.stft` for details.\n\n    win_length : int <= n_fft [scalar]\n        Each frame of audio is windowed by `window()`.\n        The window will be of length `win_length` and then padded\n        with zeros to match `n_fft`.\n\n        If unspecified, defaults to ``win_length = n_fft``.\n\n    window : string, tuple, number, function, or np.ndarray [shape=(n_fft,)]\n        - a window specification (string, tuple, or number);\n          see `scipy.signal.get_window`\n        - a window function, such as `scipy.signal.hanning`\n        - a vector or array of length `n_fft`\n\n        .. see also:: `filters.get_window`\n\n    center : boolean\n        - If `True`, the signal `y` is padded so that frame\n          `t` is centered at `y[t * hop_length]`.\n        - If `False`, then frame `t` begins at `y[t * hop_length]`\n\n    pad_mode : string\n        If `center=True`, the padding mode to use at the edges of the signal.\n        By default, STFT uses reflection padding.\n\n    freq : None or np.ndarray [shape=(d,)]\n        Center frequencies for spectrogram bins.\n        If `None`, then FFT bin center frequencies are used.\n        Otherwise, it can be a single array of `d` center frequencies.\n\n    fmin : float > 0\n        Frequency cutoff for the first bin `[0, fmin]`\n        Subsequent bins will cover `[fmin, 2*fmin]`, `[2*fmin, 4*fmin]`, etc.\n\n    n_bands : int > 1\n        number of frequency bands\n\n    quantile : float in (0, 1)\n        quantile for determining peaks and valleys\n\n    linear : bool\n        If `True`, return the linear difference of magnitudes:\n        `peaks - valleys`.\n\n        If `False`, return the logarithmic difference:\n        `log(peaks) - log(valleys)`.\n\n\n    Returns\n    -------\n    contrast : np.ndarray [shape=(n_bands + 1, t)]\n        each row of spectral contrast values corresponds to a given\n        octave-based frequency\n\n\n    Examples\n    --------\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> S = np.abs(librosa.stft(y))\n    >>> contrast = librosa.feature.spectral_contrast(S=S, sr=sr)\n\n    >>> import matplotlib.pyplot as plt\n    >>> plt.figure()\n    >>> plt.subplot(2, 1, 1)\n    >>> librosa.display.specshow(librosa.amplitude_to_db(S,\n    ...                                                  ref=np.max),\n    ...                          y_axis='log')\n    >>> plt.colorbar(format='%+2.0f dB')\n    >>> plt.title('Power spectrogram')\n    >>> plt.subplot(2, 1, 2)\n    >>> librosa.display.specshow(contrast, x_axis='time')\n    >>> plt.colorbar()\n    >>> plt.ylabel('Frequency bands')\n    >>> plt.title('Spectral contrast')\n    >>> plt.tight_layout()\n    '''\n\n    S, n_fft = _spectrogram(y=y, S=S, n_fft=n_fft, hop_length=hop_length,\n                            win_length=win_length, window=window, center=center,\n                            pad_mode=pad_mode)\n\n    # Compute the center frequencies of each bin\n    if freq is None:\n        freq = fft_frequencies(sr=sr, n_fft=n_fft)\n\n    freq = np.atleast_1d(freq)\n\n    if freq.ndim != 1 or len(freq) != S.shape[0]:\n        raise ParameterError('freq.shape mismatch: expected '\n                             '({:d},)'.format(S.shape[0]))\n\n    if n_bands < 1 or not isinstance(n_bands, int):\n        raise ParameterError('n_bands must be a positive integer')\n\n    if not 0.0 < quantile < 1.0:\n        raise ParameterError('quantile must lie in the range (0, 1)')\n\n    if fmin <= 0:\n        raise ParameterError('fmin must be a positive number')\n\n    octa = np.zeros(n_bands + 2)\n    octa[1:] = fmin * (2.0**np.arange(0, n_bands + 1))\n\n    if np.any(octa[:-1] >= 0.5 * sr):\n        raise ParameterError('Frequency band exceeds Nyquist. '\n                             'Reduce either fmin or n_bands.')\n\n    valley = np.zeros((n_bands + 1, S.shape[1]))\n    peak = np.zeros_like(valley)\n\n    for k, (f_low, f_high) in enumerate(zip(octa[:-1], octa[1:])):\n        current_band = np.logical_and(freq >= f_low, freq <= f_high)\n\n        idx = np.flatnonzero(current_band)\n\n        if k > 0:\n            current_band[idx[0] - 1] = True\n\n        if k == n_bands:\n            current_band[idx[-1] + 1:] = True\n\n        sub_band = S[current_band]\n\n        if k < n_bands:\n            sub_band = sub_band[:-1]\n\n        # Always take at least one bin from each side\n        idx = np.rint(quantile * np.sum(current_band))\n        idx = int(np.maximum(idx, 1))\n\n        sortedr = np.sort(sub_band, axis=0)\n\n        valley[k] = np.mean(sortedr[:idx], axis=0)\n        peak[k] = np.mean(sortedr[-idx:], axis=0)\n\n    if linear:\n        return peak - valley\n    else:\n        return power_to_db(peak) - power_to_db(valley)", "response": "Compute spectral contrast features."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\ncompute the roll - off frequency for a set of audio time series y sr S and n_fft.", "response": "def spectral_rolloff(y=None, sr=22050, S=None, n_fft=2048, hop_length=512,\n                     win_length=None, window='hann', center=True, pad_mode='reflect',\n                     freq=None, roll_percent=0.85):\n    '''Compute roll-off frequency.\n\n    The roll-off frequency is defined for each frame as the center frequency\n    for a spectrogram bin such that at least roll_percent (0.85 by default)\n    of the energy of the spectrum in this frame is contained in this bin and\n    the bins below. This can be used to, e.g., approximate the maximum (or\n    minimum) frequency by setting roll_percent to a value close to 1 (or 0).\n\n    Parameters\n    ----------\n    y : np.ndarray [shape=(n,)] or None\n        audio time series\n\n    sr : number > 0 [scalar]\n        audio sampling rate of `y`\n\n    S : np.ndarray [shape=(d, t)] or None\n        (optional) spectrogram magnitude\n\n    n_fft : int > 0 [scalar]\n        FFT window size\n\n    hop_length : int > 0 [scalar]\n        hop length for STFT. See `librosa.core.stft` for details.\n\n    win_length : int <= n_fft [scalar]\n        Each frame of audio is windowed by `window()`.\n        The window will be of length `win_length` and then padded\n        with zeros to match `n_fft`.\n\n        If unspecified, defaults to ``win_length = n_fft``.\n\n    window : string, tuple, number, function, or np.ndarray [shape=(n_fft,)]\n        - a window specification (string, tuple, or number);\n          see `scipy.signal.get_window`\n        - a window function, such as `scipy.signal.hanning`\n        - a vector or array of length `n_fft`\n\n        .. see also:: `filters.get_window`\n\n    center : boolean\n        - If `True`, the signal `y` is padded so that frame\n          `t` is centered at `y[t * hop_length]`.\n        - If `False`, then frame `t` begins at `y[t * hop_length]`\n\n    pad_mode : string\n        If `center=True`, the padding mode to use at the edges of the signal.\n        By default, STFT uses reflection padding.\n\n    freq : None or np.ndarray [shape=(d,) or shape=(d, t)]\n        Center frequencies for spectrogram bins.\n        If `None`, then FFT bin center frequencies are used.\n        Otherwise, it can be a single array of `d` center frequencies,\n\n        .. note:: `freq` is assumed to be sorted in increasing order\n\n    roll_percent : float [0 < roll_percent < 1]\n        Roll-off percentage.\n\n    Returns\n    -------\n    rolloff : np.ndarray [shape=(1, t)]\n        roll-off frequency for each frame\n\n\n    Examples\n    --------\n    From time-series input\n\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> # Approximate maximum frequencies with roll_percent=0.85 (default)\n    >>> rolloff = librosa.feature.spectral_rolloff(y=y, sr=sr)\n    >>> rolloff\n    array([[ 8376.416,   968.994, ...,  8925.513,  9108.545]])\n    >>> # Approximate minimum frequencies with roll_percent=0.1\n    >>> rolloff = librosa.feature.spectral_rolloff(y=y, sr=sr, roll_percent=0.1)\n    >>> rolloff\n    array([[ 75.36621094,  64.59960938,  64.59960938, ...,  75.36621094,\n         75.36621094,  64.59960938]])\n\n\n    From spectrogram input\n\n    >>> S, phase = librosa.magphase(librosa.stft(y))\n    >>> librosa.feature.spectral_rolloff(S=S, sr=sr)\n    array([[ 8376.416,   968.994, ...,  8925.513,  9108.545]])\n\n    >>> # With a higher roll percentage:\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> librosa.feature.spectral_rolloff(y=y, sr=sr, roll_percent=0.95)\n    array([[ 10012.939,   3003.882, ...,  10034.473,  10077.539]])\n\n    >>> import matplotlib.pyplot as plt\n    >>> plt.figure()\n    >>> plt.subplot(2, 1, 1)\n    >>> plt.semilogy(rolloff.T, label='Roll-off frequency')\n    >>> plt.ylabel('Hz')\n    >>> plt.xticks([])\n    >>> plt.xlim([0, rolloff.shape[-1]])\n    >>> plt.legend()\n    >>> plt.subplot(2, 1, 2)\n    >>> librosa.display.specshow(librosa.amplitude_to_db(S, ref=np.max),\n    ...                          y_axis='log', x_axis='time')\n    >>> plt.title('log Power spectrogram')\n    >>> plt.tight_layout()\n\n    '''\n\n    if not 0.0 < roll_percent < 1.0:\n        raise ParameterError('roll_percent must lie in the range (0, 1)')\n\n    S, n_fft = _spectrogram(y=y, S=S, n_fft=n_fft, hop_length=hop_length,\n                            win_length=win_length, window=window, center=center,\n                            pad_mode=pad_mode)\n\n    if not np.isrealobj(S):\n        raise ParameterError('Spectral rolloff is only defined '\n                             'with real-valued input')\n    elif np.any(S < 0):\n        raise ParameterError('Spectral rolloff is only defined '\n                             'with non-negative energies')\n\n    # Compute the center frequencies of each bin\n    if freq is None:\n        freq = fft_frequencies(sr=sr, n_fft=n_fft)\n\n    # Make sure that frequency can be broadcast\n    if freq.ndim == 1:\n        freq = freq.reshape((-1, 1))\n\n    total_energy = np.cumsum(S, axis=0)\n\n    threshold = roll_percent * total_energy[-1]\n\n    ind = np.where(total_energy < threshold, np.nan, 1)\n\n    return np.nanmin(ind * freq, axis=0, keepdims=True)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\ncomputing the spectral flatness of the given audio time series or tone - like.", "response": "def spectral_flatness(y=None, S=None, n_fft=2048, hop_length=512,\n                      win_length=None, window='hann', center=True, pad_mode='reflect',\n                      amin=1e-10, power=2.0):\n    '''Compute spectral flatness\n\n    Spectral flatness (or tonality coefficient) is a measure to\n    quantify how much noise-like a sound is, as opposed to being\n    tone-like [1]_. A high spectral flatness (closer to 1.0)\n    indicates the spectrum is similar to white noise.\n    It is often converted to decibel.\n\n    .. [1] Dubnov, Shlomo  \"Generalization of spectral flatness\n           measure for non-gaussian linear processes\"\n           IEEE Signal Processing Letters, 2004, Vol. 11.\n\n    Parameters\n    ----------\n    y : np.ndarray [shape=(n,)] or None\n        audio time series\n\n    S : np.ndarray [shape=(d, t)] or None\n        (optional) pre-computed spectrogram magnitude\n\n    n_fft : int > 0 [scalar]\n        FFT window size\n\n    hop_length : int > 0 [scalar]\n        hop length for STFT. See `librosa.core.stft` for details.\n\n    win_length : int <= n_fft [scalar]\n        Each frame of audio is windowed by `window()`.\n        The window will be of length `win_length` and then padded\n        with zeros to match `n_fft`.\n\n        If unspecified, defaults to ``win_length = n_fft``.\n\n    window : string, tuple, number, function, or np.ndarray [shape=(n_fft,)]\n        - a window specification (string, tuple, or number);\n          see `scipy.signal.get_window`\n        - a window function, such as `scipy.signal.hanning`\n        - a vector or array of length `n_fft`\n\n        .. see also:: `filters.get_window`\n\n    center : boolean\n        - If `True`, the signal `y` is padded so that frame\n          `t` is centered at `y[t * hop_length]`.\n        - If `False`, then frame `t` begins at `y[t * hop_length]`\n\n    pad_mode : string\n        If `center=True`, the padding mode to use at the edges of the signal.\n        By default, STFT uses reflection padding.\n\n    amin : float > 0 [scalar]\n        minimum threshold for `S` (=added noise floor for numerical stability)\n\n    power : float > 0 [scalar]\n        Exponent for the magnitude spectrogram.\n        e.g., 1 for energy, 2 for power, etc.\n        Power spectrogram is usually used for computing spectral flatness.\n\n    Returns\n    -------\n    flatness : np.ndarray [shape=(1, t)]\n        spectral flatness for each frame.\n        The returned value is in [0, 1] and often converted to dB scale.\n\n\n    Examples\n    --------\n    From time-series input\n\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> flatness = librosa.feature.spectral_flatness(y=y)\n    >>> flatness\n    array([[  1.00000e+00,   5.82299e-03,   5.64624e-04, ...,   9.99063e-01,\n          1.00000e+00,   1.00000e+00]], dtype=float32)\n\n    From spectrogram input\n\n    >>> S, phase = librosa.magphase(librosa.stft(y))\n    >>> librosa.feature.spectral_flatness(S=S)\n    array([[  1.00000e+00,   5.82299e-03,   5.64624e-04, ...,   9.99063e-01,\n          1.00000e+00,   1.00000e+00]], dtype=float32)\n\n    From power spectrogram input\n\n    >>> S, phase = librosa.magphase(librosa.stft(y))\n    >>> S_power = S ** 2\n    >>> librosa.feature.spectral_flatness(S=S_power, power=1.0)\n    array([[  1.00000e+00,   5.82299e-03,   5.64624e-04, ...,   9.99063e-01,\n          1.00000e+00,   1.00000e+00]], dtype=float32)\n\n    '''\n    if amin <= 0:\n        raise ParameterError('amin must be strictly positive')\n\n    S, n_fft = _spectrogram(y=y, S=S, n_fft=n_fft, hop_length=hop_length,\n                            power=1., win_length=win_length, window=window,\n                            center=center, pad_mode=pad_mode)\n\n    if not np.isrealobj(S):\n        raise ParameterError('Spectral flatness is only defined '\n                             'with real-valued input')\n    elif np.any(S < 0):\n        raise ParameterError('Spectral flatness is only defined '\n                             'with non-negative energies')\n\n    S_thresh = np.maximum(amin, S ** power)\n    gmean = np.exp(np.mean(np.log(S_thresh), axis=0, keepdims=True))\n    amean = np.mean(S_thresh, axis=0, keepdims=True)\n    return gmean / amean"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef rms(y=None, S=None, frame_length=2048, hop_length=512,\n        center=True, pad_mode='reflect'):\n    '''Compute root-mean-square (RMS) value for each frame, either from the\n    audio samples `y` or from a spectrogram `S`.\n\n    Computing the RMS value from audio samples is faster as it doesn't require\n    a STFT calculation. However, using a spectrogram will give a more accurate\n    representation of energy over time because its frames can be windowed,\n    thus prefer using `S` if it's already available.\n\n\n    Parameters\n    ----------\n    y : np.ndarray [shape=(n,)] or None\n        (optional) audio time series. Required if `S` is not input.\n\n    S : np.ndarray [shape=(d, t)] or None\n        (optional) spectrogram magnitude. Required if `y` is not input.\n\n    frame_length : int > 0 [scalar]\n        length of analysis frame (in samples) for energy calculation\n\n    hop_length : int > 0 [scalar]\n        hop length for STFT. See `librosa.core.stft` for details.\n\n    center : bool\n        If `True` and operating on time-domain input (`y`), pad the signal\n        by `frame_length//2` on either side.\n\n        If operating on spectrogram input, this has no effect.\n\n    pad_mode : str\n        Padding mode for centered analysis.  See `np.pad` for valid\n        values.\n\n    Returns\n    -------\n    rms : np.ndarray [shape=(1, t)]\n        RMS value for each frame\n\n\n    Examples\n    --------\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> librosa.feature.rms(y=y)\n    array([[ 0.   ,  0.056, ...,  0.   ,  0.   ]], dtype=float32)\n\n    Or from spectrogram input\n\n    >>> S, phase = librosa.magphase(librosa.stft(y))\n    >>> rms = librosa.feature.rms(S=S)\n\n    >>> import matplotlib.pyplot as plt\n    >>> plt.figure()\n    >>> plt.subplot(2, 1, 1)\n    >>> plt.semilogy(rms.T, label='RMS Energy')\n    >>> plt.xticks([])\n    >>> plt.xlim([0, rms.shape[-1]])\n    >>> plt.legend(loc='best')\n    >>> plt.subplot(2, 1, 2)\n    >>> librosa.display.specshow(librosa.amplitude_to_db(S, ref=np.max),\n    ...                          y_axis='log', x_axis='time')\n    >>> plt.title('log Power spectrogram')\n    >>> plt.tight_layout()\n\n    Use a STFT window of constant ones and no frame centering to get consistent\n    results with the RMS computed from the audio samples `y`\n\n    >>> S = librosa.magphase(librosa.stft(y, window=np.ones, center=False))[0]\n    >>> librosa.feature.rms(S=S)\n\n    '''\n    if y is not None and S is not None:\n        raise ValueError('Either `y` or `S` should be input.')\n    if y is not None:\n        y = to_mono(y)\n        if center:\n            y = np.pad(y, int(frame_length // 2), mode=pad_mode)\n\n        x = util.frame(y,\n                       frame_length=frame_length,\n                       hop_length=hop_length)\n    elif S is not None:\n        x, _ = _spectrogram(y=y, S=S,\n                            n_fft=frame_length,\n                            hop_length=hop_length)\n    else:\n        raise ValueError('Either `y` or `S` must be input.')\n    return np.sqrt(np.mean(np.abs(x)**2, axis=0, keepdims=True))", "response": "Compute root - mean - square value for each frame from the audio samples y and S."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef poly_features(y=None, sr=22050, S=None, n_fft=2048, hop_length=512,\n                  win_length=None, window='hann', center=True, pad_mode='reflect',\n                  order=1, freq=None):\n    '''Get coefficients of fitting an nth-order polynomial to the columns\n    of a spectrogram.\n\n    Parameters\n    ----------\n    y : np.ndarray [shape=(n,)] or None\n        audio time series\n\n    sr : number > 0 [scalar]\n        audio sampling rate of `y`\n\n    S : np.ndarray [shape=(d, t)] or None\n        (optional) spectrogram magnitude\n\n    n_fft : int > 0 [scalar]\n        FFT window size\n\n    hop_length : int > 0 [scalar]\n        hop length for STFT. See `librosa.core.stft` for details.\n\n    win_length : int <= n_fft [scalar]\n        Each frame of audio is windowed by `window()`.\n        The window will be of length `win_length` and then padded\n        with zeros to match `n_fft`.\n\n        If unspecified, defaults to ``win_length = n_fft``.\n\n    window : string, tuple, number, function, or np.ndarray [shape=(n_fft,)]\n        - a window specification (string, tuple, or number);\n          see `scipy.signal.get_window`\n        - a window function, such as `scipy.signal.hanning`\n        - a vector or array of length `n_fft`\n\n        .. see also:: `filters.get_window`\n\n    center : boolean\n        - If `True`, the signal `y` is padded so that frame\n          `t` is centered at `y[t * hop_length]`.\n        - If `False`, then frame `t` begins at `y[t * hop_length]`\n\n    pad_mode : string\n        If `center=True`, the padding mode to use at the edges of the signal.\n        By default, STFT uses reflection padding.\n\n    order : int > 0\n        order of the polynomial to fit\n\n    freq : None or np.ndarray [shape=(d,) or shape=(d, t)]\n        Center frequencies for spectrogram bins.\n        If `None`, then FFT bin center frequencies are used.\n        Otherwise, it can be a single array of `d` center frequencies,\n        or a matrix of center frequencies as constructed by\n        `librosa.core.ifgram`\n\n    Returns\n    -------\n    coefficients : np.ndarray [shape=(order+1, t)]\n        polynomial coefficients for each frame.\n\n        `coeffecients[0]` corresponds to the highest degree (`order`),\n\n        `coefficients[1]` corresponds to the next highest degree (`order-1`),\n\n        down to the constant term `coefficients[order]`.\n\n    Examples\n    --------\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> S = np.abs(librosa.stft(y))\n\n    Fit a degree-0 polynomial (constant) to each frame\n\n    >>> p0 = librosa.feature.poly_features(S=S, order=0)\n\n    Fit a linear polynomial to each frame\n\n    >>> p1 = librosa.feature.poly_features(S=S, order=1)\n\n    Fit a quadratic to each frame\n\n    >>> p2 = librosa.feature.poly_features(S=S, order=2)\n\n    Plot the results for comparison\n\n    >>> import matplotlib.pyplot as plt\n    >>> plt.figure(figsize=(8, 8))\n    >>> ax = plt.subplot(4,1,1)\n    >>> plt.plot(p2[2], label='order=2', alpha=0.8)\n    >>> plt.plot(p1[1], label='order=1', alpha=0.8)\n    >>> plt.plot(p0[0], label='order=0', alpha=0.8)\n    >>> plt.xticks([])\n    >>> plt.ylabel('Constant')\n    >>> plt.legend()\n    >>> plt.subplot(4,1,2, sharex=ax)\n    >>> plt.plot(p2[1], label='order=2', alpha=0.8)\n    >>> plt.plot(p1[0], label='order=1', alpha=0.8)\n    >>> plt.xticks([])\n    >>> plt.ylabel('Linear')\n    >>> plt.subplot(4,1,3, sharex=ax)\n    >>> plt.plot(p2[0], label='order=2', alpha=0.8)\n    >>> plt.xticks([])\n    >>> plt.ylabel('Quadratic')\n    >>> plt.subplot(4,1,4, sharex=ax)\n    >>> librosa.display.specshow(librosa.amplitude_to_db(S, ref=np.max),\n    ...                          y_axis='log')\n    >>> plt.tight_layout()\n    '''\n\n    S, n_fft = _spectrogram(y=y, S=S, n_fft=n_fft, hop_length=hop_length,\n                            win_length=win_length, window=window, center=center,\n                            pad_mode=pad_mode)\n\n    # Compute the center frequencies of each bin\n    if freq is None:\n        freq = fft_frequencies(sr=sr, n_fft=n_fft)\n\n    # If frequencies are constant over frames, then we only need to fit once\n    if freq.ndim == 1:\n        coefficients = np.polyfit(freq, S, order)\n    else:\n        # Else, fit each frame independently and stack the results\n        coefficients = np.concatenate([[np.polyfit(freq[:, i], S[:, i], order)]\n                                       for i in range(S.shape[1])], axis=0).T\n\n    return coefficients", "response": "Returns a list of features that fit an nth - order polynomial to the columns of a spectrogram."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef zero_crossing_rate(y, frame_length=2048, hop_length=512, center=True,\n                       **kwargs):\n    '''Compute the zero-crossing rate of an audio time series.\n\n    Parameters\n    ----------\n    y : np.ndarray [shape=(n,)]\n        Audio time series\n\n    frame_length : int > 0\n        Length of the frame over which to compute zero crossing rates\n\n    hop_length : int > 0\n        Number of samples to advance for each frame\n\n    center : bool\n        If `True`, frames are centered by padding the edges of `y`.\n        This is similar to the padding in `librosa.core.stft`,\n        but uses edge-value copies instead of reflection.\n\n    kwargs : additional keyword arguments\n        See `librosa.core.zero_crossings`\n\n        .. note:: By default, the `pad` parameter is set to `False`, which\n            differs from the default specified by\n            `librosa.core.zero_crossings`.\n\n    Returns\n    -------\n    zcr : np.ndarray [shape=(1, t)]\n        `zcr[0, i]` is the fraction of zero crossings in the\n        `i` th frame\n\n    See Also\n    --------\n    librosa.core.zero_crossings\n        Compute zero-crossings in a time-series\n\n    Examples\n    --------\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> librosa.feature.zero_crossing_rate(y)\n    array([[ 0.134,  0.139, ...,  0.387,  0.322]])\n\n    '''\n\n    util.valid_audio(y)\n\n    if center:\n        y = np.pad(y, int(frame_length // 2), mode='edge')\n\n    y_framed = util.frame(y, frame_length, hop_length)\n\n    kwargs['axis'] = 0\n    kwargs.setdefault('pad', False)\n\n    crossings = zero_crossings(y_framed, **kwargs)\n\n    return np.mean(crossings, axis=0, keepdims=True)", "response": "Compute the zero crossing rate of an audio time series."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef chroma_stft(y=None, sr=22050, S=None, norm=np.inf, n_fft=2048,\n                hop_length=512, win_length=None, window='hann', center=True,\n                pad_mode='reflect', tuning=None, **kwargs):\n    \"\"\"Compute a chromagram from a waveform or power spectrogram.\n\n    This implementation is derived from `chromagram_E` [1]_\n\n    .. [1] Ellis, Daniel P.W.  \"Chroma feature analysis and synthesis\"\n           2007/04/21\n           http://labrosa.ee.columbia.edu/matlab/chroma-ansyn/\n\n    Parameters\n    ----------\n    y : np.ndarray [shape=(n,)] or None\n        audio time series\n\n    sr : number > 0 [scalar]\n        sampling rate of `y`\n\n    S : np.ndarray [shape=(d, t)] or None\n        power spectrogram\n\n    norm : float or None\n        Column-wise normalization.\n        See `librosa.util.normalize` for details.\n\n        If `None`, no normalization is performed.\n\n    n_fft : int  > 0 [scalar]\n        FFT window size if provided `y, sr` instead of `S`\n\n    hop_length : int > 0 [scalar]\n        hop length if provided `y, sr` instead of `S`\n\n    win_length : int <= n_fft [scalar]\n        Each frame of audio is windowed by `window()`.\n        The window will be of length `win_length` and then padded\n        with zeros to match `n_fft`.\n\n        If unspecified, defaults to ``win_length = n_fft``.\n\n    window : string, tuple, number, function, or np.ndarray [shape=(n_fft,)]\n        - a window specification (string, tuple, or number);\n          see `scipy.signal.get_window`\n        - a window function, such as `scipy.signal.hanning`\n        - a vector or array of length `n_fft`\n\n        .. see also:: `filters.get_window`\n\n    center : boolean\n        - If `True`, the signal `y` is padded so that frame\n          `t` is centered at `y[t * hop_length]`.\n        - If `False`, then frame `t` begins at `y[t * hop_length]`\n\n    pad_mode : string\n        If `center=True`, the padding mode to use at the edges of the signal.\n        By default, STFT uses reflection padding.\n\n\n    tuning : float in `[-0.5, 0.5)` [scalar] or None.\n        Deviation from A440 tuning in fractional bins (cents).\n        If `None`, it is automatically estimated.\n\n    kwargs : additional keyword arguments\n        Arguments to parameterize chroma filters.\n        See `librosa.filters.chroma` for details.\n\n    Returns\n    -------\n    chromagram : np.ndarray [shape=(n_chroma, t)]\n        Normalized energy for each chroma bin at each frame.\n\n    See Also\n    --------\n    librosa.filters.chroma\n        Chroma filter bank construction\n    librosa.util.normalize\n        Vector normalization\n\n    Examples\n    --------\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> librosa.feature.chroma_stft(y=y, sr=sr)\n    array([[ 0.974,  0.881, ...,  0.925,  1.   ],\n           [ 1.   ,  0.841, ...,  0.882,  0.878],\n           ...,\n           [ 0.658,  0.985, ...,  0.878,  0.764],\n           [ 0.969,  0.92 , ...,  0.974,  0.915]])\n\n    Use an energy (magnitude) spectrum instead of power spectrogram\n\n    >>> S = np.abs(librosa.stft(y))\n    >>> chroma = librosa.feature.chroma_stft(S=S, sr=sr)\n    >>> chroma\n    array([[ 0.884,  0.91 , ...,  0.861,  0.858],\n           [ 0.963,  0.785, ...,  0.968,  0.896],\n           ...,\n           [ 0.871,  1.   , ...,  0.928,  0.829],\n           [ 1.   ,  0.982, ...,  0.93 ,  0.878]])\n\n    Use a pre-computed power spectrogram with a larger frame\n\n    >>> S = np.abs(librosa.stft(y, n_fft=4096))**2\n    >>> chroma = librosa.feature.chroma_stft(S=S, sr=sr)\n    >>> chroma\n    array([[ 0.685,  0.477, ...,  0.961,  0.986],\n           [ 0.674,  0.452, ...,  0.952,  0.926],\n           ...,\n           [ 0.844,  0.575, ...,  0.934,  0.869],\n           [ 0.793,  0.663, ...,  0.964,  0.972]])\n\n    >>> import matplotlib.pyplot as plt\n    >>> plt.figure(figsize=(10, 4))\n    >>> librosa.display.specshow(chroma, y_axis='chroma', x_axis='time')\n    >>> plt.colorbar()\n    >>> plt.title('Chromagram')\n    >>> plt.tight_layout()\n\n    \"\"\"\n\n    S, n_fft = _spectrogram(y=y, S=S, n_fft=n_fft, hop_length=hop_length, power=2,\n                            win_length=win_length, window=window, center=center,\n                            pad_mode=pad_mode)\n\n    n_chroma = kwargs.get('n_chroma', 12)\n\n    if tuning is None:\n        tuning = estimate_tuning(S=S, sr=sr, bins_per_octave=n_chroma)\n\n    # Get the filter bank\n    if 'A440' not in kwargs:\n        kwargs['A440'] = 440.0 * 2.0**(float(tuning) / n_chroma)\n\n    chromafb = filters.chroma(sr, n_fft, **kwargs)\n\n    # Compute raw chroma\n    raw_chroma = np.dot(chromafb, S)\n\n    # Compute normalization factor for each frame\n    return util.normalize(raw_chroma, norm=norm, axis=0)", "response": "Compute a chromagram from an audio time series or power spectrogram."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef chroma_cqt(y=None, sr=22050, C=None, hop_length=512, fmin=None,\n               norm=np.inf, threshold=0.0, tuning=None, n_chroma=12,\n               n_octaves=7, window=None, bins_per_octave=None, cqt_mode='full'):\n    r'''Constant-Q chromagram\n\n    Parameters\n    ----------\n    y : np.ndarray [shape=(n,)]\n        audio time series\n\n    sr : number > 0\n        sampling rate of `y`\n\n    C : np.ndarray [shape=(d, t)] [Optional]\n        a pre-computed constant-Q spectrogram\n\n    hop_length : int > 0\n        number of samples between successive chroma frames\n\n    fmin : float > 0\n        minimum frequency to analyze in the CQT.\n        Default: 'C1' ~= 32.7 Hz\n\n    norm : int > 0, +-np.inf, or None\n        Column-wise normalization of the chromagram.\n\n    threshold : float\n        Pre-normalization energy threshold.  Values below the\n        threshold are discarded, resulting in a sparse chromagram.\n\n    tuning : float\n        Deviation (in cents) from A440 tuning\n\n    n_chroma : int > 0\n        Number of chroma bins to produce\n\n    n_octaves : int > 0\n        Number of octaves to analyze above `fmin`\n\n    window : None or np.ndarray\n        Optional window parameter to `filters.cq_to_chroma`\n\n    bins_per_octave : int > 0\n        Number of bins per octave in the CQT.\n        Default: matches `n_chroma`\n\n    cqt_mode : ['full', 'hybrid']\n        Constant-Q transform mode\n\n    Returns\n    -------\n    chromagram : np.ndarray [shape=(n_chroma, t)]\n        The output chromagram\n\n    See Also\n    --------\n    librosa.util.normalize\n    librosa.core.cqt\n    librosa.core.hybrid_cqt\n    chroma_stft\n\n    Examples\n    --------\n    Compare a long-window STFT chromagram to the CQT chromagram\n\n\n    >>> y, sr = librosa.load(librosa.util.example_audio_file(),\n    ...                      offset=10, duration=15)\n    >>> chroma_stft = librosa.feature.chroma_stft(y=y, sr=sr,\n    ...                                           n_chroma=12, n_fft=4096)\n    >>> chroma_cq = librosa.feature.chroma_cqt(y=y, sr=sr)\n\n    >>> import matplotlib.pyplot as plt\n    >>> plt.figure()\n    >>> plt.subplot(2,1,1)\n    >>> librosa.display.specshow(chroma_stft, y_axis='chroma')\n    >>> plt.title('chroma_stft')\n    >>> plt.colorbar()\n    >>> plt.subplot(2,1,2)\n    >>> librosa.display.specshow(chroma_cq, y_axis='chroma', x_axis='time')\n    >>> plt.title('chroma_cqt')\n    >>> plt.colorbar()\n    >>> plt.tight_layout()\n\n    '''\n\n    cqt_func = {'full': cqt, 'hybrid': hybrid_cqt}\n\n    if bins_per_octave is None:\n        bins_per_octave = n_chroma\n\n    # Build the CQT if we don't have one already\n    if C is None:\n        C = np.abs(cqt_func[cqt_mode](y, sr=sr,\n                                      hop_length=hop_length,\n                                      fmin=fmin,\n                                      n_bins=n_octaves * bins_per_octave,\n                                      bins_per_octave=bins_per_octave,\n                                      tuning=tuning))\n\n    # Map to chroma\n    cq_to_chr = filters.cq_to_chroma(C.shape[0],\n                                     bins_per_octave=bins_per_octave,\n                                     n_chroma=n_chroma,\n                                     fmin=fmin,\n                                     window=window)\n    chroma = cq_to_chr.dot(C)\n\n    if threshold is not None:\n        chroma[chroma < threshold] = 0.0\n\n    # Normalize\n    if norm is not None:\n        chroma = util.normalize(chroma, norm=norm, axis=0)\n\n    return chroma", "response": "r Constant - Q chromagram of the given audio time series y sr C hop_length norm and tuning parameters n_chroma and n_octaves are optional and can be used to filter the output of the CQT to the chromagram."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef chroma_cens(y=None, sr=22050, C=None, hop_length=512, fmin=None,\n                tuning=None, n_chroma=12,\n                n_octaves=7, bins_per_octave=None, cqt_mode='full', window=None,\n                norm=2, win_len_smooth=41, smoothing_window='hann'):\n    r'''Computes the chroma variant \"Chroma Energy Normalized\" (CENS), following [1]_.\n\n    To compute CENS features, following steps are taken after obtaining chroma vectors using `chroma_cqt`:\n    1. L-1 normalization of each chroma vector\n    2. Quantization of amplitude based on \"log-like\" amplitude thresholds\n    3. (optional) Smoothing with sliding window. Default window length = 41 frames\n    4. (not implemented) Downsampling\n\n    CENS features are robust to dynamics, timbre and articulation, thus these are commonly used in audio\n    matching and retrieval applications.\n\n    .. [1] Meinard M\u00fcller and Sebastian Ewert\n           \"Chroma Toolbox: MATLAB implementations for extracting variants of chroma-based audio features\"\n           In Proceedings of the International Conference on Music Information Retrieval (ISMIR), 2011.\n\n    Parameters\n    ----------\n    y : np.ndarray [shape=(n,)]\n        audio time series\n\n    sr : number > 0\n        sampling rate of `y`\n\n    C : np.ndarray [shape=(d, t)] [Optional]\n        a pre-computed constant-Q spectrogram\n\n    hop_length : int > 0\n        number of samples between successive chroma frames\n\n    fmin : float > 0\n        minimum frequency to analyze in the CQT.\n        Default: 'C1' ~= 32.7 Hz\n\n    norm : int > 0, +-np.inf, or None\n        Column-wise normalization of the chromagram.\n\n    tuning : float\n        Deviation (in cents) from A440 tuning\n\n    n_chroma : int > 0\n        Number of chroma bins to produce\n\n    n_octaves : int > 0\n        Number of octaves to analyze above `fmin`\n\n    window : None or np.ndarray\n        Optional window parameter to `filters.cq_to_chroma`\n\n    bins_per_octave : int > 0\n        Number of bins per octave in the CQT.\n        Default: matches `n_chroma`\n\n    cqt_mode : ['full', 'hybrid']\n        Constant-Q transform mode\n\n    win_len_smooth : int > 0 or None\n        Length of temporal smoothing window. `None` disables temporal smoothing.\n        Default: 41\n\n    smoothing_window : str, float or tuple\n        Type of window function for temporal smoothing. See `filters.get_window` for possible inputs.\n        Default: 'hann'\n\n    Returns\n    -------\n    chroma_cens : np.ndarray [shape=(n_chroma, t)]\n        The output cens-chromagram\n\n    See Also\n    --------\n    chroma_cqt\n        Compute a chromagram from a constant-Q transform.\n\n    chroma_stft\n        Compute a chromagram from an STFT spectrogram or waveform.\n\n    filters.get_window\n        Compute a window function.\n\n    Examples\n    --------\n    Compare standard cqt chroma to CENS.\n\n\n    >>> y, sr = librosa.load(librosa.util.example_audio_file(),\n    ...                      offset=10, duration=15)\n    >>> chroma_cens = librosa.feature.chroma_cens(y=y, sr=sr)\n    >>> chroma_cq = librosa.feature.chroma_cqt(y=y, sr=sr)\n\n    >>> import matplotlib.pyplot as plt\n    >>> plt.figure()\n    >>> plt.subplot(2,1,1)\n    >>> librosa.display.specshow(chroma_cq, y_axis='chroma')\n    >>> plt.title('chroma_cq')\n    >>> plt.colorbar()\n    >>> plt.subplot(2,1,2)\n    >>> librosa.display.specshow(chroma_cens, y_axis='chroma', x_axis='time')\n    >>> plt.title('chroma_cens')\n    >>> plt.colorbar()\n    >>> plt.tight_layout()\n    '''\n    if not ((win_len_smooth is None) or (isinstance(win_len_smooth, int) and win_len_smooth > 0)):\n        raise ParameterError('win_len_smooth={} must be a positive integer or None'.format(win_len_smooth))\n\n    chroma = chroma_cqt(y=y, C=C, sr=sr,\n                        hop_length=hop_length,\n                        fmin=fmin,\n                        bins_per_octave=bins_per_octave,\n                        tuning=tuning,\n                        norm=None,\n                        n_chroma=n_chroma,\n                        n_octaves=n_octaves,\n                        cqt_mode=cqt_mode,\n                        window=window)\n\n    # L1-Normalization\n    chroma = util.normalize(chroma, norm=1, axis=0)\n\n    # Quantize amplitudes\n    QUANT_STEPS = [0.4, 0.2, 0.1, 0.05]\n    QUANT_WEIGHTS = [0.25, 0.25, 0.25, 0.25]\n\n    chroma_quant = np.zeros_like(chroma)\n\n    for cur_quant_step_idx, cur_quant_step in enumerate(QUANT_STEPS):\n        chroma_quant += (chroma > cur_quant_step) * QUANT_WEIGHTS[cur_quant_step_idx]\n\n    if win_len_smooth:\n        # Apply temporal smoothing\n        win = filters.get_window(smoothing_window, win_len_smooth + 2, fftbins=False)\n        win /= np.sum(win)\n        win = np.atleast_2d(win)\n\n        cens = scipy.signal.convolve2d(chroma_quant, win,\n                                       mode='same', boundary='fill')\n    else:\n        cens = chroma_quant\n\n    # L2-Normalization\n    return util.normalize(cens, norm=norm, axis=0)", "response": "r Returns a new object containing the chroma - based audio features for the given audio time series y sr and C hop length and fmin."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\ncomputing the tonal centroid features for each audio time series.", "response": "def tonnetz(y=None, sr=22050, chroma=None):\n    '''Computes the tonal centroid features (tonnetz), following the method of\n    [1]_.\n\n    .. [1] Harte, C., Sandler, M., & Gasser, M. (2006). \"Detecting Harmonic\n           Change in Musical Audio.\" In Proceedings of the 1st ACM Workshop\n           on Audio and Music Computing Multimedia (pp. 21-26).\n           Santa Barbara, CA, USA: ACM Press. doi:10.1145/1178723.1178727.\n\n    Parameters\n    ----------\n    y : np.ndarray [shape=(n,)] or None\n        Audio time series.\n\n    sr : number > 0 [scalar]\n        sampling rate of `y`\n\n    chroma : np.ndarray [shape=(n_chroma, t)] or None\n        Normalized energy for each chroma bin at each frame.\n\n        If `None`, a cqt chromagram is performed.\n\n    Returns\n    -------\n    tonnetz : np.ndarray [shape(6, t)]\n        Tonal centroid features for each frame.\n\n        Tonnetz dimensions:\n            - 0: Fifth x-axis\n            - 1: Fifth y-axis\n            - 2: Minor x-axis\n            - 3: Minor y-axis\n            - 4: Major x-axis\n            - 5: Major y-axis\n\n    See Also\n    --------\n    chroma_cqt\n        Compute a chromagram from a constant-Q transform.\n\n    chroma_stft\n        Compute a chromagram from an STFT spectrogram or waveform.\n\n    Examples\n    --------\n    Compute tonnetz features from the harmonic component of a song\n\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> y = librosa.effects.harmonic(y)\n    >>> tonnetz = librosa.feature.tonnetz(y=y, sr=sr)\n    >>> tonnetz\n    array([[-0.073, -0.053, ..., -0.054, -0.073],\n           [ 0.001,  0.001, ..., -0.054, -0.062],\n           ...,\n           [ 0.039,  0.034, ...,  0.044,  0.064],\n           [ 0.005,  0.002, ...,  0.011,  0.017]])\n\n    Compare the tonnetz features to `chroma_cqt`\n\n    >>> import matplotlib.pyplot as plt\n    >>> plt.subplot(2, 1, 1)\n    >>> librosa.display.specshow(tonnetz, y_axis='tonnetz')\n    >>> plt.colorbar()\n    >>> plt.title('Tonal Centroids (Tonnetz)')\n    >>> plt.subplot(2, 1, 2)\n    >>> librosa.display.specshow(librosa.feature.chroma_cqt(y, sr=sr),\n    ...                          y_axis='chroma', x_axis='time')\n    >>> plt.colorbar()\n    >>> plt.title('Chroma')\n    >>> plt.tight_layout()\n\n    '''\n\n    if y is None and chroma is None:\n        raise ParameterError('Either the audio samples or the chromagram must be '\n                             'passed as an argument.')\n\n    if chroma is None:\n        chroma = chroma_cqt(y=y, sr=sr)\n\n    # Generate Transformation matrix\n    dim_map = np.linspace(0, 12, num=chroma.shape[0], endpoint=False)\n\n    scale = np.asarray([7. / 6, 7. / 6,\n                        3. / 2, 3. / 2,\n                        2. / 3, 2. / 3])\n\n    V = np.multiply.outer(scale, dim_map)\n\n    # Even rows compute sin()\n    V[::2] -= 0.5\n\n    R = np.array([1, 1,         # Fifths\n                  1, 1,         # Minor\n                  0.5, 0.5])    # Major\n\n    phi = R[:, np.newaxis] * np.cos(np.pi * V)\n\n    # Do the transform to tonnetz\n    return phi.dot(util.normalize(chroma, norm=1, axis=0))"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef mfcc(y=None, sr=22050, S=None, n_mfcc=20, dct_type=2, norm='ortho', **kwargs):\n\n    if S is None:\n        S = power_to_db(melspectrogram(y=y, sr=sr, **kwargs))\n\n    return scipy.fftpack.dct(S, axis=0, type=dct_type, norm=norm)[:n_mfcc]", "response": "Generate Mel - frequency cepstral coefficients from audio time series y sr S n_mfcc"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef melspectrogram(y=None, sr=22050, S=None, n_fft=2048, hop_length=512,\n                   win_length=None, window='hann', center=True, pad_mode='reflect',\n                   power=2.0, **kwargs):\n    \"\"\"Compute a mel-scaled spectrogram.\n\n    If a spectrogram input `S` is provided, then it is mapped directly onto\n    the mel basis `mel_f` by `mel_f.dot(S)`.\n\n    If a time-series input `y, sr` is provided, then its magnitude spectrogram\n    `S` is first computed, and then mapped onto the mel scale by\n    `mel_f.dot(S**power)`.  By default, `power=2` operates on a power spectrum.\n\n    Parameters\n    ----------\n    y : np.ndarray [shape=(n,)] or None\n        audio time-series\n\n    sr : number > 0 [scalar]\n        sampling rate of `y`\n\n    S : np.ndarray [shape=(d, t)]\n        spectrogram\n\n    n_fft : int > 0 [scalar]\n        length of the FFT window\n\n    hop_length : int > 0 [scalar]\n        number of samples between successive frames.\n        See `librosa.core.stft`\n\n    win_length : int <= n_fft [scalar]\n        Each frame of audio is windowed by `window()`.\n        The window will be of length `win_length` and then padded\n        with zeros to match `n_fft`.\n\n        If unspecified, defaults to ``win_length = n_fft``.\n\n    window : string, tuple, number, function, or np.ndarray [shape=(n_fft,)]\n        - a window specification (string, tuple, or number);\n          see `scipy.signal.get_window`\n        - a window function, such as `scipy.signal.hanning`\n        - a vector or array of length `n_fft`\n\n        .. see also:: `filters.get_window`\n\n    center : boolean\n        - If `True`, the signal `y` is padded so that frame\n          `t` is centered at `y[t * hop_length]`.\n        - If `False`, then frame `t` begins at `y[t * hop_length]`\n\n    pad_mode : string\n        If `center=True`, the padding mode to use at the edges of the signal.\n        By default, STFT uses reflection padding.\n\n    power : float > 0 [scalar]\n        Exponent for the magnitude melspectrogram.\n        e.g., 1 for energy, 2 for power, etc.\n\n    kwargs : additional keyword arguments\n      Mel filter bank parameters.\n      See `librosa.filters.mel` for details.\n\n    Returns\n    -------\n    S : np.ndarray [shape=(n_mels, t)]\n        Mel spectrogram\n\n    See Also\n    --------\n    librosa.filters.mel\n        Mel filter bank construction\n\n    librosa.core.stft\n        Short-time Fourier Transform\n\n\n    Examples\n    --------\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> librosa.feature.melspectrogram(y=y, sr=sr)\n    array([[  2.891e-07,   2.548e-03, ...,   8.116e-09,   5.633e-09],\n           [  1.986e-07,   1.162e-02, ...,   9.332e-08,   6.716e-09],\n           ...,\n           [  3.668e-09,   2.029e-08, ...,   3.208e-09,   2.864e-09],\n           [  2.561e-10,   2.096e-09, ...,   7.543e-10,   6.101e-10]])\n\n    Using a pre-computed power spectrogram\n\n    >>> D = np.abs(librosa.stft(y))**2\n    >>> S = librosa.feature.melspectrogram(S=D)\n\n    >>> # Passing through arguments to the Mel filters\n    >>> S = librosa.feature.melspectrogram(y=y, sr=sr, n_mels=128,\n    ...                                     fmax=8000)\n\n    >>> import matplotlib.pyplot as plt\n    >>> plt.figure(figsize=(10, 4))\n    >>> librosa.display.specshow(librosa.power_to_db(S,\n    ...                                              ref=np.max),\n    ...                          y_axis='mel', fmax=8000,\n    ...                          x_axis='time')\n    >>> plt.colorbar(format='%+2.0f dB')\n    >>> plt.title('Mel spectrogram')\n    >>> plt.tight_layout()\n    \"\"\"\n\n    S, n_fft = _spectrogram(y=y, S=S, n_fft=n_fft, hop_length=hop_length, power=power,\n                            win_length=win_length, window=window, center=center,\n                            pad_mode=pad_mode)\n\n    # Build a Mel filter\n    mel_basis = filters.mel(sr, n_fft, **kwargs)\n\n    return np.dot(mel_basis, S)", "response": "Compute a mel - scaled spectrogram."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nloading an audio file and estimate tuning", "response": "def estimate_tuning(input_file):\n    '''Load an audio file and estimate tuning (in cents)'''\n\n    print('Loading ', input_file)\n    y, sr = librosa.load(input_file)\n\n    print('Separating harmonic component ... ')\n    y_harm = librosa.effects.harmonic(y)\n\n    print('Estimating tuning ... ')\n    # Just track the pitches associated with high magnitude\n    tuning = librosa.estimate_tuning(y=y_harm, sr=sr)\n\n    print('{:+0.2f} cents'.format(100 * tuning))"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\ncompute Jaccard similarity between two intervals.", "response": "def __jaccard(int_a, int_b):  # pragma: no cover\n    '''Jaccard similarity between two intervals\n\n    Parameters\n    ----------\n    int_a, int_b : np.ndarrays, shape=(2,)\n\n    Returns\n    -------\n    Jaccard similarity between intervals\n    '''\n    ends = [int_a[1], int_b[1]]\n    if ends[1] < ends[0]:\n        ends.reverse()\n\n    starts = [int_a[0], int_b[0]]\n    if starts[1] < starts[0]:\n        starts.reverse()\n\n    intersection = ends[0] - starts[1]\n    if intersection < 0:\n        intersection = 0.\n\n    union = ends[1] - starts[0]\n\n    if union > 0:\n        return intersection / union\n\n    return 0.0"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nfind the best Jaccard match from query to candidates", "response": "def __match_interval_overlaps(query, intervals_to, candidates):  # pragma: no cover\n    '''Find the best Jaccard match from query to candidates'''\n\n    best_score = -1\n    best_idx = -1\n    for idx in candidates:\n        score = __jaccard(query, intervals_to[idx])\n\n        if score > best_score:\n            best_score, best_idx = score, idx\n    return best_idx"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nnumbing - accelerated interval matching algorithm.", "response": "def __match_intervals(intervals_from, intervals_to, strict=True):  # pragma: no cover\n    '''Numba-accelerated interval matching algorithm.\n\n    '''\n    # sort index of the interval starts\n    start_index = np.argsort(intervals_to[:, 0])\n\n    # sort index of the interval ends\n    end_index = np.argsort(intervals_to[:, 1])\n\n    # and sorted values of starts\n    start_sorted = intervals_to[start_index, 0]\n    # and ends\n    end_sorted = intervals_to[end_index, 1]\n\n    search_ends = np.searchsorted(start_sorted, intervals_from[:, 1], side='right')\n    search_starts = np.searchsorted(end_sorted, intervals_from[:, 0], side='left')\n\n    output = np.empty(len(intervals_from), dtype=numba.uint32)\n    for i in range(len(intervals_from)):\n        query = intervals_from[i]\n\n        # Find the intervals that start after our query ends\n        after_query = search_ends[i]\n        # And the intervals that end after our query begins\n        before_query = search_starts[i]\n\n        # Candidates for overlapping have to (end after we start) and (begin before we end)\n        candidates = set(start_index[:after_query]) & set(end_index[before_query:])\n\n        # Proceed as before\n        if len(candidates) > 0:\n            output[i] = __match_interval_overlaps(query, intervals_to, candidates)\n        elif strict:\n            # Numba only lets us use compile-time constants in exception messages\n            raise ParameterError\n        else:\n            # Find the closest interval\n            # (start_index[after_query] - query[1]) is the distance to the next interval\n            # (query[0] - end_index[before_query])\n            dist_before = np.inf\n            dist_after = np.inf\n            if search_starts[i] > 0:\n                dist_before = query[0] - end_sorted[search_starts[i]-1]\n            if search_ends[i] + 1 < len(intervals_to):\n                dist_after = start_sorted[search_ends[i]+1] - query[1]\n            if dist_before < dist_after:\n                output[i] = end_index[search_starts[i]-1]\n            else:\n                output[i] = start_index[search_ends[i]+1]\n    return output"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef match_intervals(intervals_from, intervals_to, strict=True):\n    '''Match one set of time intervals to another.\n\n    This can be useful for tasks such as mapping beat timings\n    to segments.\n\n    Each element `[a, b]` of `intervals_from` is matched to the\n    element `[c, d]` of `intervals_to` which maximizes the\n    Jaccard similarity between the intervals:\n\n        `max(0, |min(b, d) - max(a, c)|) / |max(d, b) - min(a, c)|`\n\n    In `strict=True` mode, if there is no interval with positive\n    intersection with `[a,b]`, an exception is thrown.\n\n    In `strict=False` mode, any interval `[a, b]` that has no\n    intersection with any element of `intervals_to` is instead\n    matched to the interval `[c, d]` which minimizes\n\n        `min(|b - c|, |a - d|)`\n\n    that is, the disjoint interval `[c, d]` with a boundary closest\n    to `[a, b]`.\n\n    .. note:: An element of `intervals_to` may be matched to multiple\n       entries of `intervals_from`.\n\n    Parameters\n    ----------\n    intervals_from : np.ndarray [shape=(n, 2)]\n        The time range for source intervals.\n        The `i` th interval spans time `intervals_from[i, 0]`\n        to `intervals_from[i, 1]`.\n        `intervals_from[0, 0]` should be 0, `intervals_from[-1, 1]`\n        should be the track duration.\n\n    intervals_to : np.ndarray [shape=(m, 2)]\n        Analogous to `intervals_from`.\n\n    strict : bool\n        If `True`, intervals can only match if they intersect.\n        If `False`, disjoint intervals can match.\n\n    Returns\n    -------\n    interval_mapping : np.ndarray [shape=(n,)]\n        For each interval in `intervals_from`, the\n        corresponding interval in `intervals_to`.\n\n    See Also\n    --------\n    match_events\n\n    Raises\n    ------\n    ParameterError\n        If either array of input intervals is not the correct shape\n\n        If `strict=True` and some element of `intervals_from` is disjoint from\n        every element of `intervals_to`.\n\n    Examples\n    --------\n    >>> ints_from = np.array([[3, 5], [1, 4], [4, 5]])\n    >>> ints_to = np.array([[0, 2], [1, 3], [4, 5], [6, 7]])\n    >>> librosa.util.match_intervals(ints_from, ints_to)\n    array([2, 1, 2], dtype=uint32)\n    >>> # [3, 5] => [4, 5]  (ints_to[2])\n    >>> # [1, 4] => [1, 3]  (ints_to[1])\n    >>> # [4, 5] => [4, 5]  (ints_to[2])\n\n    The reverse matching of the above is not possible in `strict` mode\n    because `[6, 7]` is disjoint from all intervals in `ints_from`.\n    With `strict=False`, we get the following:\n    >>> librosa.util.match_intervals(ints_to, ints_from, strict=False)\n    array([1, 1, 2, 2], dtype=uint32)\n    >>> # [0, 2] => [1, 4]  (ints_from[1])\n    >>> # [1, 3] => [1, 4]  (ints_from[1])\n    >>> # [4, 5] => [4, 5]  (ints_from[2])\n    >>> # [6, 7] => [4, 5]  (ints_from[2])\n    '''\n\n    if len(intervals_from) == 0 or len(intervals_to) == 0:\n        raise ParameterError('Attempting to match empty interval list')\n\n    # Verify that the input intervals has correct shape and size\n    valid_intervals(intervals_from)\n    valid_intervals(intervals_to)\n\n    try:\n        return __match_intervals(intervals_from, intervals_to, strict=strict)\n    except ParameterError:\n        six.reraise(ParameterError,\n                    ParameterError('Unable to match intervals with strict={}'.format(strict)),\n                    sys.exc_info()[2])", "response": "Match one set of time intervals to another."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nmatches one set of events to another.", "response": "def match_events(events_from, events_to, left=True, right=True):\n    '''Match one set of events to another.\n\n    This is useful for tasks such as matching beats to the nearest\n    detected onsets, or frame-aligned events to the nearest zero-crossing.\n\n    .. note:: A target event may be matched to multiple source events.\n\n    Examples\n    --------\n    >>> # Sources are multiples of 7\n    >>> s_from = np.arange(0, 100, 7)\n    >>> s_from\n    array([ 0,  7, 14, 21, 28, 35, 42, 49, 56, 63, 70, 77, 84, 91,\n           98])\n    >>> # Targets are multiples of 10\n    >>> s_to = np.arange(0, 100, 10)\n    >>> s_to\n    array([ 0, 10, 20, 30, 40, 50, 60, 70, 80, 90])\n    >>> # Find the matching\n    >>> idx = librosa.util.match_events(s_from, s_to)\n    >>> idx\n    array([0, 1, 1, 2, 3, 3, 4, 5, 6, 6, 7, 8, 8, 9, 9])\n    >>> # Print each source value to its matching target\n    >>> zip(s_from, s_to[idx])\n    [(0, 0), (7, 10), (14, 10), (21, 20), (28, 30), (35, 30),\n     (42, 40), (49, 50), (56, 60), (63, 60), (70, 70), (77, 80),\n     (84, 80), (91, 90), (98, 90)]\n\n    Parameters\n    ----------\n    events_from : ndarray [shape=(n,)]\n      Array of events (eg, times, sample or frame indices) to match from.\n\n    events_to : ndarray [shape=(m,)]\n      Array of events (eg, times, sample or frame indices) to\n      match against.\n\n    left : bool\n    right : bool\n        If `False`, then matched events cannot be to the left (or right)\n        of source events.\n\n    Returns\n    -------\n    event_mapping : np.ndarray [shape=(n,)]\n        For each event in `events_from`, the corresponding event\n        index in `events_to`.\n\n        `event_mapping[i] == arg min |events_from[i] - events_to[:]|`\n\n    See Also\n    --------\n    match_intervals\n\n    Raises\n    ------\n    ParameterError\n        If either array of input events is not the correct shape\n    '''\n    if len(events_from) == 0 or len(events_to) == 0:\n        raise ParameterError('Attempting to match empty event list')\n\n    # If we can't match left or right, then only strict equivalence\n    # counts as a match.\n    if not (left or right) and not np.all(np.in1d(events_from, events_to)):\n        raise ParameterError('Cannot match events with left=right=False '\n                             'and events_from is not contained '\n                             'in events_to')\n\n    # If we can't match to the left, then there should be at least one\n    # target event greater-equal to every source event\n    if (not left) and max(events_to) < max(events_from):\n        raise ParameterError('Cannot match events with left=False '\n                             'and max(events_to) < max(events_from)')\n\n    # If we can't match to the right, then there should be at least one\n    # target event less-equal to every source event\n    if (not right) and min(events_to) > min(events_from):\n        raise ParameterError('Cannot match events with right=False '\n                             'and min(events_to) > min(events_from)')\n\n    # array of matched items\n    output = np.empty_like(events_from, dtype=np.int)\n\n    return __match_events_helper(output, events_from, events_to, left, right)"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef salience(S, freqs, h_range, weights=None, aggregate=None,\n             filter_peaks=True, fill_value=np.nan,  kind='linear', axis=0):\n    \"\"\"Harmonic salience function.\n\n    Parameters\n    ----------\n    S : np.ndarray [shape=(d, n)]\n        input time frequency magnitude representation (stft, ifgram, etc).\n        Must be real-valued and non-negative.\n    freqs : np.ndarray, shape=(S.shape[axis])\n        The frequency values corresponding to S's elements along the\n        chosen axis.\n    h_range : list-like, non-negative\n        Harmonics to include in salience computation.  The first harmonic (1)\n        corresponds to `S` itself. Values less than one (e.g., 1/2) correspond\n        to sub-harmonics.\n    weights : list-like\n        The weight to apply to each harmonic in the summation. (default:\n        uniform weights). Must be the same length as `harmonics`.\n    aggregate : function\n        aggregation function (default: `np.average`)\n        If `aggregate=np.average`, then a weighted average is\n        computed per-harmonic according to the specified weights.\n        For all other aggregation functions, all harmonics\n        are treated equally.\n    filter_peaks : bool\n        If true, returns harmonic summation only on frequencies of peak\n        magnitude. Otherwise returns harmonic summation over the full spectrum.\n        Defaults to True.\n    fill_value : float\n        The value to fill non-peaks in the output representation. (default:\n        np.nan) Only used if `filter_peaks == True`.\n    kind : str\n        Interpolation type for harmonic estimation.\n        See `scipy.interpolate.interp1d`.\n    axis : int\n        The axis along which to compute harmonics\n\n    Returns\n    -------\n    S_sal : np.ndarray, shape=(len(h_range), [x.shape])\n        `S_sal` will have the same shape as `S`, and measure\n        the overal harmonic energy at each frequency.\n\n    See Also\n    --------\n    interp_harmonics\n\n    Examples\n    --------\n    >>> y, sr = librosa.load(librosa.util.example_audio_file(),\n    ...                      duration=15, offset=30)\n    >>> S = np.abs(librosa.stft(y))\n    >>> freqs = librosa.core.fft_frequencies(sr)\n    >>> harms = [1, 2, 3, 4]\n    >>> weights = [1.0, 0.5, 0.33, 0.25]\n    >>> S_sal = librosa.salience(S, freqs, harms, weights, fill_value=0)\n    >>> print(S_sal.shape)\n    (1025, 646)\n    >>> import matplotlib.pyplot as plt\n    >>> plt.figure()\n    >>> librosa.display.specshow(librosa.amplitude_to_db(S_sal,\n    ...                                                  ref=np.max),\n    ...                          sr=sr, y_axis='log', x_axis='time')\n    >>> plt.colorbar()\n    >>> plt.title('Salience spectrogram')\n    >>> plt.tight_layout()\n    \"\"\"\n    if aggregate is None:\n        aggregate = np.average\n\n    if weights is None:\n        weights = np.ones((len(h_range), ))\n    else:\n        weights = np.array(weights, dtype=float)\n\n    S_harm = interp_harmonics(S, freqs, h_range, kind=kind, axis=axis)\n\n    if aggregate is np.average:\n        S_sal = aggregate(S_harm, axis=0, weights=weights)\n    else:\n        S_sal = aggregate(S_harm, axis=0)\n\n    if filter_peaks:\n        S_peaks = scipy.signal.argrelmax(S, axis=0)\n        S_out = np.empty(S.shape)\n        S_out.fill(fill_value)\n        S_out[S_peaks[0], S_peaks[1]] = S_sal[S_peaks[0], S_peaks[1]]\n\n        S_sal = S_out\n\n    return S_sal", "response": "Harmonic salience function.\n\n    Parameters\n    ----------\n    S : np.ndarray [shape=(d, n)]\n        input time frequency magnitude representation (stft, ifgram, etc).\n        Must be real-valued and non-negative.\n    freqs : np.ndarray, shape=(S.shape[axis])\n        The frequency values corresponding to S's elements along the\n        chosen axis.\n    h_range : list-like, non-negative\n        Harmonics to include in salience computation.  The first harmonic (1)\n        corresponds to `S` itself. Values less than one (e.g., 1/2) correspond\n        to sub-harmonics.\n    weights : list-like\n        The weight to apply to each harmonic in the summation. (default:\n        uniform weights). Must be the same length as `harmonics`.\n    aggregate : function\n        aggregation function (default: `np.average`)\n        If `aggregate=np.average`, then a weighted average is\n        computed per-harmonic according to the specified weights.\n        For all other aggregation functions, all harmonics\n        are treated equally.\n    filter_peaks : bool\n        If true, returns harmonic summation only on frequencies of peak\n        magnitude. Otherwise returns harmonic summation over the full spectrum.\n        Defaults to True.\n    fill_value : float\n        The value to fill non-peaks in the output representation. (default:\n        np.nan) Only used if `filter_peaks == True`.\n    kind : str\n        Interpolation type for harmonic estimation.\n        See `scipy.interpolate.interp1d`.\n    axis : int\n        The axis along which to compute harmonics\n\n    Returns\n    -------\n    S_sal : np.ndarray, shape=(len(h_range), [x.shape])\n        `S_sal` will have the same shape as `S`, and measure\n        the overal harmonic energy at each frequency.\n\n    See Also\n    --------\n    interp_harmonics\n\n    Examples\n    --------\n    >>> y, sr = librosa.load(librosa.util.example_audio_file(),\n    ...                      duration=15, offset=30)\n    >>> S = np.abs(librosa.stft(y))\n    >>> freqs = librosa.core.fft_frequencies(sr)\n    >>> harms = [1, 2, 3, 4]\n    >>> weights = [1.0, 0.5, 0.33, 0.25]\n    >>> S_sal = librosa.salience(S, freqs, harms, weights, fill_value=0)\n    >>> print(S_sal.shape)\n    (1025, 646)\n    >>> import matplotlib.pyplot as plt\n    >>> plt.figure()\n    >>> librosa.display.specshow(librosa.amplitude_to_db(S_sal,\n    ...                                                  ref=np.max),\n    ...                          sr=sr, y_axis='log', x_axis='time')\n    >>> plt.colorbar()\n    >>> plt.title('Salience spectrogram')\n    >>> plt.tight_layout()"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef interp_harmonics(x, freqs, h_range, kind='linear', fill_value=0, axis=0):\n    '''Compute the energy at harmonics of time-frequency representation.\n\n    Given a frequency-based energy representation such as a spectrogram\n    or tempogram, this function computes the energy at the chosen harmonics\n    of the frequency axis.  (See examples below.)\n    The resulting harmonic array can then be used as input to a salience\n    computation.\n\n    Parameters\n    ----------\n    x : np.ndarray\n        The input energy\n\n    freqs : np.ndarray, shape=(X.shape[axis])\n        The frequency values corresponding to X's elements along the\n        chosen axis.\n\n    h_range : list-like, non-negative\n        Harmonics to compute.  The first harmonic (1) corresponds to `x`\n        itself.\n        Values less than one (e.g., 1/2) correspond to sub-harmonics.\n\n    kind : str\n        Interpolation type.  See `scipy.interpolate.interp1d`.\n\n    fill_value : float\n        The value to fill when extrapolating beyond the observed\n        frequency range.\n\n    axis : int\n        The axis along which to compute harmonics\n\n    Returns\n    -------\n    x_harm : np.ndarray, shape=(len(h_range), [x.shape])\n        `x_harm[i]` will have the same shape as `x`, and measure\n        the energy at the `h_range[i]` harmonic of each frequency.\n\n    See Also\n    --------\n    scipy.interpolate.interp1d\n\n\n    Examples\n    --------\n    Estimate the harmonics of a time-averaged tempogram\n\n    >>> y, sr = librosa.load(librosa.util.example_audio_file(),\n    ...                      duration=15, offset=30)\n    >>> # Compute the time-varying tempogram and average over time\n    >>> tempi = np.mean(librosa.feature.tempogram(y=y, sr=sr), axis=1)\n    >>> # We'll measure the first five harmonics\n    >>> h_range = [1, 2, 3, 4, 5]\n    >>> f_tempo = librosa.tempo_frequencies(len(tempi), sr=sr)\n    >>> # Build the harmonic tensor\n    >>> t_harmonics = librosa.interp_harmonics(tempi, f_tempo, h_range)\n    >>> print(t_harmonics.shape)\n    (5, 384)\n\n    >>> # And plot the results\n    >>> import matplotlib.pyplot as plt\n    >>> plt.figure()\n    >>> librosa.display.specshow(t_harmonics, x_axis='tempo', sr=sr)\n    >>> plt.yticks(0.5 + np.arange(len(h_range)),\n    ...            ['{:.3g}'.format(_) for _ in h_range])\n    >>> plt.ylabel('Harmonic')\n    >>> plt.xlabel('Tempo (BPM)')\n    >>> plt.tight_layout()\n\n    We can also compute frequency harmonics for spectrograms.\n    To calculate sub-harmonic energy, use values < 1.\n\n    >>> h_range = [1./3, 1./2, 1, 2, 3, 4]\n    >>> S = np.abs(librosa.stft(y))\n    >>> fft_freqs = librosa.fft_frequencies(sr=sr)\n    >>> S_harm = librosa.interp_harmonics(S, fft_freqs, h_range, axis=0)\n    >>> print(S_harm.shape)\n    (6, 1025, 646)\n\n    >>> plt.figure()\n    >>> for i, _sh in enumerate(S_harm, 1):\n    ...     plt.subplot(3, 2, i)\n    ...     librosa.display.specshow(librosa.amplitude_to_db(_sh,\n    ...                                                      ref=S.max()),\n    ...                              sr=sr, y_axis='log')\n    ...     plt.title('h={:.3g}'.format(h_range[i-1]))\n    ...     plt.yticks([])\n    >>> plt.tight_layout()\n    '''\n\n    # X_out will be the same shape as X, plus a leading\n    # axis that has length = len(h_range)\n    out_shape = [len(h_range)]\n    out_shape.extend(x.shape)\n\n    x_out = np.zeros(out_shape, dtype=x.dtype)\n\n    if freqs.ndim == 1 and len(freqs) == x.shape[axis]:\n        harmonics_1d(x_out, x, freqs, h_range,\n                     kind=kind, fill_value=fill_value,\n                     axis=axis)\n\n    elif freqs.ndim == 2 and freqs.shape == x.shape:\n        harmonics_2d(x_out, x, freqs, h_range,\n                     kind=kind, fill_value=fill_value,\n                     axis=axis)\n    else:\n        raise ParameterError('freqs.shape={} does not match '\n                             'input shape={}'.format(freqs.shape, x.shape))\n\n    return x_out", "response": "Compute the energy at harmonics of time - frequency representation."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef harmonics_1d(harmonic_out, x, freqs, h_range, kind='linear',\n                 fill_value=0, axis=0):\n    '''Populate a harmonic tensor from a time-frequency representation.\n\n    Parameters\n    ----------\n    harmonic_out : np.ndarray, shape=(len(h_range), X.shape)\n        The output array to store harmonics\n\n    X : np.ndarray\n        The input energy\n\n    freqs : np.ndarray, shape=(x.shape[axis])\n        The frequency values corresponding to x's elements along the\n        chosen axis.\n\n    h_range : list-like, non-negative\n        Harmonics to compute.  The first harmonic (1) corresponds to `x`\n        itself.\n        Values less than one (e.g., 1/2) correspond to sub-harmonics.\n\n    kind : str\n        Interpolation type.  See `scipy.interpolate.interp1d`.\n\n    fill_value : float\n        The value to fill when extrapolating beyond the observed\n        frequency range.\n\n    axis : int\n        The axis along which to compute harmonics\n\n    See Also\n    --------\n    harmonics\n    scipy.interpolate.interp1d\n\n\n    Examples\n    --------\n    Estimate the harmonics of a time-averaged tempogram\n\n    >>> y, sr = librosa.load(librosa.util.example_audio_file(),\n    ...                      duration=15, offset=30)\n    >>> # Compute the time-varying tempogram and average over time\n    >>> tempi = np.mean(librosa.feature.tempogram(y=y, sr=sr), axis=1)\n    >>> # We'll measure the first five harmonics\n    >>> h_range = [1, 2, 3, 4, 5]\n    >>> f_tempo = librosa.tempo_frequencies(len(tempi), sr=sr)\n    >>> # Build the harmonic tensor\n    >>> t_harmonics = librosa.interp_harmonics(tempi, f_tempo, h_range)\n    >>> print(t_harmonics.shape)\n    (5, 384)\n\n    >>> # And plot the results\n    >>> import matplotlib.pyplot as plt\n    >>> plt.figure()\n    >>> librosa.display.specshow(t_harmonics, x_axis='tempo', sr=sr)\n    >>> plt.yticks(0.5 + np.arange(len(h_range)),\n    ...            ['{:.3g}'.format(_) for _ in h_range])\n    >>> plt.ylabel('Harmonic')\n    >>> plt.xlabel('Tempo (BPM)')\n    >>> plt.tight_layout()\n\n    We can also compute frequency harmonics for spectrograms.\n    To calculate subharmonic energy, use values < 1.\n\n    >>> h_range = [1./3, 1./2, 1, 2, 3, 4]\n    >>> S = np.abs(librosa.stft(y))\n    >>> fft_freqs = librosa.fft_frequencies(sr=sr)\n    >>> S_harm = librosa.interp_harmonics(S, fft_freqs, h_range, axis=0)\n    >>> print(S_harm.shape)\n    (6, 1025, 646)\n\n    >>> plt.figure()\n    >>> for i, _sh in enumerate(S_harm, 1):\n    ...     plt.subplot(3,2,i)\n    ...     librosa.display.specshow(librosa.amplitude_to_db(_sh,\n    ...                                                      ref=S.max()),\n    ...                              sr=sr, y_axis='log')\n    ...     plt.title('h={:.3g}'.format(h_range[i-1]))\n    ...     plt.yticks([])\n    >>> plt.tight_layout()\n    '''\n\n    # Note: this only works for fixed-grid, 1d interpolation\n    f_interp = scipy.interpolate.interp1d(freqs, x,\n                                          kind=kind,\n                                          axis=axis,\n                                          copy=False,\n                                          bounds_error=False,\n                                          fill_value=fill_value)\n\n    idx_out = [slice(None)] * harmonic_out.ndim\n\n    # Compute the output index of the interpolated values\n    interp_axis = 1 + (axis % x.ndim)\n\n    # Iterate over the harmonics range\n    for h_index, harmonic in enumerate(h_range):\n        idx_out[0] = h_index\n\n        # Iterate over frequencies\n        for f_index, frequency in enumerate(freqs):\n            # Offset the output axis by 1 to account for the harmonic index\n            idx_out[interp_axis] = f_index\n\n            # Estimate the harmonic energy at this frequency across time\n            harmonic_out[tuple(idx_out)] = f_interp(harmonic * frequency)", "response": "Populate a harmonic tensor from a time - frequency representation."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef harmonics_2d(harmonic_out, x, freqs, h_range, kind='linear', fill_value=0,\n                 axis=0):\n    '''Populate a harmonic tensor from a time-frequency representation with\n    time-varying frequencies.\n\n    Parameters\n    ----------\n    harmonic_out : np.ndarray\n        The output array to store harmonics\n\n    x : np.ndarray\n        The input energy\n\n    freqs : np.ndarray, shape=x.shape\n        The frequency values corresponding to each element of `x`\n\n    h_range : list-like, non-negative\n        Harmonics to compute.  The first harmonic (1) corresponds to `x`\n        itself.  Values less than one (e.g., 1/2) correspond to\n        sub-harmonics.\n\n    kind : str\n        Interpolation type.  See `scipy.interpolate.interp1d`.\n\n    fill_value : float\n        The value to fill when extrapolating beyond the observed\n        frequency range.\n\n    axis : int\n        The axis along which to compute harmonics\n\n    See Also\n    --------\n    harmonics\n    harmonics_1d\n    '''\n    idx_in = [slice(None)] * x.ndim\n    idx_freq = [slice(None)] * x.ndim\n    idx_out = [slice(None)] * harmonic_out.ndim\n\n    # This is the non-interpolation axis\n    ni_axis = (1 + axis) % x.ndim\n\n    # For each value in the non-interpolated axis, compute its harmonics\n    for i in range(x.shape[ni_axis]):\n        idx_in[ni_axis] = slice(i, i + 1)\n        idx_freq[ni_axis] = i\n        idx_out[1 + ni_axis] = idx_in[ni_axis]\n\n        harmonics_1d(harmonic_out[tuple(idx_out)], x[tuple(idx_in)], freqs[tuple(idx_freq)],\n                     h_range, kind=kind, fill_value=fill_value,\n                     axis=axis)", "response": "Populate a harmonic tensor from a time - frequency representation with a time - frequency representation with a time - varying frequencies."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nloads an audio file as a floating point time series.", "response": "def load(path, sr=22050, mono=True, offset=0.0, duration=None,\n         dtype=np.float32, res_type='kaiser_best'):\n    \"\"\"Load an audio file as a floating point time series.\n\n    Audio will be automatically resampled to the given rate\n    (default `sr=22050`).\n\n    To preserve the native sampling rate of the file, use `sr=None`.\n\n    Parameters\n    ----------\n    path : string, int, or file-like object\n        path to the input file.\n\n        Any codec supported by `soundfile` or `audioread` will work.\n\n        If the codec is supported by `soundfile`, then `path` can also be\n        an open file descriptor (int), or any object implementing Python's\n        file interface.\n\n        If the codec is not supported by `soundfile` (e.g., MP3), then only\n        string file paths are supported.\n\n    sr   : number > 0 [scalar]\n        target sampling rate\n\n        'None' uses the native sampling rate\n\n    mono : bool\n        convert signal to mono\n\n    offset : float\n        start reading after this time (in seconds)\n\n    duration : float\n        only load up to this much audio (in seconds)\n\n    dtype : numeric type\n        data type of `y`\n\n    res_type : str\n        resample type (see note)\n\n        .. note::\n            By default, this uses `resampy`'s high-quality mode ('kaiser_best').\n\n            For alternative resampling modes, see `resample`\n\n        .. note::\n           `audioread` may truncate the precision of the audio data to 16 bits.\n\n           See https://librosa.github.io/librosa/ioformats.html for alternate\n           loading methods.\n\n\n    Returns\n    -------\n    y    : np.ndarray [shape=(n,) or (2, n)]\n        audio time series\n\n    sr   : number > 0 [scalar]\n        sampling rate of `y`\n\n\n    Examples\n    --------\n    >>> # Load an ogg vorbis file\n    >>> filename = librosa.util.example_audio_file()\n    >>> y, sr = librosa.load(filename)\n    >>> y\n    array([ -4.756e-06,  -6.020e-06, ...,  -1.040e-06,   0.000e+00], dtype=float32)\n    >>> sr\n    22050\n\n    >>> # Load a file and resample to 11 KHz\n    >>> filename = librosa.util.example_audio_file()\n    >>> y, sr = librosa.load(filename, sr=11025)\n    >>> y\n    array([ -2.077e-06,  -2.928e-06, ...,  -4.395e-06,   0.000e+00], dtype=float32)\n    >>> sr\n    11025\n\n    >>> # Load 5 seconds of a file, starting 15 seconds in\n    >>> filename = librosa.util.example_audio_file()\n    >>> y, sr = librosa.load(filename, offset=15.0, duration=5.0)\n    >>> y\n    array([ 0.069,  0.1  , ..., -0.101,  0.   ], dtype=float32)\n    >>> sr\n    22050\n\n    \"\"\"\n\n    try:\n        with sf.SoundFile(path) as sf_desc:\n            sr_native = sf_desc.samplerate\n            if offset:\n                # Seek to the start of the target read\n                sf_desc.seek(int(offset * sr_native))\n            if duration is not None:\n                frame_duration = int(duration * sr_native)\n            else:\n                frame_duration = -1\n\n            # Load the target number of frames, and transpose to match librosa form\n            y = sf_desc.read(frames=frame_duration, dtype=dtype, always_2d=False).T\n\n    except RuntimeError as exc:\n        # If soundfile failed, fall back to the audioread loader\n        y, sr_native = __audioread_load(path, offset, duration, dtype)\n\n    # Final cleanup for dtype and contiguity\n    if mono:\n        y = to_mono(y)\n\n    if sr is not None:\n        y = resample(y, sr_native, sr, res_type=res_type)\n\n    else:\n        sr = sr_native\n\n    return y, sr"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nloading an audio buffer using audioread. Audioread. Audiore", "response": "def __audioread_load(path, offset, duration, dtype):\n    '''Load an audio buffer using audioread.\n\n    This loads one block at a time, and then concatenates the results.\n    '''\n\n    y = []\n    with audioread.audio_open(path) as input_file:\n        sr_native = input_file.samplerate\n        n_channels = input_file.channels\n\n        s_start = int(np.round(sr_native * offset)) * n_channels\n\n        if duration is None:\n            s_end = np.inf\n        else:\n            s_end = s_start + (int(np.round(sr_native * duration))\n                               * n_channels)\n\n        n = 0\n\n        for frame in input_file:\n            frame = util.buf_to_float(frame, dtype=dtype)\n            n_prev = n\n            n = n + len(frame)\n\n            if n < s_start:\n                # offset is after the current frame\n                # keep reading\n                continue\n\n            if s_end < n_prev:\n                # we're off the end.  stop reading\n                break\n\n            if s_end < n:\n                # the end is in this frame.  crop.\n                frame = frame[:s_end - n_prev]\n\n            if n_prev <= s_start <= n:\n                # beginning is in this frame\n                frame = frame[(s_start - n_prev):]\n\n            # tack on the current frame\n            y.append(frame)\n\n    if y:\n        y = np.concatenate(y)\n        if n_channels > 1:\n            y = y.reshape((-1, n_channels)).T\n    else:\n        y = np.empty(0, dtype=dtype)\n\n    return y, sr_native"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nforcing an audio signal down to mono.", "response": "def to_mono(y):\n    '''Force an audio signal down to mono.\n\n    Parameters\n    ----------\n    y : np.ndarray [shape=(2,n) or shape=(n,)]\n        audio time series, either stereo or mono\n\n    Returns\n    -------\n    y_mono : np.ndarray [shape=(n,)]\n        `y` as a monophonic time-series\n\n    Notes\n    -----\n    This function caches at level 20.\n\n    Examples\n    --------\n    >>> y, sr = librosa.load(librosa.util.example_audio_file(), mono=False)\n    >>> y.shape\n    (2, 1355168)\n    >>> y_mono = librosa.to_mono(y)\n    >>> y_mono.shape\n    (1355168,)\n\n    '''\n\n    # Validate the buffer.  Stereo is ok here.\n    util.valid_audio(y, mono=False)\n\n    if y.ndim > 1:\n        y = np.mean(y, axis=0)\n\n    return y"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\ncomputes the duration of an audio time series feature matrix or file.", "response": "def get_duration(y=None, sr=22050, S=None, n_fft=2048, hop_length=512,\n                 center=True, filename=None):\n    \"\"\"Compute the duration (in seconds) of an audio time series,\n    feature matrix, or filename.\n\n    Examples\n    --------\n    >>> # Load the example audio file\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> librosa.get_duration(y=y, sr=sr)\n    61.45886621315193\n\n    >>> # Or directly from an audio file\n    >>> librosa.get_duration(filename=librosa.util.example_audio_file())\n    61.4\n\n    >>> # Or compute duration from an STFT matrix\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> S = librosa.stft(y)\n    >>> librosa.get_duration(S=S, sr=sr)\n    61.44\n\n    >>> # Or a non-centered STFT matrix\n    >>> S_left = librosa.stft(y, center=False)\n    >>> librosa.get_duration(S=S_left, sr=sr)\n    61.3471201814059\n\n    Parameters\n    ----------\n    y : np.ndarray [shape=(n,), (2, n)] or None\n        audio time series\n\n    sr : number > 0 [scalar]\n        audio sampling rate of `y`\n\n    S : np.ndarray [shape=(d, t)] or None\n        STFT matrix, or any STFT-derived matrix (e.g., chromagram\n        or mel spectrogram).\n        Durations calculated from spectrogram inputs are only accurate\n        up to the frame resolution. If high precision is required,\n        it is better to use the audio time series directly.\n\n    n_fft       : int > 0 [scalar]\n        FFT window size for `S`\n\n    hop_length  : int > 0 [ scalar]\n        number of audio samples between columns of `S`\n\n    center  : boolean\n        - If `True`, `S[:, t]` is centered at `y[t * hop_length]`\n        - If `False`, then `S[:, t]` begins at `y[t * hop_length]`\n\n    filename : str\n        If provided, all other parameters are ignored, and the\n        duration is calculated directly from the audio file.\n        Note that this avoids loading the contents into memory,\n        and is therefore useful for querying the duration of\n        long files.\n\n    Returns\n    -------\n    d : float >= 0\n        Duration (in seconds) of the input time series or spectrogram.\n\n    Raises\n    ------\n    ParameterError\n        if none of `y`, `S`, or `filename` are provided.\n\n    Notes\n    -----\n    `get_duration` can be applied to a file (`filename`), a spectrogram (`S`),\n    or audio buffer (`y, sr`).  Only one of these three options should be\n    provided.  If you do provide multiple options (e.g., `filename` and `S`),\n    then `filename` takes precedence over `S`, and `S` takes precedence over\n    `(y, sr)`.\n    \"\"\"\n\n    if filename is not None:\n        try:\n            return sf.info(filename).duration\n        except:\n            with audioread.audio_open(filename) as fdesc:\n                return fdesc.duration\n\n    if y is None:\n        if S is None:\n            raise ParameterError('At least one of (y, sr), S, or filename must be provided')\n\n        n_frames = S.shape[1]\n        n_samples = n_fft + hop_length * (n_frames - 1)\n\n        # If centered, we lose half a window from each end of S\n        if center:\n            n_samples = n_samples - 2 * int(n_fft / 2)\n\n    else:\n        # Validate the audio buffer.  Stereo is okay here.\n        util.valid_audio(y, mono=False)\n        if y.ndim == 1:\n            n_samples = len(y)\n        else:\n            n_samples = y.shape[-1]\n\n    return float(n_samples) / sr"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef autocorrelate(y, max_size=None, axis=-1):\n\n    if max_size is None:\n        max_size = y.shape[axis]\n\n    max_size = int(min(max_size, y.shape[axis]))\n\n    # Compute the power spectrum along the chosen axis\n    # Pad out the signal to support full-length auto-correlation.\n    fft = get_fftlib()\n    powspec = np.abs(fft.fft(y, n=2 * y.shape[axis] + 1, axis=axis))**2\n\n    # Convert back to time domain\n    autocorr = fft.ifft(powspec, axis=axis)\n\n    # Slice down to max_size\n    subslice = [slice(None)] * autocorr.ndim\n    subslice[axis] = slice(max_size)\n\n    autocorr = autocorr[tuple(subslice)]\n\n    if not np.iscomplexobj(y):\n        autocorr = autocorr.real\n\n    return autocorr", "response": "A simple autocorrelation of a set of time series y along a specified axis."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef zero_crossings(y, threshold=1e-10, ref_magnitude=None, pad=True,\n                   zero_pos=True, axis=-1):\n    '''Find the zero-crossings of a signal `y`: indices `i` such that\n    `sign(y[i]) != sign(y[j])`.\n\n    If `y` is multi-dimensional, then zero-crossings are computed along\n    the specified `axis`.\n\n\n    Parameters\n    ----------\n    y : np.ndarray\n        The input array\n\n    threshold : float > 0 or None\n        If specified, values where `-threshold <= y <= threshold` are\n        clipped to 0.\n\n    ref_magnitude : float > 0 or callable\n        If numeric, the threshold is scaled relative to `ref_magnitude`.\n\n        If callable, the threshold is scaled relative to\n        `ref_magnitude(np.abs(y))`.\n\n    pad : boolean\n        If `True`, then `y[0]` is considered a valid zero-crossing.\n\n    zero_pos : boolean\n        If `True` then the value 0 is interpreted as having positive sign.\n\n        If `False`, then 0, -1, and +1 all have distinct signs.\n\n    axis : int\n        Axis along which to compute zero-crossings.\n\n    Returns\n    -------\n    zero_crossings : np.ndarray [shape=y.shape, dtype=boolean]\n        Indicator array of zero-crossings in `y` along the selected axis.\n\n    Notes\n    -----\n    This function caches at level 20.\n\n    Examples\n    --------\n    >>> # Generate a time-series\n    >>> y = np.sin(np.linspace(0, 4 * 2 * np.pi, 20))\n    >>> y\n    array([  0.000e+00,   9.694e-01,   4.759e-01,  -7.357e-01,\n            -8.372e-01,   3.247e-01,   9.966e-01,   1.646e-01,\n            -9.158e-01,  -6.142e-01,   6.142e-01,   9.158e-01,\n            -1.646e-01,  -9.966e-01,  -3.247e-01,   8.372e-01,\n             7.357e-01,  -4.759e-01,  -9.694e-01,  -9.797e-16])\n    >>> # Compute zero-crossings\n    >>> z = librosa.zero_crossings(y)\n    >>> z\n    array([ True, False, False,  True, False,  True, False, False,\n            True, False,  True, False,  True, False, False,  True,\n           False,  True, False,  True], dtype=bool)\n    >>> # Stack y against the zero-crossing indicator\n    >>> np.vstack([y, z]).T\n    array([[  0.000e+00,   1.000e+00],\n           [  9.694e-01,   0.000e+00],\n           [  4.759e-01,   0.000e+00],\n           [ -7.357e-01,   1.000e+00],\n           [ -8.372e-01,   0.000e+00],\n           [  3.247e-01,   1.000e+00],\n           [  9.966e-01,   0.000e+00],\n           [  1.646e-01,   0.000e+00],\n           [ -9.158e-01,   1.000e+00],\n           [ -6.142e-01,   0.000e+00],\n           [  6.142e-01,   1.000e+00],\n           [  9.158e-01,   0.000e+00],\n           [ -1.646e-01,   1.000e+00],\n           [ -9.966e-01,   0.000e+00],\n           [ -3.247e-01,   0.000e+00],\n           [  8.372e-01,   1.000e+00],\n           [  7.357e-01,   0.000e+00],\n           [ -4.759e-01,   1.000e+00],\n           [ -9.694e-01,   0.000e+00],\n           [ -9.797e-16,   1.000e+00]])\n    >>> # Find the indices of zero-crossings\n    >>> np.nonzero(z)\n    (array([ 0,  3,  5,  8, 10, 12, 15, 17, 19]),)\n    '''\n\n    # Clip within the threshold\n    if threshold is None:\n        threshold = 0.0\n\n    if six.callable(ref_magnitude):\n        threshold = threshold * ref_magnitude(np.abs(y))\n\n    elif ref_magnitude is not None:\n        threshold = threshold * ref_magnitude\n\n    if threshold > 0:\n        y = y.copy()\n        y[np.abs(y) <= threshold] = 0\n\n    # Extract the sign bit\n    if zero_pos:\n        y_sign = np.signbit(y)\n    else:\n        y_sign = np.sign(y)\n\n    # Find the change-points by slicing\n    slice_pre = [slice(None)] * y.ndim\n    slice_pre[axis] = slice(1, None)\n\n    slice_post = [slice(None)] * y.ndim\n    slice_post[axis] = slice(-1)\n\n    # Since we've offset the input by one, pad back onto the front\n    padding = [(0, 0)] * y.ndim\n    padding[axis] = (1, 0)\n\n    return np.pad((y_sign[tuple(slice_post)] != y_sign[tuple(slice_pre)]),\n                  padding,\n                  mode='constant',\n                  constant_values=pad)", "response": "Return a time - series containing zero crossings of a signal y."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef clicks(times=None, frames=None, sr=22050, hop_length=512,\n           click_freq=1000.0, click_duration=0.1, click=None, length=None):\n    \"\"\"Returns a signal with the signal `click` placed at each specified time\n\n    Parameters\n    ----------\n    times : np.ndarray or None\n        times to place clicks, in seconds\n\n    frames : np.ndarray or None\n        frame indices to place clicks\n\n    sr : number > 0\n        desired sampling rate of the output signal\n\n    hop_length : int > 0\n        if positions are specified by `frames`, the number of samples between frames.\n\n    click_freq : float > 0\n        frequency (in Hz) of the default click signal.  Default is 1KHz.\n\n    click_duration : float > 0\n        duration (in seconds) of the default click signal.  Default is 100ms.\n\n    click : np.ndarray or None\n        optional click signal sample to use instead of the default blip.\n\n    length : int > 0\n        desired number of samples in the output signal\n\n\n    Returns\n    -------\n    click_signal : np.ndarray\n        Synthesized click signal\n\n\n    Raises\n    ------\n    ParameterError\n        - If neither `times` nor `frames` are provided.\n        - If any of `click_freq`, `click_duration`, or `length` are out of range.\n\n\n    Examples\n    --------\n    >>> # Sonify detected beat events\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> tempo, beats = librosa.beat.beat_track(y=y, sr=sr)\n    >>> y_beats = librosa.clicks(frames=beats, sr=sr)\n\n    >>> # Or generate a signal of the same length as y\n    >>> y_beats = librosa.clicks(frames=beats, sr=sr, length=len(y))\n\n    >>> # Or use timing instead of frame indices\n    >>> times = librosa.frames_to_time(beats, sr=sr)\n    >>> y_beat_times = librosa.clicks(times=times, sr=sr)\n\n    >>> # Or with a click frequency of 880Hz and a 500ms sample\n    >>> y_beat_times880 = librosa.clicks(times=times, sr=sr,\n    ...                                  click_freq=880, click_duration=0.5)\n\n    Display click waveform next to the spectrogram\n\n    >>> import matplotlib.pyplot as plt\n    >>> plt.figure()\n    >>> S = librosa.feature.melspectrogram(y=y, sr=sr)\n    >>> ax = plt.subplot(2,1,2)\n    >>> librosa.display.specshow(librosa.power_to_db(S, ref=np.max),\n    ...                          x_axis='time', y_axis='mel')\n    >>> plt.subplot(2,1,1, sharex=ax)\n    >>> librosa.display.waveplot(y_beat_times, sr=sr, label='Beat clicks')\n    >>> plt.legend()\n    >>> plt.xlim(15, 30)\n    >>> plt.tight_layout()\n    \"\"\"\n\n    # Compute sample positions from time or frames\n    if times is None:\n        if frames is None:\n            raise ParameterError('either \"times\" or \"frames\" must be provided')\n\n        positions = frames_to_samples(frames, hop_length=hop_length)\n    else:\n        # Convert times to positions\n        positions = time_to_samples(times, sr=sr)\n\n    if click is not None:\n        # Check that we have a well-formed audio buffer\n        util.valid_audio(click, mono=True)\n\n    else:\n        # Create default click signal\n        if click_duration <= 0:\n            raise ParameterError('click_duration must be strictly positive')\n\n        if click_freq <= 0:\n            raise ParameterError('click_freq must be strictly positive')\n\n        angular_freq = 2 * np.pi * click_freq / float(sr)\n\n        click = np.logspace(0, -10,\n                            num=int(np.round(sr * click_duration)),\n                            base=2.0)\n\n        click *= np.sin(angular_freq * np.arange(len(click)))\n\n    # Set default length\n    if length is None:\n        length = positions.max() + click.shape[0]\n    else:\n        if length < 1:\n            raise ParameterError('length must be a positive integer')\n\n        # Filter out any positions past the length boundary\n        positions = positions[positions < length]\n\n    # Pre-allocate click signal\n    click_signal = np.zeros(length, dtype=np.float32)\n\n    # Place clicks\n    for start in positions:\n        # Compute the end-point of this click\n        end = start + click.shape[0]\n\n        if end >= length:\n            click_signal[start:] += click[:length - start]\n        else:\n            # Normally, just add a click here\n            click_signal[start:end] += click\n\n    return click_signal", "response": "Returns a signal with the signal click placed at each specified time."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\ngenerate a pure sine tone signal.", "response": "def tone(frequency, sr=22050, length=None, duration=None, phi=None):\n    \"\"\"Returns a pure tone signal. The signal generated is a cosine wave.\n\n    Parameters\n    ----------\n    frequency : float > 0\n        frequency\n\n    sr : number > 0\n        desired sampling rate of the output signal\n\n    length : int > 0\n        desired number of samples in the output signal. When both `duration` and `length` are defined,\n        `length` would take priority.\n\n    duration : float > 0\n        desired duration in seconds. When both `duration` and `length` are defined, `length` would take priority.\n\n    phi : float or None\n        phase offset, in radians. If unspecified, defaults to `-np.pi * 0.5`.\n\n\n    Returns\n    -------\n    tone_signal : np.ndarray [shape=(length,), dtype=float64]\n        Synthesized pure sine tone signal\n\n\n    Raises\n    ------\n    ParameterError\n        - If `frequency` is not provided.\n        - If neither `length` nor `duration` are provided.\n\n\n    Examples\n    --------\n    >>> # Generate a pure sine tone A4\n    >>> tone = librosa.tone(440, duration=1)\n\n    >>> # Or generate the same signal using `length`\n    >>> tone = librosa.tone(440, sr=22050, length=22050)\n\n    Display spectrogram\n\n    >>> import matplotlib.pyplot as plt\n    >>> plt.figure()\n    >>> S = librosa.feature.melspectrogram(y=tone)\n    >>> librosa.display.specshow(librosa.power_to_db(S, ref=np.max),\n    ...                          x_axis='time', y_axis='mel')\n    \"\"\"\n\n    if frequency is None:\n        raise ParameterError('\"frequency\" must be provided')\n\n    # Compute signal length\n    if length is None:\n        if duration is None:\n            raise ParameterError('either \"length\" or \"duration\" must be provided')\n        length = duration * sr\n\n    if phi is None:\n        phi = -np.pi * 0.5\n\n    step = 1.0 / sr\n    return np.cos(2 * np.pi * frequency * (np.arange(step * length, step=step)) + phi)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef chirp(fmin, fmax, sr=22050, length=None, duration=None, linear=False, phi=None):\n\n    if fmin is None or fmax is None:\n        raise ParameterError('both \"fmin\" and \"fmax\" must be provided')\n\n    # Compute signal duration\n    period = 1.0 / sr\n    if length is None:\n        if duration is None:\n            raise ParameterError('either \"length\" or \"duration\" must be provided')\n    else:\n        duration = period * length\n\n    if phi is None:\n        phi = -np.pi * 0.5\n\n    method = 'linear' if linear else 'logarithmic'\n    return scipy.signal.chirp(\n        np.arange(duration, step=period),\n        fmin,\n        duration,\n        fmax,\n        method=method,\n        phi=phi / np.pi * 180,  # scipy.signal.chirp uses degrees for phase offset\n    )", "response": "Returns a chirp signal that goes from frequency fmin to frequency fmax."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef tempogram(y=None, sr=22050, onset_envelope=None, hop_length=512,\n              win_length=384, center=True, window='hann', norm=np.inf):\n    '''Compute the tempogram: local autocorrelation of the onset strength envelope. [1]_\n\n    .. [1] Grosche, Peter, Meinard M\u00fcller, and Frank Kurth.\n        \"Cyclic tempogram - A mid-level tempo representation for music signals.\"\n        ICASSP, 2010.\n\n    Parameters\n    ----------\n    y : np.ndarray [shape=(n,)] or None\n        Audio time series.\n\n    sr : number > 0 [scalar]\n        sampling rate of `y`\n\n    onset_envelope : np.ndarray [shape=(n,) or (m, n)] or None\n        Optional pre-computed onset strength envelope as provided by\n        `onset.onset_strength`.\n\n        If multi-dimensional, tempograms are computed independently for each\n        band (first dimension).\n\n    hop_length : int > 0\n        number of audio samples between successive onset measurements\n\n    win_length : int > 0\n        length of the onset autocorrelation window (in frames/onset measurements)\n        The default settings (384) corresponds to `384 * hop_length / sr ~= 8.9s`.\n\n    center : bool\n        If `True`, onset autocorrelation windows are centered.\n        If `False`, windows are left-aligned.\n\n    window : string, function, number, tuple, or np.ndarray [shape=(win_length,)]\n        A window specification as in `core.stft`.\n\n    norm : {np.inf, -np.inf, 0, float > 0, None}\n        Normalization mode.  Set to `None` to disable normalization.\n\n    Returns\n    -------\n    tempogram : np.ndarray [shape=(win_length, n) or (m, win_length, n)]\n        Localized autocorrelation of the onset strength envelope.\n\n        If given multi-band input (`onset_envelope.shape==(m,n)`) then\n        `tempogram[i]` is the tempogram of `onset_envelope[i]`.\n\n    Raises\n    ------\n    ParameterError\n        if neither `y` nor `onset_envelope` are provided\n\n        if `win_length < 1`\n\n    See Also\n    --------\n    librosa.onset.onset_strength\n    librosa.util.normalize\n    librosa.core.stft\n\n\n    Examples\n    --------\n    >>> # Compute local onset autocorrelation\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> hop_length = 512\n    >>> oenv = librosa.onset.onset_strength(y=y, sr=sr, hop_length=hop_length)\n    >>> tempogram = librosa.feature.tempogram(onset_envelope=oenv, sr=sr,\n    ...                                       hop_length=hop_length)\n    >>> # Compute global onset autocorrelation\n    >>> ac_global = librosa.autocorrelate(oenv, max_size=tempogram.shape[0])\n    >>> ac_global = librosa.util.normalize(ac_global)\n    >>> # Estimate the global tempo for display purposes\n    >>> tempo = librosa.beat.tempo(onset_envelope=oenv, sr=sr,\n    ...                            hop_length=hop_length)[0]\n\n    >>> import matplotlib.pyplot as plt\n    >>> plt.figure(figsize=(8, 8))\n    >>> plt.subplot(4, 1, 1)\n    >>> plt.plot(oenv, label='Onset strength')\n    >>> plt.xticks([])\n    >>> plt.legend(frameon=True)\n    >>> plt.axis('tight')\n    >>> plt.subplot(4, 1, 2)\n    >>> # We'll truncate the display to a narrower range of tempi\n    >>> librosa.display.specshow(tempogram, sr=sr, hop_length=hop_length,\n    >>>                          x_axis='time', y_axis='tempo')\n    >>> plt.axhline(tempo, color='w', linestyle='--', alpha=1,\n    ...             label='Estimated tempo={:g}'.format(tempo))\n    >>> plt.legend(frameon=True, framealpha=0.75)\n    >>> plt.subplot(4, 1, 3)\n    >>> x = np.linspace(0, tempogram.shape[0] * float(hop_length) / sr,\n    ...                 num=tempogram.shape[0])\n    >>> plt.plot(x, np.mean(tempogram, axis=1), label='Mean local autocorrelation')\n    >>> plt.plot(x, ac_global, '--', alpha=0.75, label='Global autocorrelation')\n    >>> plt.xlabel('Lag (seconds)')\n    >>> plt.axis('tight')\n    >>> plt.legend(frameon=True)\n    >>> plt.subplot(4,1,4)\n    >>> # We can also plot on a BPM axis\n    >>> freqs = librosa.tempo_frequencies(tempogram.shape[0], hop_length=hop_length, sr=sr)\n    >>> plt.semilogx(freqs[1:], np.mean(tempogram[1:], axis=1),\n    ...              label='Mean local autocorrelation', basex=2)\n    >>> plt.semilogx(freqs[1:], ac_global[1:], '--', alpha=0.75,\n    ...              label='Global autocorrelation', basex=2)\n    >>> plt.axvline(tempo, color='black', linestyle='--', alpha=.8,\n    ...             label='Estimated tempo={:g}'.format(tempo))\n    >>> plt.legend(frameon=True)\n    >>> plt.xlabel('BPM')\n    >>> plt.axis('tight')\n    >>> plt.grid()\n    >>> plt.tight_layout()\n    '''\n\n    from ..onset import onset_strength\n\n    if win_length < 1:\n        raise ParameterError('win_length must be a positive integer')\n\n    ac_window = get_window(window, win_length, fftbins=True)\n\n    if onset_envelope is None:\n        if y is None:\n            raise ParameterError('Either y or onset_envelope must be provided')\n\n        onset_envelope = onset_strength(y=y, sr=sr, hop_length=hop_length)\n\n    else:\n        # Force row-contiguity to avoid framing errors below\n        onset_envelope = np.ascontiguousarray(onset_envelope)\n\n    if onset_envelope.ndim > 1:\n        # If we have multi-band input, iterate over rows\n        return np.asarray([tempogram(onset_envelope=oe_subband,\n                                     hop_length=hop_length,\n                                     win_length=win_length,\n                                     center=center,\n                                     window=window,\n                                     norm=norm) for oe_subband in onset_envelope])\n\n    # Center the autocorrelation windows\n    n = len(onset_envelope)\n\n    if center:\n        onset_envelope = np.pad(onset_envelope, int(win_length // 2),\n                                mode='linear_ramp', end_values=[0, 0])\n\n    # Carve onset envelope into frames\n    odf_frame = util.frame(onset_envelope,\n                           frame_length=win_length,\n                           hop_length=1)\n\n    # Truncate to the length of the original signal\n    if center:\n        odf_frame = odf_frame[:, :n]\n\n    # Window, autocorrelate, and normalize\n    return util.normalize(autocorrelate(odf_frame * ac_window[:, np.newaxis],\n                                        axis=0),\n                          norm=norm, axis=0)", "response": "Compute the tempogram for a given onset strength envelope."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nfind all audio files in a directory or directory sub - tree.", "response": "def find_files(directory, ext=None, recurse=True, case_sensitive=False,\n               limit=None, offset=0):\n    '''Get a sorted list of (audio) files in a directory or directory sub-tree.\n\n    Examples\n    --------\n    >>> # Get all audio files in a directory sub-tree\n    >>> files = librosa.util.find_files('~/Music')\n\n    >>> # Look only within a specific directory, not the sub-tree\n    >>> files = librosa.util.find_files('~/Music', recurse=False)\n\n    >>> # Only look for mp3 files\n    >>> files = librosa.util.find_files('~/Music', ext='mp3')\n\n    >>> # Or just mp3 and ogg\n    >>> files = librosa.util.find_files('~/Music', ext=['mp3', 'ogg'])\n\n    >>> # Only get the first 10 files\n    >>> files = librosa.util.find_files('~/Music', limit=10)\n\n    >>> # Or last 10 files\n    >>> files = librosa.util.find_files('~/Music', offset=-10)\n\n    Parameters\n    ----------\n    directory : str\n        Path to look for files\n\n    ext : str or list of str\n        A file extension or list of file extensions to include in the search.\n\n        Default: `['aac', 'au', 'flac', 'm4a', 'mp3', 'ogg', 'wav']`\n\n    recurse : boolean\n        If `True`, then all subfolders of `directory` will be searched.\n\n        Otherwise, only `directory` will be searched.\n\n    case_sensitive : boolean\n        If `False`, files matching upper-case version of\n        extensions will be included.\n\n    limit : int > 0 or None\n        Return at most `limit` files. If `None`, all files are returned.\n\n    offset : int\n        Return files starting at `offset` within the list.\n\n        Use negative values to offset from the end of the list.\n\n    Returns\n    -------\n    files : list of str\n        The list of audio files.\n    '''\n\n    if ext is None:\n        ext = ['aac', 'au', 'flac', 'm4a', 'mp3', 'ogg', 'wav']\n\n    elif isinstance(ext, six.string_types):\n        ext = [ext]\n\n    # Cast into a set\n    ext = set(ext)\n\n    # Generate upper-case versions\n    if not case_sensitive:\n        # Force to lower-case\n        ext = set([e.lower() for e in ext])\n        # Add in upper-case versions\n        ext |= set([e.upper() for e in ext])\n\n    files = set()\n\n    if recurse:\n        for walk in os.walk(directory):\n            files |= __get_files(walk[0], ext)\n    else:\n        files = __get_files(directory, ext)\n\n    files = list(files)\n    files.sort()\n    files = files[offset:]\n    if limit is not None:\n        files = files[:limit]\n\n    return files"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef __get_files(dir_name, extensions):\n    '''Helper function to get files in a single directory'''\n\n    # Expand out the directory\n    dir_name = os.path.abspath(os.path.expanduser(dir_name))\n\n    myfiles = set()\n\n    for sub_ext in extensions:\n        globstr = os.path.join(dir_name, '*' + os.path.extsep + sub_ext)\n        myfiles |= set(glob.glob(globstr))\n\n    return myfiles", "response": "Helper function to get files in a single directory"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nphase - vocoder time stretch demo function.", "response": "def stretch_demo(input_file, output_file, speed):\n    '''Phase-vocoder time stretch demo function.\n\n    :parameters:\n      - input_file : str\n          path to input audio\n      - output_file : str\n          path to save output (wav)\n      - speed : float > 0\n          speed up by this factor\n    '''\n\n    # 1. Load the wav file, resample\n    print('Loading ', input_file)\n\n    y, sr = librosa.load(input_file)\n\n    # 2. Time-stretch through effects module\n    print('Playing back at {:3.0f}% speed'.format(speed * 100))\n\n    y_stretch = librosa.effects.time_stretch(y, speed)\n\n    print('Saving stretched audio to: ', output_file)\n    librosa.output.write_wav(output_file, y_stretch, sr)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef process_arguments(args):\n    '''Argparse function to get the program parameters'''\n\n    parser = argparse.ArgumentParser(description='Time stretching example')\n\n    parser.add_argument('input_file',\n                        action='store',\n                        help='path to the input file (wav, mp3, etc)')\n\n    parser.add_argument('output_file',\n                        action='store',\n                        help='path to the stretched output (wav)')\n\n    parser.add_argument('-s', '--speed',\n                        action='store',\n                        type=float,\n                        default=2.0,\n                        required=False,\n                        help='speed')\n\n    return vars(parser.parse_args(args))", "response": "Argparse function to get the program parameters"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef beat_track(y=None, sr=22050, onset_envelope=None, hop_length=512,\n               start_bpm=120.0, tightness=100, trim=True, bpm=None,\n               units='frames'):\n    r'''Dynamic programming beat tracker.\n\n    Beats are detected in three stages, following the method of [1]_:\n      1. Measure onset strength\n      2. Estimate tempo from onset correlation\n      3. Pick peaks in onset strength approximately consistent with estimated\n         tempo\n\n    .. [1] Ellis, Daniel PW. \"Beat tracking by dynamic programming.\"\n           Journal of New Music Research 36.1 (2007): 51-60.\n           http://labrosa.ee.columbia.edu/projects/beattrack/\n\n\n    Parameters\n    ----------\n\n    y : np.ndarray [shape=(n,)] or None\n        audio time series\n\n    sr : number > 0 [scalar]\n        sampling rate of `y`\n\n    onset_envelope : np.ndarray [shape=(n,)] or None\n        (optional) pre-computed onset strength envelope.\n\n    hop_length : int > 0 [scalar]\n        number of audio samples between successive `onset_envelope` values\n\n    start_bpm  : float > 0 [scalar]\n        initial guess for the tempo estimator (in beats per minute)\n\n    tightness  : float [scalar]\n        tightness of beat distribution around tempo\n\n    trim       : bool [scalar]\n        trim leading/trailing beats with weak onsets\n\n    bpm        : float [scalar]\n        (optional) If provided, use `bpm` as the tempo instead of\n        estimating it from `onsets`.\n\n    units : {'frames', 'samples', 'time'}\n        The units to encode detected beat events in.\n        By default, 'frames' are used.\n\n\n    Returns\n    -------\n\n    tempo : float [scalar, non-negative]\n        estimated global tempo (in beats per minute)\n\n    beats : np.ndarray [shape=(m,)]\n        estimated beat event locations in the specified units\n        (default is frame indices)\n\n    .. note::\n        If no onset strength could be detected, beat_tracker estimates 0 BPM\n        and returns an empty list.\n\n\n    Raises\n    ------\n    ParameterError\n        if neither `y` nor `onset_envelope` are provided\n\n        or if `units` is not one of 'frames', 'samples', or 'time'\n\n    See Also\n    --------\n    librosa.onset.onset_strength\n\n\n    Examples\n    --------\n    Track beats using time series input\n\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n\n    >>> tempo, beats = librosa.beat.beat_track(y=y, sr=sr)\n    >>> tempo\n    64.599609375\n\n\n    Print the first 20 beat frames\n\n    >>> beats[:20]\n    array([ 320,  357,  397,  436,  480,  525,  569,  609,  658,\n            698,  737,  777,  817,  857,  896,  936,  976, 1016,\n           1055, 1095])\n\n\n    Or print them as timestamps\n\n    >>> librosa.frames_to_time(beats[:20], sr=sr)\n    array([  7.43 ,   8.29 ,   9.218,  10.124,  11.146,  12.19 ,\n            13.212,  14.141,  15.279,  16.208,  17.113,  18.042,\n            18.971,  19.9  ,  20.805,  21.734,  22.663,  23.591,\n            24.497,  25.426])\n\n\n    Track beats using a pre-computed onset envelope\n\n    >>> onset_env = librosa.onset.onset_strength(y, sr=sr,\n    ...                                          aggregate=np.median)\n    >>> tempo, beats = librosa.beat.beat_track(onset_envelope=onset_env,\n    ...                                        sr=sr)\n    >>> tempo\n    64.599609375\n    >>> beats[:20]\n    array([ 320,  357,  397,  436,  480,  525,  569,  609,  658,\n            698,  737,  777,  817,  857,  896,  936,  976, 1016,\n           1055, 1095])\n\n\n    Plot the beat events against the onset strength envelope\n\n    >>> import matplotlib.pyplot as plt\n    >>> hop_length = 512\n    >>> plt.figure(figsize=(8, 4))\n    >>> times = librosa.frames_to_time(np.arange(len(onset_env)),\n    ...                                sr=sr, hop_length=hop_length)\n    >>> plt.plot(times, librosa.util.normalize(onset_env),\n    ...          label='Onset strength')\n    >>> plt.vlines(times[beats], 0, 1, alpha=0.5, color='r',\n    ...            linestyle='--', label='Beats')\n    >>> plt.legend(frameon=True, framealpha=0.75)\n    >>> # Limit the plot to a 15-second window\n    >>> plt.xlim(15, 30)\n    >>> plt.gca().xaxis.set_major_formatter(librosa.display.TimeFormatter())\n    >>> plt.tight_layout()\n    '''\n\n    # First, get the frame->beat strength profile if we don't already have one\n    if onset_envelope is None:\n        if y is None:\n            raise ParameterError('y or onset_envelope must be provided')\n\n        onset_envelope = onset.onset_strength(y=y,\n                                              sr=sr,\n                                              hop_length=hop_length,\n                                              aggregate=np.median)\n\n    # Do we have any onsets to grab?\n    if not onset_envelope.any():\n        return (0, np.array([], dtype=int))\n\n    # Estimate BPM if one was not provided\n    if bpm is None:\n        bpm = tempo(onset_envelope=onset_envelope,\n                    sr=sr,\n                    hop_length=hop_length,\n                    start_bpm=start_bpm)[0]\n\n    # Then, run the tracker\n    beats = __beat_tracker(onset_envelope,\n                           bpm,\n                           float(sr) / hop_length,\n                           tightness,\n                           trim)\n\n    if units == 'frames':\n        pass\n    elif units == 'samples':\n        beats = core.frames_to_samples(beats, hop_length=hop_length)\n    elif units == 'time':\n        beats = core.frames_to_time(beats, hop_length=hop_length, sr=sr)\n    else:\n        raise ParameterError('Invalid unit type: {}'.format(units))\n\n    return (bpm, beats)", "response": "r Dynamic programming beat tracking."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef tempo(y=None, sr=22050, onset_envelope=None, hop_length=512, start_bpm=120,\n          std_bpm=1.0, ac_size=8.0, max_tempo=320.0, aggregate=np.mean):\n    \"\"\"Estimate the tempo (beats per minute)\n\n    Parameters\n    ----------\n    y : np.ndarray [shape=(n,)] or None\n        audio time series\n\n    sr : number > 0 [scalar]\n        sampling rate of the time series\n\n    onset_envelope    : np.ndarray [shape=(n,)]\n        pre-computed onset strength envelope\n\n    hop_length : int > 0 [scalar]\n        hop length of the time series\n\n    start_bpm : float [scalar]\n        initial guess of the BPM\n\n    std_bpm : float > 0 [scalar]\n        standard deviation of tempo distribution\n\n    ac_size : float > 0 [scalar]\n        length (in seconds) of the auto-correlation window\n\n    max_tempo : float > 0 [scalar, optional]\n        If provided, only estimate tempo below this threshold\n\n    aggregate : callable [optional]\n        Aggregation function for estimating global tempo.\n        If `None`, then tempo is estimated independently for each frame.\n\n    Returns\n    -------\n    tempo : np.ndarray [scalar]\n        estimated tempo (beats per minute)\n\n    See Also\n    --------\n    librosa.onset.onset_strength\n    librosa.feature.tempogram\n\n    Notes\n    -----\n    This function caches at level 30.\n\n    Examples\n    --------\n    >>> # Estimate a static tempo\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> onset_env = librosa.onset.onset_strength(y, sr=sr)\n    >>> tempo = librosa.beat.tempo(onset_envelope=onset_env, sr=sr)\n    >>> tempo\n    array([129.199])\n\n    >>> # Or a dynamic tempo\n    >>> dtempo = librosa.beat.tempo(onset_envelope=onset_env, sr=sr,\n    ...                             aggregate=None)\n    >>> dtempo\n    array([ 143.555,  143.555,  143.555, ...,  161.499,  161.499,\n            172.266])\n\n\n    Plot the estimated tempo against the onset autocorrelation\n\n    >>> import matplotlib.pyplot as plt\n    >>> # Convert to scalar\n    >>> tempo = np.asscalar(tempo)\n    >>> # Compute 2-second windowed autocorrelation\n    >>> hop_length = 512\n    >>> ac = librosa.autocorrelate(onset_env, 2 * sr // hop_length)\n    >>> freqs = librosa.tempo_frequencies(len(ac), sr=sr,\n    ...                                   hop_length=hop_length)\n    >>> # Plot on a BPM axis.  We skip the first (0-lag) bin.\n    >>> plt.figure(figsize=(8,4))\n    >>> plt.semilogx(freqs[1:], librosa.util.normalize(ac)[1:],\n    ...              label='Onset autocorrelation', basex=2)\n    >>> plt.axvline(tempo, 0, 1, color='r', alpha=0.75, linestyle='--',\n    ...            label='Tempo: {:.2f} BPM'.format(tempo))\n    >>> plt.xlabel('Tempo (BPM)')\n    >>> plt.grid()\n    >>> plt.title('Static tempo estimation')\n    >>> plt.legend(frameon=True)\n    >>> plt.axis('tight')\n\n    Plot dynamic tempo estimates over a tempogram\n\n    >>> plt.figure()\n    >>> tg = librosa.feature.tempogram(onset_envelope=onset_env, sr=sr,\n    ...                                hop_length=hop_length)\n    >>> librosa.display.specshow(tg, x_axis='time', y_axis='tempo')\n    >>> plt.plot(librosa.frames_to_time(np.arange(len(dtempo))), dtempo,\n    ...          color='w', linewidth=1.5, label='Tempo estimate')\n    >>> plt.title('Dynamic tempo estimation')\n    >>> plt.legend(frameon=True, framealpha=0.75)\n    \"\"\"\n\n    if start_bpm <= 0:\n        raise ParameterError('start_bpm must be strictly positive')\n\n    win_length = np.asscalar(core.time_to_frames(ac_size, sr=sr,\n                                                 hop_length=hop_length))\n\n    tg = tempogram(y=y, sr=sr,\n                   onset_envelope=onset_envelope,\n                   hop_length=hop_length,\n                   win_length=win_length)\n\n    # Eventually, we want this to work for time-varying tempo\n    if aggregate is not None:\n        tg = aggregate(tg, axis=1, keepdims=True)\n\n    # Get the BPM values for each bin, skipping the 0-lag bin\n    bpms = core.tempo_frequencies(tg.shape[0], hop_length=hop_length, sr=sr)\n\n    # Weight the autocorrelation by a log-normal distribution\n    prior = np.exp(-0.5 * ((np.log2(bpms) - np.log2(start_bpm)) / std_bpm)**2)\n\n    # Kill everything above the max tempo\n    if max_tempo is not None:\n        max_idx = np.argmax(bpms < max_tempo)\n        prior[:max_idx] = 0\n\n    # Really, instead of multiplying by the prior, we should set up a\n    # probabilistic model for tempo and add log-probabilities.\n    # This would give us a chance to recover from null signals and\n    # rely on the prior.\n    # it would also make time aggregation much more natural\n\n    # Get the maximum, weighted by the prior\n    best_period = np.argmax(tg * prior[:, np.newaxis], axis=0)\n\n    tempi = bpms[best_period]\n    # Wherever the best tempo is index 0, return start_bpm\n    tempi[best_period == 0] = start_bpm\n    return tempi", "response": "Estimate the tempo of the current audio time series."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nmap onset strength function into the range [ 0 1 )", "response": "def __normalize_onsets(onsets):\n    '''Maps onset strength function into the range [0, 1]'''\n\n    norm = onsets.std(ddof=1)\n    if norm > 0:\n        onsets = onsets / norm\n    return onsets"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nconstruct the local score for an onset envlope and given period", "response": "def __beat_local_score(onset_envelope, period):\n    '''Construct the local score for an onset envlope and given period'''\n\n    window = np.exp(-0.5 * (np.arange(-period, period+1)*32.0/period)**2)\n    return scipy.signal.convolve(__normalize_onsets(onset_envelope),\n                                 window,\n                                 'same')"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\ncore dynamic program for beat tracking", "response": "def __beat_track_dp(localscore, period, tightness):\n    \"\"\"Core dynamic program for beat tracking\"\"\"\n\n    backlink = np.zeros_like(localscore, dtype=int)\n    cumscore = np.zeros_like(localscore)\n\n    # Search range for previous beat\n    window = np.arange(-2 * period, -np.round(period / 2) + 1, dtype=int)\n\n    # Make a score window, which begins biased toward start_bpm and skewed\n    if tightness <= 0:\n        raise ParameterError('tightness must be strictly positive')\n\n    txwt = -tightness * (np.log(-window / period) ** 2)\n\n    # Are we on the first beat?\n    first_beat = True\n    for i, score_i in enumerate(localscore):\n\n        # Are we reaching back before time 0?\n        z_pad = np.maximum(0, min(- window[0], len(window)))\n\n        # Search over all possible predecessors\n        candidates = txwt.copy()\n        candidates[z_pad:] = candidates[z_pad:] + cumscore[window[z_pad:]]\n\n        # Find the best preceding beat\n        beat_location = np.argmax(candidates)\n\n        # Add the local score\n        cumscore[i] = score_i + candidates[beat_location]\n\n        # Special case the first onset.  Stop if the localscore is small\n        if first_beat and score_i < 0.01 * localscore.max():\n            backlink[i] = -1\n        else:\n            backlink[i] = window[beat_location]\n            first_beat = False\n\n        # Update the time range\n        window = window + 1\n\n    return backlink, cumscore"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nget the last beat from the cumulative score array", "response": "def __last_beat(cumscore):\n    \"\"\"Get the last beat from the cumulative score array\"\"\"\n\n    maxes = util.localmax(cumscore)\n    med_score = np.median(cumscore[np.argwhere(maxes)])\n\n    # The last of these is the last beat (since score generally increases)\n    return np.argwhere((cumscore * maxes * 2 > med_score)).max()"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\ntrimming leading and trailing beats from the given list of beats.", "response": "def __trim_beats(localscore, beats, trim):\n    \"\"\"Final post-processing: throw out spurious leading/trailing beats\"\"\"\n\n    smooth_boe = scipy.signal.convolve(localscore[beats],\n                                       scipy.signal.hann(5),\n                                       'same')\n\n    if trim:\n        threshold = 0.5 * ((smooth_boe**2).mean()**0.5)\n    else:\n        threshold = 0.0\n\n    valid = np.argwhere(smooth_boe > threshold)\n\n    return beats[valid.min():valid.max()]"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\ncomputes a recurrence matrix from a data matrix.", "response": "def recurrence_matrix(data, k=None, width=1, metric='euclidean',\n                      sym=False, sparse=False, mode='connectivity',\n                      bandwidth=None, self=False, axis=-1):\n    '''Compute a recurrence matrix from a data matrix.\n\n    `rec[i, j]` is non-zero if (`data[:, i]`, `data[:, j]`) are\n    k-nearest-neighbors and `|i - j| >= width`\n\n    The specific value of `rec[i, j]` can have several forms, governed\n    by the `mode` parameter below:\n\n        - Connectivity: `rec[i, j] = 1 or 0` indicates that frames `i` and `j` are repetitions\n\n        - Affinity: `rec[i, j] > 0` measures how similar frames `i` and `j` are.  This is also\n          known as a (sparse) self-similarity matrix.\n\n        - Distance: `rec[, j] > 0` measures how distant frames `i` and `j` are.  This is also\n          known as a (sparse) self-distance matrix.\n\n    The general term *recurrence matrix* can refer to any of the three forms above.\n\n\n    Parameters\n    ----------\n    data : np.ndarray\n        A feature matrix\n\n    k : int > 0 [scalar] or None\n        the number of nearest-neighbors for each sample\n\n        Default: `k = 2 * ceil(sqrt(t - 2 * width + 1))`,\n        or `k = 2` if `t <= 2 * width + 1`\n\n    width : int >= 1 [scalar]\n        only link neighbors `(data[:, i], data[:, j])`\n        if `|i - j| >= width`\n\n        `width` cannot exceed the length of the data.\n\n    metric : str\n        Distance metric to use for nearest-neighbor calculation.\n\n        See `sklearn.neighbors.NearestNeighbors` for details.\n\n    sym : bool [scalar]\n        set `sym=True` to only link mutual nearest-neighbors\n\n    sparse : bool [scalar]\n        if False, returns a dense type (ndarray)\n        if True, returns a sparse type (scipy.sparse.csr_matrix)\n\n    mode : str, {'connectivity', 'distance', 'affinity'}\n        If 'connectivity', a binary connectivity matrix is produced.\n\n        If 'distance', then a non-zero entry contains the distance between\n        points.\n\n        If 'affinity', then non-zero entries are mapped to\n        `exp( - distance(i, j) / bandwidth)` where `bandwidth` is\n        as specified below.\n\n    bandwidth : None or float > 0\n        If using ``mode='affinity'``, this can be used to set the\n        bandwidth on the affinity kernel.\n\n        If no value is provided, it is set automatically to the median\n        distance between furthest nearest neighbors.\n\n    self : bool\n        If `True`, then the main diagonal is populated with self-links:\n        0 if ``mode='distance'``, and 1 otherwise.\n\n        If `False`, the main diagonal is left empty.\n\n    axis : int\n        The axis along which to compute recurrence.\n        By default, the last index (-1) is taken.\n\n    Returns\n    -------\n    rec : np.ndarray or scipy.sparse.csr_matrix, [shape=(t, t)]\n        Recurrence matrix\n\n    See Also\n    --------\n    sklearn.neighbors.NearestNeighbors\n    scipy.spatial.distance.cdist\n    librosa.feature.stack_memory\n    recurrence_to_lag\n\n    Notes\n    -----\n    This function caches at level 30.\n\n    Examples\n    --------\n    Find nearest neighbors in MFCC space\n\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> mfcc = librosa.feature.mfcc(y=y, sr=sr)\n    >>> R = librosa.segment.recurrence_matrix(mfcc)\n\n    Or fix the number of nearest neighbors to 5\n\n    >>> R = librosa.segment.recurrence_matrix(mfcc, k=5)\n\n    Suppress neighbors within +- 7 samples\n\n    >>> R = librosa.segment.recurrence_matrix(mfcc, width=7)\n\n    Use cosine similarity instead of Euclidean distance\n\n    >>> R = librosa.segment.recurrence_matrix(mfcc, metric='cosine')\n\n    Require mutual nearest neighbors\n\n    >>> R = librosa.segment.recurrence_matrix(mfcc, sym=True)\n\n    Use an affinity matrix instead of binary connectivity\n\n    >>> R_aff = librosa.segment.recurrence_matrix(mfcc, mode='affinity')\n\n    Plot the feature and recurrence matrices\n\n    >>> import matplotlib.pyplot as plt\n    >>> plt.figure(figsize=(8, 4))\n    >>> plt.subplot(1, 2, 1)\n    >>> librosa.display.specshow(R, x_axis='time', y_axis='time')\n    >>> plt.title('Binary recurrence (symmetric)')\n    >>> plt.subplot(1, 2, 2)\n    >>> librosa.display.specshow(R_aff, x_axis='time', y_axis='time',\n    ...                          cmap='magma_r')\n    >>> plt.title('Affinity recurrence')\n    >>> plt.tight_layout()\n\n    '''\n\n    data = np.atleast_2d(data)\n\n    # Swap observations to the first dimension and flatten the rest\n    data = np.swapaxes(data, axis, 0)\n    t = data.shape[0]\n    data = data.reshape((t, -1))\n\n    if width < 1 or width > t:\n        raise ParameterError('width={} must be at least 1 and at most data.shape[{}]={}'.format(width, axis, t))\n\n    if mode not in ['connectivity', 'distance', 'affinity']:\n        raise ParameterError((\"Invalid mode='{}'. Must be one of \"\n                              \"['connectivity', 'distance', \"\n                              \"'affinity']\").format(mode))\n    if k is None:\n        if t > 2 * width + 1:\n            k = 2 * np.ceil(np.sqrt(t - 2 * width + 1))\n        else:\n            k = 2\n\n    if bandwidth is not None:\n        if bandwidth <= 0:\n            raise ParameterError('Invalid bandwidth={}. '\n                                 'Must be strictly positive.'.format(bandwidth))\n\n    k = int(k)\n\n    # Build the neighbor search object\n    try:\n        knn = sklearn.neighbors.NearestNeighbors(n_neighbors=min(t-1, k + 2 * width),\n                                                 metric=metric,\n                                                 algorithm='auto')\n    except ValueError:\n        knn = sklearn.neighbors.NearestNeighbors(n_neighbors=min(t-1, k + 2 * width),\n                                                 metric=metric,\n                                                 algorithm='brute')\n\n    knn.fit(data)\n\n    # Get the knn graph\n    if mode == 'affinity':\n        kng_mode = 'distance'\n    else:\n        kng_mode = mode\n\n    rec = knn.kneighbors_graph(mode=kng_mode).tolil()\n\n    # Remove connections within width\n    for diag in range(-width + 1, width):\n        rec.setdiag(0, diag)\n\n    # Retain only the top-k links per point\n    for i in range(t):\n        # Get the links from point i\n        links = rec[i].nonzero()[1]\n\n        # Order them ascending\n        idx = links[np.argsort(rec[i, links].toarray())][0]\n\n        # Everything past the kth closest gets squashed\n        rec[i, idx[k:]] = 0\n\n    if self:\n        if mode == 'connectivity':\n            rec.setdiag(1)\n        elif mode == 'affinity':\n            # we need to keep the self-loop in here, but not mess up the\n            # bandwidth estimation\n            #\n            # using negative distances here preserves the structure without changing\n            # the statistics of the data\n            rec.setdiag(-1)\n\n    # symmetrize\n    if sym:\n        # Note: this operation produces a CSR (compressed sparse row) matrix!\n        # This is why we have to do it after filling the diagonal in self-mode\n        rec = rec.minimum(rec.T)\n\n    rec = rec.tocsr()\n    rec.eliminate_zeros()\n\n    if mode == 'connectivity':\n        rec = rec.astype(np.bool)\n    elif mode == 'affinity':\n        if bandwidth is None:\n            bandwidth = np.nanmedian(rec.max(axis=1).data)\n        # Set all the negatives back to 0\n        # Negatives are temporarily inserted above to preserve the sparsity structure\n        # of the matrix without corrupting the bandwidth calculations\n        rec.data[rec.data < 0] = 0.0\n        rec.data[:] = np.exp(rec.data / (-1 * bandwidth))\n\n    if not sparse:\n        rec = rec.toarray()\n\n    return rec"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nconvert a recurrence matrix into a lag matrix.", "response": "def recurrence_to_lag(rec, pad=True, axis=-1):\n    '''Convert a recurrence matrix into a lag matrix.\n\n        `lag[i, j] == rec[i+j, j]`\n\n    Parameters\n    ----------\n    rec : np.ndarray, or scipy.sparse.spmatrix [shape=(n, n)]\n        A (binary) recurrence matrix, as returned by `recurrence_matrix`\n\n    pad : bool\n        If False, `lag` matrix is square, which is equivalent to\n        assuming that the signal repeats itself indefinitely.\n\n        If True, `lag` is padded with `n` zeros, which eliminates\n        the assumption of repetition.\n\n    axis : int\n        The axis to keep as the `time` axis.\n        The alternate axis will be converted to lag coordinates.\n\n    Returns\n    -------\n    lag : np.ndarray\n        The recurrence matrix in (lag, time) (if `axis=1`)\n        or (time, lag) (if `axis=0`) coordinates\n\n    Raises\n    ------\n    ParameterError : if `rec` is non-square\n\n    See Also\n    --------\n    recurrence_matrix\n    lag_to_recurrence\n\n    Examples\n    --------\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> mfccs = librosa.feature.mfcc(y=y, sr=sr)\n    >>> recurrence = librosa.segment.recurrence_matrix(mfccs)\n    >>> lag_pad = librosa.segment.recurrence_to_lag(recurrence, pad=True)\n    >>> lag_nopad = librosa.segment.recurrence_to_lag(recurrence, pad=False)\n\n    >>> import matplotlib.pyplot as plt\n    >>> plt.figure(figsize=(8, 4))\n    >>> plt.subplot(1, 2, 1)\n    >>> librosa.display.specshow(lag_pad, x_axis='time', y_axis='lag')\n    >>> plt.title('Lag (zero-padded)')\n    >>> plt.subplot(1, 2, 2)\n    >>> librosa.display.specshow(lag_nopad, x_axis='time')\n    >>> plt.title('Lag (no padding)')\n    >>> plt.tight_layout()\n    '''\n\n    axis = np.abs(axis)\n\n    if rec.ndim != 2 or rec.shape[0] != rec.shape[1]:\n        raise ParameterError('non-square recurrence matrix shape: '\n                             '{}'.format(rec.shape))\n\n    sparse = scipy.sparse.issparse(rec)\n\n    roll_ax = None\n    if sparse:\n        roll_ax = 1 - axis\n        lag_format = rec.format\n        if axis == 0:\n            rec = rec.tocsc()\n        elif axis in (-1, 1):\n            rec = rec.tocsr()\n\n    t = rec.shape[axis]\n\n    if sparse:\n        if pad:\n            kron = np.asarray([[1, 0]]).swapaxes(axis, 0)\n            lag = scipy.sparse.kron(kron.astype(rec.dtype), rec, format='lil')\n        else:\n            lag = scipy.sparse.lil_matrix(rec)\n    else:\n        if pad:\n            padding = [(0, 0), (0, 0)]\n            padding[(1-axis)] = (0, t)\n            lag = np.pad(rec, padding, mode='constant')\n        else:\n            lag = rec.copy()\n\n    idx_slice = [slice(None)] * lag.ndim\n\n    for i in range(1, t):\n        idx_slice[axis] = i\n        lag[tuple(idx_slice)] = util.roll_sparse(lag[tuple(idx_slice)], -i, axis=roll_ax)\n\n    if sparse:\n        return lag.asformat(lag_format)\n    return np.ascontiguousarray(lag.T).T"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nconvert a lag matrix into a recurrence matrix.", "response": "def lag_to_recurrence(lag, axis=-1):\n    '''Convert a lag matrix into a recurrence matrix.\n\n    Parameters\n    ----------\n    lag : np.ndarray or scipy.sparse.spmatrix\n        A lag matrix, as produced by `recurrence_to_lag`\n\n    axis : int\n        The axis corresponding to the time dimension.\n        The alternate axis will be interpreted in lag coordinates.\n\n    Returns\n    -------\n    rec : np.ndarray or scipy.sparse.spmatrix [shape=(n, n)]\n        A recurrence matrix in (time, time) coordinates\n        For sparse matrices, format will match that of `lag`.\n\n    Raises\n    ------\n    ParameterError : if `lag` does not have the correct shape\n\n    See Also\n    --------\n    recurrence_to_lag\n\n    Examples\n    --------\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> mfccs = librosa.feature.mfcc(y=y, sr=sr)\n    >>> recurrence = librosa.segment.recurrence_matrix(mfccs)\n    >>> lag_pad = librosa.segment.recurrence_to_lag(recurrence, pad=True)\n    >>> lag_nopad = librosa.segment.recurrence_to_lag(recurrence, pad=False)\n    >>> rec_pad = librosa.segment.lag_to_recurrence(lag_pad)\n    >>> rec_nopad = librosa.segment.lag_to_recurrence(lag_nopad)\n\n    >>> import matplotlib.pyplot as plt\n    >>> plt.figure(figsize=(8, 4))\n    >>> plt.subplot(2, 2, 1)\n    >>> librosa.display.specshow(lag_pad, x_axis='time', y_axis='lag')\n    >>> plt.title('Lag (zero-padded)')\n    >>> plt.subplot(2, 2, 2)\n    >>> librosa.display.specshow(lag_nopad, x_axis='time', y_axis='time')\n    >>> plt.title('Lag (no padding)')\n    >>> plt.subplot(2, 2, 3)\n    >>> librosa.display.specshow(rec_pad, x_axis='time', y_axis='time')\n    >>> plt.title('Recurrence (with padding)')\n    >>> plt.subplot(2, 2, 4)\n    >>> librosa.display.specshow(rec_nopad, x_axis='time', y_axis='time')\n    >>> plt.title('Recurrence (without padding)')\n    >>> plt.tight_layout()\n\n    '''\n\n    if axis not in [0, 1, -1]:\n        raise ParameterError('Invalid target axis: {}'.format(axis))\n\n    axis = np.abs(axis)\n\n    if lag.ndim != 2 or (lag.shape[0] != lag.shape[1] and\n                         lag.shape[1 - axis] != 2 * lag.shape[axis]):\n        raise ParameterError('Invalid lag matrix shape: {}'.format(lag.shape))\n\n    # Since lag must be 2-dimensional, abs(axis) = axis\n    t = lag.shape[axis]\n\n    sparse = scipy.sparse.issparse(lag)\n    if sparse:\n        rec = scipy.sparse.lil_matrix(lag)\n        roll_ax = 1 - axis\n    else:\n        rec = lag.copy()\n        roll_ax = None\n\n    idx_slice = [slice(None)] * lag.ndim\n    for i in range(1, t):\n        idx_slice[axis] = i\n        rec[tuple(idx_slice)] = util.roll_sparse(lag[tuple(idx_slice)], i, axis=roll_ax)\n\n    sub_slice = [slice(None)] * rec.ndim\n    sub_slice[1 - axis] = slice(t)\n    rec = rec[tuple(sub_slice)]\n\n    if sparse:\n        return rec.asformat(lag.format)\n    return np.ascontiguousarray(rec.T).T"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nfilters in the time-lag domain. This is primarily useful for adapting image filters to operate on `recurrence_to_lag` output. Using `timelag_filter` is equivalent to the following sequence of operations: >>> data_tl = librosa.segment.recurrence_to_lag(data) >>> data_filtered_tl = function(data_tl) >>> data_filtered = librosa.segment.lag_to_recurrence(data_filtered_tl) Parameters ---------- function : callable The filtering function to wrap, e.g., `scipy.ndimage.median_filter` pad : bool Whether to zero-pad the structure feature matrix index : int >= 0 If `function` accepts input data as a positional argument, it should be indexed by `index` Returns ------- wrapped_function : callable A new filter function which applies in time-lag space rather than time-time space. Examples -------- Apply a 5-bin median filter to the diagonal of a recurrence matrix >>> y, sr = librosa.load(librosa.util.example_audio_file()) >>> chroma = librosa.feature.chroma_cqt(y=y, sr=sr) >>> rec = librosa.segment.recurrence_matrix(chroma) >>> from scipy.ndimage import median_filter >>> diagonal_median = librosa.segment.timelag_filter(median_filter) >>> rec_filtered = diagonal_median(rec, size=(1, 3), mode='mirror') Or with affinity weights >>> rec_aff = librosa.segment.recurrence_matrix(chroma, mode='affinity') >>> rec_aff_fil = diagonal_median(rec_aff, size=(1, 3), mode='mirror') >>> import matplotlib.pyplot as plt >>> plt.figure(figsize=(8,8)) >>> plt.subplot(2, 2, 1) >>> librosa.display.specshow(rec, y_axis='time') >>> plt.title('Raw recurrence matrix') >>> plt.subplot(2, 2, 2) >>> librosa.display.specshow(rec_filtered) >>> plt.title('Filtered recurrence matrix') >>> plt.subplot(2, 2, 3) >>> librosa.display.specshow(rec_aff, x_axis='time', y_axis='time', ... cmap='magma_r') >>> plt.title('Raw affinity matrix') >>> plt.subplot(2, 2, 4) >>> librosa.display.specshow(rec_aff_fil, x_axis='time', ... cmap='magma_r') >>> plt.title('Filtered affinity matrix') >>> plt.tight_layout()", "response": "def timelag_filter(function, pad=True, index=0):\n    '''Filtering in the time-lag domain.\n\n    This is primarily useful for adapting image filters to operate on\n    `recurrence_to_lag` output.\n\n    Using `timelag_filter` is equivalent to the following sequence of\n    operations:\n\n    >>> data_tl = librosa.segment.recurrence_to_lag(data)\n    >>> data_filtered_tl = function(data_tl)\n    >>> data_filtered = librosa.segment.lag_to_recurrence(data_filtered_tl)\n\n    Parameters\n    ----------\n    function : callable\n        The filtering function to wrap, e.g., `scipy.ndimage.median_filter`\n\n    pad : bool\n        Whether to zero-pad the structure feature matrix\n\n    index : int >= 0\n        If `function` accepts input data as a positional argument, it should be\n        indexed by `index`\n\n\n    Returns\n    -------\n    wrapped_function : callable\n        A new filter function which applies in time-lag space rather than\n        time-time space.\n\n\n    Examples\n    --------\n\n    Apply a 5-bin median filter to the diagonal of a recurrence matrix\n\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> chroma = librosa.feature.chroma_cqt(y=y, sr=sr)\n    >>> rec = librosa.segment.recurrence_matrix(chroma)\n    >>> from scipy.ndimage import median_filter\n    >>> diagonal_median = librosa.segment.timelag_filter(median_filter)\n    >>> rec_filtered = diagonal_median(rec, size=(1, 3), mode='mirror')\n\n    Or with affinity weights\n\n    >>> rec_aff = librosa.segment.recurrence_matrix(chroma, mode='affinity')\n    >>> rec_aff_fil = diagonal_median(rec_aff, size=(1, 3), mode='mirror')\n\n    >>> import matplotlib.pyplot as plt\n    >>> plt.figure(figsize=(8,8))\n    >>> plt.subplot(2, 2, 1)\n    >>> librosa.display.specshow(rec, y_axis='time')\n    >>> plt.title('Raw recurrence matrix')\n    >>> plt.subplot(2, 2, 2)\n    >>> librosa.display.specshow(rec_filtered)\n    >>> plt.title('Filtered recurrence matrix')\n    >>> plt.subplot(2, 2, 3)\n    >>> librosa.display.specshow(rec_aff, x_axis='time', y_axis='time',\n    ...                          cmap='magma_r')\n    >>> plt.title('Raw affinity matrix')\n    >>> plt.subplot(2, 2, 4)\n    >>> librosa.display.specshow(rec_aff_fil, x_axis='time',\n    ...                          cmap='magma_r')\n    >>> plt.title('Filtered affinity matrix')\n    >>> plt.tight_layout()\n    '''\n\n    def __my_filter(wrapped_f, *args, **kwargs):\n        '''Decorator to wrap the filter'''\n        # Map the input data into time-lag space\n        args = list(args)\n\n        args[index] = recurrence_to_lag(args[index], pad=pad)\n\n        # Apply the filtering function\n        result = wrapped_f(*args, **kwargs)\n\n        # Map back into time-time and return\n        return lag_to_recurrence(result)\n\n    return decorator(__my_filter, function)"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nsub-divide a segmentation by feature clustering. Given a set of frame boundaries (`frames`), and a data matrix (`data`), each successive interval defined by `frames` is partitioned into `n_segments` by constrained agglomerative clustering. .. note:: If an interval spans fewer than `n_segments` frames, then each frame becomes a sub-segment. Parameters ---------- data : np.ndarray Data matrix to use in clustering frames : np.ndarray [shape=(n_boundaries,)], dtype=int, non-negative] Array of beat or segment boundaries, as provided by `librosa.beat.beat_track`, `librosa.onset.onset_detect`, or `agglomerative`. n_segments : int > 0 Maximum number of frames to sub-divide each interval. axis : int Axis along which to apply the segmentation. By default, the last index (-1) is taken. Returns ------- boundaries : np.ndarray [shape=(n_subboundaries,)] List of sub-divided segment boundaries See Also -------- agglomerative : Temporal segmentation librosa.onset.onset_detect : Onset detection librosa.beat.beat_track : Beat tracking Notes ----- This function caches at level 30. Examples -------- Load audio, detect beat frames, and subdivide in twos by CQT >>> y, sr = librosa.load(librosa.util.example_audio_file(), duration=8) >>> tempo, beats = librosa.beat.beat_track(y=y, sr=sr, hop_length=512) >>> beat_times = librosa.frames_to_time(beats, sr=sr, hop_length=512) >>> cqt = np.abs(librosa.cqt(y, sr=sr, hop_length=512)) >>> subseg = librosa.segment.subsegment(cqt, beats, n_segments=2) >>> subseg_t = librosa.frames_to_time(subseg, sr=sr, hop_length=512) >>> subseg array([ 0, 2, 4, 21, 23, 26, 43, 55, 63, 72, 83, 97, 102, 111, 122, 137, 142, 153, 162, 180, 182, 185, 202, 210, 221, 231, 241, 256, 261, 271, 281, 296, 301, 310, 320, 339, 341, 344, 361, 368, 382, 389, 401, 416, 420, 430, 436, 451, 456, 465, 476, 489, 496, 503, 515, 527, 535, 544, 553, 558, 571, 578, 590, 607, 609, 638]) >>> import matplotlib.pyplot as plt >>> plt.figure() >>> librosa.display.specshow(librosa.amplitude_to_db(cqt, ... ref=np.max), ... y_axis='cqt_hz', x_axis='time') >>> lims = plt.gca().get_ylim() >>> plt.vlines(beat_times, lims[0], lims[1], color='lime', alpha=0.9, ... linewidth=2, label='Beats') >>> plt.vlines(subseg_t, lims[0], lims[1], color='linen', linestyle='--', ... linewidth=1.5, alpha=0.5, label='Sub-beats') >>> plt.legend(frameon=True, shadow=True) >>> plt.title('CQT + Beat and sub-beat markers') >>> plt.tight_layout()", "response": "def subsegment(data, frames, n_segments=4, axis=-1):\n    '''Sub-divide a segmentation by feature clustering.\n\n    Given a set of frame boundaries (`frames`), and a data matrix (`data`),\n    each successive interval defined by `frames` is partitioned into\n    `n_segments` by constrained agglomerative clustering.\n\n    .. note::\n        If an interval spans fewer than `n_segments` frames, then each\n        frame becomes a sub-segment.\n\n    Parameters\n    ----------\n    data : np.ndarray\n        Data matrix to use in clustering\n\n    frames : np.ndarray [shape=(n_boundaries,)], dtype=int, non-negative]\n        Array of beat or segment boundaries, as provided by\n        `librosa.beat.beat_track`,\n        `librosa.onset.onset_detect`,\n        or `agglomerative`.\n\n    n_segments : int > 0\n        Maximum number of frames to sub-divide each interval.\n\n    axis : int\n        Axis along which to apply the segmentation.\n        By default, the last index (-1) is taken.\n\n    Returns\n    -------\n    boundaries : np.ndarray [shape=(n_subboundaries,)]\n        List of sub-divided segment boundaries\n\n    See Also\n    --------\n    agglomerative : Temporal segmentation\n    librosa.onset.onset_detect : Onset detection\n    librosa.beat.beat_track : Beat tracking\n\n    Notes\n    -----\n    This function caches at level 30.\n\n    Examples\n    --------\n    Load audio, detect beat frames, and subdivide in twos by CQT\n\n    >>> y, sr = librosa.load(librosa.util.example_audio_file(), duration=8)\n    >>> tempo, beats = librosa.beat.beat_track(y=y, sr=sr, hop_length=512)\n    >>> beat_times = librosa.frames_to_time(beats, sr=sr, hop_length=512)\n    >>> cqt = np.abs(librosa.cqt(y, sr=sr, hop_length=512))\n    >>> subseg = librosa.segment.subsegment(cqt, beats, n_segments=2)\n    >>> subseg_t = librosa.frames_to_time(subseg, sr=sr, hop_length=512)\n    >>> subseg\n    array([  0,   2,   4,  21,  23,  26,  43,  55,  63,  72,  83,\n            97, 102, 111, 122, 137, 142, 153, 162, 180, 182, 185,\n           202, 210, 221, 231, 241, 256, 261, 271, 281, 296, 301,\n           310, 320, 339, 341, 344, 361, 368, 382, 389, 401, 416,\n           420, 430, 436, 451, 456, 465, 476, 489, 496, 503, 515,\n           527, 535, 544, 553, 558, 571, 578, 590, 607, 609, 638])\n\n    >>> import matplotlib.pyplot as plt\n    >>> plt.figure()\n    >>> librosa.display.specshow(librosa.amplitude_to_db(cqt,\n    ...                                                  ref=np.max),\n    ...                          y_axis='cqt_hz', x_axis='time')\n    >>> lims = plt.gca().get_ylim()\n    >>> plt.vlines(beat_times, lims[0], lims[1], color='lime', alpha=0.9,\n    ...            linewidth=2, label='Beats')\n    >>> plt.vlines(subseg_t, lims[0], lims[1], color='linen', linestyle='--',\n    ...            linewidth=1.5, alpha=0.5, label='Sub-beats')\n    >>> plt.legend(frameon=True, shadow=True)\n    >>> plt.title('CQT + Beat and sub-beat markers')\n    >>> plt.tight_layout()\n\n    '''\n\n    frames = util.fix_frames(frames, x_min=0, x_max=data.shape[axis], pad=True)\n\n    if n_segments < 1:\n        raise ParameterError('n_segments must be a positive integer')\n\n    boundaries = []\n    idx_slices = [slice(None)] * data.ndim\n\n    for seg_start, seg_end in zip(frames[:-1], frames[1:]):\n        idx_slices[axis] = slice(seg_start, seg_end)\n        boundaries.extend(seg_start + agglomerative(data[tuple(idx_slices)],\n                                                    min(seg_end - seg_start, n_segments),\n                                                    axis=axis))\n\n    return np.ascontiguousarray(boundaries)"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef agglomerative(data, k, clusterer=None, axis=-1):\n\n    # Make sure we have at least two dimensions\n    data = np.atleast_2d(data)\n\n    # Swap data index to position 0\n    data = np.swapaxes(data, axis, 0)\n\n    # Flatten the features\n    n = data.shape[0]\n    data = data.reshape((n, -1))\n\n    if clusterer is None:\n        # Connect the temporal connectivity graph\n        grid = sklearn.feature_extraction.image.grid_to_graph(n_x=n,\n                                                              n_y=1, n_z=1)\n\n        # Instantiate the clustering object\n        clusterer = sklearn.cluster.AgglomerativeClustering(n_clusters=k,\n                                                            connectivity=grid,\n                                                            memory=cache.memory)\n\n    # Fit the model\n    clusterer.fit(data)\n\n    # Find the change points from the labels\n    boundaries = [0]\n    boundaries.extend(\n        list(1 + np.nonzero(np.diff(clusterer.labels_))[0].astype(int)))\n    return np.asarray(boundaries)", "response": "This function creates a top - up temporal segmentation."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef path_enhance(R, n, window='hann', max_ratio=2.0, min_ratio=None, n_filters=7,\n                 zero_mean=False, clip=True, **kwargs):\n    '''Multi-angle path enhancement for self- and cross-similarity matrices.\n\n    This function convolves multiple diagonal smoothing filters with a self-similarity (or\n    recurrence) matrix R, and aggregates the result by an element-wise maximum.\n\n    Technically, the output is a matrix R_smooth such that\n\n        `R_smooth[i, j] = max_theta (R * filter_theta)[i, j]`\n\n    where `*` denotes 2-dimensional convolution, and `filter_theta` is a smoothing filter at\n    orientation theta.\n\n    This is intended to provide coherent temporal smoothing of self-similarity matrices\n    when there are changes in tempo.\n\n    Smoothing filters are generated at evenly spaced orientations between min_ratio and\n    max_ratio.\n\n    This function is inspired by the multi-angle path enhancement of [1]_, but differs by\n    modeling tempo differences in the space of similarity matrices rather than re-sampling\n    the underlying features prior to generating the self-similarity matrix.\n\n    .. [1] M\u00fcller, Meinard and Frank Kurth.\n            \"Enhancing similarity matrices for music audio analysis.\"\n            2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.\n            Vol. 5. IEEE, 2006.\n\n    .. note:: if using recurrence_matrix to construct the input similarity matrix, be sure to include the main\n              diagonal by setting `self=True`.  Otherwise, the diagonal will be suppressed, and this is likely to\n              produce discontinuities which will pollute the smoothing filter response.\n\n    Parameters\n    ----------\n    R : np.ndarray\n        The self- or cross-similarity matrix to be smoothed.\n        Note: sparse inputs are not supported.\n\n    n : int > 0\n        The length of the smoothing filter\n\n    window : window specification\n        The type of smoothing filter to use.  See `filters.get_window` for more information\n        on window specification formats.\n\n    max_ratio : float > 0\n        The maximum tempo ratio to support\n\n    min_ratio : float > 0\n        The minimum tempo ratio to support.\n        If not provided, it will default to `1/max_ratio`\n\n    n_filters : int >= 1\n        The number of different smoothing filters to use, evenly spaced\n        between `min_ratio` and `max_ratio`.\n\n        If `min_ratio = 1/max_ratio` (the default), using an odd number\n        of filters will ensure that the main diagonal (ratio=1) is included.\n\n    zero_mean : bool\n        By default, the smoothing filters are non-negative and sum to one (i.e. are averaging\n        filters).\n\n        If `zero_mean=True`, then the smoothing filters are made to sum to zero by subtracting\n        a constant value from the non-diagonal coordinates of the filter.  This is primarily\n        useful for suppressing blocks while enhancing diagonals.\n\n    clip : bool\n        If True, the smoothed similarity matrix will be thresholded at 0, and will not contain\n        negative entries.\n\n    kwargs : additional keyword arguments\n        Additional arguments to pass to `scipy.ndimage.convolve`\n\n\n    Returns\n    -------\n    R_smooth : np.ndarray, shape=R.shape\n        The smoothed self- or cross-similarity matrix\n\n    See Also\n    --------\n    filters.diagonal_filter\n    recurrence_matrix\n\n\n    Examples\n    --------\n    Use a 51-frame diagonal smoothing filter to enhance paths in a recurrence matrix\n\n    >>> y, sr = librosa.load(librosa.util.example_audio_file(), duration=30)\n    >>> chroma = librosa.feature.chroma_cqt(y=y, sr=sr)\n    >>> rec = librosa.segment.recurrence_matrix(chroma, mode='affinity', self=True)\n    >>> rec_smooth = librosa.segment.path_enhance(rec, 51, window='hann', n_filters=7)\n\n    Plot the recurrence matrix before and after smoothing\n\n    >>> import matplotlib.pyplot as plt\n    >>> plt.figure(figsize=(8, 4))\n    >>> plt.subplot(1,2,1)\n    >>> librosa.display.specshow(rec, x_axis='time', y_axis='time')\n    >>> plt.title('Unfiltered recurrence')\n    >>> plt.subplot(1,2,2)\n    >>> librosa.display.specshow(rec_smooth, x_axis='time', y_axis='time')\n    >>> plt.title('Multi-angle enhanced recurrence')\n    >>> plt.tight_layout()\n    '''\n\n    if min_ratio is None:\n        min_ratio = 1./max_ratio\n    elif min_ratio > max_ratio:\n        raise ParameterError('min_ratio={} cannot exceed max_ratio={}'.format(min_ratio, max_ratio))\n\n    R_smooth = None\n    for ratio in np.logspace(np.log2(min_ratio), np.log2(max_ratio), num=n_filters, base=2):\n        kernel = diagonal_filter(window, n, slope=ratio, zero_mean=zero_mean)\n\n        if R_smooth is None:\n            R_smooth = scipy.ndimage.convolve(R, kernel, **kwargs)\n        else:\n            # Compute the point-wise maximum in-place\n            np.maximum(R_smooth, scipy.ndimage.convolve(R, kernel, **kwargs),\n                       out=R_smooth)\n\n    if clip:\n        # Clip the output in-place\n        np.clip(R_smooth, 0, None, out=R_smooth)\n\n    return R_smooth", "response": "Multi - angle path enhancement for self - and cross - similarity matrices."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef onset_detect(input_file, output_csv):\n    '''Onset detection function\n\n    :parameters:\n      - input_file : str\n          Path to input audio file (wav, mp3, m4a, flac, etc.)\n\n      - output_file : str\n          Path to save onset timestamps as a CSV file\n    '''\n\n    # 1. load the wav file and resample to 22.050 KHz\n    print('Loading ', input_file)\n    y, sr = librosa.load(input_file, sr=22050)\n\n    # Use a default hop size of 512 frames @ 22KHz ~= 23ms\n    hop_length = 512\n\n    # 2. run onset detection\n    print('Detecting onsets...')\n    onsets = librosa.onset.onset_detect(y=y,\n                                        sr=sr,\n                                        hop_length=hop_length)\n\n    print(\"Found {:d} onsets.\".format(onsets.shape[0]))\n\n    # 3. save output\n    # 'beats' will contain the frame numbers of beat events.\n\n    onset_times = librosa.frames_to_time(onsets,\n                                         sr=sr,\n                                         hop_length=hop_length)\n\n    print('Saving output to ', output_csv)\n    librosa.output.times_csv(output_csv, onset_times)\n    print('done!')", "response": "Function to detect onsets in a single audio file and save to a CSV file."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nslices a time series into overlapping frames.", "response": "def frame(y, frame_length=2048, hop_length=512):\n    '''Slice a time series into overlapping frames.\n\n    This implementation uses low-level stride manipulation to avoid\n    redundant copies of the time series data.\n\n    Parameters\n    ----------\n    y : np.ndarray [shape=(n,)]\n        Time series to frame. Must be one-dimensional and contiguous\n        in memory.\n\n    frame_length : int > 0 [scalar]\n        Length of the frame in samples\n\n    hop_length : int > 0 [scalar]\n        Number of samples to hop between frames\n\n    Returns\n    -------\n    y_frames : np.ndarray [shape=(frame_length, N_FRAMES)]\n        An array of frames sampled from `y`:\n        `y_frames[i, j] == y[j * hop_length + i]`\n\n    Raises\n    ------\n    ParameterError\n        If `y` is not contiguous in memory, not an `np.ndarray`, or\n        not one-dimensional.  See `np.ascontiguous()` for details.\n\n        If `hop_length < 1`, frames cannot advance.\n\n        If `len(y) < frame_length`.\n\n    Examples\n    --------\n    Extract 2048-sample frames from `y` with a hop of 64 samples per frame\n\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> librosa.util.frame(y, frame_length=2048, hop_length=64)\n    array([[ -9.216e-06,   7.710e-06, ...,  -2.117e-06,  -4.362e-07],\n           [  2.518e-06,  -6.294e-06, ...,  -1.775e-05,  -6.365e-06],\n           ...,\n           [ -7.429e-04,   5.173e-03, ...,   1.105e-05,  -5.074e-06],\n           [  2.169e-03,   4.867e-03, ...,   3.666e-06,  -5.571e-06]], dtype=float32)\n\n    '''\n\n    if not isinstance(y, np.ndarray):\n        raise ParameterError('Input must be of type numpy.ndarray, '\n                             'given type(y)={}'.format(type(y)))\n\n    if y.ndim != 1:\n        raise ParameterError('Input must be one-dimensional, '\n                             'given y.ndim={}'.format(y.ndim))\n\n    if len(y) < frame_length:\n        raise ParameterError('Buffer is too short (n={:d})'\n                             ' for frame_length={:d}'.format(len(y), frame_length))\n\n    if hop_length < 1:\n        raise ParameterError('Invalid hop_length: {:d}'.format(hop_length))\n\n    if not y.flags['C_CONTIGUOUS']:\n        raise ParameterError('Input buffer must be contiguous.')\n\n    # Compute the number of frames that will fit. The end may get truncated.\n    n_frames = 1 + int((len(y) - frame_length) / hop_length)\n\n    # Vertical stride is one sample\n    # Horizontal stride is `hop_length` samples\n    y_frames = as_strided(y, shape=(frame_length, n_frames),\n                          strides=(y.itemsize, hop_length * y.itemsize))\n    return y_frames"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef valid_audio(y, mono=True):\n    '''Validate whether a variable contains valid, mono audio data.\n\n\n    Parameters\n    ----------\n    y : np.ndarray\n      The input data to validate\n\n    mono : bool\n      Whether or not to force monophonic audio\n\n    Returns\n    -------\n    valid : bool\n        True if all tests pass\n\n    Raises\n    ------\n    ParameterError\n        If `y` fails to meet the following criteria:\n            - `type(y)` is `np.ndarray`\n            - `y.dtype` is floating-point\n            - `mono == True` and `y.ndim` is not 1\n            - `mono == False` and `y.ndim` is not 1 or 2\n            - `np.isfinite(y).all()` is not True\n\n    Notes\n    -----\n    This function caches at level 20.\n\n    Examples\n    --------\n    >>> # Only allow monophonic signals\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> librosa.util.valid_audio(y)\n    True\n\n    >>> # If we want to allow stereo signals\n    >>> y, sr = librosa.load(librosa.util.example_audio_file(), mono=False)\n    >>> librosa.util.valid_audio(y, mono=False)\n    True\n    '''\n\n    if not isinstance(y, np.ndarray):\n        raise ParameterError('data must be of type numpy.ndarray')\n\n    if not np.issubdtype(y.dtype, np.floating):\n        raise ParameterError('data must be floating-point')\n\n    if mono and y.ndim != 1:\n        raise ParameterError('Invalid shape for monophonic audio: '\n                             'ndim={:d}, shape={}'.format(y.ndim, y.shape))\n\n    elif y.ndim > 2 or y.ndim == 0:\n        raise ParameterError('Audio must have shape (samples,) or (channels, samples). '\n                             'Received shape={}'.format(y.shape))\n\n    if not np.isfinite(y).all():\n        raise ParameterError('Audio buffer is not finite everywhere')\n\n    return True", "response": "Validate whether a variable contains valid mono audio data."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef valid_int(x, cast=None):\n    '''Ensure that an input value is integer-typed.\n    This is primarily useful for ensuring integrable-valued\n    array indices.\n\n    Parameters\n    ----------\n    x : number\n        A scalar value to be cast to int\n\n    cast : function [optional]\n        A function to modify `x` before casting.\n        Default: `np.floor`\n\n    Returns\n    -------\n    x_int : int\n        `x_int = int(cast(x))`\n\n    Raises\n    ------\n    ParameterError\n        If `cast` is provided and is not callable.\n    '''\n\n    if cast is None:\n        cast = np.floor\n\n    if not six.callable(cast):\n        raise ParameterError('cast parameter must be callable')\n\n    return int(cast(x))", "response": "Ensure that an input value is integer - typed."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nensures that an array is a valid representation of time intervals.", "response": "def valid_intervals(intervals):\n    '''Ensure that an array is a valid representation of time intervals:\n\n        - intervals.ndim == 2\n        - intervals.shape[1] == 2\n        - intervals[i, 0] <= intervals[i, 1] for all i\n\n    Parameters\n    ----------\n    intervals : np.ndarray [shape=(n, 2)]\n        set of time intervals\n\n    Returns\n    -------\n    valid : bool\n        True if `intervals` passes validation.\n    '''\n\n    if intervals.ndim != 2 or intervals.shape[-1] != 2:\n        raise ParameterError('intervals must have shape (n, 2)')\n\n    if np.any(intervals[:, 0] > intervals[:, 1]):\n        raise ParameterError('intervals={} must have non-negative durations'.format(intervals))\n\n    return True"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef pad_center(data, size, axis=-1, **kwargs):\n    '''Wrapper for np.pad to automatically center an array prior to padding.\n    This is analogous to `str.center()`\n\n    Examples\n    --------\n    >>> # Generate a vector\n    >>> data = np.ones(5)\n    >>> librosa.util.pad_center(data, 10, mode='constant')\n    array([ 0.,  0.,  1.,  1.,  1.,  1.,  1.,  0.,  0.,  0.])\n\n    >>> # Pad a matrix along its first dimension\n    >>> data = np.ones((3, 5))\n    >>> librosa.util.pad_center(data, 7, axis=0)\n    array([[ 0.,  0.,  0.,  0.,  0.],\n           [ 0.,  0.,  0.,  0.,  0.],\n           [ 1.,  1.,  1.,  1.,  1.],\n           [ 1.,  1.,  1.,  1.,  1.],\n           [ 1.,  1.,  1.,  1.,  1.],\n           [ 0.,  0.,  0.,  0.,  0.],\n           [ 0.,  0.,  0.,  0.,  0.]])\n    >>> # Or its second dimension\n    >>> librosa.util.pad_center(data, 7, axis=1)\n    array([[ 0.,  1.,  1.,  1.,  1.,  1.,  0.],\n           [ 0.,  1.,  1.,  1.,  1.,  1.,  0.],\n           [ 0.,  1.,  1.,  1.,  1.,  1.,  0.]])\n\n    Parameters\n    ----------\n    data : np.ndarray\n        Vector to be padded and centered\n\n    size : int >= len(data) [scalar]\n        Length to pad `data`\n\n    axis : int\n        Axis along which to pad and center the data\n\n    kwargs : additional keyword arguments\n      arguments passed to `np.pad()`\n\n    Returns\n    -------\n    data_padded : np.ndarray\n        `data` centered and padded to length `size` along the\n        specified axis\n\n    Raises\n    ------\n    ParameterError\n        If `size < data.shape[axis]`\n\n    See Also\n    --------\n    numpy.pad\n    '''\n\n    kwargs.setdefault('mode', 'constant')\n\n    n = data.shape[axis]\n\n    lpad = int((size - n) // 2)\n\n    lengths = [(0, 0)] * data.ndim\n    lengths[axis] = (lpad, int(size - n - lpad))\n\n    if lpad < 0:\n        raise ParameterError(('Target size ({:d}) must be '\n                              'at least input size ({:d})').format(size, n))\n\n    return np.pad(data, lengths, **kwargs)", "response": "Wrapper for np. pad to automatically center an array prior to padding."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef fix_length(data, size, axis=-1, **kwargs):\n    '''Fix the length an array `data` to exactly `size`.\n\n    If `data.shape[axis] < n`, pad according to the provided kwargs.\n    By default, `data` is padded with trailing zeros.\n\n    Examples\n    --------\n    >>> y = np.arange(7)\n    >>> # Default: pad with zeros\n    >>> librosa.util.fix_length(y, 10)\n    array([0, 1, 2, 3, 4, 5, 6, 0, 0, 0])\n    >>> # Trim to a desired length\n    >>> librosa.util.fix_length(y, 5)\n    array([0, 1, 2, 3, 4])\n    >>> # Use edge-padding instead of zeros\n    >>> librosa.util.fix_length(y, 10, mode='edge')\n    array([0, 1, 2, 3, 4, 5, 6, 6, 6, 6])\n\n    Parameters\n    ----------\n    data : np.ndarray\n      array to be length-adjusted\n\n    size : int >= 0 [scalar]\n      desired length of the array\n\n    axis : int, <= data.ndim\n      axis along which to fix length\n\n    kwargs : additional keyword arguments\n        Parameters to `np.pad()`\n\n    Returns\n    -------\n    data_fixed : np.ndarray [shape=data.shape]\n        `data` either trimmed or padded to length `size`\n        along the specified axis.\n\n    See Also\n    --------\n    numpy.pad\n    '''\n\n    kwargs.setdefault('mode', 'constant')\n\n    n = data.shape[axis]\n\n    if n > size:\n        slices = [slice(None)] * data.ndim\n        slices[axis] = slice(0, size)\n        return data[tuple(slices)]\n\n    elif n < size:\n        lengths = [(0, 0)] * data.ndim\n        lengths[axis] = (0, size - n)\n        return np.pad(data, lengths, **kwargs)\n\n    return data", "response": "Fix the length an array data to exactly size."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef fix_frames(frames, x_min=0, x_max=None, pad=True):\n    '''Fix a list of frames to lie within [x_min, x_max]\n\n    Examples\n    --------\n    >>> # Generate a list of frame indices\n    >>> frames = np.arange(0, 1000.0, 50)\n    >>> frames\n    array([   0.,   50.,  100.,  150.,  200.,  250.,  300.,  350.,\n            400.,  450.,  500.,  550.,  600.,  650.,  700.,  750.,\n            800.,  850.,  900.,  950.])\n    >>> # Clip to span at most 250\n    >>> librosa.util.fix_frames(frames, x_max=250)\n    array([  0,  50, 100, 150, 200, 250])\n    >>> # Or pad to span up to 2500\n    >>> librosa.util.fix_frames(frames, x_max=2500)\n    array([   0,   50,  100,  150,  200,  250,  300,  350,  400,\n            450,  500,  550,  600,  650,  700,  750,  800,  850,\n            900,  950, 2500])\n    >>> librosa.util.fix_frames(frames, x_max=2500, pad=False)\n    array([  0,  50, 100, 150, 200, 250, 300, 350, 400, 450, 500,\n           550, 600, 650, 700, 750, 800, 850, 900, 950])\n\n    >>> # Or starting away from zero\n    >>> frames = np.arange(200, 500, 33)\n    >>> frames\n    array([200, 233, 266, 299, 332, 365, 398, 431, 464, 497])\n    >>> librosa.util.fix_frames(frames)\n    array([  0, 200, 233, 266, 299, 332, 365, 398, 431, 464, 497])\n    >>> librosa.util.fix_frames(frames, x_max=500)\n    array([  0, 200, 233, 266, 299, 332, 365, 398, 431, 464, 497,\n           500])\n\n\n    Parameters\n    ----------\n    frames : np.ndarray [shape=(n_frames,)]\n        List of non-negative frame indices\n\n    x_min : int >= 0 or None\n        Minimum allowed frame index\n\n    x_max : int >= 0 or None\n        Maximum allowed frame index\n\n    pad : boolean\n        If `True`, then `frames` is expanded to span the full range\n        `[x_min, x_max]`\n\n    Returns\n    -------\n    fixed_frames : np.ndarray [shape=(n_fixed_frames,), dtype=int]\n        Fixed frame indices, flattened and sorted\n\n    Raises\n    ------\n    ParameterError\n        If `frames` contains negative values\n    '''\n\n    frames = np.asarray(frames)\n\n    if np.any(frames < 0):\n        raise ParameterError('Negative frame index detected')\n\n    if pad and (x_min is not None or x_max is not None):\n        frames = np.clip(frames, x_min, x_max)\n\n    if pad:\n        pad_data = []\n        if x_min is not None:\n            pad_data.append(x_min)\n        if x_max is not None:\n            pad_data.append(x_max)\n        frames = np.concatenate((pad_data, frames))\n\n    if x_min is not None:\n        frames = frames[frames >= x_min]\n\n    if x_max is not None:\n        frames = frames[frames <= x_max]\n\n    return np.unique(frames).astype(int)", "response": "Fix a list of frames to lie within x_min x_max."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nsorts an array along a given axis.", "response": "def axis_sort(S, axis=-1, index=False, value=None):\n    '''Sort an array along its rows or columns.\n\n    Examples\n    --------\n    Visualize NMF output for a spectrogram S\n\n    >>> # Sort the columns of W by peak frequency bin\n    >>> y, sr = librosa.load(librosa.util.example_audio_file())\n    >>> S = np.abs(librosa.stft(y))\n    >>> W, H = librosa.decompose.decompose(S, n_components=32)\n    >>> W_sort = librosa.util.axis_sort(W)\n\n    Or sort by the lowest frequency bin\n\n    >>> W_sort = librosa.util.axis_sort(W, value=np.argmin)\n\n    Or sort the rows instead of the columns\n\n    >>> W_sort_rows = librosa.util.axis_sort(W, axis=0)\n\n    Get the sorting index also, and use it to permute the rows of H\n\n    >>> W_sort, idx = librosa.util.axis_sort(W, index=True)\n    >>> H_sort = H[idx, :]\n\n    >>> import matplotlib.pyplot as plt\n    >>> plt.figure()\n    >>> plt.subplot(2, 2, 1)\n    >>> librosa.display.specshow(librosa.amplitude_to_db(W, ref=np.max),\n    ...                          y_axis='log')\n    >>> plt.title('W')\n    >>> plt.subplot(2, 2, 2)\n    >>> librosa.display.specshow(H, x_axis='time')\n    >>> plt.title('H')\n    >>> plt.subplot(2, 2, 3)\n    >>> librosa.display.specshow(librosa.amplitude_to_db(W_sort,\n    ...                                                  ref=np.max),\n    ...                          y_axis='log')\n    >>> plt.title('W sorted')\n    >>> plt.subplot(2, 2, 4)\n    >>> librosa.display.specshow(H_sort, x_axis='time')\n    >>> plt.title('H sorted')\n    >>> plt.tight_layout()\n\n\n    Parameters\n    ----------\n    S : np.ndarray [shape=(d, n)]\n        Array to be sorted\n\n    axis : int [scalar]\n        The axis along which to compute the sorting values\n\n        - `axis=0` to sort rows by peak column index\n        - `axis=1` to sort columns by peak row index\n\n    index : boolean [scalar]\n        If true, returns the index array as well as the permuted data.\n\n    value : function\n        function to return the index corresponding to the sort order.\n        Default: `np.argmax`.\n\n    Returns\n    -------\n    S_sort : np.ndarray [shape=(d, n)]\n        `S` with the columns or rows permuted in sorting order\n\n    idx : np.ndarray (optional) [shape=(d,) or (n,)]\n        If `index == True`, the sorting index used to permute `S`.\n        Length of `idx` corresponds to the selected `axis`.\n\n    Raises\n    ------\n    ParameterError\n        If `S` does not have exactly 2 dimensions (`S.ndim != 2`)\n    '''\n\n    if value is None:\n        value = np.argmax\n\n    if S.ndim != 2:\n        raise ParameterError('axis_sort is only defined for 2D arrays')\n\n    bin_idx = value(S, axis=np.mod(1-axis, S.ndim))\n    idx = np.argsort(bin_idx)\n\n    sort_slice = [slice(None)] * S.ndim\n    sort_slice[axis] = idx\n\n    if index:\n        return S[tuple(sort_slice)], idx\n    else:\n        return S[tuple(sort_slice)]"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nnormalizes an array along a chosen axis.", "response": "def normalize(S, norm=np.inf, axis=0, threshold=None, fill=None):\n    '''Normalize an array along a chosen axis.\n\n    Given a norm (described below) and a target axis, the input\n    array is scaled so that\n\n        `norm(S, axis=axis) == 1`\n\n    For example, `axis=0` normalizes each column of a 2-d array\n    by aggregating over the rows (0-axis).\n    Similarly, `axis=1` normalizes each row of a 2-d array.\n\n    This function also supports thresholding small-norm slices:\n    any slice (i.e., row or column) with norm below a specified\n    `threshold` can be left un-normalized, set to all-zeros, or\n    filled with uniform non-zero values that normalize to 1.\n\n    Note: the semantics of this function differ from\n    `scipy.linalg.norm` in two ways: multi-dimensional arrays\n    are supported, but matrix-norms are not.\n\n\n    Parameters\n    ----------\n    S : np.ndarray\n        The matrix to normalize\n\n    norm : {np.inf, -np.inf, 0, float > 0, None}\n        - `np.inf`  : maximum absolute value\n        - `-np.inf` : mininum absolute value\n        - `0`    : number of non-zeros (the support)\n        - float  : corresponding l_p norm\n            See `scipy.linalg.norm` for details.\n        - None : no normalization is performed\n\n    axis : int [scalar]\n        Axis along which to compute the norm.\n\n    threshold : number > 0 [optional]\n        Only the columns (or rows) with norm at least `threshold` are\n        normalized.\n\n        By default, the threshold is determined from\n        the numerical precision of `S.dtype`.\n\n    fill : None or bool\n        If None, then columns (or rows) with norm below `threshold`\n        are left as is.\n\n        If False, then columns (rows) with norm below `threshold`\n        are set to 0.\n\n        If True, then columns (rows) with norm below `threshold`\n        are filled uniformly such that the corresponding norm is 1.\n\n        .. note:: `fill=True` is incompatible with `norm=0` because\n            no uniform vector exists with l0 \"norm\" equal to 1.\n\n    Returns\n    -------\n    S_norm : np.ndarray [shape=S.shape]\n        Normalized array\n\n    Raises\n    ------\n    ParameterError\n        If `norm` is not among the valid types defined above\n\n        If `S` is not finite\n\n        If `fill=True` and `norm=0`\n\n    See Also\n    --------\n    scipy.linalg.norm\n\n    Notes\n    -----\n    This function caches at level 40.\n\n    Examples\n    --------\n    >>> # Construct an example matrix\n    >>> S = np.vander(np.arange(-2.0, 2.0))\n    >>> S\n    array([[-8.,  4., -2.,  1.],\n           [-1.,  1., -1.,  1.],\n           [ 0.,  0.,  0.,  1.],\n           [ 1.,  1.,  1.,  1.]])\n    >>> # Max (l-infinity)-normalize the columns\n    >>> librosa.util.normalize(S)\n    array([[-1.   ,  1.   , -1.   ,  1.   ],\n           [-0.125,  0.25 , -0.5  ,  1.   ],\n           [ 0.   ,  0.   ,  0.   ,  1.   ],\n           [ 0.125,  0.25 ,  0.5  ,  1.   ]])\n    >>> # Max (l-infinity)-normalize the rows\n    >>> librosa.util.normalize(S, axis=1)\n    array([[-1.   ,  0.5  , -0.25 ,  0.125],\n           [-1.   ,  1.   , -1.   ,  1.   ],\n           [ 0.   ,  0.   ,  0.   ,  1.   ],\n           [ 1.   ,  1.   ,  1.   ,  1.   ]])\n    >>> # l1-normalize the columns\n    >>> librosa.util.normalize(S, norm=1)\n    array([[-0.8  ,  0.667, -0.5  ,  0.25 ],\n           [-0.1  ,  0.167, -0.25 ,  0.25 ],\n           [ 0.   ,  0.   ,  0.   ,  0.25 ],\n           [ 0.1  ,  0.167,  0.25 ,  0.25 ]])\n    >>> # l2-normalize the columns\n    >>> librosa.util.normalize(S, norm=2)\n    array([[-0.985,  0.943, -0.816,  0.5  ],\n           [-0.123,  0.236, -0.408,  0.5  ],\n           [ 0.   ,  0.   ,  0.   ,  0.5  ],\n           [ 0.123,  0.236,  0.408,  0.5  ]])\n\n    >>> # Thresholding and filling\n    >>> S[:, -1] = 1e-308\n    >>> S\n    array([[ -8.000e+000,   4.000e+000,  -2.000e+000,\n              1.000e-308],\n           [ -1.000e+000,   1.000e+000,  -1.000e+000,\n              1.000e-308],\n           [  0.000e+000,   0.000e+000,   0.000e+000,\n              1.000e-308],\n           [  1.000e+000,   1.000e+000,   1.000e+000,\n              1.000e-308]])\n\n    >>> # By default, small-norm columns are left untouched\n    >>> librosa.util.normalize(S)\n    array([[ -1.000e+000,   1.000e+000,  -1.000e+000,\n              1.000e-308],\n           [ -1.250e-001,   2.500e-001,  -5.000e-001,\n              1.000e-308],\n           [  0.000e+000,   0.000e+000,   0.000e+000,\n              1.000e-308],\n           [  1.250e-001,   2.500e-001,   5.000e-001,\n              1.000e-308]])\n    >>> # Small-norm columns can be zeroed out\n    >>> librosa.util.normalize(S, fill=False)\n    array([[-1.   ,  1.   , -1.   ,  0.   ],\n           [-0.125,  0.25 , -0.5  ,  0.   ],\n           [ 0.   ,  0.   ,  0.   ,  0.   ],\n           [ 0.125,  0.25 ,  0.5  ,  0.   ]])\n    >>> # Or set to constant with unit-norm\n    >>> librosa.util.normalize(S, fill=True)\n    array([[-1.   ,  1.   , -1.   ,  1.   ],\n           [-0.125,  0.25 , -0.5  ,  1.   ],\n           [ 0.   ,  0.   ,  0.   ,  1.   ],\n           [ 0.125,  0.25 ,  0.5  ,  1.   ]])\n    >>> # With an l1 norm instead of max-norm\n    >>> librosa.util.normalize(S, norm=1, fill=True)\n    array([[-0.8  ,  0.667, -0.5  ,  0.25 ],\n           [-0.1  ,  0.167, -0.25 ,  0.25 ],\n           [ 0.   ,  0.   ,  0.   ,  0.25 ],\n           [ 0.1  ,  0.167,  0.25 ,  0.25 ]])\n    '''\n\n    # Avoid div-by-zero\n    if threshold is None:\n        threshold = tiny(S)\n\n    elif threshold <= 0:\n        raise ParameterError('threshold={} must be strictly '\n                             'positive'.format(threshold))\n\n    if fill not in [None, False, True]:\n        raise ParameterError('fill={} must be None or boolean'.format(fill))\n\n    if not np.all(np.isfinite(S)):\n        raise ParameterError('Input must be finite')\n\n    # All norms only depend on magnitude, let's do that first\n    mag = np.abs(S).astype(np.float)\n\n    # For max/min norms, filling with 1 works\n    fill_norm = 1\n\n    if norm == np.inf:\n        length = np.max(mag, axis=axis, keepdims=True)\n\n    elif norm == -np.inf:\n        length = np.min(mag, axis=axis, keepdims=True)\n\n    elif norm == 0:\n        if fill is True:\n            raise ParameterError('Cannot normalize with norm=0 and fill=True')\n\n        length = np.sum(mag > 0, axis=axis, keepdims=True, dtype=mag.dtype)\n\n    elif np.issubdtype(type(norm), np.number) and norm > 0:\n        length = np.sum(mag**norm, axis=axis, keepdims=True)**(1./norm)\n\n        if axis is None:\n            fill_norm = mag.size**(-1./norm)\n        else:\n            fill_norm = mag.shape[axis]**(-1./norm)\n\n    elif norm is None:\n        return S\n\n    else:\n        raise ParameterError('Unsupported norm: {}'.format(repr(norm)))\n\n    # indices where norm is below the threshold\n    small_idx = length < threshold\n\n    Snorm = np.empty_like(S)\n    if fill is None:\n        # Leave small indices un-normalized\n        length[small_idx] = 1.0\n        Snorm[:] = S / length\n\n    elif fill:\n        # If we have a non-zero fill value, we locate those entries by\n        # doing a nan-divide.\n        # If S was finite, then length is finite (except for small positions)\n        length[small_idx] = np.nan\n        Snorm[:] = S / length\n        Snorm[np.isnan(Snorm)] = fill_norm\n    else:\n        # Set small values to zero by doing an inf-divide.\n        # This is safe (by IEEE-754) as long as S is finite.\n        length[small_idx] = np.inf\n        Snorm[:] = S / length\n\n    return Snorm"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nfinds local maxima in an array x along an axis.", "response": "def localmax(x, axis=0):\n    \"\"\"Find local maxima in an array `x`.\n\n    An element `x[i]` is considered a local maximum if the following\n    conditions are met:\n\n    - `x[i] > x[i-1]`\n    - `x[i] >= x[i+1]`\n\n    Note that the first condition is strict, and that the first element\n    `x[0]` will never be considered as a local maximum.\n\n    Examples\n    --------\n    >>> x = np.array([1, 0, 1, 2, -1, 0, -2, 1])\n    >>> librosa.util.localmax(x)\n    array([False, False, False,  True, False,  True, False,  True], dtype=bool)\n\n    >>> # Two-dimensional example\n    >>> x = np.array([[1,0,1], [2, -1, 0], [2, 1, 3]])\n    >>> librosa.util.localmax(x, axis=0)\n    array([[False, False, False],\n           [ True, False, False],\n           [False,  True,  True]], dtype=bool)\n    >>> librosa.util.localmax(x, axis=1)\n    array([[False, False,  True],\n           [False, False,  True],\n           [False, False,  True]], dtype=bool)\n\n    Parameters\n    ----------\n    x     : np.ndarray [shape=(d1,d2,...)]\n      input vector or array\n\n    axis : int\n      axis along which to compute local maximality\n\n    Returns\n    -------\n    m     : np.ndarray [shape=x.shape, dtype=bool]\n        indicator array of local maximality along `axis`\n\n    \"\"\"\n\n    paddings = [(0, 0)] * x.ndim\n    paddings[axis] = (1, 1)\n\n    x_pad = np.pad(x, paddings, mode='edge')\n\n    inds1 = [slice(None)] * x.ndim\n    inds1[axis] = slice(0, -2)\n\n    inds2 = [slice(None)] * x.ndim\n    inds2[axis] = slice(2, x_pad.shape[axis])\n\n    return (x > x_pad[tuple(inds1)]) & (x >= x_pad[tuple(inds2)])"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef peak_pick(x, pre_max, post_max, pre_avg, post_avg, delta, wait):\n    '''Uses a flexible heuristic to pick peaks in a signal.\n\n    A sample n is selected as an peak if the corresponding x[n]\n    fulfills the following three conditions:\n\n    1. `x[n] == max(x[n - pre_max:n + post_max])`\n    2. `x[n] >= mean(x[n - pre_avg:n + post_avg]) + delta`\n    3. `n - previous_n > wait`\n\n    where `previous_n` is the last sample picked as a peak (greedily).\n\n    This implementation is based on [1]_ and [2]_.\n\n    .. [1] Boeck, Sebastian, Florian Krebs, and Markus Schedl.\n        \"Evaluating the Online Capabilities of Onset Detection Methods.\" ISMIR.\n        2012.\n\n    .. [2] https://github.com/CPJKU/onset_detection/blob/master/onset_program.py\n\n\n    Parameters\n    ----------\n    x         : np.ndarray [shape=(n,)]\n        input signal to peak picks from\n\n    pre_max   : int >= 0 [scalar]\n        number of samples before `n` over which max is computed\n\n    post_max  : int >= 1 [scalar]\n        number of samples after `n` over which max is computed\n\n    pre_avg   : int >= 0 [scalar]\n        number of samples before `n` over which mean is computed\n\n    post_avg  : int >= 1 [scalar]\n        number of samples after `n` over which mean is computed\n\n    delta     : float >= 0 [scalar]\n        threshold offset for mean\n\n    wait      : int >= 0 [scalar]\n        number of samples to wait after picking a peak\n\n    Returns\n    -------\n    peaks     : np.ndarray [shape=(n_peaks,), dtype=int]\n        indices of peaks in `x`\n\n    Raises\n    ------\n    ParameterError\n        If any input lies outside its defined range\n\n    Examples\n    --------\n    >>> y, sr = librosa.load(librosa.util.example_audio_file(), duration=15)\n    >>> onset_env = librosa.onset.onset_strength(y=y, sr=sr,\n    ...                                          hop_length=512,\n    ...                                          aggregate=np.median)\n    >>> peaks = librosa.util.peak_pick(onset_env, 3, 3, 3, 5, 0.5, 10)\n    >>> peaks\n    array([  4,  23,  73, 102, 142, 162, 182, 211, 261, 301, 320,\n           331, 348, 368, 382, 396, 411, 431, 446, 461, 476, 491,\n           510, 525, 536, 555, 570, 590, 609, 625, 639])\n\n    >>> import matplotlib.pyplot as plt\n    >>> times = librosa.frames_to_time(np.arange(len(onset_env)),\n    ...                                sr=sr, hop_length=512)\n    >>> plt.figure()\n    >>> ax = plt.subplot(2, 1, 2)\n    >>> D = librosa.stft(y)\n    >>> librosa.display.specshow(librosa.amplitude_to_db(D, ref=np.max),\n    ...                          y_axis='log', x_axis='time')\n    >>> plt.subplot(2, 1, 1, sharex=ax)\n    >>> plt.plot(times, onset_env, alpha=0.8, label='Onset strength')\n    >>> plt.vlines(times[peaks], 0,\n    ...            onset_env.max(), color='r', alpha=0.8,\n    ...            label='Selected peaks')\n    >>> plt.legend(frameon=True, framealpha=0.8)\n    >>> plt.axis('tight')\n    >>> plt.tight_layout()\n    '''\n\n    if pre_max < 0:\n        raise ParameterError('pre_max must be non-negative')\n    if pre_avg < 0:\n        raise ParameterError('pre_avg must be non-negative')\n    if delta < 0:\n        raise ParameterError('delta must be non-negative')\n    if wait < 0:\n        raise ParameterError('wait must be non-negative')\n\n    if post_max <= 0:\n        raise ParameterError('post_max must be positive')\n\n    if post_avg <= 0:\n        raise ParameterError('post_avg must be positive')\n\n    if x.ndim != 1:\n        raise ParameterError('input array must be one-dimensional')\n\n    # Ensure valid index types\n    pre_max = valid_int(pre_max, cast=np.ceil)\n    post_max = valid_int(post_max, cast=np.ceil)\n    pre_avg = valid_int(pre_avg, cast=np.ceil)\n    post_avg = valid_int(post_avg, cast=np.ceil)\n    wait = valid_int(wait, cast=np.ceil)\n\n    # Get the maximum of the signal over a sliding window\n    max_length = pre_max + post_max\n    max_origin = np.ceil(0.5 * (pre_max - post_max))\n    # Using mode='constant' and cval=x.min() effectively truncates\n    # the sliding window at the boundaries\n    mov_max = scipy.ndimage.filters.maximum_filter1d(x, int(max_length),\n                                                     mode='constant',\n                                                     origin=int(max_origin),\n                                                     cval=x.min())\n\n    # Get the mean of the signal over a sliding window\n    avg_length = pre_avg + post_avg\n    avg_origin = np.ceil(0.5 * (pre_avg - post_avg))\n    # Here, there is no mode which results in the behavior we want,\n    # so we'll correct below.\n    mov_avg = scipy.ndimage.filters.uniform_filter1d(x, int(avg_length),\n                                                     mode='nearest',\n                                                     origin=int(avg_origin))\n\n    # Correct sliding average at the beginning\n    n = 0\n    # Only need to correct in the range where the window needs to be truncated\n    while n - pre_avg < 0 and n < x.shape[0]:\n        # This just explicitly does mean(x[n - pre_avg:n + post_avg])\n        # with truncation\n        start = n - pre_avg\n        start = start if start > 0 else 0\n        mov_avg[n] = np.mean(x[start:n + post_avg])\n        n += 1\n    # Correct sliding average at the end\n    n = x.shape[0] - post_avg\n    # When post_avg > x.shape[0] (weird case), reset to 0\n    n = n if n > 0 else 0\n    while n < x.shape[0]:\n        start = n - pre_avg\n        start = start if start > 0 else 0\n        mov_avg[n] = np.mean(x[start:n + post_avg])\n        n += 1\n\n    # First mask out all entries not equal to the local max\n    detections = x * (x == mov_max)\n\n    # Then mask out all entries less than the thresholded average\n    detections = detections * (detections >= (mov_avg + delta))\n\n    # Initialize peaks array, to be filled greedily\n    peaks = []\n\n    # Remove onsets which are close together in time\n    last_onset = -np.inf\n\n    for i in np.nonzero(detections)[0]:\n        # Only report an onset if the \"wait\" samples was reported\n        if i > last_onset + wait:\n            peaks.append(i)\n            # Save last reported onset\n            last_onset = i\n\n    return np.array(peaks)", "response": "Uses a flexible heuristic to pick peaks in a signal."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef sparsify_rows(x, quantile=0.01):\n    '''\n    Return a row-sparse matrix approximating the input `x`.\n\n    Parameters\n    ----------\n    x : np.ndarray [ndim <= 2]\n        The input matrix to sparsify.\n\n    quantile : float in [0, 1.0)\n        Percentage of magnitude to discard in each row of `x`\n\n    Returns\n    -------\n    x_sparse : `scipy.sparse.csr_matrix` [shape=x.shape]\n        Row-sparsified approximation of `x`\n\n        If `x.ndim == 1`, then `x` is interpreted as a row vector,\n        and `x_sparse.shape == (1, len(x))`.\n\n    Raises\n    ------\n    ParameterError\n        If `x.ndim > 2`\n\n        If `quantile` lies outside `[0, 1.0)`\n\n    Notes\n    -----\n    This function caches at level 40.\n\n    Examples\n    --------\n    >>> # Construct a Hann window to sparsify\n    >>> x = scipy.signal.hann(32)\n    >>> x\n    array([ 0.   ,  0.01 ,  0.041,  0.09 ,  0.156,  0.236,  0.326,\n            0.424,  0.525,  0.625,  0.72 ,  0.806,  0.879,  0.937,\n            0.977,  0.997,  0.997,  0.977,  0.937,  0.879,  0.806,\n            0.72 ,  0.625,  0.525,  0.424,  0.326,  0.236,  0.156,\n            0.09 ,  0.041,  0.01 ,  0.   ])\n    >>> # Discard the bottom percentile\n    >>> x_sparse = librosa.util.sparsify_rows(x, quantile=0.01)\n    >>> x_sparse\n    <1x32 sparse matrix of type '<type 'numpy.float64'>'\n        with 26 stored elements in Compressed Sparse Row format>\n    >>> x_sparse.todense()\n    matrix([[ 0.   ,  0.   ,  0.   ,  0.09 ,  0.156,  0.236,  0.326,\n              0.424,  0.525,  0.625,  0.72 ,  0.806,  0.879,  0.937,\n              0.977,  0.997,  0.997,  0.977,  0.937,  0.879,  0.806,\n              0.72 ,  0.625,  0.525,  0.424,  0.326,  0.236,  0.156,\n              0.09 ,  0.   ,  0.   ,  0.   ]])\n    >>> # Discard up to the bottom 10th percentile\n    >>> x_sparse = librosa.util.sparsify_rows(x, quantile=0.1)\n    >>> x_sparse\n    <1x32 sparse matrix of type '<type 'numpy.float64'>'\n        with 20 stored elements in Compressed Sparse Row format>\n    >>> x_sparse.todense()\n    matrix([[ 0.   ,  0.   ,  0.   ,  0.   ,  0.   ,  0.   ,  0.326,\n              0.424,  0.525,  0.625,  0.72 ,  0.806,  0.879,  0.937,\n              0.977,  0.997,  0.997,  0.977,  0.937,  0.879,  0.806,\n              0.72 ,  0.625,  0.525,  0.424,  0.326,  0.   ,  0.   ,\n              0.   ,  0.   ,  0.   ,  0.   ]])\n    '''\n\n    if x.ndim == 1:\n        x = x.reshape((1, -1))\n\n    elif x.ndim > 2:\n        raise ParameterError('Input must have 2 or fewer dimensions. '\n                             'Provided x.shape={}.'.format(x.shape))\n\n    if not 0.0 <= quantile < 1:\n        raise ParameterError('Invalid quantile {:.2f}'.format(quantile))\n\n    x_sparse = scipy.sparse.lil_matrix(x.shape, dtype=x.dtype)\n\n    mags = np.abs(x)\n    norms = np.sum(mags, axis=1, keepdims=True)\n\n    mag_sort = np.sort(mags, axis=1)\n    cumulative_mag = np.cumsum(mag_sort / norms, axis=1)\n\n    threshold_idx = np.argmin(cumulative_mag < quantile, axis=1)\n\n    for i, j in enumerate(threshold_idx):\n        idx = np.where(mags[i] >= mag_sort[i, j])\n        x_sparse[i, idx] = x[i, idx]\n\n    return x_sparse.tocsr()", "response": "Return a row - sparse matrix approximating the input x."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nconverts an integer buffer to floating point values.", "response": "def buf_to_float(x, n_bytes=2, dtype=np.float32):\n    \"\"\"Convert an integer buffer to floating point values.\n    This is primarily useful when loading integer-valued wav data\n    into numpy arrays.\n\n    See Also\n    --------\n    buf_to_float\n\n    Parameters\n    ----------\n    x : np.ndarray [dtype=int]\n        The integer-valued data buffer\n\n    n_bytes : int [1, 2, 4]\n        The number of bytes per sample in `x`\n\n    dtype : numeric type\n        The target output type (default: 32-bit float)\n\n    Returns\n    -------\n    x_float : np.ndarray [dtype=float]\n        The input data buffer cast to floating point\n    \"\"\"\n\n    # Invert the scale of the data\n    scale = 1./float(1 << ((8 * n_bytes) - 1))\n\n    # Construct the format string\n    fmt = '<i{:d}'.format(n_bytes)\n\n    # Rescale and format the data buffer\n    return scale * np.frombuffer(x, fmt).astype(dtype)"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef index_to_slice(idx, idx_min=None, idx_max=None, step=None, pad=True):\n    '''Generate a slice array from an index array.\n\n    Parameters\n    ----------\n    idx : list-like\n        Array of index boundaries\n\n    idx_min : None or int\n    idx_max : None or int\n        Minimum and maximum allowed indices\n\n    step : None or int\n        Step size for each slice.  If `None`, then the default\n        step of 1 is used.\n\n    pad : boolean\n        If `True`, pad `idx` to span the range `idx_min:idx_max`.\n\n    Returns\n    -------\n    slices : list of slice\n        ``slices[i] = slice(idx[i], idx[i+1], step)``\n        Additional slice objects may be added at the beginning or end,\n        depending on whether ``pad==True`` and the supplied values for\n        `idx_min` and `idx_max`.\n\n    See Also\n    --------\n    fix_frames\n\n    Examples\n    --------\n    >>> # Generate slices from spaced indices\n    >>> librosa.util.index_to_slice(np.arange(20, 100, 15))\n    [slice(20, 35, None), slice(35, 50, None), slice(50, 65, None), slice(65, 80, None),\n     slice(80, 95, None)]\n    >>> # Pad to span the range (0, 100)\n    >>> librosa.util.index_to_slice(np.arange(20, 100, 15),\n    ...                             idx_min=0, idx_max=100)\n    [slice(0, 20, None), slice(20, 35, None), slice(35, 50, None), slice(50, 65, None),\n     slice(65, 80, None), slice(80, 95, None), slice(95, 100, None)]\n    >>> # Use a step of 5 for each slice\n    >>> librosa.util.index_to_slice(np.arange(20, 100, 15),\n    ...                             idx_min=0, idx_max=100, step=5)\n    [slice(0, 20, 5), slice(20, 35, 5), slice(35, 50, 5), slice(50, 65, 5), slice(65, 80, 5),\n     slice(80, 95, 5), slice(95, 100, 5)]\n    '''\n\n    # First, normalize the index set\n    idx_fixed = fix_frames(idx, idx_min, idx_max, pad=pad)\n\n    # Now convert the indices to slices\n    return [slice(start, end, step) for (start, end) in zip(idx_fixed, idx_fixed[1:])]", "response": "Generate a slice array from an index array."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef sync(data, idx, aggregate=None, pad=True, axis=-1):\n\n    if aggregate is None:\n        aggregate = np.mean\n\n    shape = list(data.shape)\n\n    if np.all([isinstance(_, slice) for _ in idx]):\n        slices = idx\n    elif np.all([np.issubdtype(type(_), np.integer) for _ in idx]):\n        slices = index_to_slice(np.asarray(idx), 0, shape[axis], pad=pad)\n    else:\n        raise ParameterError('Invalid index set: {}'.format(idx))\n\n    agg_shape = list(shape)\n    agg_shape[axis] = len(slices)\n\n    data_agg = np.empty(agg_shape, order='F' if np.isfortran(data) else 'C', dtype=data.dtype)\n\n    idx_in = [slice(None)] * data.ndim\n    idx_agg = [slice(None)] * data_agg.ndim\n\n    for (i, segment) in enumerate(slices):\n        idx_in[axis] = segment\n        idx_agg[axis] = i\n        data_agg[tuple(idx_agg)] = aggregate(data[tuple(idx_in)], axis=axis)\n\n    return data_agg", "response": "Synchronize a multi - dimensional array with a single - dimensional array of feature vectors."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef tiny(x):\n    '''Compute the tiny-value corresponding to an input's data type.\n\n    This is the smallest \"usable\" number representable in `x`'s\n    data type (e.g., float32).\n\n    This is primarily useful for determining a threshold for\n    numerical underflow in division or multiplication operations.\n\n    Parameters\n    ----------\n    x : number or np.ndarray\n        The array to compute the tiny-value for.\n        All that matters here is `x.dtype`.\n\n    Returns\n    -------\n    tiny_value : float\n        The smallest positive usable number for the type of `x`.\n        If `x` is integer-typed, then the tiny value for `np.float32`\n        is returned instead.\n\n    See Also\n    --------\n    numpy.finfo\n\n    Examples\n    --------\n\n    For a standard double-precision floating point number:\n\n    >>> librosa.util.tiny(1.0)\n    2.2250738585072014e-308\n\n    Or explicitly as double-precision\n\n    >>> librosa.util.tiny(np.asarray(1e-5, dtype=np.float64))\n    2.2250738585072014e-308\n\n    Or complex numbers\n\n    >>> librosa.util.tiny(1j)\n    2.2250738585072014e-308\n\n    Single-precision floating point:\n\n    >>> librosa.util.tiny(np.asarray(1e-5, dtype=np.float32))\n    1.1754944e-38\n\n    Integer\n\n    >>> librosa.util.tiny(5)\n    1.1754944e-38\n    '''\n\n    # Make sure we have an array view\n    x = np.asarray(x)\n\n    # Only floating types generate a tiny\n    if np.issubdtype(x.dtype, np.floating) or np.issubdtype(x.dtype, np.complexfloating):\n        dtype = x.dtype\n    else:\n        dtype = np.float32\n\n    return np.finfo(dtype).tiny", "response": "Compute the tiny - value corresponding to an input s data type."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nfills all cells of a matrix to a given value if they lie outside a constraint region.", "response": "def fill_off_diagonal(x, radius, value=0):\n    \"\"\"Sets all cells of a matrix to a given ``value``\n    if they lie outside a constraint region.\n    In this case, the constraint region is the\n    Sakoe-Chiba band which runs with a fixed ``radius``\n    along the main diagonal.\n    When ``x.shape[0] != x.shape[1]``, the radius will be\n    expanded so that ``x[-1, -1] = 1`` always.\n\n    ``x`` will be modified in place.\n\n    Parameters\n    ----------\n    x : np.ndarray [shape=(N, M)]\n        Input matrix, will be modified in place.\n    radius : float\n        The band radius (1/2 of the width) will be\n        ``int(radius*min(x.shape))``.\n    value : int\n        ``x[n, m] = value`` when ``(n, m)`` lies outside the band.\n\n    Examples\n    --------\n    >>> x = np.ones((8, 8))\n    >>> librosa.util.fill_off_diagonal(x, 0.25)\n    >>> x\n    array([[1, 1, 0, 0, 0, 0, 0, 0],\n           [1, 1, 1, 0, 0, 0, 0, 0],\n           [0, 1, 1, 1, 0, 0, 0, 0],\n           [0, 0, 1, 1, 1, 0, 0, 0],\n           [0, 0, 0, 1, 1, 1, 0, 0],\n           [0, 0, 0, 0, 1, 1, 1, 0],\n           [0, 0, 0, 0, 0, 1, 1, 1],\n           [0, 0, 0, 0, 0, 0, 1, 1]])\n    >>> x = np.ones((8, 12))\n    >>> librosa.util.fill_off_diagonal(x, 0.25)\n    >>> x\n    array([[1, 1, 1, 1, 1, 1, 0, 0, 0, 0, 0, 0],\n           [1, 1, 1, 1, 1, 1, 1, 0, 0, 0, 0, 0],\n           [0, 1, 1, 1, 1, 1, 1, 1, 0, 0, 0, 0],\n           [0, 0, 1, 1, 1, 1, 1, 1, 1, 0, 0, 0],\n           [0, 0, 0, 1, 1, 1, 1, 1, 1, 1, 0, 0],\n           [0, 0, 0, 0, 1, 1, 1, 1, 1, 1, 1, 0],\n           [0, 0, 0, 0, 0, 1, 1, 1, 1, 1, 1, 1],\n           [0, 0, 0, 0, 0, 0, 1, 1, 1, 1, 1, 1]])\n    \"\"\"\n    nx, ny = x.shape\n\n    # Calculate the radius in indices, rather than proportion\n    radius = np.round(radius * np.min(x.shape))\n\n    nx, ny = x.shape\n    offset = np.abs((x.shape[0] - x.shape[1]))\n\n    if nx < ny:\n        idx_u = np.triu_indices_from(x, k=radius + offset)\n        idx_l = np.tril_indices_from(x, k=-radius)\n    else:\n        idx_u = np.triu_indices_from(x, k=radius)\n        idx_l = np.tril_indices_from(x, k=-radius - offset)\n\n    # modify input matrix\n    x[idx_u] = value\n    x[idx_l] = value"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nreads the frames from a directory and join them as a video.", "response": "def frames2video(frame_dir,\n                 video_file,\n                 fps=30,\n                 fourcc='XVID',\n                 filename_tmpl='{:06d}.jpg',\n                 start=0,\n                 end=0,\n                 show_progress=True):\n    \"\"\"Read the frame images from a directory and join them as a video\n\n    Args:\n        frame_dir (str): The directory containing video frames.\n        video_file (str): Output filename.\n        fps (float): FPS of the output video.\n        fourcc (str): Fourcc of the output video, this should be compatible\n            with the output file type.\n        filename_tmpl (str): Filename template with the index as the variable.\n        start (int): Starting frame index.\n        end (int): Ending frame index.\n        show_progress (bool): Whether to show a progress bar.\n    \"\"\"\n    if end == 0:\n        ext = filename_tmpl.split('.')[-1]\n        end = len([name for name in scandir(frame_dir, ext)])\n    first_file = osp.join(frame_dir, filename_tmpl.format(start))\n    check_file_exist(first_file, 'The start frame not found: ' + first_file)\n    img = cv2.imread(first_file)\n    height, width = img.shape[:2]\n    resolution = (width, height)\n    vwriter = cv2.VideoWriter(video_file, VideoWriter_fourcc(*fourcc), fps,\n                              resolution)\n\n    def write_frame(file_idx):\n        filename = osp.join(frame_dir, filename_tmpl.format(file_idx))\n        img = cv2.imread(filename)\n        vwriter.write(img)\n\n    if show_progress:\n        track_progress(write_frame, range(start, end))\n    else:\n        for i in range(start, end):\n            filename = osp.join(frame_dir, filename_tmpl.format(i))\n            img = cv2.imread(filename)\n            vwriter.write(img)\n    vwriter.release()"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef read(self):\n        # pos = self._position\n        if self._cache:\n            img = self._cache.get(self._position)\n            if img is not None:\n                ret = True\n            else:\n                if self._position != self._get_real_position():\n                    self._set_real_position(self._position)\n                ret, img = self._vcap.read()\n                if ret:\n                    self._cache.put(self._position, img)\n        else:\n            ret, img = self._vcap.read()\n        if ret:\n            self._position += 1\n        return img", "response": "Read the next frame."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nget the expected image of a specific frame.", "response": "def get_frame(self, frame_id):\n        \"\"\"Get frame by index.\n\n        Args:\n            frame_id (int): Index of the expected frame, 0-based.\n\n        Returns:\n            ndarray or None: Return the frame if successful, otherwise None.\n        \"\"\"\n        if frame_id < 0 or frame_id >= self._frame_cnt:\n            raise IndexError(\n                '\"frame_id\" must be between 0 and {}'.format(self._frame_cnt -\n                                                             1))\n        if frame_id == self._position:\n            return self.read()\n        if self._cache:\n            img = self._cache.get(frame_id)\n            if img is not None:\n                self._position = frame_id + 1\n                return img\n        self._set_real_position(frame_id)\n        ret, img = self._vcap.read()\n        if ret:\n            if self._cache:\n                self._cache.put(self._position, img)\n            self._position += 1\n        return img"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef cvt2frames(self,\n                   frame_dir,\n                   file_start=0,\n                   filename_tmpl='{:06d}.jpg',\n                   start=0,\n                   max_num=0,\n                   show_progress=True):\n        \"\"\"Convert a video to frame images\n\n        Args:\n            frame_dir (str): Output directory to store all the frame images.\n            file_start (int): Filenames will start from the specified number.\n            filename_tmpl (str): Filename template with the index as the\n                placeholder.\n            start (int): The starting frame index.\n            max_num (int): Maximum number of frames to be written.\n            show_progress (bool): Whether to show a progress bar.\n        \"\"\"\n        mkdir_or_exist(frame_dir)\n        if max_num == 0:\n            task_num = self.frame_cnt - start\n        else:\n            task_num = min(self.frame_cnt - start, max_num)\n        if task_num <= 0:\n            raise ValueError('start must be less than total frame number')\n        if start > 0:\n            self._set_real_position(start)\n\n        def write_frame(file_idx):\n            img = self.read()\n            filename = osp.join(frame_dir, filename_tmpl.format(file_idx))\n            cv2.imwrite(filename, img)\n\n        if show_progress:\n            track_progress(write_frame, range(file_start,\n                                              file_start + task_num))\n        else:\n            for i in range(task_num):\n                img = self.read()\n                if img is None:\n                    break\n                filename = osp.join(frame_dir,\n                                    filename_tmpl.format(i + file_start))\n                cv2.imwrite(filename, img)", "response": "Convert a video to frame images."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\ntracks the progress of the tasks execution with a progress bar.", "response": "def track_progress(func, tasks, bar_width=50, **kwargs):\n    \"\"\"Track the progress of tasks execution with a progress bar.\n\n    Tasks are done with a simple for-loop.\n\n    Args:\n        func (callable): The function to be applied to each task.\n        tasks (list or tuple[Iterable, int]): A list of tasks or\n            (tasks, total num).\n        bar_width (int): Width of progress bar.\n\n    Returns:\n        list: The task results.\n    \"\"\"\n    if isinstance(tasks, tuple):\n        assert len(tasks) == 2\n        assert isinstance(tasks[0], collections_abc.Iterable)\n        assert isinstance(tasks[1], int)\n        task_num = tasks[1]\n        tasks = tasks[0]\n    elif isinstance(tasks, collections_abc.Iterable):\n        task_num = len(tasks)\n    else:\n        raise TypeError(\n            '\"tasks\" must be an iterable object or a (iterator, int) tuple')\n    prog_bar = ProgressBar(task_num, bar_width)\n    results = []\n    for task in tasks:\n        results.append(func(task, **kwargs))\n        prog_bar.update()\n    sys.stdout.write('\\n')\n    return results"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef track_parallel_progress(func,\n                            tasks,\n                            nproc,\n                            initializer=None,\n                            initargs=None,\n                            bar_width=50,\n                            chunksize=1,\n                            skip_first=False,\n                            keep_order=True):\n    \"\"\"Track the progress of parallel task execution with a progress bar.\n\n    The built-in :mod:`multiprocessing` module is used for process pools and\n    tasks are done with :func:`Pool.map` or :func:`Pool.imap_unordered`.\n\n    Args:\n        func (callable): The function to be applied to each task.\n        tasks (list or tuple[Iterable, int]): A list of tasks or\n            (tasks, total num).\n        nproc (int): Process (worker) number.\n        initializer (None or callable): Refer to :class:`multiprocessing.Pool`\n            for details.\n        initargs (None or tuple): Refer to :class:`multiprocessing.Pool` for\n            details.\n        chunksize (int): Refer to :class:`multiprocessing.Pool` for details.\n        bar_width (int): Width of progress bar.\n        skip_first (bool): Whether to skip the first sample for each worker\n            when estimating fps, since the initialization step may takes\n            longer.\n        keep_order (bool): If True, :func:`Pool.imap` is used, otherwise\n            :func:`Pool.imap_unordered` is used.\n\n    Returns:\n        list: The task results.\n    \"\"\"\n    if isinstance(tasks, tuple):\n        assert len(tasks) == 2\n        assert isinstance(tasks[0], collections_abc.Iterable)\n        assert isinstance(tasks[1], int)\n        task_num = tasks[1]\n        tasks = tasks[0]\n    elif isinstance(tasks, collections_abc.Iterable):\n        task_num = len(tasks)\n    else:\n        raise TypeError(\n            '\"tasks\" must be an iterable object or a (iterator, int) tuple')\n    pool = init_pool(nproc, initializer, initargs)\n    start = not skip_first\n    task_num -= nproc * chunksize * int(skip_first)\n    prog_bar = ProgressBar(task_num, bar_width, start)\n    results = []\n    if keep_order:\n        gen = pool.imap(func, tasks, chunksize)\n    else:\n        gen = pool.imap_unordered(func, tasks, chunksize)\n    for result in gen:\n        results.append(result)\n        if skip_first:\n            if len(results) < nproc * chunksize:\n                continue\n            elif len(results) == nproc * chunksize:\n                prog_bar.start()\n                continue\n        prog_bar.update()\n    sys.stdout.write('\\n')\n    pool.close()\n    pool.join()\n    return results", "response": "Track the progress of a list of parallel tasks."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef imflip(img, direction='horizontal'):\n    assert direction in ['horizontal', 'vertical']\n    if direction == 'horizontal':\n        return np.flip(img, axis=1)\n    else:\n        return np.flip(img, axis=0)", "response": "Flip an image horizontally or vertically."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef imrotate(img,\n             angle,\n             center=None,\n             scale=1.0,\n             border_value=0,\n             auto_bound=False):\n    \"\"\"Rotate an image.\n\n    Args:\n        img (ndarray): Image to be rotated.\n        angle (float): Rotation angle in degrees, positive values mean\n            clockwise rotation.\n        center (tuple): Center of the rotation in the source image, by default\n            it is the center of the image.\n        scale (float): Isotropic scale factor.\n        border_value (int): Border value.\n        auto_bound (bool): Whether to adjust the image size to cover the whole\n            rotated image.\n\n    Returns:\n        ndarray: The rotated image.\n    \"\"\"\n    if center is not None and auto_bound:\n        raise ValueError('`auto_bound` conflicts with `center`')\n    h, w = img.shape[:2]\n    if center is None:\n        center = ((w - 1) * 0.5, (h - 1) * 0.5)\n    assert isinstance(center, tuple)\n\n    matrix = cv2.getRotationMatrix2D(center, -angle, scale)\n    if auto_bound:\n        cos = np.abs(matrix[0, 0])\n        sin = np.abs(matrix[0, 1])\n        new_w = h * sin + w * cos\n        new_h = h * cos + w * sin\n        matrix[0, 2] += (new_w - w) * 0.5\n        matrix[1, 2] += (new_h - h) * 0.5\n        w = int(np.round(new_w))\n        h = int(np.round(new_h))\n    rotated = cv2.warpAffine(img, matrix, (w, h), borderValue=border_value)\n    return rotated", "response": "Rotates an image in the base image."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nclips bboxes to fit the image shape.", "response": "def bbox_clip(bboxes, img_shape):\n    \"\"\"Clip bboxes to fit the image shape.\n\n    Args:\n        bboxes (ndarray): Shape (..., 4*k)\n        img_shape (tuple): (height, width) of the image.\n\n    Returns:\n        ndarray: Clipped bboxes.\n    \"\"\"\n    assert bboxes.shape[-1] % 4 == 0\n    clipped_bboxes = np.empty_like(bboxes, dtype=bboxes.dtype)\n    clipped_bboxes[..., 0::2] = np.maximum(\n        np.minimum(bboxes[..., 0::2], img_shape[1] - 1), 0)\n    clipped_bboxes[..., 1::2] = np.maximum(\n        np.minimum(bboxes[..., 1::2], img_shape[0] - 1), 0)\n    return clipped_bboxes"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nscales the given bounding boxes w. r. t the box center.", "response": "def bbox_scaling(bboxes, scale, clip_shape=None):\n    \"\"\"Scaling bboxes w.r.t the box center.\n\n    Args:\n        bboxes (ndarray): Shape(..., 4).\n        scale (float): Scaling factor.\n        clip_shape (tuple, optional): If specified, bboxes that exceed the\n            boundary will be clipped according to the given shape (h, w).\n\n    Returns:\n        ndarray: Scaled bboxes.\n    \"\"\"\n    if float(scale) == 1.0:\n        scaled_bboxes = bboxes.copy()\n    else:\n        w = bboxes[..., 2] - bboxes[..., 0] + 1\n        h = bboxes[..., 3] - bboxes[..., 1] + 1\n        dw = (w * (scale - 1)) * 0.5\n        dh = (h * (scale - 1)) * 0.5\n        scaled_bboxes = bboxes + np.stack((-dw, -dh, dw, dh), axis=-1)\n    if clip_shape is not None:\n        return bbox_clip(scaled_bboxes, clip_shape)\n    else:\n        return scaled_bboxes"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\ncrop image patches. 3 steps: scale the bboxes -> clip bboxes -> crop and pad. Args: img (ndarray): Image to be cropped. bboxes (ndarray): Shape (k, 4) or (4, ), location of cropped bboxes. scale (float, optional): Scale ratio of bboxes, the default value 1.0 means no padding. pad_fill (number or list): Value to be filled for padding, None for no padding. Returns: list or ndarray: The cropped image patches.", "response": "def imcrop(img, bboxes, scale=1.0, pad_fill=None):\n    \"\"\"Crop image patches.\n\n    3 steps: scale the bboxes -> clip bboxes -> crop and pad.\n\n    Args:\n        img (ndarray): Image to be cropped.\n        bboxes (ndarray): Shape (k, 4) or (4, ), location of cropped bboxes.\n        scale (float, optional): Scale ratio of bboxes, the default value\n            1.0 means no padding.\n        pad_fill (number or list): Value to be filled for padding, None for\n            no padding.\n\n    Returns:\n        list or ndarray: The cropped image patches.\n    \"\"\"\n    chn = 1 if img.ndim == 2 else img.shape[2]\n    if pad_fill is not None:\n        if isinstance(pad_fill, (int, float)):\n            pad_fill = [pad_fill for _ in range(chn)]\n        assert len(pad_fill) == chn\n\n    _bboxes = bboxes[None, ...] if bboxes.ndim == 1 else bboxes\n    scaled_bboxes = bbox_scaling(_bboxes, scale).astype(np.int32)\n    clipped_bbox = bbox_clip(scaled_bboxes, img.shape)\n\n    patches = []\n    for i in range(clipped_bbox.shape[0]):\n        x1, y1, x2, y2 = tuple(clipped_bbox[i, :])\n        if pad_fill is None:\n            patch = img[y1:y2 + 1, x1:x2 + 1, ...]\n        else:\n            _x1, _y1, _x2, _y2 = tuple(scaled_bboxes[i, :])\n            if chn == 2:\n                patch_shape = (_y2 - _y1 + 1, _x2 - _x1 + 1)\n            else:\n                patch_shape = (_y2 - _y1 + 1, _x2 - _x1 + 1, chn)\n            patch = np.array(\n                pad_fill, dtype=img.dtype) * np.ones(\n                    patch_shape, dtype=img.dtype)\n            x_start = 0 if _x1 >= 0 else -_x1\n            y_start = 0 if _y1 >= 0 else -_y1\n            w = x2 - x1 + 1\n            h = y2 - y1 + 1\n            patch[y_start:y_start + h, x_start:x_start +\n                  w, ...] = img[y1:y1 + h, x1:x1 + w, ...]\n        patches.append(patch)\n\n    if bboxes.ndim == 1:\n        return patches[0]\n    else:\n        return patches"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\npads an image to a certain shape.", "response": "def impad(img, shape, pad_val=0):\n    \"\"\"Pad an image to a certain shape.\n\n    Args:\n        img (ndarray): Image to be padded.\n        shape (tuple): Expected padding shape.\n        pad_val (number or sequence): Values to be filled in padding areas.\n\n    Returns:\n        ndarray: The padded image.\n    \"\"\"\n    if not isinstance(pad_val, (int, float)):\n        assert len(pad_val) == img.shape[-1]\n    if len(shape) < len(img.shape):\n        shape = shape + (img.shape[-1], )\n    assert len(shape) == len(img.shape)\n    for i in range(len(shape) - 1):\n        assert shape[i] >= img.shape[i]\n    pad = np.empty(shape, dtype=img.dtype)\n    pad[...] = pad_val\n    pad[:img.shape[0], :img.shape[1], ...] = img\n    return pad"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\npad an image to ensure each edge to be multiple to some number.", "response": "def impad_to_multiple(img, divisor, pad_val=0):\n    \"\"\"Pad an image to ensure each edge to be multiple to some number.\n\n    Args:\n        img (ndarray): Image to be padded.\n        divisor (int): Padded image edges will be multiple to divisor.\n        pad_val (number or sequence): Same as :func:`impad`.\n\n    Returns:\n        ndarray: The padded image.\n    \"\"\"\n    pad_h = int(np.ceil(img.shape[0] / divisor)) * divisor\n    pad_w = int(np.ceil(img.shape[1] / divisor)) * divisor\n    return impad(img, (pad_h, pad_w), pad_val)"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef imresize_like(img, dst_img, return_scale=False, interpolation='bilinear'):\n    h, w = dst_img.shape[:2]\n    return imresize(img, (w, h), return_scale, interpolation)", "response": "Resize image to the same size of a given image."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef load(file, file_format=None, **kwargs):\n    if file_format is None and is_str(file):\n        file_format = file.split('.')[-1]\n    if file_format not in file_handlers:\n        raise TypeError('Unsupported format: {}'.format(file_format))\n\n    handler = file_handlers[file_format]\n    if is_str(file):\n        obj = handler.load_from_path(file, **kwargs)\n    elif hasattr(file, 'read'):\n        obj = handler.load_from_fileobj(file, **kwargs)\n    else:\n        raise TypeError('\"file\" must be a filepath str or a file-object')\n    return obj", "response": "Load data from json or yaml files."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\ndumping data to json or yaml file.", "response": "def dump(obj, file=None, file_format=None, **kwargs):\n    \"\"\"Dump data to json/yaml/pickle strings or files.\n\n    This method provides a unified api for dumping data as strings or to files,\n    and also supports custom arguments for each file format.\n\n    Args:\n        obj (any): The python object to be dumped.\n        file (str or file-like object, optional): If not specified, then the\n            object is dump to a str, otherwise to a file specified by the\n            filename or file-like object.\n        file_format (str, optional): Same as :func:`load`.\n\n    Returns:\n        bool: True for success, False otherwise.\n    \"\"\"\n    if file_format is None:\n        if is_str(file):\n            file_format = file.split('.')[-1]\n        elif file is None:\n            raise ValueError(\n                'file_format must be specified since file is None')\n    if file_format not in file_handlers:\n        raise TypeError('Unsupported format: {}'.format(file_format))\n\n    handler = file_handlers[file_format]\n    if file is None:\n        return handler.dump_to_str(obj, **kwargs)\n    elif is_str(file):\n        handler.dump_to_path(obj, file, **kwargs)\n    elif hasattr(file, 'write'):\n        handler.dump_to_fileobj(obj, file, **kwargs)\n    else:\n        raise TypeError('\"file\" must be a filename str or a file-object')"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef _register_handler(handler, file_formats):\n    if not isinstance(handler, BaseFileHandler):\n        raise TypeError(\n            'handler must be a child of BaseFileHandler, not {}'.format(\n                type(handler)))\n    if isinstance(file_formats, str):\n        file_formats = [file_formats]\n    if not is_list_of(file_formats, str):\n        raise TypeError('file_formats must be a str or a list of str')\n    for ext in file_formats:\n        file_handlers[ext] = handler", "response": "Register a handler for some file extensions."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nget priority value. Args: priority (int or str or :obj:`Priority`): Priority. Returns: int: The priority value.", "response": "def get_priority(priority):\n    \"\"\"Get priority value.\n\n    Args:\n        priority (int or str or :obj:`Priority`): Priority.\n\n    Returns:\n        int: The priority value.\n    \"\"\"\n    if isinstance(priority, int):\n        if priority < 0 or priority > 100:\n            raise ValueError('priority must be between 0 and 100')\n        return priority\n    elif isinstance(priority, Priority):\n        return priority.value\n    elif isinstance(priority, str):\n        return Priority[priority.upper()].value\n    else:\n        raise TypeError('priority must be an integer or Priority enum value')"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef quantize(arr, min_val, max_val, levels, dtype=np.int64):\n    if not (isinstance(levels, int) and levels > 1):\n        raise ValueError(\n            'levels must be a positive integer, but got {}'.format(levels))\n    if min_val >= max_val:\n        raise ValueError(\n            'min_val ({}) must be smaller than max_val ({})'.format(\n                min_val, max_val))\n\n    arr = np.clip(arr, min_val, max_val) - min_val\n    quantized_arr = np.minimum(\n        np.floor(levels * arr / (max_val - min_val)).astype(dtype), levels - 1)\n\n    return quantized_arr", "response": "Quantize an array of - inf to max_val."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef dequantize(arr, min_val, max_val, levels, dtype=np.float64):\n    if not (isinstance(levels, int) and levels > 1):\n        raise ValueError(\n            'levels must be a positive integer, but got {}'.format(levels))\n    if min_val >= max_val:\n        raise ValueError(\n            'min_val ({}) must be smaller than max_val ({})'.format(\n                min_val, max_val))\n\n    dequantized_arr = (arr + 0.5).astype(dtype) * (\n        max_val - min_val) / levels + min_val\n\n    return dequantized_arr", "response": "Dequantize an array.\n\n    Args:\n        arr (ndarray): Input array.\n        min_val (scalar): Minimum value to be clipped.\n        max_val (scalar): Maximum value to be clipped.\n        levels (int): Quantization levels.\n        dtype (np.type): The type of the dequantized array.\n\n    Returns:\n        tuple: Dequantized array."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\ngenerate argparser from config file automatically ( experimental )", "response": "def auto_argparser(description=None):\n        \"\"\"Generate argparser from config file automatically (experimental)\n        \"\"\"\n        partial_parser = ArgumentParser(description=description)\n        partial_parser.add_argument('config', help='config file path')\n        cfg_file = partial_parser.parse_known_args()[0].config\n        cfg = Config.from_file(cfg_file)\n        parser = ArgumentParser(description=description)\n        parser.add_argument('config', help='config file path')\n        add_args(parser, cfg)\n        return parser, cfg"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\ntaking a batch of data and returns a new tensor or DataContainer with the same shape.", "response": "def collate(batch, samples_per_gpu=1):\n    \"\"\"Puts each data field into a tensor/DataContainer with outer dimension\n    batch size.\n\n    Extend default_collate to add support for\n    :type:`~mmcv.parallel.DataContainer`. There are 3 cases.\n\n    1. cpu_only = True, e.g., meta data\n    2. cpu_only = False, stack = True, e.g., images tensors\n    3. cpu_only = False, stack = False, e.g., gt bboxes\n    \"\"\"\n\n    if not isinstance(batch, collections.Sequence):\n        raise TypeError(\"{} is not supported.\".format(batch.dtype))\n\n    if isinstance(batch[0], DataContainer):\n        assert len(batch) % samples_per_gpu == 0\n        stacked = []\n        if batch[0].cpu_only:\n            for i in range(0, len(batch), samples_per_gpu):\n                stacked.append(\n                    [sample.data for sample in batch[i:i + samples_per_gpu]])\n            return DataContainer(\n                stacked, batch[0].stack, batch[0].padding_value, cpu_only=True)\n        elif batch[0].stack:\n            for i in range(0, len(batch), samples_per_gpu):\n                assert isinstance(batch[i].data, torch.Tensor)\n                # TODO: handle tensors other than 3d\n                assert batch[i].dim() == 3\n                c, h, w = batch[i].size()\n                for sample in batch[i:i + samples_per_gpu]:\n                    assert c == sample.size(0)\n                    h = max(h, sample.size(1))\n                    w = max(w, sample.size(2))\n                padded_samples = [\n                    F.pad(\n                        sample.data,\n                        (0, w - sample.size(2), 0, h - sample.size(1)),\n                        value=sample.padding_value)\n                    for sample in batch[i:i + samples_per_gpu]\n                ]\n                stacked.append(default_collate(padded_samples))\n        else:\n            for i in range(0, len(batch), samples_per_gpu):\n                stacked.append(\n                    [sample.data for sample in batch[i:i + samples_per_gpu]])\n        return DataContainer(stacked, batch[0].stack, batch[0].padding_value)\n    elif isinstance(batch[0], collections.Sequence):\n        transposed = zip(*batch)\n        return [collate(samples, samples_per_gpu) for samples in transposed]\n    elif isinstance(batch[0], collections.Mapping):\n        return {\n            key: collate([d[key] for d in batch], samples_per_gpu)\n            for key in batch[0]\n        }\n    else:\n        return default_collate(batch)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef imshow(img, win_name='', wait_time=0):\n    cv2.imshow(win_name, imread(img))\n    cv2.waitKey(wait_time)", "response": "Show an image.\n\n    Args:\n        img (str or ndarray): The image to be displayed.\n        win_name (str): The window name.\n        wait_time (int): Value of waitKey param."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\ndrawing bboxes on an image.", "response": "def imshow_bboxes(img,\n                  bboxes,\n                  colors='green',\n                  top_k=-1,\n                  thickness=1,\n                  show=True,\n                  win_name='',\n                  wait_time=0,\n                  out_file=None):\n    \"\"\"Draw bboxes on an image.\n\n    Args:\n        img (str or ndarray): The image to be displayed.\n        bboxes (list or ndarray): A list of ndarray of shape (k, 4).\n        colors (list[str or tuple or Color]): A list of colors.\n        top_k (int): Plot the first k bboxes only if set positive.\n        thickness (int): Thickness of lines.\n        show (bool): Whether to show the image.\n        win_name (str): The window name.\n        wait_time (int): Value of waitKey param.\n        out_file (str, optional): The filename to write the image.\n    \"\"\"\n    img = imread(img)\n\n    if isinstance(bboxes, np.ndarray):\n        bboxes = [bboxes]\n    if not isinstance(colors, list):\n        colors = [colors for _ in range(len(bboxes))]\n    colors = [color_val(c) for c in colors]\n    assert len(bboxes) == len(colors)\n\n    for i, _bboxes in enumerate(bboxes):\n        _bboxes = _bboxes.astype(np.int32)\n        if top_k <= 0:\n            _top_k = _bboxes.shape[0]\n        else:\n            _top_k = min(top_k, _bboxes.shape[0])\n        for j in range(_top_k):\n            left_top = (_bboxes[j, 0], _bboxes[j, 1])\n            right_bottom = (_bboxes[j, 2], _bboxes[j, 3])\n            cv2.rectangle(\n                img, left_top, right_bottom, colors[i], thickness=thickness)\n\n    if show:\n        imshow(img, win_name, wait_time)\n    if out_file is not None:\n        imwrite(img, out_file)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef imshow_det_bboxes(img,\n                      bboxes,\n                      labels,\n                      class_names=None,\n                      score_thr=0,\n                      bbox_color='green',\n                      text_color='green',\n                      thickness=1,\n                      font_scale=0.5,\n                      show=True,\n                      win_name='',\n                      wait_time=0,\n                      out_file=None):\n    \"\"\"Draw bboxes and class labels (with scores) on an image.\n\n    Args:\n        img (str or ndarray): The image to be displayed.\n        bboxes (ndarray): Bounding boxes (with scores), shaped (n, 4) or\n            (n, 5).\n        labels (ndarray): Labels of bboxes.\n        class_names (list[str]): Names of each classes.\n        score_thr (float): Minimum score of bboxes to be shown.\n        bbox_color (str or tuple or :obj:`Color`): Color of bbox lines.\n        text_color (str or tuple or :obj:`Color`): Color of texts.\n        thickness (int): Thickness of lines.\n        font_scale (float): Font scales of texts.\n        show (bool): Whether to show the image.\n        win_name (str): The window name.\n        wait_time (int): Value of waitKey param.\n        out_file (str or None): The filename to write the image.\n    \"\"\"\n    assert bboxes.ndim == 2\n    assert labels.ndim == 1\n    assert bboxes.shape[0] == labels.shape[0]\n    assert bboxes.shape[1] == 4 or bboxes.shape[1] == 5\n    img = imread(img)\n\n    if score_thr > 0:\n        assert bboxes.shape[1] == 5\n        scores = bboxes[:, -1]\n        inds = scores > score_thr\n        bboxes = bboxes[inds, :]\n        labels = labels[inds]\n\n    bbox_color = color_val(bbox_color)\n    text_color = color_val(text_color)\n\n    for bbox, label in zip(bboxes, labels):\n        bbox_int = bbox.astype(np.int32)\n        left_top = (bbox_int[0], bbox_int[1])\n        right_bottom = (bbox_int[2], bbox_int[3])\n        cv2.rectangle(\n            img, left_top, right_bottom, bbox_color, thickness=thickness)\n        label_text = class_names[\n            label] if class_names is not None else 'cls {}'.format(label)\n        if len(bbox) > 4:\n            label_text += '|{:.02f}'.format(bbox[-1])\n        cv2.putText(img, label_text, (bbox_int[0], bbox_int[1] - 2),\n                    cv2.FONT_HERSHEY_COMPLEX, font_scale, text_color)\n\n    if show:\n        imshow(img, win_name, wait_time)\n    if out_file is not None:\n        imwrite(img, out_file)", "response": "Draw the image with the given bounding boxes and labels."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef flowread(flow_or_path, quantize=False, concat_axis=0, *args, **kwargs):\n    if isinstance(flow_or_path, np.ndarray):\n        if (flow_or_path.ndim != 3) or (flow_or_path.shape[-1] != 2):\n            raise ValueError('Invalid flow with shape {}'.format(\n                flow_or_path.shape))\n        return flow_or_path\n    elif not is_str(flow_or_path):\n        raise TypeError(\n            '\"flow_or_path\" must be a filename or numpy array, not {}'.format(\n                type(flow_or_path)))\n\n    if not quantize:\n        with open(flow_or_path, 'rb') as f:\n            try:\n                header = f.read(4).decode('utf-8')\n            except Exception:\n                raise IOError('Invalid flow file: {}'.format(flow_or_path))\n            else:\n                if header != 'PIEH':\n                    raise IOError(\n                        'Invalid flow file: {}, header does not contain PIEH'.\n                        format(flow_or_path))\n\n            w = np.fromfile(f, np.int32, 1).squeeze()\n            h = np.fromfile(f, np.int32, 1).squeeze()\n            flow = np.fromfile(f, np.float32, w * h * 2).reshape((h, w, 2))\n    else:\n        assert concat_axis in [0, 1]\n        cat_flow = imread(flow_or_path, flag='unchanged')\n        if cat_flow.ndim != 2:\n            raise IOError(\n                '{} is not a valid quantized flow file, its dimension is {}.'.\n                format(flow_or_path, cat_flow.ndim))\n        assert cat_flow.shape[concat_axis] % 2 == 0\n        dx, dy = np.split(cat_flow, 2, axis=concat_axis)\n        flow = dequantize_flow(dx, dy, *args, **kwargs)\n\n    return flow.astype(np.float32)", "response": "Read an optical flow map or file."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef flowwrite(flow, filename, quantize=False, concat_axis=0, *args, **kwargs):\n    if not quantize:\n        with open(filename, 'wb') as f:\n            f.write('PIEH'.encode('utf-8'))\n            np.array([flow.shape[1], flow.shape[0]], dtype=np.int32).tofile(f)\n            flow = flow.astype(np.float32)\n            flow.tofile(f)\n            f.flush()\n    else:\n        assert concat_axis in [0, 1]\n        dx, dy = quantize_flow(flow, *args, **kwargs)\n        dxdy = np.concatenate((dx, dy), axis=concat_axis)\n        imwrite(dxdy, filename)", "response": "Write an optical flow to a file."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef quantize_flow(flow, max_val=0.02, norm=True):\n    h, w, _ = flow.shape\n    dx = flow[..., 0]\n    dy = flow[..., 1]\n    if norm:\n        dx = dx / w  # avoid inplace operations\n        dy = dy / h\n    # use 255 levels instead of 256 to make sure 0 is 0 after dequantization.\n    flow_comps = [\n        quantize(d, -max_val, max_val, 255, np.uint8) for d in [dx, dy]\n    ]\n    return tuple(flow_comps)", "response": "Quantize a flow to [ 0 255 )"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nrecover from quantized flow.", "response": "def dequantize_flow(dx, dy, max_val=0.02, denorm=True):\n    \"\"\"Recover from quantized flow.\n\n    Args:\n        dx (ndarray): Quantized dx.\n        dy (ndarray): Quantized dy.\n        max_val (float): Maximum value used when quantizing.\n        denorm (bool): Whether to multiply flow values with width/height.\n\n    Returns:\n        ndarray: Dequantized flow.\n    \"\"\"\n    assert dx.shape == dy.shape\n    assert dx.ndim == 2 or (dx.ndim == 3 and dx.shape[-1] == 1)\n\n    dx, dy = [dequantize(d, -max_val, max_val, 255) for d in [dx, dy]]\n\n    if denorm:\n        dx *= dx.shape[1]\n        dy *= dx.shape[0]\n    flow = np.dstack((dx, dy))\n    return flow"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nloading the state_dict into a module.", "response": "def load_state_dict(module, state_dict, strict=False, logger=None):\n    \"\"\"Load state_dict to a module.\n\n    This method is modified from :meth:`torch.nn.Module.load_state_dict`.\n    Default value for ``strict`` is set to ``False`` and the message for\n    param mismatch will be shown even if strict is False.\n\n    Args:\n        module (Module): Module that receives the state_dict.\n        state_dict (OrderedDict): Weights.\n        strict (bool): whether to strictly enforce that the keys\n            in :attr:`state_dict` match the keys returned by this module's\n            :meth:`~torch.nn.Module.state_dict` function. Default: ``False``.\n        logger (:obj:`logging.Logger`, optional): Logger to log the error\n            message. If not specified, print function will be used.\n    \"\"\"\n    unexpected_keys = []\n    own_state = module.state_dict()\n    for name, param in state_dict.items():\n        if name not in own_state:\n            unexpected_keys.append(name)\n            continue\n        if isinstance(param, torch.nn.Parameter):\n            # backwards compatibility for serialized parameters\n            param = param.data\n\n        try:\n            own_state[name].copy_(param)\n        except Exception:\n            raise RuntimeError('While copying the parameter named {}, '\n                               'whose dimensions in the model are {} and '\n                               'whose dimensions in the checkpoint are {}.'\n                               .format(name, own_state[name].size(),\n                                       param.size()))\n    missing_keys = set(own_state.keys()) - set(state_dict.keys())\n\n    err_msg = []\n    if unexpected_keys:\n        err_msg.append('unexpected key in source state_dict: {}\\n'.format(\n            ', '.join(unexpected_keys)))\n    if missing_keys:\n        err_msg.append('missing keys in source state_dict: {}\\n'.format(\n            ', '.join(missing_keys)))\n    err_msg = '\\n'.join(err_msg)\n    if err_msg:\n        if strict:\n            raise RuntimeError(err_msg)\n        elif logger is not None:\n            logger.warn(err_msg)\n        else:\n            print(err_msg)"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef load_checkpoint(model,\n                    filename,\n                    map_location=None,\n                    strict=False,\n                    logger=None):\n    \"\"\"Load checkpoint from a file or URI.\n\n    Args:\n        model (Module): Module to load checkpoint.\n        filename (str): Either a filepath or URL or modelzoo://xxxxxxx.\n        map_location (str): Same as :func:`torch.load`.\n        strict (bool): Whether to allow different params for the model and\n            checkpoint.\n        logger (:mod:`logging.Logger` or None): The logger for error message.\n\n    Returns:\n        dict or OrderedDict: The loaded checkpoint.\n    \"\"\"\n    # load checkpoint from modelzoo or file or url\n    if filename.startswith('modelzoo://'):\n        import torchvision\n        model_urls = dict()\n        for _, name, ispkg in pkgutil.walk_packages(\n                torchvision.models.__path__):\n            if not ispkg:\n                _zoo = import_module('torchvision.models.{}'.format(name))\n                _urls = getattr(_zoo, 'model_urls')\n                model_urls.update(_urls)\n        model_name = filename[11:]\n        checkpoint = model_zoo.load_url(model_urls[model_name])\n    elif filename.startswith('open-mmlab://'):\n        model_name = filename[13:]\n        checkpoint = model_zoo.load_url(open_mmlab_model_urls[model_name])\n    elif filename.startswith(('http://', 'https://')):\n        checkpoint = model_zoo.load_url(filename)\n    else:\n        if not osp.isfile(filename):\n            raise IOError('{} is not a checkpoint file'.format(filename))\n        checkpoint = torch.load(filename, map_location=map_location)\n    # get state_dict from checkpoint\n    if isinstance(checkpoint, OrderedDict):\n        state_dict = checkpoint\n    elif isinstance(checkpoint, dict) and 'state_dict' in checkpoint:\n        state_dict = checkpoint['state_dict']\n    else:\n        raise RuntimeError(\n            'No state_dict found in checkpoint file {}'.format(filename))\n    # strip prefix of state_dict\n    if list(state_dict.keys())[0].startswith('module.'):\n        state_dict = {k[7:]: v for k, v in checkpoint['state_dict'].items()}\n    # load state_dict\n    if hasattr(model, 'module'):\n        load_state_dict(model.module, state_dict, strict, logger)\n    else:\n        load_state_dict(model, state_dict, strict, logger)\n    return checkpoint", "response": "Loads a checkpoint from a file or URI."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef weights_to_cpu(state_dict):\n    state_dict_cpu = OrderedDict()\n    for key, val in state_dict.items():\n        state_dict_cpu[key] = val.cpu()\n    return state_dict_cpu", "response": "Copy a model state_dict to cpu."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef save_checkpoint(model, filename, optimizer=None, meta=None):\n    if meta is None:\n        meta = {}\n    elif not isinstance(meta, dict):\n        raise TypeError('meta must be a dict or None, but got {}'.format(\n            type(meta)))\n    meta.update(mmcv_version=mmcv.__version__, time=time.asctime())\n\n    mmcv.mkdir_or_exist(osp.dirname(filename))\n    if hasattr(model, 'module'):\n        model = model.module\n\n    checkpoint = {\n        'meta': meta,\n        'state_dict': weights_to_cpu(model.state_dict())\n    }\n    if optimizer is not None:\n        checkpoint['optimizer'] = optimizer.state_dict()\n\n    torch.save(checkpoint, filename)", "response": "Save a checkpoint to file."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef init_optimizer(self, optimizer):\n        if isinstance(optimizer, dict):\n            optimizer = obj_from_dict(\n                optimizer, torch.optim, dict(params=self.model.parameters()))\n        elif not isinstance(optimizer, torch.optim.Optimizer):\n            raise TypeError(\n                'optimizer must be either an Optimizer object or a dict, '\n                'but got {}'.format(type(optimizer)))\n        return optimizer", "response": "Init the optimizer.\n\n        Args:\n            optimizer (dict or :obj:`~torch.optim.Optimizer`): Either an\n                optimizer object or a dict used for constructing the optimizer.\n\n        Returns:\n            :obj:`~torch.optim.Optimizer`: An optimizer object.\n\n        Examples:\n            >>> optimizer = dict(type='SGD', lr=0.01, momentum=0.9)\n            >>> type(runner.init_optimizer(optimizer))\n            <class 'torch.optim.sgd.SGD'>"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\ngets current learning rates.", "response": "def current_lr(self):\n        \"\"\"Get current learning rates.\n\n        Returns:\n            list: Current learning rate of all param groups.\n        \"\"\"\n        if self.optimizer is None:\n            raise RuntimeError(\n                'lr is not applicable because optimizer does not exist.')\n        return [group['lr'] for group in self.optimizer.param_groups]"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nregister a hook into the hook list.", "response": "def register_hook(self, hook, priority='NORMAL'):\n        \"\"\"Register a hook into the hook list.\n\n        Args:\n            hook (:obj:`Hook`): The hook to be registered.\n            priority (int or str or :obj:`Priority`): Hook priority.\n                Lower value means higher priority.\n        \"\"\"\n        assert isinstance(hook, Hook)\n        if hasattr(hook, 'priority'):\n            raise ValueError('\"priority\" is a reserved attribute for hooks')\n        priority = get_priority(priority)\n        hook.priority = priority\n        # insert the hook to a sorted list\n        inserted = False\n        for i in range(len(self._hooks) - 1, -1, -1):\n            if priority >= self._hooks[i].priority:\n                self._hooks.insert(i + 1, hook)\n                inserted = True\n                break\n        if not inserted:\n            self._hooks.insert(0, hook)"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nrun the training and validation on the given data loaders and the given workflow.", "response": "def run(self, data_loaders, workflow, max_epochs, **kwargs):\n        \"\"\"Start running.\n\n        Args:\n            data_loaders (list[:obj:`DataLoader`]): Dataloaders for training\n                and validation.\n            workflow (list[tuple]): A list of (phase, epochs) to specify the\n                running order and epochs. E.g, [('train', 2), ('val', 1)] means\n                running 2 epochs for training and 1 epoch for validation,\n                iteratively.\n            max_epochs (int): Total training epochs.\n        \"\"\"\n        assert isinstance(data_loaders, list)\n        assert mmcv.is_list_of(workflow, tuple)\n        assert len(data_loaders) == len(workflow)\n\n        self._max_epochs = max_epochs\n        work_dir = self.work_dir if self.work_dir is not None else 'NONE'\n        self.logger.info('Start running, host: %s, work_dir: %s',\n                         get_host_info(), work_dir)\n        self.logger.info('workflow: %s, max: %d epochs', workflow, max_epochs)\n        self.call_hook('before_run')\n\n        while self.epoch < max_epochs:\n            for i, flow in enumerate(workflow):\n                mode, epochs = flow\n                if isinstance(mode, str):  # self.train()\n                    if not hasattr(self, mode):\n                        raise ValueError(\n                            'runner has no method named \"{}\" to run an epoch'.\n                            format(mode))\n                    epoch_runner = getattr(self, mode)\n                elif callable(mode):  # custom train()\n                    epoch_runner = mode\n                else:\n                    raise TypeError('mode in workflow must be a str or '\n                                    'callable function, not {}'.format(\n                                        type(mode)))\n                for _ in range(epochs):\n                    if mode == 'train' and self.epoch >= max_epochs:\n                        return\n                    epoch_runner(data_loaders[i], **kwargs)\n\n        time.sleep(1)  # wait for some hooks like loggers to finish\n        self.call_hook('after_run')"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nregisters default training hooks for training.", "response": "def register_training_hooks(self,\n                                lr_config,\n                                optimizer_config=None,\n                                checkpoint_config=None,\n                                log_config=None):\n        \"\"\"Register default hooks for training.\n\n        Default hooks include:\n\n        - LrUpdaterHook\n        - OptimizerStepperHook\n        - CheckpointSaverHook\n        - IterTimerHook\n        - LoggerHook(s)\n        \"\"\"\n        if optimizer_config is None:\n            optimizer_config = {}\n        if checkpoint_config is None:\n            checkpoint_config = {}\n        self.register_lr_hooks(lr_config)\n        self.register_hook(self.build_hook(optimizer_config, OptimizerHook))\n        self.register_hook(self.build_hook(checkpoint_config, CheckpointHook))\n        self.register_hook(IterTimerHook())\n        if log_config is not None:\n            self.register_logger_hooks(log_config)"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nconvert a video with ffmpeg.", "response": "def convert_video(in_file, out_file, print_cmd=False, pre_options='',\n                  **kwargs):\n    \"\"\"Convert a video with ffmpeg.\n\n    This provides a general api to ffmpeg, the executed command is::\n\n        `ffmpeg -y <pre_options> -i <in_file> <options> <out_file>`\n\n    Options(kwargs) are mapped to ffmpeg commands with the following rules:\n\n    - key=val: \"-key val\"\n    - key=True: \"-key\"\n    - key=False: \"\"\n\n    Args:\n        in_file (str): Input video filename.\n        out_file (str): Output video filename.\n        pre_options (str): Options appears before \"-i <in_file>\".\n        print_cmd (bool): Whether to print the final ffmpeg command.\n    \"\"\"\n    options = []\n    for k, v in kwargs.items():\n        if isinstance(v, bool):\n            if v:\n                options.append('-{}'.format(k))\n        elif k == 'log_level':\n            assert v in [\n                'quiet', 'panic', 'fatal', 'error', 'warning', 'info',\n                'verbose', 'debug', 'trace'\n            ]\n            options.append('-loglevel {}'.format(v))\n        else:\n            options.append('-{} {}'.format(k, v))\n    cmd = 'ffmpeg -y {} -i {} {} {}'.format(pre_options, in_file,\n                                            ' '.join(options), out_file)\n    if print_cmd:\n        print(cmd)\n    subprocess.call(cmd, shell=True)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef cut_video(in_file,\n              out_file,\n              start=None,\n              end=None,\n              vcodec=None,\n              acodec=None,\n              log_level='info',\n              print_cmd=False,\n              **kwargs):\n    \"\"\"Cut a clip from a video.\n\n    Args:\n        in_file (str): Input video filename.\n        out_file (str): Output video filename.\n        start (None or float): Start time (in seconds).\n        end (None or float): End time (in seconds).\n        vcodec (None or str): Output video codec, None for unchanged.\n        acodec (None or str): Output audio codec, None for unchanged.\n        log_level (str): Logging level of ffmpeg.\n        print_cmd (bool): Whether to print the final ffmpeg command.\n    \"\"\"\n    options = {'log_level': log_level}\n    if vcodec is None:\n        options['vcodec'] = 'copy'\n    if acodec is None:\n        options['acodec'] = 'copy'\n    if start:\n        options['ss'] = start\n    else:\n        start = 0\n    if end:\n        options['t'] = end - start\n    convert_video(in_file, out_file, print_cmd, **options)", "response": "Cut a video from a file."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nconcatenates multiple videos into a single video.", "response": "def concat_video(video_list,\n                 out_file,\n                 vcodec=None,\n                 acodec=None,\n                 log_level='info',\n                 print_cmd=False,\n                 **kwargs):\n    \"\"\"Concatenate multiple videos into a single one.\n\n    Args:\n        video_list (list): A list of video filenames\n        out_file (str): Output video filename\n        vcodec (None or str): Output video codec, None for unchanged\n        acodec (None or str): Output audio codec, None for unchanged\n        log_level (str): Logging level of ffmpeg.\n        print_cmd (bool): Whether to print the final ffmpeg command.\n    \"\"\"\n    _, tmp_filename = tempfile.mkstemp(suffix='.txt', text=True)\n    with open(tmp_filename, 'w') as f:\n        for filename in video_list:\n            f.write('file {}\\n'.format(osp.abspath(filename)))\n    options = {'log_level': log_level}\n    if vcodec is None:\n        options['vcodec'] = 'copy'\n    if acodec is None:\n        options['acodec'] = 'copy'\n    convert_video(\n        tmp_filename,\n        out_file,\n        print_cmd,\n        pre_options='-f concat -safe 0',\n        **options)\n    os.remove(tmp_filename)"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nloading a text file and parse the content as a list of strings.", "response": "def list_from_file(filename, prefix='', offset=0, max_num=0):\n    \"\"\"Load a text file and parse the content as a list of strings.\n\n    Args:\n        filename (str): Filename.\n        prefix (str): The prefix to be inserted to the begining of each item.\n        offset (int): The offset of lines.\n        max_num (int): The maximum number of lines to be read,\n            zeros and negatives mean no limitation.\n\n    Returns:\n        list[str]: A list of strings.\n    \"\"\"\n    cnt = 0\n    item_list = []\n    with open(filename, 'r') as f:\n        for _ in range(offset):\n            f.readline()\n        for line in f:\n            if max_num > 0 and cnt >= max_num:\n                break\n            item_list.append(prefix + line.rstrip('\\n'))\n            cnt += 1\n    return item_list"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef dict_from_file(filename, key_type=str):\n    mapping = {}\n    with open(filename, 'r') as f:\n        for line in f:\n            items = line.rstrip('\\n').split()\n            assert len(items) >= 2\n            key = key_type(items[0])\n            val = items[1:] if len(items) > 2 else items[1]\n            mapping[key] = val\n    return mapping", "response": "Load a text file and parse the content as a dict."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef conv3x3(in_planes, out_planes, dilation=1):\n    \"3x3 convolution with padding\"\n    return nn.Conv2d(\n        in_planes,\n        out_planes,\n        kernel_size=3,\n        padding=dilation,\n        dilation=dilation)", "response": "3x3 convolution with padding"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef obj_from_dict(info, parent=None, default_args=None):\n    assert isinstance(info, dict) and 'type' in info\n    assert isinstance(default_args, dict) or default_args is None\n    args = info.copy()\n    obj_type = args.pop('type')\n    if mmcv.is_str(obj_type):\n        if parent is not None:\n            obj_type = getattr(parent, obj_type)\n        else:\n            obj_type = sys.modules[obj_type]\n    elif not isinstance(obj_type, type):\n        raise TypeError('type must be a str or valid type, but got {}'.format(\n            type(obj_type)))\n    if default_args is not None:\n        for name, value in default_args.items():\n            args.setdefault(name, value)\n    return obj_type(**args)", "response": "Initialize an object from a dictionary."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nread an image. Args: img_or_path (ndarray or str): Either a numpy array or image path. If it is a numpy array (loaded image), then it will be returned as is. flag (str): Flags specifying the color type of a loaded image, candidates are `color`, `grayscale` and `unchanged`. Returns: ndarray: Loaded image array.", "response": "def imread(img_or_path, flag='color'):\n    \"\"\"Read an image.\n\n    Args:\n        img_or_path (ndarray or str): Either a numpy array or image path.\n            If it is a numpy array (loaded image), then it will be returned\n            as is.\n        flag (str): Flags specifying the color type of a loaded image,\n            candidates are `color`, `grayscale` and `unchanged`.\n\n    Returns:\n        ndarray: Loaded image array.\n    \"\"\"\n    if isinstance(img_or_path, np.ndarray):\n        return img_or_path\n    elif is_str(img_or_path):\n        flag = imread_flags[flag] if is_str(flag) else flag\n        check_file_exist(img_or_path,\n                         'img file does not exist: {}'.format(img_or_path))\n        return cv2.imread(img_or_path, flag)\n    else:\n        raise TypeError('\"img\" must be a numpy array or a filename')"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nreads an image from bytes.", "response": "def imfrombytes(content, flag='color'):\n    \"\"\"Read an image from bytes.\n\n    Args:\n        content (bytes): Image bytes got from files or other streams.\n        flag (str): Same as :func:`imread`.\n\n    Returns:\n        ndarray: Loaded image array.\n    \"\"\"\n    img_np = np.frombuffer(content, np.uint8)\n    flag = imread_flags[flag] if is_str(flag) else flag\n    img = cv2.imdecode(img_np, flag)\n    return img"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nwrite image to file_path.", "response": "def imwrite(img, file_path, params=None, auto_mkdir=True):\n    \"\"\"Write image to file\n\n    Args:\n        img (ndarray): Image array to be written.\n        file_path (str): Image file path.\n        params (None or list): Same as opencv's :func:`imwrite` interface.\n        auto_mkdir (bool): If the parent folder of `file_path` does not exist,\n            whether to create it automatically.\n\n    Returns:\n        bool: Successful or not.\n    \"\"\"\n    if auto_mkdir:\n        dir_name = osp.abspath(osp.dirname(file_path))\n        mkdir_or_exist(dir_name)\n    return cv2.imwrite(file_path, img, params)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nconverting a BGR image to grayscale image.", "response": "def bgr2gray(img, keepdim=False):\n    \"\"\"Convert a BGR image to grayscale image.\n\n    Args:\n        img (ndarray): The input image.\n        keepdim (bool): If False (by default), then return the grayscale image\n            with 2 dims, otherwise 3 dims.\n\n    Returns:\n        ndarray: The converted grayscale image.\n    \"\"\"\n    out_img = cv2.cvtColor(img, cv2.COLOR_BGR2GRAY)\n    if keepdim:\n        out_img = out_img[..., None]\n    return out_img"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef gray2bgr(img):\n    img = img[..., None] if img.ndim == 2 else img\n    out_img = cv2.cvtColor(img, cv2.COLOR_GRAY2BGR)\n    return out_img", "response": "Convert a grayscale image to BGR image."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\ncast elements of an iterable object into some type.", "response": "def iter_cast(inputs, dst_type, return_type=None):\n    \"\"\"Cast elements of an iterable object into some type.\n\n    Args:\n        inputs (Iterable): The input object.\n        dst_type (type): Destination type.\n        return_type (type, optional): If specified, the output object will be\n            converted to this type, otherwise an iterator.\n\n    Returns:\n        iterator or specified type: The converted object.\n    \"\"\"\n    if not isinstance(inputs, collections_abc.Iterable):\n        raise TypeError('inputs must be an iterable object')\n    if not isinstance(dst_type, type):\n        raise TypeError('\"dst_type\" must be a valid type')\n\n    out_iterable = six.moves.map(dst_type, inputs)\n\n    if return_type is None:\n        return out_iterable\n    else:\n        return return_type(out_iterable)"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nchecks whether the given sequence is of some type.", "response": "def is_seq_of(seq, expected_type, seq_type=None):\n    \"\"\"Check whether it is a sequence of some type.\n\n    Args:\n        seq (Sequence): The sequence to be checked.\n        expected_type (type): Expected type of sequence items.\n        seq_type (type, optional): Expected sequence type.\n\n    Returns:\n        bool: Whether the sequence is valid.\n    \"\"\"\n    if seq_type is None:\n        exp_seq_type = collections_abc.Sequence\n    else:\n        assert isinstance(seq_type, type)\n        exp_seq_type = seq_type\n    if not isinstance(seq, exp_seq_type):\n        return False\n    for item in seq:\n        if not isinstance(item, expected_type):\n            return False\n    return True"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nslice a list into several sub lists by a list of given length.", "response": "def slice_list(in_list, lens):\n    \"\"\"Slice a list into several sub lists by a list of given length.\n\n    Args:\n        in_list (list): The list to be sliced.\n        lens(int or list): The expected length of each out list.\n\n    Returns:\n        list: A list of sliced list.\n    \"\"\"\n    if not isinstance(lens, list):\n        raise TypeError('\"indices\" must be a list of integers')\n    elif sum(lens) != len(in_list):\n        raise ValueError(\n            'sum of lens and list length does not match: {} != {}'.format(\n                sum(lens), len(in_list)))\n    out_list = []\n    idx = 0\n    for i in range(len(lens)):\n        out_list.append(in_list[idx:idx + lens[i]])\n        idx += lens[i]\n    return out_list"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef average(self, n=0):\n        assert n >= 0\n        for key in self.val_history:\n            values = np.array(self.val_history[key][-n:])\n            nums = np.array(self.n_history[key][-n:])\n            avg = np.sum(values * nums) / np.sum(nums)\n            self.output[key] = avg\n        self.ready = True", "response": "Average latest n values or all values"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef scatter(input, devices, streams=None):\n    if streams is None:\n        streams = [None] * len(devices)\n\n    if isinstance(input, list):\n        chunk_size = (len(input) - 1) // len(devices) + 1\n        outputs = [\n            scatter(input[i], [devices[i // chunk_size]],\n                    [streams[i // chunk_size]]) for i in range(len(input))\n        ]\n        return outputs\n    elif isinstance(input, torch.Tensor):\n        output = input.contiguous()\n        # TODO: copy to a pinned buffer first (if copying from CPU)\n        stream = streams[0] if output.numel() > 0 else None\n        with torch.cuda.device(devices[0]), torch.cuda.stream(stream):\n            output = output.cuda(devices[0], non_blocking=True)\n        return output\n    else:\n        raise Exception('Unknown type {}.'.format(type(input)))", "response": "Scatters tensor across multiple GPUs."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nconvert various input types to color tuples.", "response": "def color_val(color):\n    \"\"\"Convert various input to color tuples.\n\n    Args:\n        color (:obj:`Color`/str/tuple/int/ndarray): Color inputs\n\n    Returns:\n        tuple[int]: A tuple of 3 integers indicating BGR channels.\n    \"\"\"\n    if is_str(color):\n        return Color[color].value\n    elif isinstance(color, Color):\n        return color.value\n    elif isinstance(color, tuple):\n        assert len(color) == 3\n        for channel in color:\n            assert channel >= 0 and channel <= 255\n        return color\n    elif isinstance(color, int):\n        assert color >= 0 and color <= 255\n        return color, color, color\n    elif isinstance(color, np.ndarray):\n        assert color.ndim == 1 and color.size == 3\n        assert np.all((color >= 0) & (color <= 255))\n        color = color.astype(np.uint8)\n        return tuple(color)\n    else:\n        raise TypeError('Invalid type for color: {}'.format(type(color)))"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nadding check points in a single line. This method is suitable for running a task on a list of items. A timer will be registered when the method is called for the first time. :Example: >>> import time >>> import mmcv >>> for i in range(1, 6): >>> # simulate a code block >>> time.sleep(i) >>> mmcv.check_time('task1') 2.000 3.000 4.000 5.000 Args: timer_id (str): Timer identifier.", "response": "def check_time(timer_id):\n    \"\"\"Add check points in a single line.\n\n    This method is suitable for running a task on a list of items. A timer will\n    be registered when the method is called for the first time.\n\n    :Example:\n\n    >>> import time\n    >>> import mmcv\n    >>> for i in range(1, 6):\n    >>>     # simulate a code block\n    >>>     time.sleep(i)\n    >>>     mmcv.check_time('task1')\n    2.000\n    3.000\n    4.000\n    5.000\n\n    Args:\n        timer_id (str): Timer identifier.\n    \"\"\"\n    if timer_id not in _g_timers:\n        _g_timers[timer_id] = Timer()\n        return 0\n    else:\n        return _g_timers[timer_id].since_last_check()"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nreturning the total time since the timer started.", "response": "def since_start(self):\n        \"\"\"Total time since the timer is started.\n\n        Returns (float): Time in seconds.\n        \"\"\"\n        if not self._is_running:\n            raise TimerError('timer is not running')\n        self._t_last = time()\n        return self._t_last - self._t_start"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nreturns the time since the last checking.", "response": "def since_last_check(self):\n        \"\"\"Time since the last checking.\n\n        Either :func:`since_start` or :func:`since_last_check` is a checking\n        operation.\n\n        Returns (float): Time in seconds.\n        \"\"\"\n        if not self._is_running:\n            raise TimerError('timer is not running')\n        dur = time() - self._t_last\n        self._t_last = time()\n        return dur"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nshows optical flow. Args: flow (ndarray or str): The optical flow to be displayed. win_name (str): The window name. wait_time (int): Value of waitKey param.", "response": "def flowshow(flow, win_name='', wait_time=0):\n    \"\"\"Show optical flow.\n\n    Args:\n        flow (ndarray or str): The optical flow to be displayed.\n        win_name (str): The window name.\n        wait_time (int): Value of waitKey param.\n    \"\"\"\n    flow = flowread(flow)\n    flow_img = flow2rgb(flow)\n    imshow(rgb2bgr(flow_img), win_name, wait_time)"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nconvert a flow map to RGB image.", "response": "def flow2rgb(flow, color_wheel=None, unknown_thr=1e6):\n    \"\"\"Convert flow map to RGB image.\n\n    Args:\n        flow (ndarray): Array of optical flow.\n        color_wheel (ndarray or None): Color wheel used to map flow field to\n            RGB colorspace. Default color wheel will be used if not specified.\n        unknown_thr (str): Values above this threshold will be marked as\n            unknown and thus ignored.\n\n    Returns:\n        ndarray: RGB image that can be visualized.\n    \"\"\"\n    assert flow.ndim == 3 and flow.shape[-1] == 2\n    if color_wheel is None:\n        color_wheel = make_color_wheel()\n    assert color_wheel.ndim == 2 and color_wheel.shape[1] == 3\n    num_bins = color_wheel.shape[0]\n\n    dx = flow[:, :, 0].copy()\n    dy = flow[:, :, 1].copy()\n\n    ignore_inds = (np.isnan(dx) | np.isnan(dy) | (np.abs(dx) > unknown_thr) |\n                   (np.abs(dy) > unknown_thr))\n    dx[ignore_inds] = 0\n    dy[ignore_inds] = 0\n\n    rad = np.sqrt(dx**2 + dy**2)\n    if np.any(rad > np.finfo(float).eps):\n        max_rad = np.max(rad)\n        dx /= max_rad\n        dy /= max_rad\n\n    [h, w] = dx.shape\n\n    rad = np.sqrt(dx**2 + dy**2)\n    angle = np.arctan2(-dy, -dx) / np.pi\n\n    bin_real = (angle + 1) / 2 * (num_bins - 1)\n    bin_left = np.floor(bin_real).astype(int)\n    bin_right = (bin_left + 1) % num_bins\n    w = (bin_real - bin_left.astype(np.float32))[..., None]\n    flow_img = (\n        1 - w) * color_wheel[bin_left, :] + w * color_wheel[bin_right, :]\n    small_ind = rad <= 1\n    flow_img[small_ind] = 1 - rad[small_ind, None] * (1 - flow_img[small_ind])\n    flow_img[np.logical_not(small_ind)] *= 0.75\n\n    flow_img[ignore_inds, :] = 0\n\n    return flow_img"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef make_color_wheel(bins=None):\n    if bins is None:\n        bins = [15, 6, 4, 11, 13, 6]\n    assert len(bins) == 6\n\n    RY, YG, GC, CB, BM, MR = tuple(bins)\n\n    ry = [1, np.arange(RY) / RY, 0]\n    yg = [1 - np.arange(YG) / YG, 1, 0]\n    gc = [0, 1, np.arange(GC) / GC]\n    cb = [0, 1 - np.arange(CB) / CB, 1]\n    bm = [np.arange(BM) / BM, 0, 1]\n    mr = [1, 0, 1 - np.arange(MR) / MR]\n\n    num_bins = RY + YG + GC + CB + BM + MR\n\n    color_wheel = np.zeros((3, num_bins), dtype=np.float32)\n\n    col = 0\n    for i, color in enumerate([ry, yg, gc, cb, bm, mr]):\n        for j in range(3):\n            color_wheel[j, col:col + bins[i]] = color[j]\n        col += bins[i]\n\n    return color_wheel.T", "response": "Build a color wheel for each available resource."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\ncomputes the precision@k for the specified values of k", "response": "def accuracy(output, target, topk=(1, )):\n    \"\"\"Computes the precision@k for the specified values of k\"\"\"\n    with torch.no_grad():\n        maxk = max(topk)\n        batch_size = target.size(0)\n\n        _, pred = output.topk(maxk, 1, True, True)\n        pred = pred.t()\n        correct = pred.eq(target.view(1, -1).expand_as(pred))\n\n        res = []\n        for k in topk:\n            correct_k = correct[:k].view(-1).float().sum(0, keepdim=True)\n            res.append(correct_k.mul_(100.0 / batch_size))\n        return res"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef scatter(inputs, target_gpus, dim=0):\n\n    def scatter_map(obj):\n        if isinstance(obj, torch.Tensor):\n            return OrigScatter.apply(target_gpus, None, dim, obj)\n        if isinstance(obj, DataContainer):\n            if obj.cpu_only:\n                return obj.data\n            else:\n                return Scatter.forward(target_gpus, obj.data)\n        if isinstance(obj, tuple) and len(obj) > 0:\n            return list(zip(*map(scatter_map, obj)))\n        if isinstance(obj, list) and len(obj) > 0:\n            out = list(map(list, zip(*map(scatter_map, obj))))\n            return out\n        if isinstance(obj, dict) and len(obj) > 0:\n            out = list(map(type(obj), zip(*map(scatter_map, obj.items()))))\n            return out\n        return [obj for targets in target_gpus]\n\n    # After scatter_map is called, a scatter_map cell will exist. This cell\n    # has a reference to the actual function scatter_map, which has references\n    # to a closure that has a reference to the scatter_map cell (because the\n    # fn is recursive). To avoid this reference cycle, we set the function to\n    # None, clearing the cell\n    try:\n        return scatter_map(inputs)\n    finally:\n        scatter_map = None", "response": "Scatter inputs to target gpus."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef scatter_kwargs(inputs, kwargs, target_gpus, dim=0):\n    inputs = scatter(inputs, target_gpus, dim) if inputs else []\n    kwargs = scatter(kwargs, target_gpus, dim) if kwargs else []\n    if len(inputs) < len(kwargs):\n        inputs.extend([() for _ in range(len(kwargs) - len(inputs))])\n    elif len(kwargs) < len(inputs):\n        kwargs.extend([{} for _ in range(len(inputs) - len(kwargs))])\n    inputs = tuple(inputs)\n    kwargs = tuple(kwargs)\n    return inputs, kwargs", "response": "Scatter with support for kwargs dictionary"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\nasync def fetch(self) -> Response:\n        if self.request_config.get('DELAY', 0) > 0:\n            await asyncio.sleep(self.request_config['DELAY'])\n\n        timeout = self.request_config.get('TIMEOUT', 10)\n        try:\n            async with async_timeout.timeout(timeout):\n                resp = await self._make_request()\n            try:\n                resp_data = await resp.text(encoding=self.encoding)\n            except UnicodeDecodeError:\n                resp_data = await resp.read()\n\n            response = Response(\n                url=self.url,\n                method=self.method,\n                encoding=resp.get_encoding(),\n                html=resp_data,\n                metadata=self.metadata,\n                cookies=resp.cookies,\n                headers=resp.headers,\n                history=resp.history,\n                status=resp.status,\n                aws_json=resp.json,\n                aws_text=resp.text,\n                aws_read=resp.read)\n            # Retry middleware\n            aws_valid_response = self.request_config.get('VALID')\n            if aws_valid_response and iscoroutinefunction(aws_valid_response):\n                response = await aws_valid_response(response)\n            if response.ok:\n                return response\n            else:\n                return await self._retry(error_msg='request url failed!')\n        except asyncio.TimeoutError:\n            return await self._retry(error_msg='timeout')\n        except Exception as e:\n            return await self._retry(error_msg=e)\n        finally:\n            # Close client session\n            await self._close_request_session()", "response": "Fetch all the information by using aiohttp"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef request(self, *args, **kwargs):\n        middleware = args[0]\n\n        @wraps(middleware)\n        def register_middleware(*args, **kwargs):\n            self.request_middleware.append(middleware)\n            return middleware\n\n        return register_middleware()", "response": "Define a Decorate to be called before a request."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef response(self, *args, **kwargs):\n        middleware = args[0]\n\n        @wraps(middleware)\n        def register_middleware(*args, **kwargs):\n            self.response_middleware.appendleft(middleware)\n            return middleware\n\n        return register_middleware()", "response": "Define a Decorate to be called after a response."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\nasync def json(self,\n                   *,\n                   encoding: str = None,\n                   loads: JSONDecoder = DEFAULT_JSON_DECODER,\n                   content_type: Optional[str] = 'application/json') -> Any:\n        \"\"\"Read and decodes JSON response.\"\"\"\n        return await self._aws_json(\n            encoding=encoding, loads=loads, content_type=content_type)", "response": "Read and decodes JSON response."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nreading response payload and decode.", "response": "async def text(self,\n                   *,\n                   encoding: Optional[str] = None,\n                   errors: str = 'strict') -> str:\n        \"\"\"Read response payload and decode.\"\"\"\n        return await self._aws_text(encoding=encoding, errors=errors)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nrunning hook before or after spider start crawling", "response": "async def _run_spider_hook(self, hook_func):\n        \"\"\"\n        Run hook before/after spider start crawling\n        :param hook_func: aws function\n        :return:\n        \"\"\"\n        if callable(hook_func):\n            try:\n                aws_hook_func = hook_func(weakref.proxy(self))\n                if isawaitable(aws_hook_func):\n                    await aws_hook_func\n            except Exception as e:\n                self.logger.error(f'<Hook {hook_func.__name__}: {e}')"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\nasync def process_callback_result(self, callback_result):\n        callback_result_name = type(callback_result).__name__\n        process_func_name = self.callback_result_map.get(\n            callback_result_name, '')\n        process_func = getattr(self, process_func_name, None)\n        if process_func is not None:\n            await process_func(callback_result)\n        else:\n            raise InvalidCallbackResult(\n                f'<Parse invalid callback result type: {callback_result_name}>'\n            )", "response": "Corresponding processing for the invalid callback result."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nstarting an async spider.", "response": "async def async_start(\n            cls,\n            middleware: typing.Union[typing.Iterable, Middleware] = None,\n            loop=None,\n            after_start=None,\n            before_stop=None,\n            **kwargs):\n        \"\"\"\n        Start an async spider\n        :param middleware: customize middleware or a list of middleware\n        :param loop:\n        :param after_start: hook\n        :param before_stop: hook\n        :return:\n        \"\"\"\n        loop = loop or asyncio.get_event_loop()\n        spider_ins = cls(middleware=middleware, loop=loop, is_async_start=True)\n        await spider_ins._start(\n            after_start=after_start, before_stop=before_stop)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef start(cls,\n              middleware: typing.Union[typing.Iterable, Middleware] = None,\n              loop=None,\n              after_start=None,\n              before_stop=None,\n              close_event_loop=True,\n              **kwargs):\n        \"\"\"\n        Start a spider\n        :param after_start: hook\n        :param before_stop: hook\n        :param middleware: customize middleware or a list of middleware\n        :param loop: event loop\n        :param close_event_loop: bool\n        :return:\n        \"\"\"\n        loop = loop or asyncio.new_event_loop()\n        spider_ins = cls(middleware=middleware, loop=loop)\n\n        # Actually start crawling\n        spider_ins.loop.run_until_complete(\n            spider_ins._start(\n                after_start=after_start, before_stop=before_stop)\n        )\n        spider_ins.loop.run_until_complete(\n            spider_ins.loop.shutdown_asyncgens())\n        if close_event_loop:\n            spider_ins.loop.close()", "response": "Start a spider with the given middleware."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nprocess coroutine callback function", "response": "async def handle_callback(self, aws_callback: typing.Coroutine, response):\n        \"\"\"Process coroutine callback function\"\"\"\n        callback_result = None\n\n        try:\n            callback_result = await aws_callback\n        except NothingMatchedError as e:\n            self.logger.error(f'<Item: {str(e).lower()}>')\n        except Exception as e:\n            self.logger.error(f'<Callback[{aws_callback.__name__}]: {e}')\n        return callback_result, response"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nwrapping request with middleware.", "response": "async def handle_request(self, request: Request\n                             ) -> typing.Tuple[AsyncGeneratorType, Response]:\n        \"\"\"\n        Wrap request with middleware.\n        :param request:\n        :return:\n        \"\"\"\n        callback_result, response = None, None\n\n        await self._run_request_middleware(request)\n        try:\n            callback_result, response = await request.fetch_callback(self.sem)\n        except NotImplementedParseError as e:\n            self.logger.error(e)\n        except NothingMatchedError as e:\n            self.logger.error(f'<Item: {str(e).lower()}>')\n        except Exception as e:\n            self.logger.error(f'<Callback[{request.callback.__name__}]: {e}')\n        await self._run_response_middleware(request, response)\n        await self._process_response(request=request, response=response)\n        return callback_result, response"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\nasync def stop(self, _signal):\n        self.logger.info(f'Stopping spider: {self.name}')\n        await self._cancel_tasks()\n        self.loop.stop()", "response": "Stop all running tasks cancel remaining tasks then stop loop."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nparse a match and return the key value pair.", "response": "def _parse_match(self, match):\n        \"\"\"\n        If there is a group dict, return the dict;\n            even if there's only one value in the dict, return a dictionary;\n        If there is a group in match, return the group;\n            if there is only one value in the group, return the value;\n        if there has no group, return the whole matched string;\n        if there are many groups, return a tuple;\n        :param match:\n        :return:\n        \"\"\"\n        if not match:\n            if self.default:\n                return self.default\n            else:\n                raise NothingMatchedError(\n                    f\"Extract `{self._re_select}` error, \"\n                    f\"please check selector or set parameter named `default`\")\n        else:\n            string = match.group()\n            groups = match.groups()\n            group_dict = match.groupdict()\n            if group_dict:\n                return group_dict\n            if groups:\n                return groups[0] if len(groups) == 1 else groups\n            return string"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef get_db(self, db='test'):\n        if db not in self._db:\n            self._db[db] = self.client(db)[db]\n\n        return self._db[db]", "response": "Get a db instance"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef normalize_task_v2(task):\n    '''Ensures tasks have an action key and strings are converted to python objects'''\n\n    result = dict()\n    mod_arg_parser = ModuleArgsParser(task)\n    try:\n        action, arguments, result['delegate_to'] = mod_arg_parser.parse()\n    except AnsibleParserError as e:\n        try:\n            task_info = \"%s:%s\" % (task[FILENAME_KEY], task[LINE_NUMBER_KEY])\n            del task[FILENAME_KEY]\n            del task[LINE_NUMBER_KEY]\n        except KeyError:\n            task_info = \"Unknown\"\n        try:\n            import pprint\n            pp = pprint.PrettyPrinter(indent=2)\n            task_pprint = pp.pformat(task)\n        except ImportError:\n            task_pprint = task\n        raise SystemExit(\"Couldn't parse task at %s (%s)\\n%s\" % (task_info, e.message, task_pprint))\n\n    # denormalize shell -> command conversion\n    if '_uses_shell' in arguments:\n        action = 'shell'\n        del(arguments['_uses_shell'])\n\n    for (k, v) in list(task.items()):\n        if k in ('action', 'local_action', 'args', 'delegate_to') or k == action:\n            # we don't want to re-assign these values, which were\n            # determined by the ModuleArgsParser() above\n            continue\n        else:\n            result[k] = v\n\n    result['action'] = dict(__ansible_module__=action)\n\n    if '_raw_params' in arguments:\n        result['action']['__ansible_arguments__'] = arguments['_raw_params'].split(' ')\n        del(arguments['_raw_params'])\n    else:\n        result['action']['__ansible_arguments__'] = list()\n\n    if 'argv' in arguments and not result['action']['__ansible_arguments__']:\n        result['action']['__ansible_arguments__'] = arguments['argv']\n        del(arguments['argv'])\n\n    result['action'].update(arguments)\n    return result", "response": "Normalizes a task dict to be compatible with the Ansible version 2 API."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef parse_yaml_linenumbers(data, filename):\n\n    def compose_node(parent, index):\n        # the line number where the previous token has ended (plus empty lines)\n        line = loader.line\n        node = Composer.compose_node(loader, parent, index)\n        node.__line__ = line + 1\n        return node\n\n    def construct_mapping(node, deep=False):\n        if ANSIBLE_VERSION < 2:\n            mapping = Constructor.construct_mapping(loader, node, deep=deep)\n        else:\n            mapping = AnsibleConstructor.construct_mapping(loader, node, deep=deep)\n        if hasattr(node, '__line__'):\n            mapping[LINE_NUMBER_KEY] = node.__line__\n        else:\n            mapping[LINE_NUMBER_KEY] = mapping._line_number\n        mapping[FILENAME_KEY] = filename\n        return mapping\n\n    try:\n        if ANSIBLE_VERSION < 2:\n            loader = yaml.Loader(data)\n        else:\n            import inspect\n            kwargs = {}\n            if 'vault_password' in inspect.getargspec(AnsibleLoader.__init__).args:\n                kwargs['vault_password'] = DEFAULT_VAULT_PASSWORD\n            loader = AnsibleLoader(data, **kwargs)\n        loader.compose_node = compose_node\n        loader.construct_mapping = construct_mapping\n        data = loader.get_single_data()\n    except (yaml.parser.ParserError, yaml.scanner.ScannerError) as e:\n        raise SystemExit(\"Failed to parse YAML in %s: %s\" % (filename, str(e)))\n    return data", "response": "Parses a YAML file with line numbers."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef append_skipped_rules(pyyaml_data, file_text, file_type):\n    yaml = ruamel.yaml.YAML()\n    ruamel_data = yaml.load(file_text)\n\n    if file_type in ('tasks', 'handlers'):\n        ruamel_tasks = ruamel_data\n        pyyaml_tasks = pyyaml_data\n    elif file_type == 'playbook':\n        try:\n            ruamel_tasks = []\n            pyyaml_tasks = []\n            for ruamel_play, pyyaml_play in zip(ruamel_data, pyyaml_data):\n                ruamel_tasks.extend(ruamel_play.get('tasks'))\n                pyyaml_tasks.extend(pyyaml_play.get('tasks'))\n        except (AttributeError, TypeError):\n            return pyyaml_data\n    elif file_type == 'meta':\n        if not isinstance(pyyaml_data, list):\n            return pyyaml_data\n        ruamel_tasks = [ruamel_data]\n        pyyaml_tasks = pyyaml_data\n    else:\n        return pyyaml_data\n\n    if len(ruamel_tasks) != len(pyyaml_tasks):\n        return pyyaml_data\n\n    for ruamel_task, pyyaml_task in zip(ruamel_tasks, pyyaml_tasks):\n        skipped_rules = _get_rule_skips_from_task(ruamel_task)\n        if skipped_rules:\n            pyyaml_task['skipped_rules'] = skipped_rules\n\n    return pyyaml_data", "response": "Adds the skipped rules to the task or meta yaml block."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef __should_write_changes(self, old_value: StoreItem, new_value: StoreItem) -> bool:\n\n        # If old_value is none or if the new_value's e_tag is '*', then we return True\n        if old_value is None or (hasattr(new_value, 'e_tag') and new_value.e_tag == '*'):\n            return True\n        # If none of the above cases, we verify that e_tags exist on both arguments\n        elif hasattr(new_value, 'e_tag') and hasattr(old_value, 'e_tag'):\n            if new_value.e_tag is not None and old_value.e_tag is None:\n                return True\n            # And then we do a comparing between the old and new e_tag values to decide if the new data will be written\n            if old_value.e_tag == new_value.e_tag or int(old_value.e_tag) <= int(new_value.e_tag):\n                return True\n            else:\n                return False\n        else:\n            return False", "response": "Helper method that compares two StoreItems and returns True if the new data should be written otherwise returns False."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\nasync def run_middleware(self, context: TurnContext, callback: Callable=None):\n        return await self._middleware.receive_activity_with_status(context, callback)", "response": "Called by the parent class to run the adapters middleware set and calls the callback handler at the end of the chain."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef use(self, *middleware: Middleware):\n        for (idx, m) in enumerate(middleware):\n            if hasattr(m, 'on_process_request') and callable(m.on_process_request):\n                self._middleware.append(m)\n                return self\n            else:\n                raise TypeError('MiddlewareSet.use(): invalid middleware at index \"%s\" being added.' % idx)", "response": "Registers middleware plugins with the bot or set."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nsend information about the page viewed in the application.", "response": "def track_pageview(self, name: str, url:str, duration: int = 0, properties : Dict[str, object]=None, \n                        measurements: Dict[str, object]=None) -> None:\n        \"\"\"\n        Send information about the page viewed in the application (a web page for instance).\n        :param name: the name of the page that was viewed.\n        :param url: the URL of the page that was viewed.\n        :param duration: the duration of the page view in milliseconds. (defaults to: 0)\n        :param properties: the set of custom properties the client wants attached to this data item. (defaults to: None)\n        :param measurements: the set of custom measurements the client wants to attach to this data item. (defaults to: None)\n        \"\"\"\n        self._client.track_pageview(name, url, duration, properties, measurements)"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef track_exception(self, type_exception: type = None, value : Exception =None, tb : traceback =None, \n                        properties: Dict[str, object]=None, measurements: Dict[str, object]=None) -> None:\n        \"\"\" \n        Send information about a single exception that occurred in the application.\n        :param type_exception: the type of the exception that was thrown.\n        :param value: the exception that the client wants to send.\n        :param tb: the traceback information as returned by :func:`sys.exc_info`.\n        :param properties: the set of custom properties the client wants attached to this data item. (defaults to: None)\n        :param measurements: the set of custom measurements the client wants to attach to this data item. (defaults to: None)\n        \"\"\"\n        self._client.track_exception(type_exception, value, tb, properties, measurements)", "response": "Send information about a single exception that occurred in the application."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nsending information about a single event that has occurred in the context of the application.", "response": "def track_event(self, name: str, properties: Dict[str, object] = None, \n                    measurements: Dict[str, object] = None) -> None:\n        \"\"\" \n        Send information about a single event that has occurred in the context of the application.\n        :param name: the data to associate to this event.\n        :param properties: the set of custom properties the client wants attached to this data item. (defaults to: None)\n        :param measurements: the set of custom measurements the client wants to attach to this data item. (defaults to: None)\n        \"\"\"\n        self._client.track_event(name, properties, measurements)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nsending a metric to the application.", "response": "def track_metric(self, name: str, value: float, type: TelemetryDataPointType =None, \n                    count: int =None, min: float=None, max: float=None, std_dev: float=None,\n                    properties: Dict[str, object]=None) -> NotImplemented:\n        \"\"\"\n        Send information about a single metric data point that was captured for the application.\n        :param name: The name of the metric that was captured.\n        :param value: The value of the metric that was captured.\n        :param type: The type of the metric. (defaults to: TelemetryDataPointType.aggregation`)\n        :param count: the number of metrics that were aggregated into this data point. (defaults to: None)\n        :param min: the minimum of all metrics collected that were aggregated into this data point. (defaults to: None)\n        :param max: the maximum of all metrics collected that were aggregated into this data point. (defaults to: None)\n        :param std_dev: the standard deviation of all metrics collected that were aggregated into this data point. (defaults to: None)\n        :param properties: the set of custom properties the client wants attached to this data item. (defaults to: None)\n        \"\"\"\n        self._client.track_metric(name, value, type, count, min, max, std_dev, properties)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef track_trace(self, name: str, properties: Dict[str, object]=None, severity=None):\n        self._client.track_trace(name, properties, severity)", "response": "Send a single trace statement."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef track_request(self, name: str, url: str, success: bool, start_time: str=None, \n                    duration: int=None, response_code: str =None, http_method: str=None, \n                    properties: Dict[str, object]=None, measurements: Dict[str, object]=None, \n                    request_id: str=None):\n        \"\"\"\n        Sends a single request that was captured for the application.\n        :param name: The name for this request. All requests with the same name will be grouped together.\n        :param url: The actual URL for this request (to show in individual request instances).\n        :param success: True if the request ended in success, False otherwise.\n        :param start_time: the start time of the request. The value should look the same as the one returned by :func:`datetime.isoformat()` (defaults to: None)\n        :param duration: the number of milliseconds that this request lasted. (defaults to: None)\n        :param response_code: the response code that this request returned. (defaults to: None)\n        :param http_method: the HTTP method that triggered this request. (defaults to: None)\n        :param properties: the set of custom properties the client wants attached to this data item. (defaults to: None)\n        :param measurements: the set of custom measurements the client wants to attach to this data item. (defaults to: None)\n        :param request_id: the id for this request. If None, a new uuid will be generated. (defaults to: None)\n        \"\"\"\n        self._client.track_request(name, url, success, start_time, duration, response_code, http_method, properties,\n                                    measurements, request_id)", "response": "Send a single request to the application."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nsends a single dependency telemetry that was captured for the application.", "response": "def track_dependency(self, name:str, data:str, type:str=None, target:str=None, duration:int=None, \n                        success:bool=None, result_code:str=None, properties:Dict[str, object]=None, \n                        measurements:Dict[str, object]=None, dependency_id:str=None):\n        \"\"\"\n        Sends a single dependency telemetry that was captured for the application.\n        :param name: the name of the command initiated with this dependency call. Low cardinality value. Examples are stored procedure name and URL path template.\n        :param data: the command initiated by this dependency call. Examples are SQL statement and HTTP URL with all query parameters.\n        :param type: the dependency type name. Low cardinality value for logical grouping of dependencies and interpretation of other fields like commandName and resultCode. Examples are SQL, Azure table, and HTTP. (default to: None)\n        :param target: the target site of a dependency call. Examples are server name, host address. (default to: None)\n        :param duration: the number of milliseconds that this dependency call lasted. (defaults to: None)\n        :param success: true if the dependency call ended in success, false otherwise. (defaults to: None)\n        :param result_code: the result code of a dependency call. Examples are SQL error code and HTTP status code. (defaults to: None)\n        :param properties: the set of custom properties the client wants attached to this data item. (defaults to: None)\n        :param measurements: the set of custom measurements the client wants to attach to this data item. (defaults to: None)\n        :param id: the id for this dependency call. If None, a new uuid will be generated. (defaults to: None)\n        \"\"\"\n        self._client.track_dependency(name, data, type, target, duration, success, result_code, properties,\n                                        measurements, dependency_id)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef create_property(self, name:str) -> StatePropertyAccessor:\n        if not name:\n            raise TypeError('BotState.create_property(): BotState cannot be None or empty.')\n        return BotStatePropertyAccessor(self, name)", "response": "Create a property definition and register it with this BotState."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\nasync def load(self, turn_context: TurnContext, force: bool = False) -> None:\n        if turn_context == None:\n            raise TypeError('BotState.load(): turn_context cannot be None.')\n        \n        cached_state = turn_context.turn_state.get(self._context_service_key)\n        storage_key = self.get_storage_key(turn_context)\n        \n        if (force or not cached_state or not cached_state.state) :\n            items = await self._storage.read([storage_key])\n            val = items.get(storage_key)\n            turn_context.turn_state[self._context_service_key] = CachedBotState(val)", "response": "Loads the current state object for this turn."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\nasync def save_changes(self, turn_context: TurnContext, force: bool = False) -> None:\n        if turn_context == None:\n            raise TypeError('BotState.save_changes(): turn_context cannot be None.')\n        \n        cached_state = turn_context.turn_state.get(self._context_service_key)\n        \n        if force or (cached_state != None and cached_state.is_changed == True):\n            storage_key = self.get_storage_key(turn_context)\n            changes : Dict[str, object] = { storage_key: cached_state.state }\n            await self._storage.write(changes)\n            cached_state.hash = cached_state.compute_hash(cached_state.state)", "response": "Saves the state of the current state object to the storage."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\nasync def clear_state(self, turn_context: TurnContext):\n        if turn_context == None:\n            raise TypeError('BotState.clear_state(): turn_context cannot be None.')\n        \n        #  Explicitly setting the hash will mean IsChanged is always true. And that will force a Save.\n        cache_value = CachedBotState()\n        cache_value.hash = ''\n        turn_context.turn_state[self._context_service_key] = cache_value", "response": "Clears any state currently stored in this state scope."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\ndeletes any state currently stored in this state scope.", "response": "async def delete(self, turn_context: TurnContext) -> None:\n        \"\"\"\n        Delete any state currently stored in this state scope.\n        \n        :param turn_context: The context object for this turn.\n        :return: None\n        \"\"\"\n        if turn_context == None:\n            raise TypeError('BotState.delete(): turn_context cannot be None.')\n        \n        turn_context.turn_state.pop(self._context_service_key)\n            \n        storage_key = self.get_storage_key(turn_context)\n        await self._storage.delete({ storage_key })"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nsetting the value of a property on the state cache.", "response": "async def set_property_value(self, turn_context: TurnContext, property_name: str, value: object) -> None:\n        \"\"\"\n        Deletes a property from the state cache in the turn context.\n        \n        :param turn_context: The context object for this turn.\n        :param property_name: The value to set on the property.\n        :return: None\n        \"\"\"\n\n        if turn_context == None:\n            raise TypeError('BotState.delete_property(): turn_context cannot be None.')\n        if not property_name:\n            raise TypeError('BotState.delete_property(): property_name cannot be None.')\n        cached_state = turn_context.turn_state.get(self._context_service_key)\n        cached_state.state[property_name] = value"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\ncontinues a conversation with a user.", "response": "async def continue_conversation(self, reference: ConversationReference, logic):\n        \"\"\"\n        Continues a conversation with a user. This is often referred to as the bots \"Proactive Messaging\"\n        flow as its lets the bot proactively send messages to a conversation or user that its already\n        communicated with. Scenarios like sending notifications or coupons to a user are enabled by this\n        method.\n        :param reference:\n        :param logic:\n        :return:\n        \"\"\"\n        request = TurnContext.apply_conversation_reference(Activity(), reference, is_incoming=True)\n        context = self.create_context(request)\n        return await self.run_middleware(context, logic)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\nasync def create_conversation(self, reference: ConversationReference, logic):\n        try:\n            if reference.service_url is None:\n                raise TypeError('BotFrameworkAdapter.create_conversation(): reference.service_url cannot be None.')\n\n            # Create conversation\n            parameters = ConversationParameters(bot=reference.bot)\n            client = self.create_connector_client(reference.service_url)\n\n            resource_response = await client.conversations.create_conversation(parameters)\n            request = TurnContext.apply_conversation_reference(Activity(), reference, is_incoming=True)\n            request.conversation = ConversationAccount(id=resource_response.id)\n            if resource_response.service_url:\n                request.service_url = resource_response.service_url\n\n            context = self.create_context(request)\n            return await self.run_middleware(context, logic)\n\n        except Exception as e:\n            raise e", "response": "Starts a new conversation with a user."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\nasync def process_activity(self, req, auth_header: str, logic: Callable):\n        activity = await self.parse_request(req)\n        auth_header = auth_header or ''\n\n        await self.authenticate_request(activity, auth_header)\n        context = self.create_context(activity)\n\n        return await self.run_middleware(context, logic)", "response": "Processes an activity received by the bots web server."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\nasync def authenticate_request(self, request: Activity, auth_header: str):\n        await JwtTokenValidation.authenticate_request(request, auth_header, self._credential_provider)", "response": "Allows for overriding of authentication in unit tests."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\nasync def parse_request(req):\n\n        async def validate_activity(activity: Activity):\n            if not isinstance(activity.type, str):\n                raise TypeError('BotFrameworkAdapter.parse_request(): invalid or missing activity type.')\n            return True\n\n        if not isinstance(req, Activity):\n            # If the req is a raw HTTP Request, try to deserialize it into an Activity and return the Activity.\n            if hasattr(req, 'body'):\n                try:\n                    activity = Activity().deserialize(req.body)\n                    is_valid_activity = await validate_activity(activity)\n                    if is_valid_activity:\n                        return activity\n                except Exception as e:\n                    raise e\n            elif 'body' in req:\n                try:\n                    activity = Activity().deserialize(req['body'])\n                    is_valid_activity = await validate_activity(activity)\n                    if is_valid_activity:\n                        return activity\n                except Exception as e:\n                    raise e\n            else:\n                raise TypeError('BotFrameworkAdapter.parse_request(): received invalid request')\n        else:\n            # The `req` has already been deserialized to an Activity, so verify the Activity.type and return it.\n            is_valid_activity = await validate_activity(req)\n            if is_valid_activity:\n                return req", "response": "Parses and validates a request and returns the corresponding object."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\nasync def update_activity(self, context: TurnContext, activity: Activity):\n        try:\n            client = self.create_connector_client(activity.service_url)\n            return await client.conversations.update_activity(\n                activity.conversation.id,\n                activity.conversation.activity_id,\n                activity)\n        except Exception as e:\n            raise e", "response": "Updates an activity that was previously sent to a channel."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\nasync def delete_activity(self, context: TurnContext, conversation_reference: ConversationReference):\n        try:\n            client = self.create_connector_client(conversation_reference.service_url)\n            await client.conversations.delete_activity(conversation_reference.conversation.id,\n                                                             conversation_reference.activity_id)\n        except Exception as e:\n            raise e", "response": "Deletes an activity that was previously sent to a channel."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\ndelete a member from the current conversation.", "response": "async def delete_conversation_member(self, context: TurnContext, member_id: str) -> None:\n        \"\"\"\n        Deletes a member from the current conversation.\n        :param context:\n        :param member_id:\n        :return:\n        \"\"\"\n        try:\n            if not context.activity.service_url:\n                raise TypeError('BotFrameworkAdapter.delete_conversation_member(): missing service_url')\n            if not context.activity.conversation or not context.activity.conversation.id:\n                raise TypeError('BotFrameworkAdapter.delete_conversation_member(): missing conversation or '\n                                'conversation.id')\n            service_url = context.activity.service_url\n            conversation_id = context.activity.conversation.id\n            client = self.create_connector_client(service_url)\n            return await client.conversations.delete_conversation_member(conversation_id, member_id)\n        except AttributeError as attr_e:\n            raise attr_e\n        except Exception as e:\n            raise e"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\nasync def get_activity_members(self, context: TurnContext, activity_id: str):\n        try:\n            if not activity_id:\n                activity_id = context.activity.id\n            if not context.activity.service_url:\n                raise TypeError('BotFrameworkAdapter.get_activity_member(): missing service_url')\n            if not context.activity.conversation or not context.activity.conversation.id:\n                raise TypeError('BotFrameworkAdapter.get_activity_member(): missing conversation or conversation.id')\n            if not activity_id:\n                raise TypeError('BotFrameworkAdapter.get_activity_member(): missing both activity_id and '\n                                'context.activity.id')\n            service_url = context.activity.service_url\n            conversation_id = context.activity.conversation.id\n            client = self.create_connector_client(service_url)\n            return await client.conversations.get_activity_members(conversation_id, activity_id)\n        except Exception as e:\n            raise e", "response": "Get the members of a given activity."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\ngets the members of a current conversation.", "response": "async def get_conversation_members(self, context: TurnContext):\n        \"\"\"\n        Lists the members of a current conversation.\n        :param context:\n        :return:\n        \"\"\"\n        try:\n            if not context.activity.service_url:\n                raise TypeError('BotFrameworkAdapter.get_conversation_members(): missing service_url')\n            if not context.activity.conversation or not context.activity.conversation.id:\n                raise TypeError('BotFrameworkAdapter.get_conversation_members(): missing conversation or '\n                                'conversation.id')\n            service_url = context.activity.service_url\n            conversation_id = context.activity.conversation.id\n            client = self.create_connector_client(service_url)\n            return await client.conversations.get_conversation_members(conversation_id)\n        except Exception as e:\n            raise e"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nget the Conversations in which this bot has participated for a given channel server.", "response": "async def get_conversations(self, service_url: str, continuation_token: str=None):\n        \"\"\"\n        Lists the Conversations in which this bot has participated for a given channel server. The channel server\n        returns results in pages and each page will include a `continuationToken` that can be used to fetch the next\n        page of results from the server.\n        :param service_url:\n        :param continuation_token:\n        :return:\n        \"\"\"\n        client = self.create_connector_client(service_url)\n        return await client.conversations.get_conversations(continuation_token)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\ncreate a connector client.", "response": "def create_connector_client(self, service_url: str) -> ConnectorClient:\n        \"\"\"\n        Allows for mocking of the connector client in unit tests.\n        :param service_url:\n        :return:\n        \"\"\"\n        client = ConnectorClient(self._credentials, base_url=service_url)\n        client.config.add_user_agent(USER_AGENT)\n        return client"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef add_dialog(self, dialog: Dialog) -> object:\n        self._dialogs.add(dialog)\n        if not self.initial_dialog_id:\n            self.initial_dialog_id = dialog.id\n        return self", "response": "Adds a dialog to the ComponentDialog object."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nauthenticates the request and sets the service url in the set of trusted urls.", "response": "async def authenticate_request(activity: Activity, auth_header: str, credentials: CredentialProvider) -> ClaimsIdentity:\n        \"\"\"Authenticates the request and sets the service url in the set of trusted urls.\n        \n        :param activity: The incoming Activity from the Bot Framework or the Emulator\n        :type activity: ~botframework.connector.models.Activity\n        :param auth_header: The Bearer token included as part of the request\n        :type auth_header: str\n        :param credentials: The set of valid credentials, such as the Bot Application ID\n        :type credentials: CredentialProvider\n\n        :raises Exception:\n        \"\"\"\n        if not auth_header:\n            # No auth header was sent. We might be on the anonymous code path.\n            is_auth_disabled = await credentials.is_authentication_disabled()\n            if is_auth_disabled:\n                # We are on the anonymous code path.\n                return\n\n            # No Auth Header. Auth is required. Request is not authorized.\n            raise Exception('Unauthorized Access. Request is not authorized')\n\n        claims_identity = await JwtTokenValidation.validate_auth_header(auth_header, credentials, activity.channel_id, activity.service_url)\n\n        # On the standard Auth path, we need to trust the URL that was incoming.\n        MicrosoftAppCredentials.trust_service_url(activity.service_url)\n\n        return claims_identity"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nreturn the full wheel distribution name with - replaced with _", "response": "def wheel_dist_name(self):\n        \"\"\"Return distribution full name with - replaced with _\"\"\"\n        return '-'.join((safer_name(self.distribution.get_name()),\n                         safer_version(self.distribution.get_version())))"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef get_archive_basename(self):\n\n        impl_tag, abi_tag, plat_tag = self.get_tag()\n\n        archive_basename = \"%s-%s-%s-%s\" % (\n            self.wheel_dist_name,\n            impl_tag,\n            abi_tag,\n            plat_tag)\n        return archive_basename", "response": "Return archive name without extension"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\ngenerates a list of tuples that define a specific requirement for a specific locale.", "response": "def setupcfg_requirements(self):\n        \"\"\"Generate requirements from setup.cfg as\n        ('Requires-Dist', 'requirement; qualifier') tuples. From a metadata\n        section in setup.cfg:\n\n        [metadata]\n        provides-extra = extra1\n            extra2\n        requires-dist = requirement; qualifier\n            another; qualifier2\n            unqualified\n\n        Yields\n\n        ('Provides-Extra', 'extra1'),\n        ('Provides-Extra', 'extra2'),\n        ('Requires-Dist', 'requirement; qualifier'),\n        ('Requires-Dist', 'another; qualifier2'),\n        ('Requires-Dist', 'unqualified')\n        \"\"\"\n        metadata = self.distribution.get_option_dict('metadata')\n\n        # our .ini parser folds - to _ in key names:\n        for key, title in (('provides_extra', 'Provides-Extra'),\n                           ('requires_dist', 'Requires-Dist')):\n            if not key in metadata:\n                continue\n            field = metadata[key]\n            for line in field[1].splitlines():\n                line = line.strip()\n                if not line:\n                    continue\n                yield (title, line)"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nadd additional requirements from setup. cfg to file metadata_path", "response": "def add_requirements(self, metadata_path):\n        \"\"\"Add additional requirements from setup.cfg to file metadata_path\"\"\"\n        additional = list(self.setupcfg_requirements())\n        if not additional: return\n        pkg_info = read_pkg_info(metadata_path)\n        if 'Provides-Extra' in pkg_info or 'Requires-Dist' in pkg_info:\n            warnings.warn('setup.cfg requirements overwrite values from setup.py')\n            del pkg_info['Provides-Extra']\n            del pkg_info['Requires-Dist']\n        for k, v in additional:\n            pkg_info[k] = v\n        write_pkg_info(metadata_path, pkg_info)"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nconverts an egg - info directory into a. dist - info directory.", "response": "def egg2dist(self, egginfo_path, distinfo_path):\n        \"\"\"Convert an .egg-info directory into a .dist-info directory\"\"\"\n        def adios(p):\n            \"\"\"Appropriately delete directory, file or link.\"\"\"\n            if os.path.exists(p) and not os.path.islink(p) and os.path.isdir(p):\n                shutil.rmtree(p)\n            elif os.path.exists(p):\n                os.unlink(p)\n\n        adios(distinfo_path)\n\n        if not os.path.exists(egginfo_path):\n            # There is no egg-info. This is probably because the egg-info\n            # file/directory is not named matching the distribution name used\n            # to name the archive file. Check for this case and report\n            # accordingly.\n            import glob\n            pat = os.path.join(os.path.dirname(egginfo_path), '*.egg-info')\n            possible = glob.glob(pat)\n            err = \"Egg metadata expected at %s but not found\" % (egginfo_path,)\n            if possible:\n                alt = os.path.basename(possible[0])\n                err += \" (%s found - possible misnamed archive file?)\" % (alt,)\n\n            raise ValueError(err)\n\n        if os.path.isfile(egginfo_path):\n            # .egg-info is a single file\n            pkginfo_path = egginfo_path\n            pkg_info = self._pkginfo_to_metadata(egginfo_path, egginfo_path)\n            os.mkdir(distinfo_path)\n        else:\n            # .egg-info is a directory\n            pkginfo_path = os.path.join(egginfo_path, 'PKG-INFO')\n            pkg_info = self._pkginfo_to_metadata(egginfo_path, pkginfo_path)\n\n            # ignore common egg metadata that is useless to wheel\n            shutil.copytree(egginfo_path, distinfo_path,\n                            ignore=lambda x, y: set(('PKG-INFO',\n                                                     'requires.txt',\n                                                     'SOURCES.txt',\n                                                     'not-zip-safe',)))\n\n            # delete dependency_links if it is only whitespace\n            dependency_links_path = os.path.join(distinfo_path, 'dependency_links.txt')\n            with open(dependency_links_path, 'r') as dependency_links_file:\n                dependency_links = dependency_links_file.read().strip()\n            if not dependency_links:\n                adios(dependency_links_path)\n\n        write_pkg_info(os.path.join(distinfo_path, 'METADATA'), pkg_info)\n\n        # XXX deprecated. Still useful for current distribute/setuptools.\n        metadata_path = os.path.join(distinfo_path, 'METADATA')\n        self.add_requirements(metadata_path)\n\n        # XXX intentionally a different path than the PEP.\n        metadata_json_path = os.path.join(distinfo_path, 'metadata.json')\n        pymeta = pkginfo_to_dict(metadata_path,\n                                 distribution=self.distribution)\n\n        if 'description' in pymeta:\n            description_filename = 'DESCRIPTION.rst'\n            description_text = pymeta.pop('description')\n            description_path = os.path.join(distinfo_path,\n                                            description_filename)\n            with open(description_path, \"wb\") as description_file:\n                description_file.write(description_text.encode('utf-8'))\n            pymeta['extensions']['python.details']['document_names']['description'] = description_filename\n\n        # XXX heuristically copy any LICENSE/LICENSE.txt?\n        license = self.license_file()\n        if license:\n            license_filename = 'LICENSE.txt'\n            shutil.copy(license, os.path.join(self.distinfo_dir, license_filename))\n            pymeta['extensions']['python.details']['document_names']['license'] = license_filename\n\n        with open(metadata_json_path, \"w\") as metadata_json:\n            json.dump(pymeta, metadata_json, sort_keys=True)\n\n        adios(egginfo_path)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef get_conversations(\n            self, continuation_token=None, custom_headers=None, raw=False, **operation_config):\n        \"\"\"GetConversations.\n\n        List the Conversations in which this bot has participated.\n        GET from this method with a skip token\n        The return value is a ConversationsResult, which contains an array of\n        ConversationMembers and a skip token.  If the skip token is not empty,\n        then\n        there are further values to be returned. Call this method again with\n        the returned token to get more values.\n        Each ConversationMembers object contains the ID of the conversation and\n        an array of ChannelAccounts that describe the members of the\n        conversation.\n\n        :param continuation_token: skip or continuation token\n        :type continuation_token: str\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: ConversationsResult or ClientRawResponse if raw=true\n        :rtype: ~botframework.connector.models.ConversationsResult or\n         ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`ErrorResponseException<botframework.connector.models.ErrorResponseException>`\n        \"\"\"\n        # Construct URL\n        url = self.get_conversations.metadata['url']\n\n        # Construct parameters\n        query_parameters = {}\n        if continuation_token is not None:\n            query_parameters['continuationToken'] = self._serialize.query(\"continuation_token\", continuation_token, 'str')\n        \n        # Construct headers\n        header_parameters = {}\n        header_parameters['Accept'] = 'application/json'\n        if custom_headers:\n            header_parameters.update(custom_headers)\n\n        # Construct and send request\n        request = self._client.get(url, query_parameters, header_parameters)\n        response = self._client.send(request, stream=False, **operation_config)\n\n        if response.status_code not in [200]:\n            raise models.ErrorResponseException(self._deserialize, response)\n\n        deserialized = None\n        if response.status_code == 200:\n            deserialized = self._deserialize('ConversationsResult', response)\n\n        if raw:\n            client_raw_response = ClientRawResponse(deserialized, response)\n            return client_raw_response\n\n        return deserialized", "response": "This method returns the list of conversations in which this bot has participated."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\ncreate a new conversation from the given parameters.", "response": "def create_conversation(\n            self, parameters, custom_headers=None, raw=False, **operation_config):\n        \"\"\"CreateConversation.\n\n        Create a new Conversation.\n        POST to this method with a\n        * Bot being the bot creating the conversation\n        * IsGroup set to true if this is not a direct message (default is\n        false)\n        * Array containing the members to include in the conversation\n        The return value is a ResourceResponse which contains a conversation id\n        which is suitable for use\n        in the message payload and REST API uris.\n        Most channels only support the semantics of bots initiating a direct\n        message conversation.  An example of how to do that would be:\n        ```\n        var resource = await connector.conversations.CreateConversation(new\n        ConversationParameters(){ Bot = bot, members = new ChannelAccount[] {\n        new ChannelAccount(\"user1\") } );\n        await connect.Conversations.SendToConversationAsync(resource.Id, new\n        Activity() ... ) ;\n        ```.\n\n        :param parameters: Parameters to create the conversation from\n        :type parameters:\n         ~botframework.connector.models.ConversationParameters\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: ConversationResourceResponse or ClientRawResponse if raw=true\n        :rtype: ~botframework.connector.models.ConversationResourceResponse or\n         ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`ErrorResponseException<botframework.connector.models.ErrorResponseException>`\n        \"\"\"\n        # Construct URL\n        url = self.create_conversation.metadata['url']\n\n        # Construct parameters\n        query_parameters = {}\n\n        # Construct headers\n        header_parameters = {}\n        header_parameters['Accept'] = 'application/json'\n        header_parameters['Content-Type'] = 'application/json; charset=utf-8'\n        if custom_headers:\n            header_parameters.update(custom_headers)\n\n        # Construct body\n        body_content = self._serialize.body(parameters, 'ConversationParameters')\n\n        # Construct and send request\n        request = self._client.post(url, query_parameters, header_parameters, body_content)\n        response = self._client.send(request, stream=False, **operation_config)\n\n        if response.status_code not in [200, 201, 202]:\n            raise models.ErrorResponseException(self._deserialize, response)\n\n        deserialized = None\n        if response.status_code == 200:\n            deserialized = self._deserialize('ConversationResourceResponse', response)\n        if response.status_code == 201:\n            deserialized = self._deserialize('ConversationResourceResponse', response)\n        if response.status_code == 202:\n            deserialized = self._deserialize('ConversationResourceResponse', response)\n\n        if raw:\n            client_raw_response = ClientRawResponse(deserialized, response)\n            return client_raw_response\n\n        return deserialized"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef get_conversation_paged_members(\n            self, conversation_id, page_size=None, continuation_token=None, custom_headers=None, raw=False, **operation_config):\n        \"\"\"GetConversationPagedMembers.\n\n        Enumerate the members of a conversation one page at a time.\n        This REST API takes a ConversationId. Optionally a pageSize and/or\n        continuationToken can be provided. It returns a PagedMembersResult,\n        which contains an array\n        of ChannelAccounts representing the members of the conversation and a\n        continuation token that can be used to get more values.\n        One page of ChannelAccounts records are returned with each call. The\n        number of records in a page may vary between channels and calls. The\n        pageSize parameter can be used as\n        a suggestion. If there are no additional results the response will not\n        contain a continuation token. If there are no members in the\n        conversation the Members will be empty or not present in the response.\n        A response to a request that has a continuation token from a prior\n        request may rarely return members from a previous request.\n\n        :param conversation_id: Conversation ID\n        :type conversation_id: str\n        :param page_size: Suggested page size\n        :type page_size: int\n        :param continuation_token: Continuation Token\n        :type continuation_token: str\n        :param dict custom_headers: headers that will be added to the request\n        :param bool raw: returns the direct response alongside the\n         deserialized response\n        :param operation_config: :ref:`Operation configuration\n         overrides<msrest:optionsforoperations>`.\n        :return: PagedMembersResult or ClientRawResponse if raw=true\n        :rtype: ~botframework.connector.models.PagedMembersResult or\n         ~msrest.pipeline.ClientRawResponse\n        :raises:\n         :class:`HttpOperationError<msrest.exceptions.HttpOperationError>`\n        \"\"\"\n        # Construct URL\n        url = self.get_conversation_paged_members.metadata['url']\n        path_format_arguments = {\n            'conversationId': self._serialize.url(\"conversation_id\", conversation_id, 'str')\n        }\n        url = self._client.format_url(url, **path_format_arguments)\n\n        # Construct parameters\n        query_parameters = {}\n        if page_size is not None:\n            query_parameters['pageSize'] = self._serialize.query(\"page_size\", page_size, 'int')\n        if continuation_token is not None:\n            query_parameters['continuationToken'] = self._serialize.query(\"continuation_token\", continuation_token, 'str')\n        \n        # Construct headers\n        header_parameters = {}\n        header_parameters['Accept'] = 'application/json'\n        if custom_headers:\n            header_parameters.update(custom_headers)\n\n        # Construct and send request\n        request = self._client.get(url, query_parameters, header_parameters)\n        response = self._client.send(request, stream=False, **operation_config)\n\n        if response.status_code not in [200]:\n            raise HttpOperationError(self._deserialize, response)\n\n        deserialized = None\n        if response.status_code == 200:\n            deserialized = self._deserialize('PagedMembersResult', response)\n\n        if raw:\n            client_raw_response = ClientRawResponse(deserialized, response)\n            return client_raw_response\n\n        return deserialized", "response": "This method returns the list of members of a conversation in a page at a time."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nsending information about the page viewed in the application.", "response": "def track_pageview(self, name: str, url, duration: int = 0, properties : Dict[str, object]=None, \n                        measurements: Dict[str, object]=None) -> None:\n        \"\"\"\n        Send information about the page viewed in the application (a web page for instance).\n        :param name: the name of the page that was viewed.\n        :param url: the URL of the page that was viewed.\n        :param duration: the duration of the page view in milliseconds. (defaults to: 0)\n        :param properties: the set of custom properties the client wants attached to this data item. (defaults to: None)\n        :param measurements: the set of custom measurements the client wants to attach to this data item. (defaults to: None)\n        \"\"\"\n        raise NotImplementedError('BotTelemetryClient.track_request(): is not implemented.')"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef track_exception(self, type: type = None, value : Exception =None, tb : traceback =None, \n                        properties: Dict[str, object]=None, measurements: Dict[str, object]=None) -> None:\n        \"\"\" \n        Send information about a single exception that occurred in the application.\n        :param type: the type of the exception that was thrown.\n        :param value: the exception that the client wants to send.\n        :param tb: the traceback information as returned by :func:`sys.exc_info`.\n        :param properties: the set of custom properties the client wants attached to this data item. (defaults to: None)\n        :param measurements: the set of custom measurements the client wants to attach to this data item. (defaults to: None)\n        \"\"\"\n        raise NotImplementedError('BotTelemetryClient.track_request(): is not implemented.')", "response": "Send information about a single exception that occurred in the application."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef track_metric(self, name: str, value: float, type: TelemetryDataPointType =None, \n                    count: int =None, min: float=None, max: float=None, std_dev: float=None,\n                    properties: Dict[str, object]=None) -> NotImplemented:\n        \"\"\"\n        Send information about a single metric data point that was captured for the application.\n        :param name: The name of the metric that was captured.\n        :param value: The value of the metric that was captured.\n        :param type: The type of the metric. (defaults to: TelemetryDataPointType.aggregation`)\n        :param count: the number of metrics that were aggregated into this data point. (defaults to: None)\n        :param min: the minimum of all metrics collected that were aggregated into this data point. (defaults to: None)\n        :param max: the maximum of all metrics collected that were aggregated into this data point. (defaults to: None)\n        :param std_dev: the standard deviation of all metrics collected that were aggregated into this data point. (defaults to: None)\n        :param properties: the set of custom properties the client wants attached to this data item. (defaults to: None)\n        \"\"\"\n        raise NotImplementedError('BotTelemetryClient.track_metric(): is not implemented.')", "response": "Send information about a single metric data point that was captured for the application."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nsending a single request that was captured for the application. :param name: The name for this request. All requests with the same name will be grouped together. :param url: The actual URL for this request (to show in individual request instances). :param success: True if the request ended in success, False otherwise. :param start_time: the start time of the request. The value should look the same as the one returned by :func:`datetime.isoformat()` (defaults to: None) :param duration: the number of milliseconds that this request lasted. (defaults to: None) :param response_code: the response code that this request returned. (defaults to: None) :param http_method: the HTTP method that triggered this request. (defaults to: None) :param properties: the set of custom properties the client wants attached to this data item. (defaults to: None) :param measurements: the set of custom measurements the client wants to attach to this data item. (defaults to: None) :param request_id: the id for this request. If None, a new uuid will be generated. (defaults to: None)", "response": "def track_request(self, name: str, url: str, success: bool, start_time: str=None, \n                    duration: int=None, response_code: str =None, http_method: str=None, \n                    properties: Dict[str, object]=None, measurements: Dict[str, object]=None, \n                    request_id: str=None):\n        \"\"\"\n        Sends a single request that was captured for the application.\n        :param name: The name for this request. All requests with the same name will be grouped together.\n        :param url: The actual URL for this request (to show in individual request instances).\n        :param success: True if the request ended in success, False otherwise.\n        :param start_time: the start time of the request. The value should look the same as the one returned by :func:`datetime.isoformat()` (defaults to: None)\n        :param duration: the number of milliseconds that this request lasted. (defaults to: None)\n        :param response_code: the response code that this request returned. (defaults to: None)\n        :param http_method: the HTTP method that triggered this request. (defaults to: None)\n        :param properties: the set of custom properties the client wants attached to this data item. (defaults to: None)\n        :param measurements: the set of custom measurements the client wants to attach to this data item. (defaults to: None)\n        :param request_id: the id for this request. If None, a new uuid will be generated. (defaults to: None)\n        \"\"\"\n        raise NotImplementedError('BotTelemetryClient.track_request(): is not implemented.')"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef track_dependency(self, name:str, data:str, type:str=None, target:str=None, duration:int=None, \n                        success:bool=None, result_code:str=None, properties:Dict[str, object]=None, \n                        measurements:Dict[str, object]=None, dependency_id:str=None):\n        \"\"\"\n        Sends a single dependency telemetry that was captured for the application.\n        :param name: the name of the command initiated with this dependency call. Low cardinality value. Examples are stored procedure name and URL path template.\n        :param data: the command initiated by this dependency call. Examples are SQL statement and HTTP URL with all query parameters.\n        :param type: the dependency type name. Low cardinality value for logical grouping of dependencies and interpretation of other fields like commandName and resultCode. Examples are SQL, Azure table, and HTTP. (default to: None)\n        :param target: the target site of a dependency call. Examples are server name, host address. (default to: None)\n        :param duration: the number of milliseconds that this dependency call lasted. (defaults to: None)\n        :param success: true if the dependency call ended in success, false otherwise. (defaults to: None)\n        :param result_code: the result code of a dependency call. Examples are SQL error code and HTTP status code. (defaults to: None)\n        :param properties: the set of custom properties the client wants attached to this data item. (defaults to: None)\n        :param measurements: the set of custom measurements the client wants to attach to this data item. (defaults to: None)\n        :param id: the id for this dependency call. If None, a new uuid will be generated. (defaults to: None)\n        \"\"\"\n        raise NotImplementedError('BotTelemetryClient.track_dependency(): is not implemented.')", "response": "This method is used to track the execution of a dependency call."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef text(text: str, speak: str = None, input_hint: Union[InputHints, str] = InputHints.accepting_input) -> Activity:\n        message = Activity(type=ActivityTypes.message, text=text, input_hint=input_hint)\n        if speak:\n            message.speak = speak\n\n        return message", "response": "Create a simple text activity."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nreturn an activity that includes a set of suggested actions and optional text.", "response": "def suggested_actions(actions: List[CardAction], text: str = None, speak: str = None,\n                          input_hint: Union[InputHints, str] = InputHints.accepting_input) -> Activity:\n        \"\"\"\n        Returns a message that includes a set of suggested actions and optional text.\n\n        :Example:\n        message = MessageFactory.suggested_actions([CardAction(title='a', type=ActionTypes.im_back, value='a'),\n                                                    CardAction(title='b', type=ActionTypes.im_back, value='b'),\n                                                    CardAction(title='c', type=ActionTypes.im_back, value='c')], 'Choose a color')\n        await context.send_activity(message)\n\n        :param actions:\n        :param text:\n        :param speak:\n        :param input_hint:\n        :return:\n        \"\"\"\n        actions = SuggestedActions(actions=actions)\n        message = Activity(type=ActivityTypes.message, input_hint=input_hint, suggested_actions=actions)\n        if text:\n            message.text = text\n        if speak:\n            message.speak = speak\n        return message"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef attachment(attachment: Attachment, text: str = None, speak: str = None,\n                   input_hint: Union[InputHints, str] = None):\n        \"\"\"\n        Returns a single message activity containing an attachment.\n\n        :Example:\n        message = MessageFactory.attachment(CardFactory.hero_card(HeroCard(title='White T-Shirt',\n                                                                  images=[CardImage(url='https://example.com/whiteShirt.jpg')],\n                                                                  buttons=[CardAction(title='buy')])))\n        await context.send_activity(message)\n\n        :param attachment:\n        :param text:\n        :param speak:\n        :param input_hint:\n        :return:\n        \"\"\"\n        return attachment_activity(AttachmentLayoutTypes.list, [attachment], text, speak, input_hint)", "response": "Returns a single message activity containing an attachment."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef list(attachments: List[Attachment], text: str = None, speak: str = None,\n             input_hint: Union[InputHints, str] = None) -> Activity:\n        \"\"\"\n        Returns a message that will display a set of attachments in list form.\n\n        :Example:\n        message = MessageFactory.list([CardFactory.hero_card(HeroCard(title='title1',\n                                                             images=[CardImage(url='imageUrl1')],\n                                                             buttons=[CardAction(title='button1')])),\n                                       CardFactory.hero_card(HeroCard(title='title2',\n                                                             images=[CardImage(url='imageUrl2')],\n                                                             buttons=[CardAction(title='button2')])),\n                                       CardFactory.hero_card(HeroCard(title='title3',\n                                                             images=[CardImage(url='imageUrl3')],\n                                                             buttons=[CardAction(title='button3')]))])\n        await context.send_activity(message)\n\n        :param attachments:\n        :param text:\n        :param speak:\n        :param input_hint:\n        :return:\n        \"\"\"\n        return attachment_activity(AttachmentLayoutTypes.list, attachments, text, speak, input_hint)", "response": "Returns a message that will display a set of attachments in list form."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef content_url(url: str, content_type: str, name: str = None, text: str = None, speak: str = None,\n                    input_hint: Union[InputHints, str] = None):\n        \"\"\"\n        Returns a message that will display a single image or video to a user.\n\n        :Example:\n        message = MessageFactory.content_url('https://example.com/hawaii.jpg', 'image/jpeg',\n                                             'Hawaii Trip', 'A photo from our family vacation.')\n        await context.send_activity(message)\n\n        :param url:\n        :param content_type:\n        :param name:\n        :param text:\n        :param speak:\n        :param input_hint:\n        :return:\n        \"\"\"\n        attachment = Attachment(content_type=content_type, content_url=url)\n        if name:\n            attachment.name = name\n        return attachment_activity(AttachmentLayoutTypes.list, [attachment], text, speak, input_hint)", "response": "A function that returns a message that will display a single image or video to a user."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\nasync def authenticate_token_service_url(auth_header: str, credentials: CredentialProvider, service_url: str, channel_id: str) -> ClaimsIdentity:\n        identity = await asyncio.ensure_future(\n            ChannelValidation.authenticate_token(auth_header, credentials, channel_id))\n\n        service_url_claim = identity.get_claim_value(ChannelValidation.SERVICE_URL_CLAIM)\n        if service_url_claim != service_url:\n            # Claim must match. Not Authorized.\n            raise Exception('Unauthorized. service_url claim do not match.')\n\n        return identity", "response": "Authenticate the token with the given service url."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\nasync def authenticate_token(auth_header: str, credentials: CredentialProvider, channel_id: str) -> ClaimsIdentity:\n        token_extractor = JwtTokenExtractor(\n            ChannelValidation.TO_BOT_FROM_CHANNEL_TOKEN_VALIDATION_PARAMETERS,\n            Constants.TO_BOT_FROM_CHANNEL_OPEN_ID_METADATA_URL,\n            Constants.ALLOWED_SIGNING_ALGORITHMS)\n\n        identity = await asyncio.ensure_future(\n            token_extractor.get_identity_from_auth_header(auth_header, channel_id))\n        if not identity:\n            # No valid identity. Not Authorized.\n            raise Exception('Unauthorized. No valid identity.')\n\n        if not identity.isAuthenticated:\n            # The token is in some way invalid. Not Authorized.\n            raise Exception('Unauthorized. Is not authenticated')\n\n        # Now check that the AppID in the claimset matches\n        # what we're looking for. Note that in a multi-tenant bot, this value\n        # comes from developer code that may be reaching out to a service, hence the\n        # Async validation.\n\n        # Look for the \"aud\" claim, but only if issued from the Bot Framework\n        if identity.get_claim_value(Constants.ISSUER_CLAIM) != Constants.TO_BOT_FROM_CHANNEL_TOKEN_ISSUER:\n            # The relevant Audience Claim MUST be present. Not Authorized.\n            raise Exception('Unauthorized. Audience Claim MUST be present.')\n\n        # The AppId from the claim in the token must match the AppId specified by the developer.\n        # Note that the Bot Framework uses the Audience claim (\"aud\") to pass the AppID.\n        aud_claim = identity.get_claim_value(Constants.AUDIENCE_CLAIM)\n        is_valid_app_id = await asyncio.ensure_future(credentials.is_valid_appid(aud_claim or \"\"))\n        if not is_valid_app_id:\n            # The AppId is not valid or not present. Not Authorized.\n            raise Exception('Unauthorized. Invalid AppId passed on token: ', aud_claim)\n\n        return identity", "response": "Authenticate a token using the given credentials."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef create_trace(\n        turn_activity: Activity,\n        name: str,\n        value: object = None,\n        value_type: str = None,\n        label: str = None,\n    ) -> Activity:\n        \"\"\"Creates a trace activity based on this activity.\n\n        :param turn_activity:\n        :type turn_activity: Activity\n        :param name: The value to assign to the trace activity's <see cref=\"Activity.name\"/> property.\n        :type name: str\n        :param value: The value to assign to the trace activity's <see cref=\"Activity.value\"/> property., defaults to None\n        :param value: object, optional\n        :param value_type: The value to assign to the trace activity's <see cref=\"Activity.value_type\"/> property, defaults to None\n        :param value_type: str, optional\n        :param label: The value to assign to the trace activity's <see cref=\"Activity.label\"/> property, defaults to None\n        :param label: str, optional\n        :return: The created trace activity.\n        :rtype: Activity\n        \"\"\"\n\n        from_property = (\n            ChannelAccount(\n                id=turn_activity.recipient.id, name=turn_activity.recipient.name\n            )\n            if turn_activity.recipient is not None\n            else ChannelAccount()\n        )\n        if value_type is None and value is not None:\n            value_type = type(value).__name__\n\n        reply = Activity(\n            type=ActivityTypes.trace,\n            timestamp=datetime.utcnow(),\n            from_property=from_property,\n            recipient=ChannelAccount(\n                id=turn_activity.from_property.id, name=turn_activity.from_property.name\n            ),\n            reply_to_id=turn_activity.id,\n            service_url=turn_activity.service_url,\n            channel_id=turn_activity.channel_id,\n            conversation=ConversationAccount(\n                is_group=turn_activity.conversation.is_group,\n                id=turn_activity.conversation.id,\n                name=turn_activity.conversation.name,\n            ),\n            name=name,\n            label=label,\n            value_type=value_type,\n            value=value,\n        )\n        return reply", "response": "Creates a trace activity based on this activity."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\nasync def add(self, dialog: Dialog):\n        if dialog is None or not isinstance(dialog, Dialog):\n            raise TypeError('DialogSet.add(): dialog cannot be None and must be a Dialog or derived class.')\n\n        if dialog.id in self._dialogs:\n            raise TypeError(\"DialogSet.add(): A dialog with an id of '%s' already added.\" % dialog.id)\n\n        # dialog.telemetry_client = this._telemetry_client;\n        self._dialogs[dialog.id] = dialog\n\n        return self", "response": "Adds a dialog to the set and returns the added dialog."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nfinding a dialog in the set.", "response": "async def find(self, dialog_id: str) -> Dialog:\n        \"\"\"\n        Finds a dialog that was previously added to the set using add(dialog)\n        :param dialog_id: ID of the dialog/prompt to look up.\n        :return: The dialog if found, otherwise null.\n        \"\"\"\n        if (not dialog_id):\n            raise TypeError('DialogContext.find(): dialog_id cannot be None.')\n\n        if dialog_id in self._dialogs:\n            return self._dialogs[dialog_id]\n\n        return None"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nreturns the storage key for the current user state.", "response": "def get_storage_key(self, context: TurnContext) -> str:\n        \"\"\"\n        Returns the storage key for the current user state.\n        :param context:\n        :return:\n        \"\"\"\n        activity = context.activity\n        channel_id = getattr(activity, 'channel_id', None)\n        user_id = getattr(activity.from_property, 'id', None) if hasattr(activity, 'from_property') else None\n\n        storage_key = None\n        if channel_id and user_id:\n            storage_key = \"%s/users/%s\" % (channel_id, user_id)\n        return storage_key"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef get_top_scoring_intent(self) -> TopIntent:\n\n        if self.intents is None:\n            raise TypeError(\"result.intents can't be None\")\n\n        top_intent = TopIntent(intent=\"\", score=0.0)\n        for intent_name, intent_score in self.intents.items():\n            score = intent_score.score\n            if score > top_intent[1]:\n                top_intent = TopIntent(intent_name, score)\n\n        return top_intent", "response": "Return the top scoring intent and its score."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nset the telemetry client for logging events.", "response": "def telemetry_client(self, value: BotTelemetryClient) -> None:\n        \"\"\"\n        Sets the telemetry client for logging events.\n        \"\"\"\n        if value is None:\n            self._telemetry_client = NullTelemetryClient()\n        else:\n            self._telemetry_client = value"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\nasync def resume_dialog(self, dc, reason: DialogReason, result: object):\n        # By default just end the current dialog.\n        return await dc.EndDialog(result)", "response": "Method called when an instance of the dialog is being returned to from another dialog that was started by the current instance using BeginDialog."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef from_application_endpoint(cls, application_endpoint: str):\n        (application_id, endpoint_key, endpoint) = LuisApplication._parse(\n            application_endpoint\n        )\n        return cls(application_id, endpoint_key, endpoint)", "response": "Initializes a new instance of the class LuisApplication\n        from the given application endpoint."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef top_intent(\n        results: RecognizerResult, default_intent: str = \"None\", min_score: float = 0.0\n    ) -> str:\n        \"\"\"Returns the name of the top scoring intent from a set of LUIS results.\n        \n        :param results: Result set to be searched.\n        :type results: RecognizerResult\n        :param default_intent: Intent name to return should a top intent be found, defaults to \"None\"\n        :param default_intent: str, optional\n        :param min_score: Minimum score needed for an intent to be considered as a top intent. If all intents in the set are below this threshold then the `defaultIntent` will be returned, defaults to 0.0\n        :param min_score: float, optional\n        :raises TypeError:\n        :return: The top scoring intent name.\n        :rtype: str\n        \"\"\"\n\n        if results is None:\n            raise TypeError(\"LuisRecognizer.top_intent(): results cannot be None.\")\n\n        top_intent: str = None\n        top_score: float = -1.0\n        if results.intents:\n            for intent_name, intent_score in results.intents.items():\n                score = intent_score.score\n                if score > top_score and score >= min_score:\n                    top_intent = intent_name\n                    top_score = score\n\n        return top_intent or default_intent", "response": "Returns the name of the top intent from a set of LUIS results."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\nasync def recognize(\n        self,\n        turn_context: TurnContext,\n        telemetry_properties: Dict[str, str] = None,\n        telemetry_metrics: Dict[str, float] = None,\n    ) -> RecognizerResult:\n        \"\"\"Return results of the analysis (Suggested actions and intents).\n        \n        :param turn_context: Context object containing information for a single turn of conversation with a user.\n        :type turn_context: TurnContext\n        :param telemetry_properties: Additional properties to be logged to telemetry with the LuisResult event, defaults to None\n        :param telemetry_properties: Dict[str, str], optional\n        :param telemetry_metrics: Additional metrics to be logged to telemetry with the LuisResult event, defaults to None\n        :param telemetry_metrics: Dict[str, float], optional\n        :return: The LUIS results of the analysis of the current message text in the current turn's context activity.\n        :rtype: RecognizerResult\n        \"\"\"\n\n        return await self._recognize_internal(\n            turn_context, telemetry_properties, telemetry_metrics\n        )", "response": "Return the results of the LUIS analysis of the current message text in the current context activity."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef on_recognizer_result(\n        self,\n        recognizer_result: RecognizerResult,\n        turn_context: TurnContext,\n        telemetry_properties: Dict[str, str] = None,\n        telemetry_metrics: Dict[str, float] = None,\n    ):\n        \"\"\"Invoked prior to a LuisResult being logged.\n        \n        :param recognizer_result: The Luis Results for the call.\n        :type recognizer_result: RecognizerResult\n        :param turn_context: Context object containing information for a single turn of conversation with a user.\n        :type turn_context: TurnContext\n        :param telemetry_properties: Additional properties to be logged to telemetry with the LuisResult event, defaults to None\n        :param telemetry_properties: Dict[str, str], optional\n        :param telemetry_metrics: Additional metrics to be logged to telemetry with the LuisResult event, defaults to None\n        :param telemetry_metrics: Dict[str, float], optional\n        \"\"\"\n\n        properties = self.fill_luis_event_properties(\n            recognizer_result, turn_context, telemetry_properties\n        )\n\n        # Track the event\n        self.telemetry_client.track_event(\n            LuisTelemetryConstants.luis_result, properties, telemetry_metrics\n        )", "response": "Called when a RecognizerResult is received from the Luis."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nfilling the event properties for the LuisResult event.", "response": "def fill_luis_event_properties(\n        self,\n        recognizer_result: RecognizerResult,\n        turn_context: TurnContext,\n        telemetry_properties: Dict[str, str] = None,\n    ) -> Dict[str, str]:\n        \"\"\"Fills the event properties for LuisResult event for telemetry.\n        These properties are logged when the recognizer is called.\n        \n        :param recognizer_result: Last activity sent from user.\n        :type recognizer_result: RecognizerResult\n        :param turn_context: Context object containing information for a single turn of conversation with a user.\n        :type turn_context: TurnContext\n        :param telemetry_properties: Additional properties to be logged to telemetry with the LuisResult event, defaults to None\n        :param telemetry_properties: Dict[str, str], optional\n        :return: A dictionary that is sent as \"Properties\" to IBotTelemetryClient.TrackEvent method for the BotMessageSend event.\n        :rtype: Dict[str, str]\n        \"\"\"\n\n        intents = recognizer_result.intents\n        top_two_intents = (\n            sorted(intents.keys(), key=lambda k: intents[k].score, reverse=True)[:2]\n            if intents\n            else []\n        )\n\n        intent_name, intent_score = LuisRecognizer._get_top_k_intent_score(\n            top_two_intents, intents, index=0\n        )\n        intent2_name, intent2_score = LuisRecognizer._get_top_k_intent_score(\n            top_two_intents, intents, index=1\n        )\n\n        # Add the intent score and conversation id properties\n        properties: Dict[str, str] = {\n            LuisTelemetryConstants.application_id_property: self._application.application_id,\n            LuisTelemetryConstants.intent_property: intent_name,\n            LuisTelemetryConstants.intent_score_property: intent_score,\n            LuisTelemetryConstants.intent2_property: intent2_name,\n            LuisTelemetryConstants.intent_score2_property: intent2_score,\n            LuisTelemetryConstants.from_id_property: turn_context.activity.from_property.id,\n        }\n\n        sentiment = recognizer_result.properties.get(\"sentiment\")\n        if sentiment is not None and isinstance(sentiment, Dict):\n            label = sentiment.get(\"label\")\n            if label is not None:\n                properties[LuisTelemetryConstants.sentiment_label_property] = str(label)\n\n            score = sentiment.get(\"score\")\n            if score is not None:\n                properties[LuisTelemetryConstants.sentiment_score_property] = str(score)\n\n        entities = None\n        if recognizer_result.entities is not None:\n            entities = json.dumps(recognizer_result.entities)\n        properties[LuisTelemetryConstants.entities_property] = entities\n\n        # Use the LogPersonalInformation flag to toggle logging PII data, text is a common example\n        if self.log_personal_information and turn_context.activity.text:\n            properties[\n                LuisTelemetryConstants.question_property\n            ] = turn_context.activity.text\n\n        # Additional Properties can override \"stock\" properties.\n        if telemetry_properties is not None:\n            for key in telemetry_properties:\n                properties[key] = telemetry_properties[key]\n\n        return properties"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nsend data as a multipart form - data request.", "response": "async def async_send_formdata(self, request, headers=None, content=None, **config):\n        \"\"\"Send data as a multipart form-data request.\n        We only deal with file-like objects or strings at this point.\n        The requests is not yet streamed.\n        :param ClientRequest request: The request object to be sent.\n        :param dict headers: Any headers to add to the request.\n        :param dict content: Dictionary of the fields of the formdata.\n        :param config: Any specific config overrides.\n        \"\"\"\n        files = self._prepare_send_formdata(request, headers, content)\n        return await self.async_send(request, headers, files=files, **config)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\nasync def async_send(self, request, headers=None, content=None, **config):\n        loop = asyncio.get_event_loop()\n        if self.config.keep_alive and self._session is None:\n            self._session = requests.Session()\n        try:\n            session = self.creds.signed_session(self._session)\n        except TypeError:  # Credentials does not support session injection\n            session = self.creds.signed_session()\n            if self._session is not None:\n                _LOGGER.warning(\n                    \"Your credentials class does not support session injection. Performance will not be at the maximum.\")\n\n        kwargs = self._configure_session(session, **config)\n        if headers:\n            request.headers.update(headers)\n\n        if not kwargs.get('files'):\n            request.add_content(content)\n        if request.data:\n            kwargs['data'] = request.data\n        kwargs['headers'].update(request.headers)\n\n        response = None\n        try:\n\n            try:\n                future = loop.run_in_executor(\n                    None,\n                    functools.partial(\n                        session.request,\n                        request.method,\n                        request.url,\n                        **kwargs\n                    )\n                )\n                return await future\n\n            except (oauth2.rfc6749.errors.InvalidGrantError,\n                    oauth2.rfc6749.errors.TokenExpiredError) as err:\n                error = \"Token expired or is invalid. Attempting to refresh.\"\n                _LOGGER.warning(error)\n\n            try:\n                session = self.creds.refresh_session()\n                kwargs = self._configure_session(session)\n                if request.data:\n                    kwargs['data'] = request.data\n                kwargs['headers'].update(request.headers)\n\n                future = loop.run_in_executor(\n                    None,\n                    functools.partial(\n                        session.request,\n                        request.method,\n                        request.url,\n                        **kwargs\n                    )\n                )\n                return await future\n            except (oauth2.rfc6749.errors.InvalidGrantError,\n                    oauth2.rfc6749.errors.TokenExpiredError) as err:\n                msg = \"Token expired or is invalid.\"\n                raise_with_traceback(TokenExpiredError, msg, err)\n\n        except (requests.RequestException,\n                oauth2.rfc6749.errors.OAuth2Error) as err:\n            msg = \"Error occurred in request.\"\n            raise_with_traceback(ClientRequestError, msg, err)\n        finally:\n            self._close_local_session_if_necessary(response, session, kwargs['stream'])", "response": "Send a request object according to configuration."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nread storeitems from storage.", "response": "async def read(self, keys: List[str]) -> dict:\n        \"\"\"Read storeitems from storage.\n\n        :param keys:\n        :return dict:\n        \"\"\"\n        try:\n            # check if the database and container exists and if not create\n            if not self.__container_exists:\n                self.__create_db_and_container()\n            if len(keys) > 0:\n                # create the parameters object\n                parameters = [\n                    {'name': f'@id{i}', 'value': f'{self.__sanitize_key(key)}'}\n                    for i, key in enumerate(keys)\n                    ]\n                # get the names of the params\n                parameter_sequence = ','.join(param.get('name')\n                                              for param in parameters)\n                # create the query\n                query = {\n                    \"query\":\n                    f\"SELECT c.id, c.realId, c.document, c._etag \\\nFROM c WHERE c.id in ({parameter_sequence})\",\n                    \"parameters\": parameters\n                    }\n                options = {'enableCrossPartitionQuery': True}\n                # run the query and store the results as a list\n                results = list(\n                    self.client.QueryItems(\n                        self.__container_link, query, options)\n                    )\n                # return a dict with a key and a StoreItem\n                return {\n                    r.get('realId'): self.__create_si(r) for r in results\n                    }\n            else:\n                raise Exception('cosmosdb_storage.read(): \\\nprovide at least one key')\n        except TypeError as e:\n            raise e"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\nasync def write(self, changes: Dict[str, StoreItem]):\n        try:\n            # check if the database and container exists and if not create\n            if not self.__container_exists:\n                self.__create_db_and_container()\n                # iterate over the changes\n            for (key, change) in changes.items():\n                # store the e_tag\n                e_tag = change.e_tag\n                # create the new document\n                doc = {'id': self.__sanitize_key(key),\n                       'realId': key,\n                       'document': self.__create_dict(change)\n                       }\n                # the e_tag will be * for new docs so do an insert\n                if (e_tag == '*' or not e_tag):\n                    self.client.UpsertItem(\n                        database_or_Container_link=self.__container_link,\n                        document=doc,\n                        options={'disableAutomaticIdGeneration': True}\n                        )\n                # if we have an etag, do opt. concurrency replace\n                elif(len(e_tag) > 0):\n                    access_condition = {'type': 'IfMatch', 'condition': e_tag}\n                    self.client.ReplaceItem(\n                        document_link=self.__item_link(\n                            self.__sanitize_key(key)),\n                        new_document=doc,\n                        options={'accessCondition': access_condition}\n                        )\n                # error when there is no e_tag\n                else:\n                    raise Exception('cosmosdb_storage.write(): etag missing')\n        except Exception as e:\n            raise e", "response": "Save storeitems to storage."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\nasync def delete(self, keys: List[str]):\n        try:\n            # check if the database and container exists and if not create\n            if not self.__container_exists:\n                self.__create_db_and_container()\n            # call the function for each key\n            for k in keys:\n                self.client.DeleteItem(\n                    document_link=self.__item_link(self.__sanitize_key(k)))\n                # print(res)\n        except cosmos_errors.HTTPFailure as h:\n            # print(h.status_code)\n            if h.status_code != 404:\n                raise h\n        except TypeError as e:\n            raise e", "response": "Remove storeitems from storage."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\ncreates a StoreItem from a result out of CosmosDB.", "response": "def __create_si(self, result) -> StoreItem:\n        \"\"\"Create a StoreItem from a result out of CosmosDB.\n\n        :param result:\n        :return StoreItem:\n        \"\"\"\n        # get the document item from the result and turn into a dict\n        doc = result.get('document')\n        # readd the e_tag from Cosmos\n        doc['e_tag'] = result.get('_etag')\n        # create and return the StoreItem\n        return StoreItem(**doc)"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef __create_dict(self, si: StoreItem) -> Dict:\n        # read the content\n        non_magic_attr = ([attr for attr in dir(si)\n                          if not attr.startswith('_') or attr.__eq__('e_tag')])\n        # loop through attributes and write and return a dict\n        return ({attr: getattr(si, attr)\n                for attr in non_magic_attr})", "response": "Create a dict of a StoreItem."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef __sanitize_key(self, key) -> str:\n        # forbidden characters\n        bad_chars = ['\\\\', '?', '/', '#', '\\t', '\\n', '\\r']\n        # replace those with with '*' and the\n        # Unicode code point of the character and return the new string\n        return ''.join(\n            map(\n                lambda x: '*'+str(ord(x)) if x in bad_chars else x, key\n                )\n            )", "response": "Return the sanitized key."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef __create_db_and_container(self):\n        db_id = self.config.database\n        container_name = self.config.container\n        self.db = self.__get_or_create_database(self.client, db_id)\n        self.container = self.__get_or_create_container(\n            self.client, container_name\n            )", "response": "Call the get or create methods."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef __get_or_create_database(self, doc_client, id) -> str:\n        # query CosmosDB for a database with that name/id\n        dbs = list(doc_client.QueryDatabases({\n            \"query\": \"SELECT * FROM r WHERE r.id=@id\",\n            \"parameters\": [\n                {\"name\": \"@id\", \"value\": id}\n            ]\n        }))\n        # if there are results, return the first (db names are unique)\n        if len(dbs) > 0:\n            return dbs[0]['id']\n        else:\n            # create the database if it didn't exist\n            res = doc_client.CreateDatabase({'id': id})\n            return res['id']", "response": "Get the database link."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\ngetting the container link.", "response": "def __get_or_create_container(self, doc_client, container) -> str:\n        \"\"\"Return the container link.\n\n        Check if the container exists or create the container.\n\n        :param doc_client:\n        :param container:\n        :return str:\n        \"\"\"\n        # query CosmosDB for a container in the database with that name\n        containers = list(doc_client.QueryContainers(\n            self.__database_link,\n            {\n                \"query\": \"SELECT * FROM r WHERE r.id=@id\",\n                \"parameters\": [\n                    {\"name\": \"@id\", \"value\": container}\n                ]\n            }\n        ))\n        # if there are results, return the first (container names are unique)\n        if len(containers) > 0:\n            return containers[0]['id']\n        else:\n            # Create a container if it didn't exist\n            res = doc_client.CreateContainer(\n                self.__database_link, {'id': container})\n            return res['id']"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef fill_qna_event(\n        self,\n        query_results: [QueryResult],\n        turn_context: TurnContext,\n        telemetry_properties: Dict[str,str] = None,\n        telemetry_metrics: Dict[str,float] = None\n    ) -> EventData:\n        \"\"\"\n        Fills the event properties and metrics for the QnaMessage event for telemetry.\n\n        :return: A tuple of event data properties and metrics that will be sent to the BotTelemetryClient.track_event() method for the QnAMessage event. The properties and metrics returned the standard properties logged with any properties passed from the get_answers() method.\n\n        :rtype: EventData\n        \"\"\"\n\n        properties: Dict[str,str] = dict()\n        metrics: Dict[str, float] = dict()\n\n        properties[QnATelemetryConstants.knowledge_base_id_property] = self._endpoint.knowledge_base_id\n\n        text: str = turn_context.activity.text\n        userName: str = turn_context.activity.from_property.name\n\n        # Use the LogPersonalInformation flag to toggle logging PII data; text and username are common examples.\n        if self.log_personal_information:\n            if text:\n                properties[QnATelemetryConstants.question_property] = text\n            \n            if userName:\n                properties[QnATelemetryConstants.username_property] = userName\n\n        # Fill in Qna Results (found or not).\n        if len(query_results) > 0:\n            query_result = query_results[0]\n\n            result_properties = {\n                QnATelemetryConstants.matched_question_property: json.dumps(query_result.questions),\n                QnATelemetryConstants.question_id_property: str(query_result.id),\n                QnATelemetryConstants.answer_property: query_result.answer,\n                QnATelemetryConstants.score_metric: query_result.score,\n                QnATelemetryConstants.article_found_property: 'true'\n            }\n\n            properties.update(result_properties)\n        else:\n            no_match_properties = {\n                QnATelemetryConstants.matched_question_property : 'No Qna Question matched',\n                QnATelemetryConstants.question_id_property : 'No Qna Question Id matched',\n                QnATelemetryConstants.answer_property : 'No Qna Answer matched',\n                QnATelemetryConstants.article_found_property : 'false'\n            }\n            \n            properties.update(no_match_properties)\n\n        # Additional Properties can override \"stock\" properties.\n        if telemetry_properties:\n            properties.update(telemetry_properties)\n\n        # Additional Metrics can override \"stock\" metrics.\n        if telemetry_metrics:\n            metrics.update(telemetry_metrics)\n        \n        return EventData(properties=properties, metrics=metrics)", "response": "Fill in the event data for the QnaMessage."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\ngenerating answers from the knowledge base.", "response": "async def get_answers(\n        self, \n        context: TurnContext, \n        options: QnAMakerOptions = None, \n        telemetry_properties: Dict[str,str] = None,\n        telemetry_metrics: Dict[str,int] = None\n    ) -> [QueryResult]:\n        \"\"\"\n        Generates answers from the knowledge base.\n        \n        :return: A list of answers for the user's query, sorted in decreasing order of ranking score.\n        \n        :rtype: [QueryResult]\n        \"\"\"\n\n\n        hydrated_options = self._hydrate_options(options)\n        self._validate_options(hydrated_options)\n        \n        result = self._query_qna_service(context.activity, hydrated_options)\n        \n        await self._emit_trace_info(context, result, hydrated_options)\n\n        return result"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _hydrate_options(self, query_options: QnAMakerOptions) -> QnAMakerOptions:\n\n        hydrated_options = copy(self._options)\n\n        if query_options:\n            if (\n                query_options.score_threshold != hydrated_options.score_threshold \n                and query_options.score_threshold\n            ):\n                hydrated_options.score_threshold = query_options.score_threshold\n            \n            if (query_options.top != hydrated_options.top and query_options.top != 0):\n                hydrated_options.top = query_options.top\n            \n            if (len(query_options.strict_filters) > 0):\n                hydrated_options.strict_filters = query_options.strict_filters\n\n        return hydrated_options", "response": "Combines the options passed into the constructor with the options passed into get_answers."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\ncopy this object to the given context.", "response": "def copy_to(self, context: 'TurnContext') -> None:\n        \"\"\"\n        Called when this TurnContext instance is passed into the constructor of a new TurnContext\n        instance. Can be overridden in derived classes.\n        :param context:\n        :return:\n        \"\"\"\n        for attribute in ['adapter', 'activity', '_responded', '_services',\n                          '_on_send_activities', '_on_update_activity', '_on_delete_activity']:\n            setattr(context, attribute, getattr(self, attribute))"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef activity(self, value):\n        if not isinstance(value, Activity):\n            raise TypeError('TurnContext: cannot set `activity` to a type other than Activity.')\n        else:\n            self._activity = value", "response": "Sets the _activity attribute of the object."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nreturn True is set() has been called for a key.", "response": "def has(self, key: str) -> bool:\n        \"\"\"\n        Returns True is set() has been called for a key. The cached value may be of type 'None'.\n        :param key:\n        :return:\n        \"\"\"\n        if key in self._services:\n            return True\n        return False"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef set(self, key: str, value: object) -> None:\n        if not key or not isinstance(key, str):\n            raise KeyError('\"key\" must be a valid string.')\n\n        self._services[key] = value", "response": "Stores a value for the lifetime of the current turn."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\nasync def send_activity(self, *activity_or_text: Union[Activity, str]) -> ResourceResponse:\n        reference = TurnContext.get_conversation_reference(self.activity)\n\n        output = [TurnContext.apply_conversation_reference(\n            Activity(text=a, type='message') if isinstance(a, str) else a, reference)\n            for a in activity_or_text]\n        for activity in output:\n            activity.input_hint = 'acceptingInput'\n\n        async def callback(context: 'TurnContext', output):\n            responses = await context.adapter.send_activities(context, output)\n            context._responded = True\n            return responses\n\n        await self._emit(self._on_send_activities, output, callback(self, output))", "response": "Sends a single activity or message to the user."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nreplaces an existing activity.", "response": "async def update_activity(self, activity: Activity):\n        \"\"\"\n        Replaces an existing activity.\n        :param activity:\n        :return:\n        \"\"\"\n        return await self._emit(self._on_update_activity, activity, self.adapter.update_activity(self, activity))"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\nasync def delete_activity(self, id_or_reference: Union[str, ConversationReference]):\n        if type(id_or_reference) == str:\n            reference = TurnContext.get_conversation_reference(self.activity)\n            reference.activity_id = id_or_reference\n        else:\n            reference = id_or_reference\n        return await self._emit(self._on_delete_activity, reference, self.adapter.delete_activity(self, reference))", "response": "Deletes an existing activity."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef get_conversation_reference(activity: Activity) -> ConversationReference:\n        return ConversationReference(activity_id=activity.id,\n                                     user=copy(activity.from_property),\n                                     bot=copy(activity.recipient),\n                                     conversation=copy(activity.conversation),\n                                     channel_id=activity.channel_id,\n                                     service_url=activity.service_url)", "response": "Returns the conversation reference for an activity."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef apply_conversation_reference(activity: Activity,\n                                     reference: ConversationReference,\n                                     is_incoming: bool=False) -> Activity:\n        \"\"\"\n        Updates an activity with the delivery information from a conversation reference. Calling\n        this after get_conversation_reference on an incoming activity\n        will properly address the reply to a received activity.\n        :param activity:\n        :param reference:\n        :param is_incoming:\n        :return:\n        \"\"\"\n        activity.channel_id = reference.channel_id\n        activity.service_url = reference.service_url\n        activity.conversation = reference.conversation\n        if is_incoming:\n            activity.from_property = reference.user\n            activity.recipient = reference.bot\n            if reference.activity_id:\n                activity.id = reference.activity_id\n        else:\n            activity.from_property = reference.bot\n            activity.recipient = reference.user\n            if reference.activity_id:\n                activity.reply_to_id = reference.activity_id\n\n        return activity", "response": "Applies the delivery information from a conversation reference to an activity."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef add_step(self, step):\n        if not step:\n            raise TypeError('WaterfallDialog.add_step(): step cannot be None.')\n\n        self._steps.append(step)\n        return self", "response": "Adds a new step to the waterfall dialog."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef get_step_name(self, index: int) -> str:\n        step_name = self._steps[index].__qualname__\n\n        if not step_name or \">\" in step_name :\n            step_name = f\"Step{index + 1}of{len(self._steps)}\"\n\n        return step_name", "response": "Get the name of the waterfall step."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef track_event(self, name: str, properties: Dict[str, object] = None, \n                    measurements: Dict[str, object] = None) -> None:\n        \"\"\" \n        Send information about a single event that has occurred in the context of the application.\n        :param name: the data to associate to this event.\n        :param properties: the set of custom properties the client wants attached to this data item. (defaults to: None)\n        :param measurements: the set of custom measurements the client wants to attach to this data item. (defaults to: None)\n        \"\"\"\n        pass", "response": "Send information about a single event that has occurred in the context of the application."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef calculate_change_hash(item: StoreItem) -> str:\n    cpy = copy(item)\n    if cpy.e_tag is not None:\n        del cpy.e_tag\n    return str(cpy)", "response": "Utility function to calculate a change hash for a StoreItem."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\nasync def begin_dialog(self, dialog_id: str, options: object = None):\n        if (not dialog_id):\n            raise TypeError('Dialog(): dialogId cannot be None.')\n        # Look up dialog\n        dialog = await self.find_dialog(dialog_id)\n        if dialog is None:\n            raise Exception(\"'DialogContext.begin_dialog(): A dialog with an id of '%s' wasn't found.\"\n                    \" The dialog must be included in the current or parent DialogSet.\"\n                    \" For example, if subclassing a ComponentDialog you can call add_dialog() within your constructor.\" % dialog_id)\n        # Push new instance onto stack\n        instance = DialogInstance()\n        instance.id = dialog_id\n        instance.state = {}\n\n        self._stack.append(instance)\n        \n        # Call dialog's begin_dialog() method\n        return await dialog.begin_dialog(self, options)", "response": "Starts a new dialog."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nstart a prompt dialog.", "response": "async def prompt(self, dialog_id: str, options) -> DialogTurnResult:\n        \"\"\"\n        Helper function to simplify formatting the options for calling a prompt dialog. This helper will\n        take a `PromptOptions` argument and then call.\n        :param dialog_id: ID of the prompt to start.\n        :param options: Contains a Prompt, potentially a RetryPrompt and if using ChoicePrompt, Choices.\n        :return:\n        \"\"\"\n        if (not dialog_id):\n            raise TypeError('DialogContext.prompt(): dialogId cannot be None.')\n\n        if (not options):\n            raise TypeError('DialogContext.prompt(): options cannot be None.')\n\n        return await self.begin_dialog(dialog_id, options)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\nasync def continue_dialog(self):\n        # Check for a dialog on the stack\n        if self.active_dialog != None:\n            # Look up dialog\n            dialog = await self.find_dialog(self.active_dialog.id)\n            if not dialog:\n                raise Exception(\"DialogContext.continue_dialog(): Can't continue dialog. A dialog with an id of '%s' wasn't found.\" % active_dialog.id)\n\n            # Continue execution of dialog\n            return await dialog.continue_dialog(self)\n        else:\n            return DialogTurnResult(DialogTurnStatus.Empty)", "response": "Continues execution of the active dialog if there is one."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\nasync def end_dialog(self, result: object = None):\n        await self.end_active_dialog(DialogReason.EndCalled)\n\n        # Resume previous dialog\n        if self.active_dialog != None:\n            # Look up dialog\n            dialog = await self.find_dialog(self.active_dialog.id)\n            if not dialog:\n                raise Exception(\"DialogContext.EndDialogAsync(): Can't resume previous dialog. A dialog with an id of '%s' wasn't found.\" % self.active_dialog.id)\n\n            # Return result to previous dialog\n            return await dialog.resume_dialog(self, DialogReason.EndCalled, result)\n        else:\n            return DialogTurnResult(DialogTurnStatus.Complete, result)", "response": "End a dialog by popping it off the stack and returning an optional result to the dialog s parent."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\nasync def cancel_all_dialogs(self):\n        if (len(self.stack) > 0):\n            while (len(self.stack) > 0):\n                await self.end_active_dialog(DialogReason.CancelCalled)\n            return DialogTurnResult(DialogTurnStatus.Cancelled)\n        else:\n            return DialogTurnResult(DialogTurnStatus.Empty)", "response": "Cancels all dialogs on the stack."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nfind the dialog with the given ID.", "response": "async def find_dialog(self, dialog_id: str) -> Dialog:\n        \"\"\"\n        If the dialog cannot be found within the current `DialogSet`, the parent `DialogContext`\n        will be searched if there is one.\n        :param dialog_id: ID of the dialog to search for.\n        :return:\n        \"\"\"\n        dialog = await self.dialogs.find(dialog_id)\n\n        if (dialog == None and self.parent != None):\n            dialog = self.parent.find_dialog(dialog_id)\n        return dialog"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nstart a new dialog and returns the result.", "response": "async def replace_dialog(self, dialog_id: str, options: object = None) -> DialogTurnResult:\n        \"\"\"\n        Ends the active dialog and starts a new dialog in its place. This is particularly useful\n        for creating loops or redirecting to another dialog.\n        :param dialog_id: ID of the dialog to search for.\n        :param options: (Optional) additional argument(s) to pass to the new dialog.\n        :return:\n        \"\"\"\n        # End the current dialog and giving the reason.\n        await self.end_active_dialog(DialogReason.ReplaceCalled)\n\n        # Start replacement dialog\n        return await self.begin_dialog(dialog_id, options)"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\ncalling reprompt on the currently active dialog.", "response": "async def reprompt_dialog(self):\n        \"\"\"\n        Calls reprompt on the currently active dialog, if there is one. Used with Prompts that have a reprompt behavior.\n        :return:\n        \"\"\"\n        # Check for a dialog on the stack\n        if self.active_dialog != None:\n            # Look up dialog\n            dialog = await self.find_dialog(self.active_dialog.id)\n            if not dialog:\n                raise Exception(\"DialogSet.reprompt_dialog(): Can't find A dialog with an id of '%s'.\" % self.active_dialog.id)\n\n            # Ask dialog to re-prompt if supported\n            await dialog.reprompt_dialog(self.context, self.active_dialog)"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nchecks if a number of Suggested Actions are supported by a Channel.", "response": "def supports_suggested_actions(channel_id: str, button_cnt: int = 100) -> bool:\n        \"\"\"Determine if a number of Suggested Actions are supported by a Channel.\n\n        Args:\n            channel_id (str): The Channel to check the if Suggested Actions are supported in.\n            button_cnt (int, optional): Defaults to 100. The number of Suggested Actions to check for the Channel.\n\n        Returns:\n            bool: True if the Channel supports the button_cnt total Suggested Actions, False if the Channel does not support that number of Suggested Actions.\n        \"\"\"\n\n        max_actions = {\n            # https://developers.facebook.com/docs/messenger-platform/send-messages/quick-replies\n            Channels.facebook: 10,\n            Channels.skype: 10,\n            # https://developers.line.biz/en/reference/messaging-api/#items-object\n            Channels.line: 13,\n            # https://dev.kik.com/#/docs/messaging#text-response-object\n            Channels.kik: 20,\n            Channels.telegram: 100,\n            Channels.slack: 100,\n            Channels.emulator: 100,\n            Channels.direct_line: 100,\n            Channels.webchat: 100,\n        }\n        return button_cnt <= max_actions[channel_id] if channel_id in max_actions else False"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nchecks if a number of Card Actions are supported by a Channel.", "response": "def supports_card_actions(channel_id: str, button_cnt: int = 100) -> bool:\n        \"\"\"Determine if a number of Card Actions are supported by a Channel.\n\n        Args:\n            channel_id (str): The Channel to check if the Card Actions are supported in.\n            button_cnt (int, optional): Defaults to 100. The number of Card Actions to check for the Channel.\n\n        Returns:\n            bool: True if the Channel supports the button_cnt total Card Actions, False if the Channel does not support that number of Card Actions.\n        \"\"\"\n\n        max_actions = {\n            Channels.facebook: 3,\n            Channels.skype: 3,\n            Channels.ms_teams: 3,\n            Channels.line: 99,\n            Channels.slack: 100,\n            Channels.emulator: 100,\n            Channels.direct_line: 100,\n            Channels.webchat: 100,\n            Channels.cortana: 100,\n        }\n        return button_cnt <= max_actions[channel_id] if channel_id in max_actions else False"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nget the Channel Id from the current Activity on the Turn Context.", "response": "def get_channel_id(turn_context: TurnContext) -> str:\n        \"\"\"Get the Channel Id from the current Activity on the Turn Context.\n\n        Args:\n            turn_context (TurnContext): The Turn Context to retrieve the Activity's Channel Id from.\n\n        Returns:\n            str: The Channel Id from the Turn Context's Activity.\n        \"\"\"\n\n        if turn_context.activity.channel_id is None:\n            return \"\"\n        else:\n            return turn_context.activity.channel_id"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nstart listening to console input.", "response": "async def process_activity(self, logic: Callable):\n        \"\"\"\n        Begins listening to console input.\n        :param logic:\n        :return:\n        \"\"\"\n        while True:\n            msg = input()\n            if msg is None:\n                pass\n            else:\n                self._next_id += 1\n                activity = Activity(text=msg,\n                                    channel_id='console',\n                                    from_property=ChannelAccount(id='user', name='User1'),\n                                    recipient=ChannelAccount(id='bot', name='Bot'),\n                                    conversation=ConversationAccount(id='Convo1'),\n                                    type=ActivityTypes.message,\n                                    timestamp=datetime.datetime.now(),\n                                    id=str(self._next_id))\n\n                activity = TurnContext.apply_conversation_reference(activity, self.reference, True)\n                context = TurnContext(self, activity)\n                await self.run_middleware(context, logic)"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\nasync def send_activities(self, context: TurnContext, activities: List[Activity]):\n        if context is None:\n            raise TypeError('ConsoleAdapter.send_activities(): `context` argument cannot be None.')\n        if type(activities) != list:\n            raise TypeError('ConsoleAdapter.send_activities(): `activities` argument must be a list.')\n        if len(activities) == 0:\n            raise ValueError('ConsoleAdapter.send_activities(): `activities` argument cannot have a length of 0.')\n\n        async def next_activity(i: int):\n            responses = []\n\n            if i < len(activities):\n                responses.append(ResourceResponse())\n                a = activities[i]\n\n                if a.type == 'delay':\n                    await asyncio.sleep(a.delay)\n                    await next_activity(i + 1)\n                elif a.type == ActivityTypes.message:\n                    if a.attachments is not None and len(a.attachments) > 0:\n                        append = '(1 attachment)' if len(a.attachments) == 1 else f'({len(a.attachments)} attachments)'\n                        print(f'{a.text} {append}')\n                    else:\n                        print(a.text)\n                    await next_activity(i + 1)\n                else:\n                    print(f'[{a.type}]')\n                    await next_activity(i + 1)\n            else:\n                return responses\n\n        await next_activity(0)", "response": "Send a series of activities to the console."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef is_token_from_emulator(auth_header: str) -> bool:\n        # The Auth Header generally looks like this:\n        # \"Bearer eyJ0e[...Big Long String...]XAiO\"\n        if not auth_header:\n            # No token. Can't be an emulator token.\n            return False\n\n        parts = auth_header.split(' ')\n        if len(parts) != 2:\n            # Emulator tokens MUST have exactly 2 parts.\n            # If we don't have 2 parts, it's not an emulator token\n            return False\n\n        auth_scheme = parts[0]\n        bearer_token = parts[1]\n\n        # We now have an array that should be:\n        # [0] = \"Bearer\"\n        # [1] = \"[Big Long String]\"\n        if auth_scheme != 'Bearer':\n            # The scheme from the emulator MUST be \"Bearer\"\n            return False\n\n        # Parse the Big Long String into an actual token.\n        token = jwt.decode(bearer_token, verify=False)\n        if not token:\n            return False\n\n        # Is there an Issuer?\n        issuer = token['iss']\n        if not issuer:\n            # No Issuer, means it's not from the Emulator.\n            return False\n\n        # Is the token issues by a source we consider to be the emulator?\n        issuer_list = EmulatorValidation.TO_BOT_FROM_EMULATOR_TOKEN_VALIDATION_PARAMETERS.issuer\n        if issuer_list and not issuer in issuer_list:\n            # Not a Valid Issuer. This is NOT a Bot Framework Emulator Token.\n            return False\n\n        # The Token is from the Bot Framework Emulator. Success!\n        return True", "response": "Determines if a given Auth header is from the Emulator."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\nasync def authenticate_emulator_token(auth_header: str, credentials: CredentialProvider, channel_id: str) -> ClaimsIdentity:\n        token_extractor = JwtTokenExtractor(\n            EmulatorValidation.TO_BOT_FROM_EMULATOR_TOKEN_VALIDATION_PARAMETERS,\n            Constants.TO_BOT_FROM_EMULATOR_OPEN_ID_METADATA_URL,\n            Constants.ALLOWED_SIGNING_ALGORITHMS)\n\n        identity = await asyncio.ensure_future(\n            token_extractor.get_identity_from_auth_header(auth_header, channel_id))\n        if not identity:\n            # No valid identity. Not Authorized.\n            raise Exception('Unauthorized. No valid identity.')\n\n        if not identity.isAuthenticated:\n            # The token is in some way invalid. Not Authorized.\n            raise Exception('Unauthorized. Is not authenticated')\n\n        # Now check that the AppID in the claimset matches\n        # what we're looking for. Note that in a multi-tenant bot, this value\n        # comes from developer code that may be reaching out to a service, hence the\n        # Async validation.\n        version_claim = identity.get_claim_value(EmulatorValidation.VERSION_CLAIM)\n        if version_claim is None:\n            raise Exception('Unauthorized. \"ver\" claim is required on Emulator Tokens.')\n\n        app_id = ''\n\n        # The Emulator, depending on Version, sends the AppId via either the\n        # appid claim (Version 1) or the Authorized Party claim (Version 2).\n        if not version_claim or version_claim == '1.0':\n            # either no Version or a version of \"1.0\" means we should look for\n            # the claim in the \"appid\" claim.\n            app_id_claim = identity.get_claim_value(EmulatorValidation.APP_ID_CLAIM)\n            if not app_id_claim:\n                # No claim around AppID. Not Authorized.\n                raise Exception('Unauthorized. '\n                                '\"appid\" claim is required on Emulator Token version \"1.0\".')\n\n            app_id = app_id_claim\n        elif version_claim == '2.0':\n            # Emulator, \"2.0\" puts the AppId in the \"azp\" claim.\n            app_authz_claim = identity.get_claim_value(Constants.AUTHORIZED_PARTY)\n            if not app_authz_claim:\n                # No claim around AppID. Not Authorized.\n                raise Exception('Unauthorized. '\n                                '\"azp\" claim is required on Emulator Token version \"2.0\".')\n\n            app_id = app_authz_claim\n        else:\n            # Unknown Version. Not Authorized.\n            raise Exception('Unauthorized. Unknown Emulator Token version ', version_claim, '.')\n\n        is_valid_app_id = await asyncio.ensure_future(credentials.is_valid_appid(app_id))\n        if not is_valid_app_id:\n            raise Exception('Unauthorized. Invalid AppId passed on token: ', app_id)\n\n        return identity", "response": "Authenticate the emulator token."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef adaptive_card(card: dict) -> Attachment:\n        if not type(card) == dict:\n            raise TypeError('CardFactory.adaptive_card(): `card` argument is not of type dict, unable to prepare '\n                            'attachment.')\n\n        return Attachment(content_type=CardFactory.content_types.adaptive_card,\n                          content=card)", "response": "Returns an adaptive card attachment."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef animation_card(card: AnimationCard) -> Attachment:\n        if not isinstance(card, AnimationCard):\n            raise TypeError('CardFactory.animation_card(): `card` argument is not an instance of an AnimationCard, '\n                            'unable to prepare attachment.')\n\n        return Attachment(content_type=CardFactory.content_types.animation_card,\n                          content=card)", "response": "Returns an animation card attachment. Will raise a TypeError if the card argument is not an animation card."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nreturning an audio card attachment. Will raise a TypeError if the given audio card is not an audio card.", "response": "def audio_card(card: AudioCard) -> Attachment:\n        \"\"\"\n        Returns an attachment for an audio card. Will raise a TypeError if 'card' argument is not an AudioCard.\n        :param card:\n        :return:\n        \"\"\"\n        if not isinstance(card, AudioCard):\n            raise TypeError('CardFactory.audio_card(): `card` argument is not an instance of an AudioCard, '\n                            'unable to prepare attachment.')\n\n        return Attachment(content_type=CardFactory.content_types.audio_card,\n                          content=card)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nreturning an attachment for a hero card. Will raise a TypeError if the card argument is not an instance of HeroCard.", "response": "def hero_card(card: HeroCard) -> Attachment:\n        \"\"\"\n        Returns an attachment for a hero card. Will raise a TypeError if 'card' argument is not a HeroCard.\n\n        Hero cards tend to have one dominant full width image and the cards text & buttons can\n        usually be found below the image.\n        :return:\n        \"\"\"\n        if not isinstance(card, HeroCard):\n            raise TypeError('CardFactory.hero_card(): `card` argument is not an instance of an HeroCard, '\n                            'unable to prepare attachment.')\n\n        return Attachment(content_type=CardFactory.content_types.hero_card,\n                          content=card)"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef oauth_card(card: OAuthCard) -> Attachment:\n        if not isinstance(card, OAuthCard):\n            raise TypeError('CardFactory.oauth_card(): `card` argument is not an instance of an OAuthCard, '\n                            'unable to prepare attachment.')\n\n        return Attachment(content_type=CardFactory.content_types.oauth_card,\n                          content=card)", "response": "Returns an attachment for an OAuth card used by the Bot Frameworks Single Sign On service. Will raise aTypeError TypeError if the given OAuthCard argument is not an instance of OAuthCard."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef receipt_card(card: ReceiptCard) -> Attachment:\n        if not isinstance(card, ReceiptCard):\n            raise TypeError('CardFactory.receipt_card(): `card` argument is not an instance of an ReceiptCard, '\n                            'unable to prepare attachment.')\n\n        return Attachment(content_type=CardFactory.content_types.receipt_card,\n                          content=card)", "response": "Returns an attachment for a receipt card. Will raise a TypeError if the given argument is not an instance of ReceiptCard."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nreturn an attachment for a signin card.", "response": "def signin_card(card: SigninCard) -> Attachment:\n        \"\"\"\n        Returns an attachment for a signin card. For channels that don't natively support signin cards an alternative\n        message will be rendered. Will raise a TypeError if 'card' argument is not a SigninCard.\n        :param card:\n        :return:\n        \"\"\"\n        if not isinstance(card, SigninCard):\n            raise TypeError('CardFactory.signin_card(): `card` argument is not an instance of an SigninCard, '\n                            'unable to prepare attachment.')\n\n        return Attachment(content_type=CardFactory.content_types.signin_card,\n                          content=card)"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nreturn an attachment for a thumbnail card.", "response": "def thumbnail_card(card: ThumbnailCard) -> Attachment:\n        \"\"\"\n        Returns an attachment for a thumbnail card. Thumbnail cards are similar to\n        but instead of a full width image, they're typically rendered with a smaller thumbnail version of\n        the image on either side and the text will be rendered in column next to the image. Any buttons\n        will typically show up under the card. Will raise a TypeError if 'card' argument is not a ThumbnailCard.\n        :param card:\n        :return:\n        \"\"\"\n        if not isinstance(card, ThumbnailCard):\n            raise TypeError('CardFactory.thumbnail_card(): `card` argument is not an instance of an ThumbnailCard, '\n                            'unable to prepare attachment.')\n\n        return Attachment(content_type=CardFactory.content_types.thumbnail_card,\n                          content=card)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef video_card(card: VideoCard) -> Attachment:\n        if not isinstance(card, VideoCard):\n            raise TypeError('CardFactory.video_card(): `card` argument is not an instance of an VideoCard, '\n                            'unable to prepare attachment.')\n\n        return Attachment(content_type=CardFactory.content_types.video_card,\n                          content=card)", "response": "Returns an attachment for a video card. Will raise a TypeError if the card is not a video card."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef assemble(self):\n        instruction = QasmQobjInstruction(name=self.name)\n        # Evaluate parameters\n        if self.params:\n            params = [\n                x.evalf() if hasattr(x, 'evalf') else x for x in self.params\n            ]\n            params = [\n                sympy.matrix2numpy(x, dtype=complex) if isinstance(\n                    x, sympy.Matrix) else x for x in params\n            ]\n            instruction.params = params\n        # Add placeholder for qarg and carg params\n        if self.num_qubits:\n            instruction.qubits = list(range(self.num_qubits))\n        if self.num_clbits:\n            instruction.memory = list(range(self.num_clbits))\n        # Add control parameters for assembler. This is needed to convert\n        # to a qobj conditional instruction at assemble time and after\n        # conversion will be deleted by the assembler.\n        if self.control:\n            instruction._control = self.control\n        return instruction", "response": "Assemble a QasmQobjInstruction object."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef inverse(self):\n        if not self.definition:\n            raise QiskitError(\"inverse() not implemented for %s.\" % self.name)\n        inverse_gate = self.copy(name=self.name + '_dg')\n        inverse_gate._definition = []\n        for inst, qargs, cargs in reversed(self._definition):\n            inverse_gate._definition.append((inst.inverse(), qargs, cargs))\n        return inverse_gate", "response": "Invert this instruction.\n\n        If the instruction is composite (i.e. has a definition),\n        then its definition will be recursively inverted.\n\n        Special instructions inheriting from Instruction can\n        implement their own inverse (e.g. T and Tdg, Barrier, etc.)\n\n        Returns:\n            Instruction: a fresh instruction for the inverse\n\n        Raises:\n            QiskitError: if the instruction is not composite\n                and an inverse has not been implemented for it."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef c_if(self, classical, val):\n        if not isinstance(classical, ClassicalRegister):\n            raise QiskitError(\"c_if must be used with a classical register\")\n        if val < 0:\n            raise QiskitError(\"control value should be non-negative\")\n        self.control = (classical, val)\n        return self", "response": "Add classical control on register classical and value val."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nreturn a shallow copy of the current instruction with the given name.", "response": "def copy(self, name=None):\n        \"\"\"\n        shallow copy of the instruction.\n\n        Args:\n          name (str): name to be given to the copied circuit,\n            if None then the name stays the same\n\n        Returns:\n          Instruction: a shallow copy of the current instruction, with the name\n            updated if it was provided\n        \"\"\"\n        cpy = copy.copy(self)\n        if name:\n            cpy.name = name\n        return cpy"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nprint an if statement if needed.", "response": "def _qasmif(self, string):\n        \"\"\"Print an if statement if needed.\"\"\"\n        if self.control is None:\n            return string\n        return \"if(%s==%d) \" % (self.control[0].name, self.control[1]) + string"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nreturns an OpenQASM string for the instruction.", "response": "def qasm(self):\n        \"\"\"Return a default OpenQASM string for the instruction.\n\n        Derived instructions may override this to print in a\n        different format (e.g. measure q[0] -> c[0];).\n        \"\"\"\n        name_param = self.name\n        if self.params:\n            name_param = \"%s(%s)\" % (name_param, \",\".join(\n                [str(i) for i in self.params]))\n\n        return self._qasmif(name_param)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\njoin the options of each passset with the options of each passmanager.", "response": "def _join_options(self, passset_options):\n        \"\"\"Set the options of each passset, based on precedence rules:\n        passset options (set via ``PassManager.append()``) override\n        passmanager options (set via ``PassManager.__init__()``), which override Default.\n        .\n        \"\"\"\n        default = {'ignore_preserves': False,  # Ignore preserves for this pass\n                   'ignore_requires': False,  # Ignore requires for this pass\n                   'max_iteration': 1000}  # Maximum allowed iteration on this pass\n\n        passmanager_level = {k: v for k, v in self.passmanager_options.items() if v is not None}\n        passset_level = {k: v for k, v in passset_options.items() if v is not None}\n        return {**default, **passmanager_level, **passset_level}"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nadd a new entry to the internal list of control controllers.", "response": "def append(self, passes, ignore_requires=None, ignore_preserves=None, max_iteration=None,\n               **flow_controller_conditions):\n        \"\"\"\n        Args:\n            passes (list[BasePass] or BasePass): pass(es) to be added to schedule\n            ignore_preserves (bool): ignore the preserves claim of passes. Default: False\n            ignore_requires (bool): ignore the requires need of passes. Default: False\n            max_iteration (int): max number of iterations of passes. Default: 1000\n            flow_controller_conditions (kwargs): See add_flow_controller(): Dictionary of\n            control flow plugins. Default:\n\n                * do_while (callable property_set -> boolean): The passes repeat until the\n                  callable returns False.\n                  Default: `lambda x: False # i.e. passes run once`\n\n                * condition (callable property_set -> boolean): The passes run only if the\n                  callable returns True.\n                  Default: `lambda x: True # i.e. passes run`\n\n        Raises:\n            TranspilerError: if a pass in passes is not a proper pass.\n        \"\"\"\n\n        passset_options = {'ignore_requires': ignore_requires,\n                           'ignore_preserves': ignore_preserves,\n                           'max_iteration': max_iteration}\n\n        options = self._join_options(passset_options)\n\n        if isinstance(passes, BasePass):\n            passes = [passes]\n\n        for pass_ in passes:\n            if not isinstance(pass_, BasePass):\n                raise TranspilerError('%s is not a pass instance' % pass_.__class__)\n\n        for name, param in flow_controller_conditions.items():\n            if callable(param):\n                flow_controller_conditions[name] = partial(param, self.fenced_property_set)\n            else:\n                raise TranspilerError('The flow controller parameter %s is not callable' % name)\n\n        self.working_list.append(\n            FlowController.controller_factory(passes, options, **flow_controller_conditions))"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nrunning all the passes on a QuantumCircuit and return a new circuit with the new names and the new ones.", "response": "def run(self, circuit):\n        \"\"\"Run all the passes on a QuantumCircuit\n\n        Args:\n            circuit (QuantumCircuit): circuit to transform via all the registered passes\n\n        Returns:\n            QuantumCircuit: Transformed circuit.\n        \"\"\"\n        name = circuit.name\n        dag = circuit_to_dag(circuit)\n        del circuit\n        for passset in self.working_list:\n            for pass_ in passset:\n                dag = self._do_pass(pass_, dag, passset.options)\n        circuit = dag_to_circuit(dag)\n        circuit.name = name\n        return circuit"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef _do_pass(self, pass_, dag, options):\n\n        # First, do the requires of pass_\n        if not options[\"ignore_requires\"]:\n            for required_pass in pass_.requires:\n                dag = self._do_pass(required_pass, dag, options)\n\n        # Run the pass itself, if not already run\n        if pass_ not in self.valid_passes:\n            if pass_.is_transformation_pass:\n                pass_.property_set = self.fenced_property_set\n                new_dag = pass_.run(dag)\n                if not isinstance(new_dag, DAGCircuit):\n                    raise TranspilerError(\"Transformation passes should return a transformed dag.\"\n                                          \"The pass %s is returning a %s\" % (type(pass_).__name__,\n                                                                             type(new_dag)))\n                dag = new_dag\n            elif pass_.is_analysis_pass:\n                pass_.property_set = self.property_set\n                pass_.run(FencedDAGCircuit(dag))\n            else:\n                raise TranspilerError(\"I dont know how to handle this type of pass\")\n\n            # update the valid_passes property\n            self._update_valid_passes(pass_, options['ignore_preserves'])\n\n        return dag", "response": "Do a pass and its requires."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nreturns a list of the appended passes and its options.", "response": "def passes(self):\n        \"\"\"\n        Returns a list structure of the appended passes and its options.\n\n        Returns (list): The appended passes.\n        \"\"\"\n        ret = []\n        for pass_ in self.working_list:\n            ret.append(pass_.dump_passes())\n        return ret"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef dump_passes(self):\n        ret = {'options': self.options, 'passes': [], 'type': type(self)}\n        for pass_ in self._passes:\n            if isinstance(pass_, FlowController):\n                ret['passes'].append(pass_.dump_passes())\n            else:\n                ret['passes'].append(pass_)\n        return ret", "response": "Dump the passes added to this flow controller."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nremoves a flow controller from the list of registered controllers.", "response": "def remove_flow_controller(cls, name):\n        \"\"\"\n        Removes a flow controller.\n        Args:\n            name (string): Name of the controller to remove.\n        Raises:\n            KeyError: If the controller to remove was not registered.\n        \"\"\"\n        if name not in cls.registered_controllers:\n            raise KeyError(\"Flow controller not found: %s\" % name)\n        del cls.registered_controllers[name]"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nconstructs a flow controller based on the partially evaluated controller arguments. Args: passes (list[BasePass]): passes to add to the flow controller. options (dict): PassManager options. **partial_controller (dict): Partially evaluated controller arguments in the form `{name:partial}` Raises: TranspilerError: When partial_controller is not well-formed. Returns: FlowController: A FlowController instance.", "response": "def controller_factory(cls, passes, options, **partial_controller):\n        \"\"\"\n        Constructs a flow controller based on the partially evaluated controller arguments.\n\n        Args:\n            passes (list[BasePass]): passes to add to the flow controller.\n            options (dict): PassManager options.\n            **partial_controller (dict): Partially evaluated controller arguments in the form\n                `{name:partial}`\n\n        Raises:\n            TranspilerError: When partial_controller is not well-formed.\n\n        Returns:\n            FlowController: A FlowController instance.\n        \"\"\"\n        if None in partial_controller.values():\n            raise TranspilerError('The controller needs a condition.')\n\n        if partial_controller:\n            for registered_controller in cls.registered_controllers.keys():\n                if registered_controller in partial_controller:\n                    return cls.registered_controllers[registered_controller](passes, options,\n                                                                             **partial_controller)\n            raise TranspilerError(\"The controllers for %s are not registered\" % partial_controller)\n        else:\n            return FlowControllerLinear(passes, options)"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef u_base(self, theta, phi, lam, q):\n    return self.append(UBase(theta, phi, lam), [q], [])", "response": "Apply U to q."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nreturns a Numpy. array for the U3 gate.", "response": "def to_matrix(self):\n        \"\"\"Return a Numpy.array for the U3 gate.\"\"\"\n        theta, phi, lam = self.params\n        return numpy.array(\n            [[\n                numpy.cos(theta / 2),\n                -numpy.exp(1j * lam) * numpy.sin(theta / 2)\n            ],\n             [\n                 numpy.exp(1j * phi) * numpy.sin(theta / 2),\n                 numpy.exp(1j * (phi + lam)) * numpy.cos(theta / 2)\n             ]],\n            dtype=complex)"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\napplies a single qubit gate to the qubit.", "response": "def single_gate_params(gate, params=None):\n    \"\"\"Apply a single qubit gate to the qubit.\n\n    Args:\n        gate(str): the single qubit gate name\n        params(list): the operation parameters op['params']\n    Returns:\n        tuple: a tuple of U gate parameters (theta, phi, lam)\n    Raises:\n        QiskitError: if the gate name is not valid\n    \"\"\"\n    if gate in ('U', 'u3'):\n        return params[0], params[1], params[2]\n    elif gate == 'u2':\n        return np.pi / 2, params[0], params[1]\n    elif gate == 'u1':\n        return 0, 0, params[0]\n    elif gate == 'id':\n        return 0, 0, 0\n    raise QiskitError('Gate is not among the valid types: %s' % gate)"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nreturning the matrix for a single qubit gate.", "response": "def single_gate_matrix(gate, params=None):\n    \"\"\"Get the matrix for a single qubit.\n\n    Args:\n        gate(str): the single qubit gate name\n        params(list): the operation parameters op['params']\n    Returns:\n        array: A numpy array representing the matrix\n    \"\"\"\n\n    # Converting sym to floats improves the performance of the simulator 10x.\n    # This a is a probable a FIXME since it might show bugs in the simulator.\n    (theta, phi, lam) = map(float, single_gate_params(gate, params))\n\n    return np.array([[np.cos(theta / 2),\n                      -np.exp(1j * lam) * np.sin(theta / 2)],\n                     [np.exp(1j * phi) * np.sin(theta / 2),\n                      np.exp(1j * phi + 1j * lam) * np.cos(theta / 2)]])"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef einsum_matmul_index(gate_indices, number_of_qubits):\n\n    mat_l, mat_r, tens_lin, tens_lout = _einsum_matmul_index_helper(gate_indices,\n                                                                    number_of_qubits)\n\n    # Right indices for the N-qubit input and output tensor\n    tens_r = ascii_uppercase[:number_of_qubits]\n\n    # Combine indices into matrix multiplication string format\n    # for numpy.einsum function\n    return \"{mat_l}{mat_r}, \".format(mat_l=mat_l, mat_r=mat_r) + \\\n           \"{tens_lin}{tens_r}->{tens_lout}{tens_r}\".format(tens_lin=tens_lin,\n                                                            tens_lout=tens_lout,\n                                                            tens_r=tens_r)", "response": "Return the index string for Numpy. einsum matrix - matrix multiplication."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef einsum_vecmul_index(gate_indices, number_of_qubits):\n\n    mat_l, mat_r, tens_lin, tens_lout = _einsum_matmul_index_helper(gate_indices,\n                                                                    number_of_qubits)\n\n    # Combine indices into matrix multiplication string format\n    # for numpy.einsum function\n    return \"{mat_l}{mat_r}, \".format(mat_l=mat_l, mat_r=mat_r) + \\\n           \"{tens_lin}->{tens_lout}\".format(tens_lin=tens_lin,\n                                            tens_lout=tens_lout)", "response": "Return the index string for Numpy. einsum matrix - vector multiplication."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nreturn the index string for Numpy.eignsum matrix multiplication. The returned indices are to perform a matrix multiplication A.v where the matrix A is an M-qubit matrix, matrix v is an N-qubit vector, and M <= N, and identity matrices are implied on the subsystems where A has no support on v. Args: gate_indices (list[int]): the indices of the right matrix subsystems to contract with the left matrix. number_of_qubits (int): the total number of qubits for the right matrix. Returns: tuple: (mat_left, mat_right, tens_in, tens_out) of index strings for that may be combined into a Numpy.einsum function string. Raises: QiskitError: if the total number of qubits plus the number of contracted indices is greater than 26.", "response": "def _einsum_matmul_index_helper(gate_indices, number_of_qubits):\n    \"\"\"Return the index string for Numpy.eignsum matrix multiplication.\n\n    The returned indices are to perform a matrix multiplication A.v where\n    the matrix A is an M-qubit matrix, matrix v is an N-qubit vector, and\n    M <= N, and identity matrices are implied on the subsystems where A has no\n    support on v.\n\n    Args:\n        gate_indices (list[int]): the indices of the right matrix subsystems\n                                   to contract with the left matrix.\n        number_of_qubits (int): the total number of qubits for the right matrix.\n\n    Returns:\n        tuple: (mat_left, mat_right, tens_in, tens_out) of index strings for\n        that may be combined into a Numpy.einsum function string.\n\n    Raises:\n        QiskitError: if the total number of qubits plus the number of\n        contracted indices is greater than 26.\n    \"\"\"\n\n    # Since we use ASCII alphabet for einsum index labels we are limited\n    # to 26 total free left (lowercase) and 26 right (uppercase) indexes.\n    # The rank of the contracted tensor reduces this as we need to use that\n    # many characters for the contracted indices\n    if len(gate_indices) + number_of_qubits > 26:\n        raise QiskitError(\"Total number of free indexes limited to 26\")\n\n    # Indicies for N-qubit input tensor\n    tens_in = ascii_lowercase[:number_of_qubits]\n\n    # Indices for the N-qubit output tensor\n    tens_out = list(tens_in)\n\n    # Left and right indices for the M-qubit multiplying tensor\n    mat_left = \"\"\n    mat_right = \"\"\n\n    # Update left indices for mat and output\n    for pos, idx in enumerate(reversed(gate_indices)):\n        mat_left += ascii_lowercase[-1 - pos]\n        mat_right += tens_in[-1 - idx]\n        tens_out[-1 - idx] = ascii_lowercase[-1 - pos]\n    tens_out = \"\".join(tens_out)\n\n    # Combine indices into matrix multiplication string format\n    # for numpy.einsum function\n    return mat_left, mat_right, tens_in, tens_out"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nconverting a QuantumCircuit to a DAG.", "response": "def circuit_to_dag(circuit):\n    \"\"\"Build a ``DAGCircuit`` object from a ``QuantumCircuit``.\n\n    Args:\n        circuit (QuantumCircuit): the input circuit.\n\n    Return:\n        DAGCircuit: the DAG representing the input circuit.\n    \"\"\"\n    dagcircuit = DAGCircuit()\n    dagcircuit.name = circuit.name\n    for register in circuit.qregs:\n        dagcircuit.add_qreg(register)\n    for register in circuit.cregs:\n        dagcircuit.add_creg(register)\n\n    for instruction, qargs, cargs in circuit.data:\n        # Get arguments for classical control (if any)\n        if instruction.control is None:\n            control = None\n        else:\n            control = (instruction.control[0], instruction.control[1])\n\n        dagcircuit.apply_operation_back(instruction.copy(),\n                                        qargs, cargs, control)\n\n    return dagcircuit"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef exp_fit_fun(x, a, tau, c):\n    # pylint: disable=invalid-name\n    return a * np.exp(-x / tau) + c", "response": "Function used to fit the exponential decay."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nfunction used to fit the decay cosine.", "response": "def osc_fit_fun(x, a, tau, f, phi, c):\n    \"\"\"Function used to fit the decay cosine.\"\"\"\n    # pylint: disable=invalid-name\n    return a * np.exp(-x / tau) * np.cos(2 * np.pi * f * x + phi) + c"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nplotting the coherence data.", "response": "def plot_coherence(xdata, ydata, std_error, fit, fit_function, xunit, exp_str,\n                   qubit_label):\n    \"\"\"Plot coherence data.\n\n    Args:\n        xdata\n        ydata\n        std_error\n        fit\n        fit_function\n        xunit\n        exp_str\n        qubit_label\n    Raises:\n        ImportError: If matplotlib is not installed.\n    \"\"\"\n    if not HAS_MATPLOTLIB:\n        raise ImportError('The function plot_coherence needs matplotlib. '\n                          'Run \"pip install matplotlib\" before.')\n    plt.errorbar(xdata, ydata, std_error, marker='.',\n                 markersize=9, c='b', linestyle='')\n    plt.plot(xdata, fit_function(xdata, *fit), c='r', linestyle='--',\n             label=(exp_str + '= %s %s' % (str(round(fit[1])), xunit)))\n    plt.xticks(fontsize=14, rotation=70)\n    plt.yticks(fontsize=14)\n    plt.xlabel('time [%s]' % (xunit), fontsize=16)\n    plt.ylabel('P(1)', fontsize=16)\n    plt.title(exp_str + ' measurement of Q$_{%s}$' % (str(qubit_label)), fontsize=18)\n    plt.legend(fontsize=12)\n    plt.grid(True)\n    plt.show()"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\ntakes the raw rb data and convert it into averages and std devs", "response": "def shape_rb_data(raw_rb):\n    \"\"\"Take the raw rb data and convert it into averages and std dev\n\n    Args:\n        raw_rb (numpy.array): m x n x l list where m is the number of seeds, n\n            is the number of Clifford sequences and l is the number of qubits\n\n    Return:\n        numpy_array: 2 x n x l list where index 0 is the mean over seeds, 1 is\n            the std dev overseeds\n    \"\"\"\n    rb_data = []\n    rb_data.append(np.mean(raw_rb, 0))\n    rb_data.append(np.std(raw_rb, 0))\n\n    return rb_data"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\ntake the rb fit data and convert it into EPC", "response": "def rb_epc(fit, rb_pattern):\n    \"\"\"Take the rb fit data and convert it into EPC (error per Clifford)\n\n    Args:\n        fit (dict): dictionary of the fit quantities (A, alpha, B) with the\n            keys 'qn' where n is  the qubit and subkeys 'fit', e.g.\n            {'q0':{'fit': [1, 0, 0.9], 'fiterr': [0, 0, 0]}}}\n        rb_pattern (list): (see randomized benchmarking functions). Pattern\n            which specifies which qubits performing RB with which qubits. E.g.\n            [[1],[0,2]] is Q1  doing 1Q RB simultaneously with Q0/Q2 doing\n            2Q RB\n\n    Return:\n        dict: updates the passed in fit dictionary with the epc\n    \"\"\"\n    for patterns in rb_pattern:\n        for qubit in patterns:\n            fitalpha = fit['q%d' % qubit]['fit'][1]\n            fitalphaerr = fit['q%d' % qubit]['fiterr'][1]\n            nrb = 2 ** len(patterns)\n\n            fit['q%d' % qubit]['fit_calcs'] = {}\n            fit['q%d' % qubit]['fit_calcs']['epc'] = [(nrb - 1) / nrb * (1 - fitalpha),\n                                                      fitalphaerr / fitalpha]\n            fit['q%d' % qubit]['fit_calcs']['epc'][1] *= fit['q%d' % qubit]['fit_calcs']['epc'][0]\n\n    return fit"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nplot randomized benchmarking data for each subsequence.", "response": "def plot_rb_data(xdata, ydatas, yavg, yerr, fit, survival_prob, ax=None,\n                 show_plt=True):\n    \"\"\"Plot randomized benchmarking data.\n\n    Args:\n        xdata (list): list of subsequence lengths\n        ydatas (list): list of lists of survival probabilities for each\n            sequence\n        yavg (list): mean of the survival probabilities at each sequence\n            length\n        yerr (list): error of the survival\n        fit (list): fit parameters\n        survival_prob (callable): function that computes survival probability\n        ax (Axes or None): plot axis (if passed in)\n        show_plt (bool): display the plot.\n\n    Raises:\n        ImportError: If matplotlib is not installed.\n    \"\"\"\n    # pylint: disable=invalid-name\n\n    if not HAS_MATPLOTLIB:\n        raise ImportError('The function plot_rb_data needs matplotlib. '\n                          'Run \"pip install matplotlib\" before.')\n    if ax is None:\n        plt.figure()\n        ax = plt.gca()\n\n    # Plot the result for each sequence\n    for ydata in ydatas:\n        ax.plot(xdata, ydata, color='gray', linestyle='none', marker='x')\n    # Plot the mean with error bars\n    ax.errorbar(xdata, yavg, yerr=yerr, color='r', linestyle='--', linewidth=3)\n\n    # Plot the fit\n    ax.plot(xdata, survival_prob(xdata, *fit), color='blue', linestyle='-', linewidth=2)\n    ax.tick_params(labelsize=14)\n    # ax.tick_params(axis='x',labelrotation=70)\n\n    ax.set_xlabel('Clifford Length', fontsize=16)\n    ax.set_ylabel('Z', fontsize=16)\n    ax.grid(True)\n\n    if show_plt:\n        plt.show()"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nfinding runs containing parameterized gates and splits them into sequential runs excluding the parameterized gates.", "response": "def _split_runs_on_parameters(runs):\n    \"\"\"Finds runs containing parameterized gates and splits them into sequential\n    runs excluding the parameterized gates.\n    \"\"\"\n\n    def _is_dagnode_parameterized(node):\n        return any(isinstance(param, Parameter) for param in node.op.params)\n\n    out = []\n    for run in runs:\n        groups = groupby(run, _is_dagnode_parameterized)\n\n        for group_is_parameterized, gates in groups:\n            if not group_is_parameterized:\n                out.append(list(gates))\n\n    return out"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef run(self, dag):\n        runs = dag.collect_runs([\"u1\", \"u2\", \"u3\", \"id\"])\n        runs = _split_runs_on_parameters(runs)\n        for run in runs:\n            right_name = \"u1\"\n            right_parameters = (0, 0, 0)  # (theta, phi, lambda)\n\n            for current_node in run:\n                left_name = current_node.name\n                if (current_node.condition is not None\n                        or len(current_node.qargs) != 1\n                        or left_name not in [\"u1\", \"u2\", \"u3\", \"id\"]):\n                    raise TranspilerError(\"internal error\")\n                if left_name == \"u1\":\n                    left_parameters = (0, 0, current_node.op.params[0])\n                elif left_name == \"u2\":\n                    left_parameters = (np.pi / 2, current_node.op.params[0],\n                                       current_node.op.params[1])\n                elif left_name == \"u3\":\n                    left_parameters = tuple(current_node.op.params)\n                else:\n                    left_name = \"u1\"  # replace id with u1\n                    left_parameters = (0, 0, 0)\n                # If there are any sympy objects coming from the gate convert\n                # to numpy.\n                left_parameters = tuple([float(x) for x in left_parameters])\n                # Compose gates\n                name_tuple = (left_name, right_name)\n                if name_tuple == (\"u1\", \"u1\"):\n                    # u1(lambda1) * u1(lambda2) = u1(lambda1 + lambda2)\n                    right_parameters = (0, 0, right_parameters[2] +\n                                        left_parameters[2])\n                elif name_tuple == (\"u1\", \"u2\"):\n                    # u1(lambda1) * u2(phi2, lambda2) = u2(phi2 + lambda1, lambda2)\n                    right_parameters = (np.pi / 2, right_parameters[1] +\n                                        left_parameters[2], right_parameters[2])\n                elif name_tuple == (\"u2\", \"u1\"):\n                    # u2(phi1, lambda1) * u1(lambda2) = u2(phi1, lambda1 + lambda2)\n                    right_name = \"u2\"\n                    right_parameters = (np.pi / 2, left_parameters[1],\n                                        right_parameters[2] + left_parameters[2])\n                elif name_tuple == (\"u1\", \"u3\"):\n                    # u1(lambda1) * u3(theta2, phi2, lambda2) =\n                    #     u3(theta2, phi2 + lambda1, lambda2)\n                    right_parameters = (right_parameters[0], right_parameters[1] +\n                                        left_parameters[2], right_parameters[2])\n                elif name_tuple == (\"u3\", \"u1\"):\n                    # u3(theta1, phi1, lambda1) * u1(lambda2) =\n                    #     u3(theta1, phi1, lambda1 + lambda2)\n                    right_name = \"u3\"\n                    right_parameters = (left_parameters[0], left_parameters[1],\n                                        right_parameters[2] + left_parameters[2])\n                elif name_tuple == (\"u2\", \"u2\"):\n                    # Using Ry(pi/2).Rz(2*lambda).Ry(pi/2) =\n                    #    Rz(pi/2).Ry(pi-2*lambda).Rz(pi/2),\n                    # u2(phi1, lambda1) * u2(phi2, lambda2) =\n                    #    u3(pi - lambda1 - phi2, phi1 + pi/2, lambda2 + pi/2)\n                    right_name = \"u3\"\n                    right_parameters = (np.pi - left_parameters[2] -\n                                        right_parameters[1], left_parameters[1] +\n                                        np.pi / 2, right_parameters[2] +\n                                        np.pi / 2)\n                elif name_tuple[1] == \"nop\":\n                    right_name = left_name\n                    right_parameters = left_parameters\n                else:\n                    # For composing u3's or u2's with u3's, use\n                    # u2(phi, lambda) = u3(pi/2, phi, lambda)\n                    # together with the qiskit.mapper.compose_u3 method.\n                    right_name = \"u3\"\n                    # Evaluate the symbolic expressions for efficiency\n                    right_parameters = Optimize1qGates.compose_u3(left_parameters[0],\n                                                                  left_parameters[1],\n                                                                  left_parameters[2],\n                                                                  right_parameters[0],\n                                                                  right_parameters[1],\n                                                                  right_parameters[2])\n                    # Why evalf()? This program:\n                    #   OPENQASM 2.0;\n                    #   include \"qelib1.inc\";\n                    #   qreg q[2];\n                    #   creg c[2];\n                    #   u3(0.518016983430947*pi,1.37051598592907*pi,1.36816383603222*pi) q[0];\n                    #   u3(1.69867232277986*pi,0.371448347747471*pi,0.461117217930936*pi) q[0];\n                    #   u3(0.294319836336836*pi,0.450325871124225*pi,1.46804720442555*pi) q[0];\n                    #   measure q -> c;\n                    # took >630 seconds (did not complete) to optimize without\n                    # calling evalf() at all, 19 seconds to optimize calling\n                    # evalf() AFTER compose_u3, and 1 second to optimize\n                    # calling evalf() BEFORE compose_u3.\n                # 1. Here down, when we simplify, we add f(theta) to lambda to\n                # correct the global phase when f(theta) is 2*pi. This isn't\n                # necessary but the other steps preserve the global phase, so\n                # we continue in that manner.\n                # 2. The final step will remove Z rotations by 2*pi.\n                # 3. Note that is_zero is true only if the expression is exactly\n                # zero. If the input expressions have already been evaluated\n                # then these final simplifications will not occur.\n                # TODO After we refactor, we should have separate passes for\n                # exact and approximate rewriting.\n\n                # Y rotation is 0 mod 2*pi, so the gate is a u1\n                if np.mod(right_parameters[0], (2 * np.pi)) == 0 \\\n                        and right_name != \"u1\":\n                    right_name = \"u1\"\n                    right_parameters = (0, 0, right_parameters[1] +\n                                        right_parameters[2] +\n                                        right_parameters[0])\n                # Y rotation is pi/2 or -pi/2 mod 2*pi, so the gate is a u2\n                if right_name == \"u3\":\n                    # theta = pi/2 + 2*k*pi\n                    if np.mod((right_parameters[0] - np.pi / 2), (2 * np.pi)) == 0:\n                        right_name = \"u2\"\n                        right_parameters = (np.pi / 2, right_parameters[1],\n                                            right_parameters[2] +\n                                            (right_parameters[0] - np.pi / 2))\n                    # theta = -pi/2 + 2*k*pi\n                    if np.mod((right_parameters[0] + np.pi / 2), (2 * np.pi)) == 0:\n                        right_name = \"u2\"\n                        right_parameters = (np.pi / 2, right_parameters[1] +\n                                            np.pi, right_parameters[2] -\n                                            np.pi + (right_parameters[0] +\n                                                     np.pi / 2))\n                # u1 and lambda is 0 mod 2*pi so gate is nop (up to a global phase)\n                if right_name == \"u1\" and np.mod(right_parameters[2], (2 * np.pi)) == 0:\n                    right_name = \"nop\"\n\n            # Replace the the first node in the run with a dummy DAG which contains a dummy\n            # qubit. The name is irrelevant, because substitute_node_with_dag will take care of\n            # putting it in the right place.\n            run_qarg = (QuantumRegister(1, 'q'), 0)\n            new_op = Gate(name=\"\", num_qubits=1, params=[])\n            if right_name == \"u1\":\n                new_op = U1Gate(right_parameters[2])\n            if right_name == \"u2\":\n                new_op = U2Gate(right_parameters[1], right_parameters[2])\n            if right_name == \"u3\":\n                new_op = U3Gate(*right_parameters)\n\n            if right_name != 'nop':\n                new_dag = DAGCircuit()\n                new_dag.add_qreg(run_qarg[0])\n                new_dag.apply_operation_back(new_op, [run_qarg], [])\n                dag.substitute_node_with_dag(run[0], new_dag)\n\n            # Delete the other nodes in the run\n            for current_node in run[1:]:\n                dag.remove_op_node(current_node)\n            if right_name == \"nop\":\n                dag.remove_op_node(run[0])\n\n        return dag", "response": "Return a new circuit that has been optimized."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nreturn a triple theta phi lambda for the product.", "response": "def compose_u3(theta1, phi1, lambda1, theta2, phi2, lambda2):\n        \"\"\"Return a triple theta, phi, lambda for the product.\n\n        u3(theta, phi, lambda)\n           = u3(theta1, phi1, lambda1).u3(theta2, phi2, lambda2)\n           = Rz(phi1).Ry(theta1).Rz(lambda1+phi2).Ry(theta2).Rz(lambda2)\n           = Rz(phi1).Rz(phi').Ry(theta').Rz(lambda').Rz(lambda2)\n           = u3(theta', phi1 + phi', lambda2 + lambda')\n\n        Return theta, phi, lambda.\n        \"\"\"\n        # Careful with the factor of two in yzy_to_zyz\n        thetap, phip, lambdap = Optimize1qGates.yzy_to_zyz((lambda1 + phi2), theta1, theta2)\n        (theta, phi, lamb) = (thetap, phi1 + phip, lambda2 + lambdap)\n\n        return (theta, phi, lamb)"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nexpresses a Y. Z. Y single qubit gate as a Z. Y. Z single qubit gate.", "response": "def yzy_to_zyz(xi, theta1, theta2, eps=1e-9):  # pylint: disable=invalid-name\n        \"\"\"Express a Y.Z.Y single qubit gate as a Z.Y.Z gate.\n\n        Solve the equation\n\n        .. math::\n\n        Ry(theta1).Rz(xi).Ry(theta2) = Rz(phi).Ry(theta).Rz(lambda)\n\n        for theta, phi, and lambda.\n\n        Return a solution theta, phi, and lambda.\n        \"\"\"\n        quaternion_yzy = quaternion_from_euler([theta1, xi, theta2], 'yzy')\n        euler = quaternion_yzy.to_zyz()\n        quaternion_zyz = quaternion_from_euler(euler, 'zyz')\n        # output order different than rotation order\n        out_angles = (euler[1], euler[0], euler[2])\n        abs_inner = abs(quaternion_zyz.data.dot(quaternion_yzy.data))\n        if not np.allclose(abs_inner, 1, eps):\n            raise TranspilerError('YZY and ZYZ angles do not give same rotation matrix.')\n        out_angles = tuple(0 if np.abs(angle) < _CHOP_THRESHOLD else angle\n                           for angle in out_angles)\n        return out_angles"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef run(self, dag):\n        self.layout = self.layout or self.property_set.get('layout')\n\n        if self.layout is None:\n            raise TranspilerError(\"FullAncilla pass requires property_set[\\\"layout\\\"] or\"\n                                  \" \\\"layout\\\" parameter to run\")\n\n        layout_physical_qubits = self.layout.get_physical_bits().keys()\n        coupling_physical_qubits = self.coupling_map.physical_qubits\n        idle_physical_qubits = [q for q in coupling_physical_qubits\n                                if q not in layout_physical_qubits]\n\n        if idle_physical_qubits:\n            if self.ancilla_name in dag.qregs:\n                save_prefix = QuantumRegister.prefix\n                QuantumRegister.prefix = self.ancilla_name\n                qreg = QuantumRegister(len(idle_physical_qubits))\n                QuantumRegister.prefix = save_prefix\n            else:\n                qreg = QuantumRegister(len(idle_physical_qubits), name=self.ancilla_name)\n\n        for idx, idle_q in enumerate(idle_physical_qubits):\n            self.property_set['layout'][idle_q] = qreg[idx]\n\n        return dag", "response": "This method is used to extend the layout with new physical qubits and virtual qubits."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nvalidates the input to state visualization functions.", "response": "def _validate_input_state(quantum_state):\n    \"\"\"Validates the input to state visualization functions.\n\n    Args:\n        quantum_state (ndarray): Input state / density matrix.\n    Returns:\n        rho: A 2d numpy array for the density matrix.\n    Raises:\n        VisualizationError: Invalid input.\n    \"\"\"\n    rho = np.asarray(quantum_state)\n    if rho.ndim == 1:\n        rho = np.outer(rho, np.conj(rho))\n    # Check the shape of the input is a square matrix\n    shape = np.shape(rho)\n    if len(shape) != 2 or shape[0] != shape[1]:\n        raise VisualizationError(\"Input is not a valid quantum state.\")\n    # Check state is an n-qubit state\n    num = int(np.log2(rho.shape[0]))\n    if 2 ** num != rho.shape[0]:\n        raise VisualizationError(\"Input is not a multi-qubit quantum state.\")\n    return rho"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\ntrims a PIL image and remove white space.", "response": "def _trim(image):\n    \"\"\"Trim a PIL image and remove white space.\"\"\"\n    background = PIL.Image.new(image.mode, image.size, image.getpixel((0, 0)))\n    diff = PIL.ImageChops.difference(image, background)\n    diff = PIL.ImageChops.add(diff, diff, 2.0, -100)\n    bbox = diff.getbbox()\n    if bbox:\n        image = image.crop(bbox)\n    return image"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\ngives a circuit return a tuple of qregs cregs and ops that are layered.", "response": "def _get_layered_instructions(circuit, reverse_bits=False, justify=None):\n    \"\"\"\n    Given a circuit, return a tuple (qregs, cregs, ops) where\n    qregs and cregs are the quantum and classical registers\n    in order (based on reverse_bits) and ops is a list\n    of DAG nodes which type is \"operation\".\n    Args:\n        circuit (QuantumCircuit): From where the information is extracted.\n        reverse_bits (bool): If true the order of the bits in the registers is\n            reversed.\n        justify (str) : `left`, `right` or `none`. Defaults to `left`. Says how\n            the circuit should be justified.\n    Returns:\n        Tuple(list,list,list): To be consumed by the visualizer directly.\n    \"\"\"\n    if justify:\n        justify = justify.lower()\n\n    # default to left\n    justify = justify if justify in ('right', 'none') else 'left'\n\n    dag = circuit_to_dag(circuit)\n    ops = []\n    qregs = []\n    cregs = []\n\n    for qreg in dag.qregs.values():\n        qregs += [(qreg, bitno) for bitno in range(qreg.size)]\n\n    for creg in dag.cregs.values():\n        cregs += [(creg, bitno) for bitno in range(creg.size)]\n\n    if justify == 'none':\n        for node in dag.topological_op_nodes():\n            ops.append([node])\n\n    if justify == 'left':\n        for dag_layer in dag.layers():\n            layers = []\n            current_layer = []\n\n            dag_nodes = dag_layer['graph'].op_nodes()\n            dag_nodes.sort(key=lambda nd: nd._node_id)\n\n            for node in dag_nodes:\n                multibit_gate = len(node.qargs) + len(node.cargs) > 1\n\n                if multibit_gate:\n                    # need to see if it crosses over any other nodes\n                    gate_span = _get_gate_span(qregs, node)\n\n                    all_indices = []\n                    for check_node in dag_nodes:\n                        if check_node != node:\n                            all_indices += _get_gate_span(qregs, check_node)\n\n                    if any(i in gate_span for i in all_indices):\n                        # needs to be a new layer\n                        layers.append([node])\n                    else:\n                        # can be added\n                        current_layer.append(node)\n                else:\n                    current_layer.append(node)\n\n            if current_layer:\n                layers.append(current_layer)\n            ops += layers\n\n    if justify == 'right':\n        dag_layers = []\n\n        for dag_layer in dag.layers():\n            dag_layers.append(dag_layer)\n\n        # Have to work from the end of the circuit\n        dag_layers.reverse()\n\n        # Dict per layer, keys are qubits and values are the gate\n        layer_dicts = [{}]\n\n        for dag_layer in dag_layers:\n\n            dag_instructions = dag_layer['graph'].op_nodes()\n\n            # sort into the order they were input\n            dag_instructions.sort(key=lambda nd: nd._node_id)\n            for instruction_node in dag_instructions:\n\n                gate_span = _get_gate_span(qregs, instruction_node)\n\n                added = False\n                for i in range(len(layer_dicts)):\n                    # iterate from the end\n                    curr_dict = layer_dicts[-1 - i]\n\n                    if any(index in curr_dict for index in gate_span):\n                        added = True\n\n                        if i == 0:\n                            new_dict = {}\n\n                            for index in gate_span:\n                                new_dict[index] = instruction_node\n                            layer_dicts.append(new_dict)\n                        else:\n                            curr_dict = layer_dicts[-i]\n                            for index in gate_span:\n                                curr_dict[index] = instruction_node\n\n                        break\n\n                if not added:\n                    for index in gate_span:\n                        layer_dicts[0][index] = instruction_node\n\n        # need to convert from dict format to layers\n        layer_dicts.reverse()\n        ops = [list(layer.values()) for layer in layer_dicts]\n\n    if reverse_bits:\n        qregs.reverse()\n        cregs.reverse()\n\n    return qregs, cregs, ops"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\ngets the list of qubits drawing this gate would cover", "response": "def _get_gate_span(qregs, instruction):\n    \"\"\"Get the list of qubits drawing this gate would cover\"\"\"\n\n    min_index = len(qregs)\n    max_index = 0\n    for qreg in instruction.qargs:\n        index = qregs.index(qreg)\n\n        if index < min_index:\n            min_index = index\n        if index > max_index:\n            max_index = index\n\n    if instruction.cargs:\n        return qregs[min_index:]\n\n    return qregs[min_index:max_index + 1]"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef circuit_to_instruction(circuit):\n    instruction = Instruction(name=circuit.name,\n                              num_qubits=sum([qreg.size for qreg in circuit.qregs]),\n                              num_clbits=sum([creg.size for creg in circuit.cregs]),\n                              params=[])\n    instruction.control = None\n\n    def find_bit_position(bit):\n        \"\"\"find the index of a given bit (Register, int) within\n        a flat ordered list of bits of the circuit\n        \"\"\"\n        if isinstance(bit[0], QuantumRegister):\n            ordered_regs = circuit.qregs\n        else:\n            ordered_regs = circuit.cregs\n        reg_index = ordered_regs.index(bit[0])\n        return sum([reg.size for reg in ordered_regs[:reg_index]]) + bit[1]\n\n    definition = circuit.data.copy()\n\n    if instruction.num_qubits > 0:\n        q = QuantumRegister(instruction.num_qubits, 'q')\n    if instruction.num_clbits > 0:\n        c = ClassicalRegister(instruction.num_clbits, 'c')\n\n    definition = list(map(lambda x:\n                          (x[0],\n                           list(map(lambda y: (q, find_bit_position(y)), x[1])),\n                           list(map(lambda y: (c, find_bit_position(y)), x[2]))), definition))\n    instruction.definition = definition\n\n    return instruction", "response": "Converts a QuantumCircuit to an Instruction object."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nruns the layout algorithm.", "response": "def run(self, dag):\n        \"\"\"\n        Pick a convenient layout depending on the best matching\n        qubit connectivity, and set the property `layout`.\n\n        Args:\n            dag (DAGCircuit): DAG to find layout for.\n\n        Raises:\n            TranspilerError: if dag wider than self.coupling_map\n        \"\"\"\n        num_dag_qubits = sum([qreg.size for qreg in dag.qregs.values()])\n        if num_dag_qubits > self.coupling_map.size():\n            raise TranspilerError('Number of qubits greater than device.')\n        best_sub = self._best_subset(num_dag_qubits)\n        layout = Layout()\n        map_iter = 0\n        for qreg in dag.qregs.values():\n            for i in range(qreg.size):\n                layout[(qreg, i)] = int(best_sub[map_iter])\n                map_iter += 1\n        self.property_set['layout'] = layout"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef _best_subset(self, n_qubits):\n        if n_qubits == 1:\n            return np.array([0])\n\n        device_qubits = self.coupling_map.size()\n\n        cmap = np.asarray(self.coupling_map.get_edges())\n        data = np.ones_like(cmap[:, 0])\n        sp_cmap = sp.coo_matrix((data, (cmap[:, 0], cmap[:, 1])),\n                                shape=(device_qubits, device_qubits)).tocsr()\n        best = 0\n        best_map = None\n        # do bfs with each node as starting point\n        for k in range(sp_cmap.shape[0]):\n            bfs = cs.breadth_first_order(sp_cmap, i_start=k, directed=False,\n                                         return_predecessors=False)\n\n            connection_count = 0\n            sub_graph = []\n            for i in range(n_qubits):\n                node_idx = bfs[i]\n                for j in range(sp_cmap.indptr[node_idx],\n                               sp_cmap.indptr[node_idx + 1]):\n                    node = sp_cmap.indices[j]\n                    for counter in range(n_qubits):\n                        if node == bfs[counter]:\n                            connection_count += 1\n                            sub_graph.append([node_idx, node])\n                            break\n\n            if connection_count > best:\n                best = connection_count\n                best_map = bfs[0:n_qubits]\n                # Return a best mapping that has reduced bandwidth\n                mapping = {}\n                for edge in range(best_map.shape[0]):\n                    mapping[best_map[edge]] = edge\n                new_cmap = [[mapping[c[0]], mapping[c[1]]] for c in sub_graph]\n                rows = [edge[0] for edge in new_cmap]\n                cols = [edge[1] for edge in new_cmap]\n                data = [1]*len(rows)\n                sp_sub_graph = sp.coo_matrix((data, (rows, cols)),\n                                             shape=(n_qubits, n_qubits)).tocsr()\n                perm = cs.reverse_cuthill_mckee(sp_sub_graph)\n                best_map = best_map[perm]\n        return best_map", "response": "Computes the qubit mapping with the best connectivity."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nreturning True if completely - positive trace - preserving.", "response": "def is_cptp(self, atol=None, rtol=None):\n        \"\"\"Return True if completely-positive trace-preserving.\"\"\"\n        if self._data[1] is not None:\n            return False\n        if atol is None:\n            atol = self._atol\n        if rtol is None:\n            rtol = self._rtol\n        accum = 0j\n        for op in self._data[0]:\n            accum += np.dot(np.transpose(np.conj(op)), op)\n        return is_identity_matrix(accum, rtol=rtol, atol=atol)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nreturning the conjugate of the QuantumChannel.", "response": "def conjugate(self):\n        \"\"\"Return the conjugate of the QuantumChannel.\"\"\"\n        kraus_l, kraus_r = self._data\n        kraus_l = [k.conj() for k in kraus_l]\n        if kraus_r is not None:\n            kraus_r = [k.conj() for k in kraus_r]\n        return Kraus((kraus_l, kraus_r), self.input_dims(), self.output_dims())"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nreturning the transpose of the QuantumChannel.", "response": "def transpose(self):\n        \"\"\"Return the transpose of the QuantumChannel.\"\"\"\n        kraus_l, kraus_r = self._data\n        kraus_l = [k.T for k in kraus_l]\n        if kraus_r is not None:\n            kraus_r = [k.T for k in kraus_r]\n        return Kraus((kraus_l, kraus_r),\n                     input_dims=self.output_dims(),\n                     output_dims=self.input_dims())"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nreturn the composition of self and other.", "response": "def compose(self, other, qargs=None, front=False):\n        \"\"\"Return the composition channel self\u2218other.\n\n        Args:\n            other (QuantumChannel): a quantum channel subclass.\n            qargs (list): a list of subsystem positions to compose other on.\n            front (bool): If False compose in standard order other(self(input))\n                          otherwise compose in reverse order self(other(input))\n                          [default: False]\n\n        Returns:\n            Kraus: The composition channel as a Kraus object.\n\n        Raises:\n            QiskitError: if other cannot be converted to a channel, or\n            has incompatible dimensions.\n        \"\"\"\n        if qargs is not None:\n            return Kraus(\n                SuperOp(self).compose(other, qargs=qargs, front=front))\n\n        if not isinstance(other, Kraus):\n            other = Kraus(other)\n        # Check dimensions match up\n        if front and self._input_dim != other._output_dim:\n            raise QiskitError(\n                'input_dim of self must match output_dim of other')\n        if not front and self._output_dim != other._input_dim:\n            raise QiskitError(\n                'input_dim of other must match output_dim of self')\n\n        if front:\n            ka_l, ka_r = self._data\n            kb_l, kb_r = other._data\n            input_dim = other._input_dim\n            output_dim = self._output_dim\n        else:\n            ka_l, ka_r = other._data\n            kb_l, kb_r = self._data\n            input_dim = self._input_dim\n            output_dim = other._output_dim\n\n        kab_l = [np.dot(a, b) for a in ka_l for b in kb_l]\n        if ka_r is None and kb_r is None:\n            kab_r = None\n        elif ka_r is None:\n            kab_r = [np.dot(a, b) for a in ka_l for b in kb_r]\n        elif kb_r is None:\n            kab_r = [np.dot(a, b) for a in ka_r for b in kb_l]\n        else:\n            kab_r = [np.dot(a, b) for a in ka_r for b in kb_r]\n        return Kraus((kab_l, kab_r), input_dim, output_dim)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef power(self, n):\n        if n > 0:\n            return super().power(n)\n        return Kraus(SuperOp(self).power(n))", "response": "The matrix power of the superoperator matrix."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nreturns the QuantumChannel self + other.", "response": "def multiply(self, other):\n        \"\"\"Return the QuantumChannel self + other.\n\n        Args:\n            other (complex): a complex number.\n\n        Returns:\n            Kraus: the scalar multiplication other * self as a Kraus object.\n\n        Raises:\n            QiskitError: if other is not a valid scalar.\n        \"\"\"\n        if not isinstance(other, Number):\n            raise QiskitError(\"other is not a number\")\n        # If the number is complex we need to convert to general\n        # kraus channel so we multiply via Choi representation\n        if isinstance(other, complex) or other < 0:\n            # Convert to Choi-matrix\n            return Kraus(Choi(self).multiply(other))\n        # If the number is real we can update the Kraus operators\n        # directly\n        val = np.sqrt(other)\n        kraus_r = None\n        kraus_l = [val * k for k in self._data[0]]\n        if self._data[1] is not None:\n            kraus_r = [val * k for k in self._data[1]]\n        return Kraus((kraus_l, kraus_r), self._input_dim, self._output_dim)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef _evolve(self, state, qargs=None):\n        # If subsystem evolution we use the SuperOp representation\n        if qargs is not None:\n            return SuperOp(self)._evolve(state, qargs)\n\n        # Otherwise we compute full evolution directly\n        state = self._format_state(state)\n        if state.shape[0] != self._input_dim:\n            raise QiskitError(\n                \"QuantumChannel input dimension is not equal to state dimension.\"\n            )\n        if state.ndim == 1 and self._data[1] is None and len(\n                self._data[0]) == 1:\n            # If we only have a single Kraus operator we can implement unitary-type\n            # evolution of a state vector psi -> K[0].psi\n            return np.dot(self._data[0][0], state)\n        # Otherwise we always return a density matrix\n        state = self._format_state(state, density_matrix=True)\n        kraus_l, kraus_r = self._data\n        if kraus_r is None:\n            kraus_r = kraus_l\n        return np.einsum('AiB,BC,AjC->ij', kraus_l, state,\n                         np.conjugate(kraus_r))", "response": "Evolve a state by the QuantumChannel."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nreturning the tensor product of self and other.", "response": "def _tensor_product(self, other, reverse=False):\n        \"\"\"Return the tensor product channel.\n\n        Args:\n            other (QuantumChannel): a quantum channel subclass.\n            reverse (bool): If False return self \u2297 other, if True return\n                            if True return (other \u2297 self) [Default: False\n        Returns:\n            Kraus: the tensor product channel as a Kraus object.\n\n        Raises:\n            QiskitError: if other cannot be converted to a channel.\n        \"\"\"\n        # Convert other to Kraus\n        if not isinstance(other, Kraus):\n            other = Kraus(other)\n\n        # Get tensor matrix\n        ka_l, ka_r = self._data\n        kb_l, kb_r = other._data\n        if reverse:\n            input_dims = self.input_dims() + other.input_dims()\n            output_dims = self.output_dims() + other.output_dims()\n            kab_l = [np.kron(b, a) for a in ka_l for b in kb_l]\n        else:\n            input_dims = other.input_dims() + self.input_dims()\n            output_dims = other.output_dims() + self.output_dims()\n            kab_l = [np.kron(a, b) for a in ka_l for b in kb_l]\n        if ka_r is None and kb_r is None:\n            kab_r = None\n        else:\n            if ka_r is None:\n                ka_r = ka_l\n            if kb_r is None:\n                kb_r = kb_l\n            if reverse:\n                kab_r = [np.kron(b, a) for a in ka_r for b in kb_r]\n            else:\n                kab_r = [np.kron(a, b) for a in ka_r for b in kb_r]\n        data = (kab_l, kab_r)\n        return Kraus(data, input_dims, output_dims)"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nreturns the correspond floating point number.", "response": "def real(self, nested_scope=None):\n        \"\"\"Return the correspond floating point number.\"\"\"\n        operation = self.children[0].operation()\n        lhs = self.children[1].real(nested_scope)\n        rhs = self.children[2].real(nested_scope)\n        return operation(lhs, rhs)"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef sym(self, nested_scope=None):\n        operation = self.children[0].operation()\n        lhs = self.children[1].sym(nested_scope)\n        rhs = self.children[2].sym(nested_scope)\n        return operation(lhs, rhs)", "response": "Return the correspond symbolic number."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\napply a barrier to all the qubits in that circuit.", "response": "def barrier(self, *qargs):\n    \"\"\"Apply barrier to circuit.\n    If qargs is None, applies to all the qbits.\n    Args is a list of QuantumRegister or single qubits.\n    For QuantumRegister, applies barrier to all the qubits in that register.\"\"\"\n    qubits = []\n\n    qargs = _convert_to_bits(qargs, [qbit for qreg in self.qregs for qbit in qreg])\n\n    if not qargs:  # None\n        for qreg in self.qregs:\n            for j in range(qreg.size):\n                qubits.append((qreg, j))\n\n    for qarg in qargs:\n        if isinstance(qarg, (QuantumRegister, list)):\n            if isinstance(qarg, QuantumRegister):\n                qubits.extend([(qarg, j) for j in range(qarg.size)])\n            else:\n                qubits.extend(qarg)\n        else:\n            qubits.append(qarg)\n\n    return self.append(Barrier(len(qubits)), qubits, [])"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef average_data(counts, observable):\n    if not isinstance(observable, dict):\n        observable = make_dict_observable(observable)\n    temp = 0\n    tot = sum(counts.values())\n    for key in counts:\n        if key in observable:\n            temp += counts[key] * observable[key] / tot\n    return temp", "response": "Compute the mean value of an experiment and an observable."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nprocessing an Id or IndexedId node as a bit or register type.", "response": "def _process_bit_id(self, node):\n        \"\"\"Process an Id or IndexedId node as a bit or register type.\n\n        Return a list of tuples (Register,index).\n        \"\"\"\n        # pylint: disable=inconsistent-return-statements\n        reg = None\n        if node.name in self.dag.qregs:\n            reg = self.dag.qregs[node.name]\n        elif node.name in self.dag.cregs:\n            reg = self.dag.cregs[node.name]\n        else:\n            raise QiskitError(\"expected qreg or creg name:\",\n                              \"line=%s\" % node.line,\n                              \"file=%s\" % node.file)\n\n        if node.type == \"indexed_id\":\n            # An indexed bit or qubit\n            return [(reg, node.index)]\n        elif node.type == \"id\":\n            # A qubit or qreg or creg\n            if not self.bit_stack[-1]:\n                # Global scope\n                return [(reg, j) for j in range(reg.size)]\n            else:\n                # local scope\n                if node.name in self.bit_stack[-1]:\n                    return [self.bit_stack[-1][node.name]]\n                raise QiskitError(\"expected local bit name:\",\n                                  \"line=%s\" % node.line,\n                                  \"file=%s\" % node.file)\n        return None"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef _process_custom_unitary(self, node):\n        name = node.name\n        if node.arguments is not None:\n            args = self._process_node(node.arguments)\n        else:\n            args = []\n        bits = [self._process_bit_id(node_element)\n                for node_element in node.bitlist.children]\n        if name in self.gates:\n            gargs = self.gates[name][\"args\"]\n            gbits = self.gates[name][\"bits\"]\n            # Loop over register arguments, if any.\n            maxidx = max(map(len, bits))\n            for idx in range(maxidx):\n                self.arg_stack.append({gargs[j]: args[j]\n                                       for j in range(len(gargs))})\n                # Only index into register arguments.\n                element = [idx*x for x in\n                           [len(bits[j]) > 1 for j in range(len(bits))]]\n                self.bit_stack.append({gbits[j]: bits[j][element[j]]\n                                       for j in range(len(gbits))})\n                self._create_dag_op(name,\n                                    [self.arg_stack[-1][s].sym() for s in gargs],\n                                    [self.bit_stack[-1][s] for s in gbits])\n                self.arg_stack.pop()\n                self.bit_stack.pop()\n        else:\n            raise QiskitError(\"internal error undefined gate:\",\n                              \"line=%s\" % node.line, \"file=%s\" % node.file)", "response": "Process a custom unitary node."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nprocessing a gate node.", "response": "def _process_gate(self, node, opaque=False):\n        \"\"\"Process a gate node.\n\n        If opaque is True, process the node as an opaque gate node.\n        \"\"\"\n        self.gates[node.name] = {}\n        de_gate = self.gates[node.name]\n        de_gate[\"print\"] = True  # default\n        de_gate[\"opaque\"] = opaque\n        de_gate[\"n_args\"] = node.n_args()\n        de_gate[\"n_bits\"] = node.n_bits()\n        if node.n_args() > 0:\n            de_gate[\"args\"] = [element.name for element in node.arguments.children]\n        else:\n            de_gate[\"args\"] = []\n        de_gate[\"bits\"] = [c.name for c in node.bitlist.children]\n        if opaque:\n            de_gate[\"body\"] = None\n        else:\n            de_gate[\"body\"] = node.body"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nprocesses a CNOT gate node.", "response": "def _process_cnot(self, node):\n        \"\"\"Process a CNOT gate node.\"\"\"\n        id0 = self._process_bit_id(node.children[0])\n        id1 = self._process_bit_id(node.children[1])\n        if not(len(id0) == len(id1) or len(id0) == 1 or len(id1) == 1):\n            raise QiskitError(\"internal error: qreg size mismatch\",\n                              \"line=%s\" % node.line, \"file=%s\" % node.file)\n        maxidx = max([len(id0), len(id1)])\n        for idx in range(maxidx):\n            if len(id0) > 1 and len(id1) > 1:\n                self.dag.apply_operation_back(CXBase(), [id0[idx], id1[idx]], [], self.condition)\n            elif len(id0) > 1:\n                self.dag.apply_operation_back(CXBase(), [id0[idx], id1[0]], [], self.condition)\n            else:\n                self.dag.apply_operation_back(CXBase(), [id0[0], id1[idx]], [], self.condition)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nprocess a measurement node.", "response": "def _process_measure(self, node):\n        \"\"\"Process a measurement node.\"\"\"\n        id0 = self._process_bit_id(node.children[0])\n        id1 = self._process_bit_id(node.children[1])\n        if len(id0) != len(id1):\n            raise QiskitError(\"internal error: reg size mismatch\",\n                              \"line=%s\" % node.line, \"file=%s\" % node.file)\n        for idx, idy in zip(id0, id1):\n            self.dag.apply_operation_back(Measure(), [idx], [idy], self.condition)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nprocess an if node.", "response": "def _process_if(self, node):\n        \"\"\"Process an if node.\"\"\"\n        creg_name = node.children[0].name\n        creg = self.dag.cregs[creg_name]\n        cval = node.children[1].value\n        self.condition = (creg, cval)\n        self._process_node(node.children[2])\n        self.condition = None"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef _process_node(self, node):\n        if node.type == \"program\":\n            self._process_children(node)\n\n        elif node.type == \"qreg\":\n            qreg = QuantumRegister(node.index, node.name)\n            self.dag.add_qreg(qreg)\n\n        elif node.type == \"creg\":\n            creg = ClassicalRegister(node.index, node.name)\n            self.dag.add_creg(creg)\n\n        elif node.type == \"id\":\n            raise QiskitError(\"internal error: _process_node on id\")\n\n        elif node.type == \"int\":\n            raise QiskitError(\"internal error: _process_node on int\")\n\n        elif node.type == \"real\":\n            raise QiskitError(\"internal error: _process_node on real\")\n\n        elif node.type == \"indexed_id\":\n            raise QiskitError(\"internal error: _process_node on indexed_id\")\n\n        elif node.type == \"id_list\":\n            # We process id_list nodes when they are leaves of barriers.\n            return [self._process_bit_id(node_children)\n                    for node_children in node.children]\n\n        elif node.type == \"primary_list\":\n            # We should only be called for a barrier.\n            return [self._process_bit_id(m) for m in node.children]\n\n        elif node.type == \"gate\":\n            self._process_gate(node)\n\n        elif node.type == \"custom_unitary\":\n            self._process_custom_unitary(node)\n\n        elif node.type == \"universal_unitary\":\n            args = self._process_node(node.children[0])\n            qid = self._process_bit_id(node.children[1])\n            for element in qid:\n                self.dag.apply_operation_back(UBase(*args, element), self.condition)\n\n        elif node.type == \"cnot\":\n            self._process_cnot(node)\n\n        elif node.type == \"expression_list\":\n            return node.children\n\n        elif node.type == \"binop\":\n            raise QiskitError(\"internal error: _process_node on binop\")\n\n        elif node.type == \"prefix\":\n            raise QiskitError(\"internal error: _process_node on prefix\")\n\n        elif node.type == \"measure\":\n            self._process_measure(node)\n\n        elif node.type == \"format\":\n            self.version = node.version()\n\n        elif node.type == \"barrier\":\n            ids = self._process_node(node.children[0])\n            qubits = []\n            for qubit in ids:\n                for j, _ in enumerate(qubit):\n                    qubits.append(qubit[j])\n            self.dag.apply_operation_back(Barrier(len(qubits)), qubits, [])\n\n        elif node.type == \"reset\":\n            id0 = self._process_bit_id(node.children[0])\n            for i, _ in enumerate(id0):\n                self.dag.apply_operation_back(Reset(), [id0[i]], [], self.condition)\n\n        elif node.type == \"if\":\n            self._process_if(node)\n\n        elif node.type == \"opaque\":\n            self._process_gate(node, opaque=True)\n\n        elif node.type == \"external\":\n            raise QiskitError(\"internal error: _process_node on external\")\n\n        else:\n            raise QiskitError(\"internal error: undefined node type\",\n                              node.type, \"line=%s\" % node.line,\n                              \"file=%s\" % node.file)\n        return None", "response": "Carry out the action associated with a node."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\ncreate a DAG node out of a parsed AST op node.", "response": "def _create_dag_op(self, name, params, qargs):\n        \"\"\"\n        Create a DAG node out of a parsed AST op node.\n\n        Args:\n            name (str): operation name to apply to the dag.\n            params (list): op parameters\n            qargs (list(QuantumRegister, int)): qubits to attach to\n\n        Raises:\n            QiskitError: if encountering a non-basis opaque gate\n        \"\"\"\n        if name == \"u0\":\n            op_class = U0Gate\n        elif name == \"u1\":\n            op_class = U1Gate\n        elif name == \"u2\":\n            op_class = U2Gate\n        elif name == \"u3\":\n            op_class = U3Gate\n        elif name == \"x\":\n            op_class = XGate\n        elif name == \"y\":\n            op_class = YGate\n        elif name == \"z\":\n            op_class = ZGate\n        elif name == \"t\":\n            op_class = TGate\n        elif name == \"tdg\":\n            op_class = TdgGate\n        elif name == \"s\":\n            op_class = SGate\n        elif name == \"sdg\":\n            op_class = SdgGate\n        elif name == \"swap\":\n            op_class = SwapGate\n        elif name == \"rx\":\n            op_class = RXGate\n        elif name == \"ry\":\n            op_class = RYGate\n        elif name == \"rz\":\n            op_class = RZGate\n        elif name == \"rzz\":\n            op_class = RZZGate\n        elif name == \"id\":\n            op_class = IdGate\n        elif name == \"h\":\n            op_class = HGate\n        elif name == \"cx\":\n            op_class = CnotGate\n        elif name == \"cy\":\n            op_class = CyGate\n        elif name == \"cz\":\n            op_class = CzGate\n        elif name == \"ch\":\n            op_class = CHGate\n        elif name == \"crz\":\n            op_class = CrzGate\n        elif name == \"cu1\":\n            op_class = Cu1Gate\n        elif name == \"cu3\":\n            op_class = Cu3Gate\n        elif name == \"ccx\":\n            op_class = ToffoliGate\n        elif name == \"cswap\":\n            op_class = FredkinGate\n        else:\n            raise QiskitError(\"unknown operation for ast node name %s\" % name)\n\n        op = op_class(*params)\n\n        self.dag.apply_operation_back(op, qargs, [], condition=self.condition)"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef qasm(self, prec=15):\n        return \"measure \" + self.children[0].qasm(prec) + \" -> \" + \\\n               self.children[1].qasm(prec) + \";\"", "response": "Return the corresponding OPENQASM string."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nreturn the duration of the supplied channels.", "response": "def ch_duration(self, *channels: List[Channel]) -> int:\n        \"\"\"Return duration of supplied channels.\n\n        Args:\n            *channels: Supplied channels\n        \"\"\"\n        return self.timeslots.ch_duration(*channels)"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef ch_start_time(self, *channels: List[Channel]) -> int:\n        return self.timeslots.ch_start_time(*channels)", "response": "Return the minimum start time for the supplied channels."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef ch_stop_time(self, *channels: List[Channel]) -> int:\n        return self.timeslots.ch_stop_time(*channels)", "response": "Return the maximum start time for the given channels."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\niterates over the Instructions in the schedule tree.", "response": "def _instructions(self, time: int = 0) -> Iterable[Tuple[int, 'Instruction']]:\n        \"\"\"Iterable for flattening Schedule tree.\n\n        Args:\n            time: Shifted time due to parent\n\n        Yields:\n            Tuple[int, ScheduleComponent]: Tuple containing time `ScheduleComponent` starts\n                at and the flattened `ScheduleComponent`.\n        \"\"\"\n        for insert_time, child_sched in self.children:\n            yield from child_sched._instructions(time + insert_time)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nvalidating a value against the expected type of the a .", "response": "def check_type(self, value, attr, data):\n        \"\"\"Validates a value against the correct type of the field.\n\n        It calls ``_expected_types`` to get a list of valid types.\n\n        Subclasses can do one of the following:\n\n            1. They can override the ``valid_types`` property with a tuple with\n            the expected types for this field.\n\n            2. They can override the ``_expected_types`` method to return a\n            tuple of expected types for the field.\n\n            3. They can change ``check_type`` completely to customize\n            validation.\n\n        This method or the overrides must return the ``value`` parameter\n        untouched.\n        \"\"\"\n        expected_types = self._expected_types()\n        if not isinstance(value, expected_types):\n            raise self._not_expected_type(\n                value, expected_types, fields=[self], field_names=attr, data=data)\n        return value"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\ndump additional data into a dictionary.", "response": "def dump_additional_data(self, valid_data, many, original_data):\n        \"\"\"Include unknown fields after dumping.\n\n        Unknown fields are added with no processing at all.\n\n        Args:\n            valid_data (dict or list): data collected and returned by ``dump()``.\n            many (bool): if True, data and original_data are a list.\n            original_data (object or list): object passed to ``dump()`` in the\n                first place.\n\n        Returns:\n            dict: the same ``valid_data`` extended with the unknown attributes.\n\n        Inspired by https://github.com/marshmallow-code/marshmallow/pull/595.\n        \"\"\"\n        if many:\n            for i, _ in enumerate(valid_data):\n                additional_keys = set(original_data[i].__dict__) - set(valid_data[i])\n                for key in additional_keys:\n                    valid_data[i][key] = getattr(original_data[i], key)\n        else:\n            additional_keys = set(original_data.__dict__) - set(valid_data)\n            for key in additional_keys:\n                valid_data[key] = getattr(original_data, key)\n\n        return valid_data"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nloads additional data from the original data.", "response": "def load_additional_data(self, valid_data, many, original_data):\n        \"\"\"Include unknown fields after load.\n\n        Unknown fields are added with no processing at all.\n\n        Args:\n            valid_data (dict or list): validated data returned by ``load()``.\n            many (bool): if True, data and original_data are a list.\n            original_data (dict or list): data passed to ``load()`` in the\n                first place.\n\n        Returns:\n            dict: the same ``valid_data`` extended with the unknown attributes.\n\n        Inspired by https://github.com/marshmallow-code/marshmallow/pull/595.\n        \"\"\"\n        if many:\n            for i, _ in enumerate(valid_data):\n                additional_keys = set(original_data[i]) - set(valid_data[i])\n                for key in additional_keys:\n                    valid_data[i][key] = original_data[i][key]\n        else:\n            additional_keys = set(original_data) - set(valid_data)\n            for key in additional_keys:\n                valid_data[key] = original_data[key]\n\n        return valid_data"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef _create_validation_schema(schema_cls):\n        validation_schema = schema_cls()\n        for _, field in validation_schema.fields.items():\n            if isinstance(field, ModelTypeValidator):\n                validate_function = field.__class__.check_type\n                field._deserialize = MethodType(validate_function, field)\n\n        return validation_schema", "response": "Create a patched Schema for validating models."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nvalidates the internal representation of the instance.", "response": "def _validate(instance):\n        \"\"\"Validate the internal representation of the instance.\"\"\"\n        try:\n            _ = instance.schema.validate(instance.to_dict())\n        except ValidationError as ex:\n            raise ModelValidationError(\n                ex.messages, ex.field_names, ex.fields, ex.data, **ex.kwargs)"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nadd validation after instantiation.", "response": "def _validate_after_init(init_method):\n        \"\"\"Add validation after instantiation.\"\"\"\n\n        @wraps(init_method)\n        def _decorated(self, **kwargs):\n            try:\n                _ = self.shallow_schema.validate(kwargs)\n            except ValidationError as ex:\n                raise ModelValidationError(\n                    ex.messages, ex.field_names, ex.fields, ex.data, **ex.kwargs) from None\n\n            init_method(self, **kwargs)\n\n        return _decorated"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef to_dict(self):\n        try:\n            data, _ = self.schema.dump(self)\n        except ValidationError as ex:\n            raise ModelValidationError(\n                ex.messages, ex.field_names, ex.fields, ex.data, **ex.kwargs) from None\n\n        return data", "response": "Serialize the model into a Python dict of simple types."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef partial_trace(state, trace_systems, dimensions=None, reverse=True):\n    state = np.array(state)  # convert op to density matrix\n\n    if dimensions is None:  # compute dims if not specified\n        num_qubits = int(np.log2(len(state)))\n        dimensions = [2 for _ in range(num_qubits)]\n        if len(state) != 2 ** num_qubits:\n            raise Exception(\"Input is not a multi-qubit state, \"\n                            \"specify input state dims\")\n    else:\n        dimensions = list(dimensions)\n\n    if isinstance(trace_systems, int):\n        trace_systems = [trace_systems]\n    else:  # reverse sort trace sys\n        trace_systems = sorted(trace_systems, reverse=True)\n\n    # trace out subsystems\n    if state.ndim == 1:\n        # optimized partial trace for input state vector\n        return __partial_trace_vec(state, trace_systems, dimensions, reverse)\n    # standard partial trace for input density matrix\n    return __partial_trace_mat(state, trace_systems, dimensions, reverse)", "response": "Partial trace over subsystems of multi - partite matrix."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nflattens an operator to a vector in a specified basis.", "response": "def vectorize(density_matrix, method='col'):\n    \"\"\"Flatten an operator to a vector in a specified basis.\n\n    Args:\n        density_matrix (ndarray): a density matrix.\n        method (str): the method of vectorization. Allowed values are\n            - 'col' (default) flattens to column-major vector.\n            - 'row' flattens to row-major vector.\n            - 'pauli'flattens in the n-qubit Pauli basis.\n            - 'pauli-weights': flattens in the n-qubit Pauli basis ordered by\n               weight.\n\n    Returns:\n        ndarray: the resulting vector.\n    Raises:\n        Exception: if input state is not a n-qubit state\n    \"\"\"\n    density_matrix = np.array(density_matrix)\n    if method == 'col':\n        return density_matrix.flatten(order='F')\n    elif method == 'row':\n        return density_matrix.flatten(order='C')\n    elif method in ['pauli', 'pauli_weights']:\n        num = int(np.log2(len(density_matrix)))  # number of qubits\n        if len(density_matrix) != 2**num:\n            raise Exception('Input state must be n-qubit state')\n        if method == 'pauli_weights':\n            pgroup = pauli_group(num, case='weight')\n        else:\n            pgroup = pauli_group(num, case='tensor')\n        vals = [np.trace(np.dot(p.to_matrix(), density_matrix))\n                for p in pgroup]\n        return np.array(vals)\n    return None"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef choi_to_rauli(choi, order=1):\n    if order == 0:\n        order = 'weight'\n    elif order == 1:\n        order = 'tensor'\n\n    # get number of qubits'\n    num_qubits = int(np.log2(np.sqrt(len(choi))))\n    pgp = pauli_group(num_qubits, case=order)\n    rauli = []\n    for i in pgp:\n        for j in pgp:\n            pauliop = np.kron(j.to_matrix().T, i.to_matrix())\n            rauli += [np.trace(np.dot(choi, pauliop))]\n    return np.array(rauli).reshape(4 ** num_qubits, 4 ** num_qubits)", "response": "Convert a Choi - matrix to a Pauli - basis superoperator."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef chop(array, epsilon=1e-10):\n    ret = np.array(array)\n\n    if np.isrealobj(ret):\n        ret[abs(ret) < epsilon] = 0.0\n    else:\n        ret.real[abs(ret.real) < epsilon] = 0.0\n        ret.imag[abs(ret.imag) < epsilon] = 0.0\n    return ret", "response": "Truncates small values of a complex array."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nconstructing the outer product of two vectors.", "response": "def outer(vector1, vector2=None):\n    \"\"\"\n    Construct the outer product of two vectors.\n\n    The second vector argument is optional, if absent the projector\n    of the first vector will be returned.\n\n    Args:\n        vector1 (ndarray): the first vector.\n        vector2 (ndarray): the (optional) second vector.\n\n    Returns:\n        np.array: The matrix |v1><v2|.\n\n    \"\"\"\n    if vector2 is None:\n        vector2 = np.array(vector1).conj()\n    else:\n        vector2 = np.array(vector2).conj()\n    return np.outer(vector1, vector2)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef random_unitary_matrix(dim, seed=None):\n    warnings.warn('The random_unitary_matrix() function in qiskit.tools.qi has been '\n                  'deprecated and will be removed in the future. Instead use '\n                  'the function in qiskit.quantum_info.random',\n                  DeprecationWarning)\n    return random.random_unitary(dim, seed).data", "response": "Returns a random unitary matrix."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nreturns a random density matrix.", "response": "def random_density_matrix(length, rank=None, method='Hilbert-Schmidt', seed=None):\n    \"\"\"Deprecated in 0.8+\n    \"\"\"\n    warnings.warn('The random_density_matrix() function in qiskit.tools.qi has been '\n                  'deprecated and will be removed in the future. Instead use '\n                  'the function in qiskit.quantum_info.random',\n                  DeprecationWarning)\n    return random.random_density_matrix(length, rank, method, seed)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\ncalculating the concurrence. Args: state (np.array): a quantum state (1x4 array) or a density matrix (4x4 array) Returns: float: concurrence. Raises: Exception: if attempted on more than two qubits.", "response": "def concurrence(state):\n    \"\"\"Calculate the concurrence.\n\n    Args:\n        state (np.array): a quantum state (1x4 array) or a density matrix (4x4\n                          array)\n    Returns:\n        float: concurrence.\n    Raises:\n        Exception: if attempted on more than two qubits.\n    \"\"\"\n    rho = np.array(state)\n    if rho.ndim == 1:\n        rho = outer(state)\n    if len(state) != 4:\n        raise Exception(\"Concurrence is only defined for more than two qubits\")\n\n    YY = np.fliplr(np.diag([-1, 1, 1, -1]))\n    A = rho.dot(YY).dot(rho.conj()).dot(YY)\n    w = la.eigh(A, eigvals_only=True)\n    w = np.sqrt(np.maximum(w, 0))\n    return max(0.0, w[-1] - np.sum(w[0:-1]))"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef shannon_entropy(pvec, base=2):\n    # pylint: disable=missing-docstring\n    if base == 2:\n        def logfn(x):\n            return - x * np.log2(x)\n    elif base == np.e:\n        def logfn(x):\n            return - x * np.log(x)\n    else:\n        def logfn(x):\n            return -x * np.log(x) / np.log(base)\n\n    h = 0.\n    for x in pvec:\n        if 0 < x < 1:\n            h += logfn(x)\n    return h", "response": "Compute the Shannon entropy of a probability vector."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef entropy(state):\n\n    rho = np.array(state)\n    if rho.ndim == 1:\n        return 0\n    evals = np.maximum(np.linalg.eigvalsh(state), 0.)\n    return shannon_entropy(evals, base=np.e)", "response": "Compute the von - Neumann entropy of a quantum state vector."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\ncomputing the mutual information of a bipartite state.", "response": "def mutual_information(state, d0, d1=None):\n    \"\"\"\n    Compute the mutual information of a bipartite state.\n\n    Args:\n        state (array_like): a bipartite state-vector or density-matrix.\n        d0 (int): dimension of the first subsystem.\n        d1 (int or None): dimension of the second subsystem.\n\n    Returns:\n        float: The mutual information S(rho_A) + S(rho_B) - S(rho_AB).\n    \"\"\"\n    if d1 is None:\n        d1 = int(len(state) / d0)\n    mi = entropy(partial_trace(state, [0], dimensions=[d0, d1]))\n    mi += entropy(partial_trace(state, [1], dimensions=[d0, d1]))\n    mi -= entropy(state)\n    return mi"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\ncompute the entanglement of formation of a bipartite state vector or 2 - qubit density matrix.", "response": "def entanglement_of_formation(state, d0, d1=None):\n    \"\"\"\n    Compute the entanglement of formation of quantum state.\n\n    The input quantum state must be either a bipartite state vector, or a\n    2-qubit density matrix.\n\n    Args:\n        state (array_like): (N) array_like or (4,4) array_like, a\n            bipartite quantum state.\n        d0 (int): the dimension of the first subsystem.\n        d1 (int or None): the dimension of the second subsystem.\n\n    Returns:\n        float: The entanglement of formation.\n    \"\"\"\n    state = np.array(state)\n\n    if d1 is None:\n        d1 = int(len(state) / d0)\n\n    if state.ndim == 2 and len(state) == 4 and d0 == 2 and d1 == 2:\n        return __eof_qubit(state)\n    elif state.ndim == 1:\n        # trace out largest dimension\n        if d0 < d1:\n            tr = [1]\n        else:\n            tr = [0]\n        state = partial_trace(state, tr, dimensions=[d0, d1])\n        return entropy(state)\n    else:\n        print('Input must be a state-vector or 2-qubit density matrix.')\n    return None"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\ncompute the Entanglement of Formation of a 2 - qubit density matrix.", "response": "def __eof_qubit(rho):\n    \"\"\"\n    Compute the Entanglement of Formation of a 2-qubit density matrix.\n\n    Args:\n        rho ((array_like): (4,4) array_like, input density matrix.\n\n    Returns:\n        float: The entanglement of formation.\n    \"\"\"\n    c = concurrence(rho)\n    c = 0.5 + 0.5 * np.sqrt(1 - c * c)\n    return shannon_entropy([c, 1 - c])"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef union(*schedules: List[Union[ScheduleComponent, Tuple[int, ScheduleComponent]]],\n          name: str = None) -> Schedule:\n    \"\"\"Create a union (and also shift if desired) of all input `Schedule`s.\n\n    Args:\n        *schedules: Schedules to take the union of\n        name: Name of the new schedule. Defaults to first element of `schedules`\n    \"\"\"\n    if name is None and schedules:\n        sched = schedules[0]\n        if isinstance(sched, (list, tuple)):\n            name = sched[1].name\n        else:\n            name = sched.name\n    return Schedule(*schedules, name=name)", "response": "Create a union of all input Schedule s."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef flatten(schedule: ScheduleComponent, name: str = None) -> Schedule:\n    if name is None:\n        name = schedule.name\n\n    return Schedule(*schedule.instructions, name=name)", "response": "Create a flattened schedule."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nreturns a schedule shifted by time.", "response": "def shift(schedule: ScheduleComponent, time: int, name: str = None) -> Schedule:\n    \"\"\"Return schedule shifted by `time`.\n\n    Args:\n        schedule: The schedule to shift\n        time: The time to shift by\n        name: Name of shifted schedule. Defaults to name of `schedule`\n    \"\"\"\n    if name is None:\n        name = schedule.name\n    return union((time, schedule), name=name)"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nreturns a new schedule with the child schedule inserted into the parent at the given time.", "response": "def insert(parent: ScheduleComponent, time: int, child: ScheduleComponent,\n           name: str = None) -> Schedule:\n    \"\"\"Return a new schedule with the `child` schedule inserted into the `parent` at `start_time`.\n\n    Args:\n        parent: Schedule to be inserted into\n        time: Time to be inserted defined with respect to `parent`\n        child: Schedule to insert\n        name: Name of the new schedule. Defaults to name of parent\n    \"\"\"\n    return union(parent, (time, child), name=name)"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef append(parent: ScheduleComponent, child: ScheduleComponent,\n           name: str = None) -> Schedule:\n    r\"\"\"Return a new schedule with by appending `child` to `parent` at\n       the last time of the `parent` schedule's channels\n       over the intersection of the parent and child schedule's channels.\n\n       $t = \\textrm{max}({x.stop\\_time |x \\in parent.channels \\cap child.channels})$\n\n    Args:\n        parent: The schedule to be inserted into\n        child: The schedule to insert\n        name: Name of the new schedule. Defaults to name of parent\n    \"\"\"\n    common_channels = set(parent.channels) & set(child.channels)\n    insertion_time = parent.ch_stop_time(*common_channels)\n    return insert(parent, insertion_time, child, name=name)", "response": "r Returns a new schedule with by appending child to parent at the last time of the parent schedule s channels\n       over the intersection of parent and child s channels."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef u3(self, theta, phi, lam, q):\n    return self.append(U3Gate(theta, phi, lam), [q], [])", "response": "Apply u3 to q."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef status(self):\n        return BackendStatus(backend_name=self.name(),\n                             backend_version=__version__,\n                             operational=True,\n                             pending_jobs=0,\n                             status_msg='')", "response": "Return the status of the backend."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nstarting the progress bar.", "response": "def start(self, iterations):\n        \"\"\"Start the progress bar.\n\n        Parameters:\n            iterations (int): Number of iterations.\n        \"\"\"\n        self.touched = True\n        self.iter = int(iterations)\n        self.t_start = time.time()"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nestimate the remaining time left.", "response": "def time_remaining_est(self, completed_iter):\n        \"\"\"Estimate the remaining time left.\n\n        Parameters:\n            completed_iter (int): Number of iterations completed.\n\n        Returns:\n            est_time: Estimated time remaining.\n        \"\"\"\n        if completed_iter:\n            t_r_est = (time.time() - self.t_start) / \\\n                completed_iter*(self.iter-completed_iter)\n        else:\n            t_r_est = 0\n        date_time = datetime.datetime(1, 1, 1) + datetime.timedelta(seconds=t_r_est)\n        time_string = \"%02d:%02d:%02d:%02d\" % \\\n            (date_time.day - 1, date_time.hour, date_time.minute, date_time.second)\n\n        return time_string"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nrun the CommutativeCancellation pass on a DAG.", "response": "def run(self, dag):\n        \"\"\"Run the CommutativeCancellation pass on a dag\n\n        Args:\n            dag (DAGCircuit): the DAG to be optimized.\n\n        Returns:\n            DAGCircuit: the optimized DAG.\n\n        Raises:\n            TranspilerError: when the 1 qubit rotation gates are not found\n        \"\"\"\n\n        q_gate_list = ['cx', 'cy', 'cz', 'h', 'x', 'y', 'z']\n\n        # Gate sets to be cancelled\n        cancellation_sets = defaultdict(lambda: [])\n\n        for wire in dag.wires:\n            wire_name = \"{0}[{1}]\".format(str(wire[0].name), str(wire[1]))\n            wire_commutation_set = self.property_set['commutation_set'][wire_name]\n\n            for com_set_idx, com_set in enumerate(wire_commutation_set):\n                if com_set[0].type in ['in', 'out']:\n                    continue\n                for node in com_set:\n                    num_qargs = len(node.qargs)\n                    if num_qargs == 1 and node.name in q_gate_list:\n                        cancellation_sets[(node.name, wire_name, com_set_idx)].append(node)\n                    if num_qargs == 1 and node.name in ['u1', 'rz', 't', 's']:\n                        cancellation_sets[('z_rotation', wire_name, com_set_idx)].append(node)\n                    elif num_qargs == 2 and node.qargs[0] == wire:\n                        second_op_name = \"{0}[{1}]\".format(str(node.qargs[1][0].name),\n                                                           str(node.qargs[1][1]))\n                        q2_key = (node.name, wire_name, second_op_name,\n                                  self.property_set['commutation_set'][(node, second_op_name)])\n                        cancellation_sets[q2_key].append(node)\n\n        for cancel_set_key in cancellation_sets:\n            set_len = len(cancellation_sets[cancel_set_key])\n            if ((set_len) > 1 and cancel_set_key[0] in q_gate_list):\n                gates_to_cancel = cancellation_sets[cancel_set_key]\n                for c_node in gates_to_cancel[:(set_len // 2) * 2]:\n                    dag.remove_op_node(c_node)\n\n            elif((set_len) > 1 and cancel_set_key[0] == 'z_rotation'):\n                run = cancellation_sets[cancel_set_key]\n                run_qarg = run[0].qargs[0]\n                total_angle = 0.0  # lambda\n                for current_node in run:\n                    if (current_node.condition is not None\n                            or len(current_node.qargs) != 1\n                            or current_node.qargs[0] != run_qarg):\n                        raise TranspilerError(\"internal error\")\n\n                    if current_node.name in ['u1', 'rz']:\n                        current_angle = float(current_node.op.params[0])\n                    elif current_node.name == 't':\n                        current_angle = sympy.pi / 4\n                    elif current_node.name == 's':\n                        current_angle = sympy.pi / 2\n\n                    # Compose gates\n                    total_angle = current_angle + total_angle\n\n                # Replace the data of the first node in the run\n                new_op = U1Gate(total_angle)\n                new_qarg = (QuantumRegister(1, 'q'), 0)\n                new_dag = DAGCircuit()\n                new_dag.add_qreg(new_qarg[0])\n                new_dag.apply_operation_back(new_op, [new_qarg])\n                dag.substitute_node_with_dag(run[0], new_dag)\n\n                # Delete the other nodes in the run\n                for current_node in run[1:]:\n                    dag.remove_op_node(current_node)\n\n        return dag"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nconvert a QuantumArchive object to a list of QuantumCircuits.", "response": "def _experiments_to_circuits(qobj):\n    \"\"\"Return a list of QuantumCircuit object(s) from a qobj\n\n    Args:\n        qobj (Qobj): The Qobj object to convert to QuantumCircuits\n    Returns:\n        list: A list of QuantumCircuit objects from the qobj\n\n    \"\"\"\n    if qobj.experiments:\n        circuits = []\n        for x in qobj.experiments:\n            quantum_registers = [QuantumRegister(i[1], name=i[0])\n                                 for i in x.header.qreg_sizes]\n            classical_registers = [ClassicalRegister(i[1], name=i[0])\n                                   for i in x.header.creg_sizes]\n            circuit = QuantumCircuit(*quantum_registers,\n                                     *classical_registers,\n                                     name=x.header.name)\n            qreg_dict = {}\n            creg_dict = {}\n            for reg in quantum_registers:\n                qreg_dict[reg.name] = reg\n            for reg in classical_registers:\n                creg_dict[reg.name] = reg\n            for i in x.instructions:\n                instr_method = getattr(circuit, i.name)\n                qubits = []\n                try:\n                    for qubit in i.qubits:\n                        qubit_label = x.header.qubit_labels[qubit]\n                        qubits.append(\n                            qreg_dict[qubit_label[0]][qubit_label[1]])\n                except Exception:  # pylint: disable=broad-except\n                    pass\n                clbits = []\n                try:\n                    for clbit in i.memory:\n                        clbit_label = x.header.clbit_labels[clbit]\n                        clbits.append(\n                            creg_dict[clbit_label[0]][clbit_label[1]])\n                except Exception:  # pylint: disable=broad-except\n                    pass\n                params = []\n                try:\n                    params = i.params\n                except Exception:  # pylint: disable=broad-except\n                    pass\n                if i.name in ['snapshot']:\n                    instr_method(\n                        i.label,\n                        snapshot_type=i.snapshot_type,\n                        qubits=qubits,\n                        params=params)\n                elif i.name == 'initialize':\n                    instr_method(params, qubits)\n                else:\n                    instr_method(*params, *qubits, *clbits)\n            circuits.append(circuit)\n        return circuits\n    return None"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef disassemble(qobj):\n    run_config = qobj.config.to_dict()\n    user_qobj_header = qobj.header.to_dict()\n    circuits = _experiments_to_circuits(qobj)\n\n    return circuits, run_config, user_qobj_header", "response": "Dissasemble a qobj and return the circuits run_config and user header\n   "}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef hamming_distance(str1, str2):\n    if len(str1) != len(str2):\n        raise VisualizationError('Strings not same length.')\n    return sum(s1 != s2 for s1, s2 in zip(str1, str2))", "response": "Calculate the Hamming distance between two strings."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nplots a histogram of data.", "response": "def plot_histogram(data, figsize=(7, 5), color=None, number_to_keep=None,\n                   sort='asc', target_string=None,\n                   legend=None, bar_labels=True, title=None):\n    \"\"\"Plot a histogram of data.\n\n    Args:\n        data (list or dict): This is either a list of dictionaries or a single\n            dict containing the values to represent (ex {'001': 130})\n        figsize (tuple): Figure size in inches.\n        color (list or str): String or list of strings for histogram bar colors.\n        number_to_keep (int): The number of terms to plot and rest\n            is made into a single bar called 'rest'.\n        sort (string): Could be 'asc', 'desc', or 'hamming'.\n        target_string (str): Target string if 'sort' is a distance measure.\n        legend(list): A list of strings to use for labels of the data.\n            The number of entries must match the length of data (if data is a\n            list or 1 if it's a dict)\n        bar_labels (bool): Label each bar in histogram with probability value.\n        title (str): A string to use for the plot title\n\n    Returns:\n        matplotlib.Figure: A figure for the rendered histogram.\n\n    Raises:\n        ImportError: Matplotlib not available.\n        VisualizationError: When legend is provided and the length doesn't\n            match the input data.\n    \"\"\"\n    if not HAS_MATPLOTLIB:\n        raise ImportError('Must have Matplotlib installed.')\n    if sort not in VALID_SORTS:\n        raise VisualizationError(\"Value of sort option, %s, isn't a \"\n                                 \"valid choice. Must be 'asc', \"\n                                 \"'desc', or 'hamming'\")\n    elif sort in DIST_MEAS.keys() and target_string is None:\n        err_msg = 'Must define target_string when using distance measure.'\n        raise VisualizationError(err_msg)\n\n    if isinstance(data, dict):\n        data = [data]\n\n    if legend and len(legend) != len(data):\n        raise VisualizationError(\"Length of legendL (%s) doesn't match \"\n                                 \"number of input executions: %s\" %\n                                 (len(legend), len(data)))\n\n    fig, ax = plt.subplots(figsize=figsize)\n    labels = list(sorted(\n        functools.reduce(lambda x, y: x.union(y.keys()), data, set())))\n    if number_to_keep is not None:\n        labels.append('rest')\n\n    if sort in DIST_MEAS.keys():\n        dist = []\n        for item in labels:\n            dist.append(DIST_MEAS[sort](item, target_string))\n\n        labels = [list(x) for x in zip(*sorted(zip(dist, labels),\n                                               key=lambda pair: pair[0]))][1]\n\n    labels_dict = OrderedDict()\n\n    # Set bar colors\n    if color is None:\n        color = ['#648fff', '#dc267f', '#785ef0', '#ffb000', '#fe6100']\n    elif isinstance(color, str):\n        color = [color]\n\n    all_pvalues = []\n    length = len(data)\n    for item, execution in enumerate(data):\n        if number_to_keep is not None:\n            data_temp = dict(Counter(execution).most_common(number_to_keep))\n            data_temp[\"rest\"] = sum(execution.values()) - sum(data_temp.values())\n            execution = data_temp\n        values = []\n        for key in labels:\n            if key not in execution:\n                if number_to_keep is None:\n                    labels_dict[key] = 1\n                    values.append(0)\n                else:\n                    values.append(-1)\n            else:\n                labels_dict[key] = 1\n                values.append(execution[key])\n        values = np.array(values, dtype=float)\n        where_idx = np.where(values >= 0)[0]\n        pvalues = values[where_idx] / sum(values[where_idx])\n        for value in pvalues:\n            all_pvalues.append(value)\n        numelem = len(values[where_idx])\n        ind = np.arange(numelem)  # the x locations for the groups\n        width = 1/(len(data)+1)  # the width of the bars\n        rects = []\n        for idx, val in enumerate(pvalues):\n            label = None\n            if not idx and legend:\n                label = legend[item]\n            if val >= 0:\n                rects.append(ax.bar(idx+item*width, val, width, label=label,\n                                    color=color[item % len(color)],\n                                    zorder=2))\n        bar_center = (width / 2) * (length - 1)\n        ax.set_xticks(ind + bar_center)\n        ax.set_xticklabels(labels_dict.keys(), fontsize=14, rotation=70)\n        # attach some text labels\n        if bar_labels:\n            for rect in rects:\n                for rec in rect:\n                    height = rec.get_height()\n                    if height >= 1e-3:\n                        ax.text(rec.get_x() + rec.get_width() / 2., 1.05 * height,\n                                '%.3f' % float(height),\n                                ha='center', va='bottom', zorder=3)\n                    else:\n                        ax.text(rec.get_x() + rec.get_width() / 2., 1.05 * height,\n                                '0',\n                                ha='center', va='bottom', zorder=3)\n\n    # add some text for labels, title, and axes ticks\n    ax.set_ylabel('Probabilities', fontsize=14)\n    ax.set_ylim([0., min([1.2, max([1.2 * val for val in all_pvalues])])])\n    if sort == 'desc':\n        ax.invert_xaxis()\n\n    ax.yaxis.set_major_locator(MaxNLocator(5))\n    for tick in ax.yaxis.get_major_ticks():\n        tick.label.set_fontsize(14)\n    ax.set_facecolor('#eeeeee')\n    plt.grid(which='major', axis='y', zorder=0, linestyle='--')\n    if title:\n        plt.title(title)\n\n    if legend:\n        ax.legend(loc='upper left', bbox_to_anchor=(1.01, 1.0), ncol=1,\n                  borderaxespad=0, frameon=True, fontsize=12)\n    if fig:\n        plt.close(fig)\n    return fig"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nreturn quaternion for rotation about given axis.", "response": "def quaternion_from_axis_rotation(angle, axis):\n    \"\"\"Return quaternion for rotation about given axis.\n\n    Args:\n        angle (float): Angle in radians.\n        axis (str): Axis for rotation\n\n    Returns:\n        Quaternion: Quaternion for axis rotation.\n\n    Raises:\n        ValueError: Invalid input axis.\n    \"\"\"\n    out = np.zeros(4, dtype=float)\n    if axis == 'x':\n        out[1] = 1\n    elif axis == 'y':\n        out[2] = 1\n    elif axis == 'z':\n        out[3] = 1\n    else:\n        raise ValueError('Invalid axis input.')\n    out *= math.sin(angle/2.0)\n    out[0] = math.cos(angle/2.0)\n    return Quaternion(out)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\ngenerating a quaternion from a set of Euler angles.", "response": "def quaternion_from_euler(angles, order='yzy'):\n    \"\"\"Generate a quaternion from a set of Euler angles.\n\n    Args:\n        angles (array_like): Array of Euler angles.\n        order (str): Order of Euler rotations.  'yzy' is default.\n\n    Returns:\n        Quaternion: Quaternion representation of Euler rotation.\n    \"\"\"\n    angles = np.asarray(angles, dtype=float)\n    quat = quaternion_from_axis_rotation(angles[0], order[0])\\\n        * (quaternion_from_axis_rotation(angles[1], order[1])\n           * quaternion_from_axis_rotation(angles[2], order[2]))\n    quat.normalize(inplace=True)\n    return quat"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nnormalize a Quaternion to unit length so that it represents a valid rotation.", "response": "def normalize(self, inplace=False):\n        \"\"\"Normalizes a Quaternion to unit length\n        so that it represents a valid rotation.\n\n        Args:\n            inplace (bool): Do an inplace normalization.\n\n        Returns:\n            Quaternion: Normalized quaternion.\n        \"\"\"\n        if inplace:\n            nrm = self.norm()\n            self.data /= nrm\n            return None\n        nrm = self.norm()\n        data_copy = np.array(self.data, copy=True)\n        data_copy /= nrm\n        return Quaternion(data_copy)"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nconverting a unit - length quaternion to a rotation matrix.", "response": "def to_matrix(self):\n        \"\"\"Converts a unit-length quaternion to a rotation matrix.\n\n        Returns:\n            ndarray: Rotation matrix.\n        \"\"\"\n        w, x, y, z = self.normalize().data  # pylint: disable=C0103\n        mat = np.array([\n            [1-2*y**2-2*z**2, 2*x*y-2*z*w, 2*x*z+2*y*w],\n            [2*x*y+2*z*w, 1-2*x**2-2*z**2, 2*y*z-2*x*w],\n            [2*x*z-2*y*w, 2*y*z+2*x*w, 1-2*x**2-2*y**2]\n        ], dtype=float)\n        return mat"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef to_zyz(self):\n        mat = self.to_matrix()\n        euler = np.zeros(3, dtype=float)\n        if mat[2, 2] < 1:\n            if mat[2, 2] > -1:\n                euler[0] = math.atan2(mat[1, 2], mat[0, 2])\n                euler[1] = math.acos(mat[2, 2])\n                euler[2] = math.atan2(mat[2, 1], -mat[2, 0])\n            else:\n                euler[0] = -math.atan2(mat[1, 0], mat[1, 1])\n                euler[1] = np.pi\n        else:\n            euler[0] = math.atan2(mat[1, 0], mat[1, 1])\n        return euler", "response": "Converts a unit - length quaternion to a sequence\n            of ZYZ Euler angles."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nprocesses received data for representation.", "response": "def process_data(data, number_to_keep):\n    \"\"\" Prepare received data for representation.\n\n        Args:\n            data (dict): values to represent (ex. {'001' : 130})\n            number_to_keep (int): number of elements to show individually.\n\n        Returns:\n            dict: processed data to show.\n    \"\"\"\n\n    result = dict()\n\n    if number_to_keep != 0:\n        data_temp = dict(Counter(data).most_common(number_to_keep))\n        data_temp['rest'] = sum(data.values()) - sum(data_temp.values())\n        data = data_temp\n\n    labels = data\n    values = np.array([data[key] for key in labels], dtype=float)\n    pvalues = values / sum(values)\n    for position, label in enumerate(labels):\n        result[label] = round(pvalues[position], 5)\n\n    return result"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\ncreate a histogram representation of the input data.", "response": "def iplot_histogram(data, figsize=None, number_to_keep=None,\n                    sort='asc', legend=None):\n    \"\"\" Create a histogram representation.\n\n        Graphical representation of the input array using a vertical bars\n        style graph.\n\n        Args:\n            data (list or dict):  This is either a list of dicts or a single\n                dict containing the values to represent (ex. {'001' : 130})\n            figsize (tuple): Figure size in pixels.\n            number_to_keep (int): The number of terms to plot and\n                rest is made into a single bar called other values\n            sort (string): Could be 'asc' or 'desc'\n            legend (list): A list of strings to use for labels of the data.\n                The number of entries must match the length of data.\n        Raises:\n            VisualizationError: When legend is provided and the length doesn't\n                match the input data.\n    \"\"\"\n\n    # HTML\n    html_template = Template(\"\"\"\n    <p>\n        <div id=\"histogram_$divNumber\"></div>\n    </p>\n    \"\"\")\n\n    # JavaScript\n    javascript_template = Template(\"\"\"\n    <script>\n        requirejs.config({\n            paths: {\n                qVisualization: \"https://qvisualization.mybluemix.net/q-visualizations\"\n            }\n        });\n\n        require([\"qVisualization\"], function(qVisualizations) {\n            qVisualizations.plotState(\"histogram_$divNumber\",\n                                      \"histogram\",\n                                      $executions,\n                                      $options);\n        });\n    </script>\n    \"\"\")\n\n    # Process data and execute\n    div_number = str(time.time())\n    div_number = re.sub('[.]', '', div_number)\n\n    # set default figure size if none provided\n    if figsize is None:\n        figsize = (7, 5)\n\n    options = {'number_to_keep': 0 if number_to_keep is None else number_to_keep,\n               'sort': sort,\n               'show_legend': 0,\n               'width': int(figsize[0]),\n               'height': int(figsize[1])}\n    if legend:\n        options['show_legend'] = 1\n\n    data_to_plot = []\n    if isinstance(data, dict):\n        data = [data]\n\n    if legend and len(legend) != len(data):\n        raise VisualizationError(\"Length of legendL (%s) doesn't match number \"\n                                 \"of input executions: %s\" %\n                                 (len(legend), len(data)))\n\n    for item, execution in enumerate(data):\n        exec_data = process_data(execution, options['number_to_keep'])\n        out_dict = {'data': exec_data}\n        if legend:\n            out_dict['name'] = legend[item]\n        data_to_plot.append(out_dict)\n\n    html = html_template.substitute({\n        'divNumber': div_number\n    })\n\n    javascript = javascript_template.substitute({\n        'divNumber': div_number,\n        'executions': data_to_plot,\n        'options': options\n    })\n\n    display(HTML(html + javascript))"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\ncustomizes check_type for handling containers.", "response": "def check_type(self, value, attr, data):\n        \"\"\"Customize check_type for handling containers.\"\"\"\n        # Check the type in the standard way first, in order to fail quickly\n        # in case of invalid values.\n        root_value = super(InstructionParameter, self).check_type(\n            value, attr, data)\n\n        if is_collection(value):\n            _ = [super(InstructionParameter, self).check_type(item, attr, data)\n                 for item in value]\n\n        return root_value"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef check_range(self, j):\n        if isinstance(j, int):\n            if j < 0 or j >= self.size:\n                raise QiskitIndexError(\"register index out of range\")\n            elif isinstance(j, slice):\n                if j.start < 0 or j.stop >= self.size or (j.step is not None and\n                                                          j.step <= 0):\n                    raise QiskitIndexError(\"register index slice out of range\")", "response": "Check that j is a valid index into self."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\ntesting if an array is a square matrix.", "response": "def is_square_matrix(mat):\n    \"\"\"Test if an array is a square matrix.\"\"\"\n    mat = np.array(mat)\n    if mat.ndim != 2:\n        return False\n    shape = mat.shape\n    return shape[0] == shape[1]"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\ntesting if an array is a diagonal matrix", "response": "def is_diagonal_matrix(mat, rtol=RTOL_DEFAULT, atol=ATOL_DEFAULT):\n    \"\"\"Test if an array is a diagonal matrix\"\"\"\n    if atol is None:\n        atol = ATOL_DEFAULT\n    if rtol is None:\n        rtol = RTOL_DEFAULT\n    mat = np.array(mat)\n    if mat.ndim != 2:\n        return False\n    return np.allclose(mat, np.diag(np.diagonal(mat)), rtol=rtol, atol=atol)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\ntesting if an array is a symmetrix matrix", "response": "def is_symmetric_matrix(op, rtol=RTOL_DEFAULT, atol=ATOL_DEFAULT):\n    \"\"\"Test if an array is a symmetrix matrix\"\"\"\n    if atol is None:\n        atol = ATOL_DEFAULT\n    if rtol is None:\n        rtol = RTOL_DEFAULT\n    mat = np.array(op)\n    if mat.ndim != 2:\n        return False\n    return np.allclose(mat, mat.T, rtol=rtol, atol=atol)"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef is_hermitian_matrix(mat, rtol=RTOL_DEFAULT, atol=ATOL_DEFAULT):\n    if atol is None:\n        atol = ATOL_DEFAULT\n    if rtol is None:\n        rtol = RTOL_DEFAULT\n    mat = np.array(mat)\n    if mat.ndim != 2:\n        return False\n    return np.allclose(mat, np.conj(mat.T), rtol=rtol, atol=atol)", "response": "Test if an array is a Hermitian matrix"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\ntesting if a matrix is positive semidefinite", "response": "def is_positive_semidefinite_matrix(mat, rtol=RTOL_DEFAULT, atol=ATOL_DEFAULT):\n    \"\"\"Test if a matrix is positive semidefinite\"\"\"\n    if atol is None:\n        atol = ATOL_DEFAULT\n    if rtol is None:\n        rtol = RTOL_DEFAULT\n    if not is_hermitian_matrix(mat, rtol=rtol, atol=atol):\n        return False\n    # Check eigenvalues are all positive\n    vals = np.linalg.eigvalsh(mat)\n    for v in vals:\n        if v < -atol:\n            return False\n    return True"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\ntest if an array is an identity matrix.", "response": "def is_identity_matrix(mat,\n                       ignore_phase=False,\n                       rtol=RTOL_DEFAULT,\n                       atol=ATOL_DEFAULT):\n    \"\"\"Test if an array is an identity matrix.\"\"\"\n    if atol is None:\n        atol = ATOL_DEFAULT\n    if rtol is None:\n        rtol = RTOL_DEFAULT\n    mat = np.array(mat)\n    if mat.ndim != 2:\n        return False\n    if ignore_phase:\n        # If the matrix is equal to an identity up to a phase, we can\n        # remove the phase by multiplying each entry by the complex\n        # conjugate of the phase of the [0, 0] entry.\n        theta = np.angle(mat[0, 0])\n        mat = np.exp(-1j * theta) * mat\n    # Check if square identity\n    iden = np.eye(len(mat))\n    return np.allclose(mat, iden, rtol=rtol, atol=atol)"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\ntest if an array is a unitary matrix.", "response": "def is_unitary_matrix(mat, rtol=RTOL_DEFAULT, atol=ATOL_DEFAULT):\n    \"\"\"Test if an array is a unitary matrix.\"\"\"\n    if atol is None:\n        atol = ATOL_DEFAULT\n    if rtol is None:\n        rtol = RTOL_DEFAULT\n    mat = np.array(mat)\n    # Compute A^dagger.A and see if it is identity matrix\n    mat = np.conj(mat.T).dot(mat)\n    return is_identity_matrix(mat, ignore_phase=False, rtol=rtol, atol=atol)"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef run(self, dag):\n        swaps = dag.op_nodes(SwapGate)\n        for swap in swaps:\n            final_successor = []\n            for successor in dag.successors(swap):\n                final_successor.append(successor.type == 'out' or (successor.type == 'op' and\n                                                                   successor.op.name == 'measure'))\n            if all(final_successor):\n                # the node swap needs to be removed and, if a measure follows, needs to be adapted\n                swap_qargs = swap.qargs\n                measure_layer = DAGCircuit()\n                for qreg in dag.qregs.values():\n                    measure_layer.add_qreg(qreg)\n                for creg in dag.cregs.values():\n                    measure_layer.add_creg(creg)\n                for successor in dag.successors(swap):\n                    if successor.type == 'op' and successor.op.name == 'measure':\n                        # replace measure node with a new one, where qargs is set with the \"other\"\n                        # swap qarg.\n                        dag.remove_op_node(successor)\n                        old_measure_qarg = successor.qargs[0]\n                        new_measure_qarg = swap_qargs[swap_qargs.index(old_measure_qarg) - 1]\n                        measure_layer.apply_operation_back(Measure(), [new_measure_qarg],\n                                                           [successor.cargs[0]])\n                dag.extend_back(measure_layer)\n                dag.remove_op_node(swap)\n        return dag", "response": "Return a new circuit that has been optimized."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef left_sample(continuous_pulse: Callable, duration: int, *args, **kwargs) -> np.ndarray:\n    times = np.arange(duration)\n    return continuous_pulse(times, *args, **kwargs)", "response": "Left sample a continuous function."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\ntransforming a QuantumChannel to the Choi representation.", "response": "def _to_choi(rep, data, input_dim, output_dim):\n    \"\"\"Transform a QuantumChannel to the Choi representation.\"\"\"\n    if rep == 'Choi':\n        return data\n    if rep == 'Operator':\n        return _from_operator('Choi', data, input_dim, output_dim)\n    if rep == 'SuperOp':\n        return _superop_to_choi(data, input_dim, output_dim)\n    if rep == 'Kraus':\n        return _kraus_to_choi(data, input_dim, output_dim)\n    if rep == 'Chi':\n        return _chi_to_choi(data, input_dim, output_dim)\n    if rep == 'PTM':\n        data = _ptm_to_superop(data, input_dim, output_dim)\n        return _superop_to_choi(data, input_dim, output_dim)\n    if rep == 'Stinespring':\n        return _stinespring_to_choi(data, input_dim, output_dim)\n    raise QiskitError('Invalid QuantumChannel {}'.format(rep))"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\ntransform a QuantumChannel to the SuperOp representation.", "response": "def _to_superop(rep, data, input_dim, output_dim):\n    \"\"\"Transform a QuantumChannel to the SuperOp representation.\"\"\"\n    if rep == 'SuperOp':\n        return data\n    if rep == 'Operator':\n        return _from_operator('SuperOp', data, input_dim, output_dim)\n    if rep == 'Choi':\n        return _choi_to_superop(data, input_dim, output_dim)\n    if rep == 'Kraus':\n        return _kraus_to_superop(data, input_dim, output_dim)\n    if rep == 'Chi':\n        data = _chi_to_choi(data, input_dim, output_dim)\n        return _choi_to_superop(data, input_dim, output_dim)\n    if rep == 'PTM':\n        return _ptm_to_superop(data, input_dim, output_dim)\n    if rep == 'Stinespring':\n        return _stinespring_to_superop(data, input_dim, output_dim)\n    raise QiskitError('Invalid QuantumChannel {}'.format(rep))"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef _to_kraus(rep, data, input_dim, output_dim):\n    if rep == 'Kraus':\n        return data\n    if rep == 'Stinespring':\n        return _stinespring_to_kraus(data, input_dim, output_dim)\n    if rep == 'Operator':\n        return _from_operator('Kraus', data, input_dim, output_dim)\n    # Convert via Choi and Kraus\n    if rep != 'Choi':\n        data = _to_choi(rep, data, input_dim, output_dim)\n    return _choi_to_kraus(data, input_dim, output_dim)", "response": "Transform a QuantumChannel to the Kraus representation."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _to_chi(rep, data, input_dim, output_dim):\n    if rep == 'Chi':\n        return data\n    # Check valid n-qubit input\n    _check_nqubit_dim(input_dim, output_dim)\n    if rep == 'Operator':\n        return _from_operator('Chi', data, input_dim, output_dim)\n    # Convert via Choi representation\n    if rep != 'Choi':\n        data = _to_choi(rep, data, input_dim, output_dim)\n    return _choi_to_chi(data, input_dim, output_dim)", "response": "Transform a QuantumChannel to the Chi representation."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\ntransforms a QuantumChannel to the PTM representation.", "response": "def _to_ptm(rep, data, input_dim, output_dim):\n    \"\"\"Transform a QuantumChannel to the PTM representation.\"\"\"\n    if rep == 'PTM':\n        return data\n    # Check valid n-qubit input\n    _check_nqubit_dim(input_dim, output_dim)\n    if rep == 'Operator':\n        return _from_operator('PTM', data, input_dim, output_dim)\n    # Convert via Superoperator representation\n    if rep != 'SuperOp':\n        data = _to_superop(rep, data, input_dim, output_dim)\n    return _superop_to_ptm(data, input_dim, output_dim)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\ntransforming a QuantumChannel to the Stinespring representation.", "response": "def _to_stinespring(rep, data, input_dim, output_dim):\n    \"\"\"Transform a QuantumChannel to the Stinespring representation.\"\"\"\n    if rep == 'Stinespring':\n        return data\n    if rep == 'Operator':\n        return _from_operator('Stinespring', data, input_dim, output_dim)\n    # Convert via Superoperator representation\n    if rep != 'Kraus':\n        data = _to_kraus(rep, data, input_dim, output_dim)\n    return _kraus_to_stinespring(data, input_dim, output_dim)"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\ntransforming a QuantumChannel to the Operator representation.", "response": "def _to_operator(rep, data, input_dim, output_dim):\n    \"\"\"Transform a QuantumChannel to the Operator representation.\"\"\"\n    if rep == 'Operator':\n        return data\n    if rep == 'Stinespring':\n        return _stinespring_to_operator(data, input_dim, output_dim)\n    # Convert via Kraus representation\n    if rep != 'Kraus':\n        data = _to_kraus(rep, data, input_dim, output_dim)\n    return _kraus_to_operator(data, input_dim, output_dim)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\ntransform Operator representation to other representation.", "response": "def _from_operator(rep, data, input_dim, output_dim):\n    \"\"\"Transform Operator representation to other representation.\"\"\"\n    if rep == 'Operator':\n        return data\n    if rep == 'SuperOp':\n        return np.kron(np.conj(data), data)\n    if rep == 'Choi':\n        vec = np.ravel(data, order='F')\n        return np.outer(vec, np.conj(vec))\n    if rep == 'Kraus':\n        return ([data], None)\n    if rep == 'Stinespring':\n        return (data, None)\n    if rep == 'Chi':\n        _check_nqubit_dim(input_dim, output_dim)\n        data = _from_operator('Choi', data, input_dim, output_dim)\n        return _choi_to_chi(data, input_dim, output_dim)\n    if rep == 'PTM':\n        _check_nqubit_dim(input_dim, output_dim)\n        data = _from_operator('SuperOp', data, input_dim, output_dim)\n        return _superop_to_ptm(data, input_dim, output_dim)\n    raise QiskitError('Invalid QuantumChannel {}'.format(rep))"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef _stinespring_to_operator(data, input_dim, output_dim):\n    trace_dim = data[0].shape[0] // output_dim\n    if data[1] is not None or trace_dim != 1:\n        raise QiskitError(\n            'Channel cannot be converted to Operator representation')\n    return data[0]", "response": "Transform Stinespring representation to Operator representation."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef _superop_to_choi(data, input_dim, output_dim):\n    shape = (output_dim, output_dim, input_dim, input_dim)\n    return _reshuffle(data, shape)", "response": "Transform SuperOp representation to Choi representation."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\ntransforms Choi to SuperOp representation.", "response": "def _choi_to_superop(data, input_dim, output_dim):\n    \"\"\"Transform Choi to SuperOp representation.\"\"\"\n    shape = (input_dim, output_dim, input_dim, output_dim)\n    return _reshuffle(data, shape)"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\ntransforming Kraus representation to Choi representation.", "response": "def _kraus_to_choi(data, input_dim, output_dim):\n    \"\"\"Transform Kraus representation to Choi representation.\"\"\"\n    choi = 0\n    kraus_l, kraus_r = data\n    if kraus_r is None:\n        for i in kraus_l:\n            vec = i.ravel(order='F')\n            choi += np.outer(vec, vec.conj())\n    else:\n        for i, j in zip(kraus_l, kraus_r):\n            choi += np.outer(i.ravel(order='F'), j.ravel(order='F').conj())\n    return choi"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef _choi_to_kraus(data, input_dim, output_dim, atol=ATOL_DEFAULT):\n    # Check if hermitian matrix\n    if is_hermitian_matrix(data, atol=atol):\n        # Get eigen-decomposition of Choi-matrix\n        w, v = la.eigh(data)\n        # Check eigenvaleus are non-negative\n        if len(w[w < -atol]) == 0:\n            # CP-map Kraus representation\n            kraus = []\n            for val, vec in zip(w, v.T):\n                if abs(val) > atol:\n                    k = np.sqrt(val) * vec.reshape(\n                        (output_dim, input_dim), order='F')\n                    kraus.append(k)\n            # If we are converting a zero matrix, we need to return a Kraus set\n            # with a single zero-element Kraus matrix\n            if not kraus:\n                kraus.append(np.zeros((output_dim, input_dim), dtype=complex))\n            return (kraus, None)\n    # Non-CP-map generalized Kraus representation\n    mat_u, svals, mat_vh = la.svd(data)\n    kraus_l = []\n    kraus_r = []\n    for val, vec_l, vec_r in zip(svals, mat_u.T, mat_vh.conj()):\n        kraus_l.append(\n            np.sqrt(val) * vec_l.reshape((output_dim, input_dim), order='F'))\n        kraus_r.append(\n            np.sqrt(val) * vec_r.reshape((output_dim, input_dim), order='F'))\n    return (kraus_l, kraus_r)", "response": "Transform Choi representation to Kraus representation."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\ntransform Stinespring representation to Kraus representation.", "response": "def _stinespring_to_kraus(data, input_dim, output_dim):\n    \"\"\"Transform Stinespring representation to Kraus representation.\"\"\"\n    kraus_pair = []\n    for stine in data:\n        if stine is None:\n            kraus_pair.append(None)\n        else:\n            trace_dim = stine.shape[0] // output_dim\n            iden = np.eye(output_dim)\n            kraus = []\n            for j in range(trace_dim):\n                vec = np.zeros(trace_dim)\n                vec[j] = 1\n                kraus.append(np.kron(iden, vec[None, :]).dot(stine))\n            kraus_pair.append(kraus)\n    return tuple(kraus_pair)"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef _stinespring_to_choi(data, input_dim, output_dim):\n    trace_dim = data[0].shape[0] // output_dim\n    stine_l = np.reshape(data[0], (output_dim, trace_dim, input_dim))\n    if data[1] is None:\n        stine_r = stine_l\n    else:\n        stine_r = np.reshape(data[1], (output_dim, trace_dim, input_dim))\n    return np.reshape(\n        np.einsum('iAj,kAl->jilk', stine_l, stine_r.conj()),\n        2 * [input_dim * output_dim])", "response": "Transform Stinespring representation to Choi representation."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef _kraus_to_stinespring(data, input_dim, output_dim):\n    stine_pair = [None, None]\n    for i, kraus in enumerate(data):\n        if kraus is not None:\n            num_kraus = len(kraus)\n            stine = np.zeros((output_dim * num_kraus, input_dim),\n                             dtype=complex)\n            for j, mat in enumerate(kraus):\n                vec = np.zeros(num_kraus)\n                vec[j] = 1\n                stine += np.kron(mat, vec[:, None])\n            stine_pair[i] = stine\n    return tuple(stine_pair)", "response": "Transform Kraus representation to Stinespring representation."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\ntransforms Kraus representation to SuperOp representation.", "response": "def _kraus_to_superop(data, input_dim, output_dim):\n    \"\"\"Transform Kraus representation to SuperOp representation.\"\"\"\n    kraus_l, kraus_r = data\n    superop = 0\n    if kraus_r is None:\n        for i in kraus_l:\n            superop += np.kron(np.conj(i), i)\n    else:\n        for i, j in zip(kraus_l, kraus_r):\n            superop += np.kron(np.conj(j), i)\n    return superop"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef _chi_to_choi(data, input_dim, output_dim):\n    num_qubits = int(np.log2(input_dim))\n    return _transform_from_pauli(data, num_qubits)", "response": "Transform Chi representation to a Choi representation."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef _choi_to_chi(data, input_dim, output_dim):\n    num_qubits = int(np.log2(input_dim))\n    return _transform_to_pauli(data, num_qubits)", "response": "Transform Choi representation to the Chi representation."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef _bipartite_tensor(mat1, mat2, shape1=None, shape2=None):\n    # Convert inputs to numpy arrays\n    mat1 = np.array(mat1)\n    mat2 = np.array(mat2)\n\n    # Determine bipartite dimensions if not provided\n    dim_a0, dim_a1 = mat1.shape\n    dim_b0, dim_b1 = mat2.shape\n    if shape1 is None:\n        sdim_a0 = int(np.sqrt(dim_a0))\n        sdim_a1 = int(np.sqrt(dim_a1))\n        shape1 = (sdim_a0, sdim_a0, sdim_a1, sdim_a1)\n    if shape2 is None:\n        sdim_b0 = int(np.sqrt(dim_b0))\n        sdim_b1 = int(np.sqrt(dim_b1))\n        shape2 = (sdim_b0, sdim_b0, sdim_b1, sdim_b1)\n    # Check dimensions\n    if len(shape1) != 4 or shape1[0] * shape1[1] != dim_a0 or \\\n            shape1[2] * shape1[3] != dim_a1:\n        raise QiskitError(\"Invalid shape_a\")\n    if len(shape2) != 4 or shape2[0] * shape2[1] != dim_b0 or \\\n            shape2[2] * shape2[3] != dim_b1:\n        raise QiskitError(\"Invalid shape_b\")\n\n    return _reravel(mat1, mat2, shape1, shape2)", "response": "This function returns a tensor product of two bipartite matrices and reravel indicies."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\ntransforms pauli data into a single - qubit basis of bipartite matrix represenation.", "response": "def _transform_from_pauli(data, num_qubits):\n    \"\"\"Change of basis of bipartite matrix represenation.\"\"\"\n    # Change basis: sum_{i=0}^3 =|\\sigma_i>><i|\n    basis_mat = np.array(\n        [[1, 0, 0, 1], [0, 1, 1j, 0], [0, 1, -1j, 0], [1, 0j, 0, -1]],\n        dtype=complex)\n    # Note that we manually renormalized after change of basis\n    # to avoid rounding errors from square-roots of 2.\n    cob = basis_mat\n    for _ in range(num_qubits - 1):\n        dim = int(np.sqrt(len(cob)))\n        cob = np.reshape(\n            np.transpose(\n                np.reshape(\n                    np.kron(basis_mat, cob), (2, 2, dim, dim, 4, dim * dim)),\n                (0, 2, 1, 3, 4, 5)), (4 * dim * dim, 4 * dim * dim))\n    return np.dot(np.dot(cob, data), cob.conj().T) / 2**num_qubits"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nreshuffle the indicies of a bipartite matrix A [ ij ki ] - > A [ lj ki ] - > A [ lj ki ] - > A [ lj ki ] - > A [ lj ki ] - > A [ lj ki ] - > A [ lj ki ] - > A [ lj ki ] - > A [ lj ki ] - > A [ lj ki", "response": "def _reshuffle(mat, shape):\n    \"\"\"Reshuffle the indicies of a bipartite matrix A[ij,kl] -> A[lj,ki].\"\"\"\n    return np.reshape(\n        np.transpose(np.reshape(mat, shape), (3, 1, 2, 0)),\n        (shape[3] * shape[1], shape[0] * shape[2]))"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nreturns true if dims correspond to an n - qubit channel.", "response": "def _check_nqubit_dim(input_dim, output_dim):\n    \"\"\"Return true if dims correspond to an n-qubit channel.\"\"\"\n    if input_dim != output_dim:\n        raise QiskitError(\n            'Not an n-qubit channel: input_dim' +\n            ' ({}) != output_dim ({})'.format(input_dim, output_dim))\n    num_qubits = int(np.log2(input_dim))\n    if 2**num_qubits != input_dim:\n        raise QiskitError('Not an n-qubit channel: input_dim != 2 ** n')"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef _hide_tick_lines_and_labels(axis):\n    for item in axis.get_ticklines() + axis.get_ticklabels():\n        item.set_visible(False)", "response": "Hide ticklines and ticklabels of an axis."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef set_label_convention(self, convention):\n        ketex = \"$\\\\left.|%s\\\\right\\\\rangle$\"\n        # \\left.| is on purpose, so that every ket has the same size\n\n        if convention == \"original\":\n            self.xlabel = ['$x$', '']\n            self.ylabel = ['$y$', '']\n            self.zlabel = ['$\\\\left|0\\\\right>$', '$\\\\left|1\\\\right>$']\n        elif convention == \"xyz\":\n            self.xlabel = ['$x$', '']\n            self.ylabel = ['$y$', '']\n            self.zlabel = ['$z$', '']\n        elif convention == \"sx sy sz\":\n            self.xlabel = ['$s_x$', '']\n            self.ylabel = ['$s_y$', '']\n            self.zlabel = ['$s_z$', '']\n        elif convention == \"01\":\n            self.xlabel = ['', '']\n            self.ylabel = ['', '']\n            self.zlabel = ['$\\\\left|0\\\\right>$', '$\\\\left|1\\\\right>$']\n        elif convention == \"polarization jones\":\n            self.xlabel = [ketex % \"\\\\nearrow\\\\hspace{-1.46}\\\\swarrow\",\n                           ketex % \"\\\\nwarrow\\\\hspace{-1.46}\\\\searrow\"]\n            self.ylabel = [ketex % \"\\\\circlearrowleft\", ketex %\n                           \"\\\\circlearrowright\"]\n            self.zlabel = [ketex % \"\\\\leftrightarrow\", ketex % \"\\\\updownarrow\"]\n        elif convention == \"polarization jones letters\":\n            self.xlabel = [ketex % \"D\", ketex % \"A\"]\n            self.ylabel = [ketex % \"L\", ketex % \"R\"]\n            self.zlabel = [ketex % \"H\", ketex % \"V\"]\n        elif convention == \"polarization stokes\":\n            self.ylabel = [\"$\\\\nearrow\\\\hspace{-1.46}\\\\swarrow$\",\n                           \"$\\\\nwarrow\\\\hspace{-1.46}\\\\searrow$\"]\n            self.zlabel = [\"$\\\\circlearrowleft$\", \"$\\\\circlearrowright$\"]\n            self.xlabel = [\"$\\\\leftrightarrow$\", \"$\\\\updownarrow$\"]\n        else:\n            raise Exception(\"No such convention.\")", "response": "Set x y and z labels according to the convention."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef clear(self):\n        self.points = []\n        self.vectors = []\n        self.point_style = []\n        self.annotations = []", "response": "Resets all Bloch sphere data sets to empty."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef add_points(self, points, meth='s'):\n        if not isinstance(points[0], (list, np.ndarray)):\n            points = [[points[0]], [points[1]], [points[2]]]\n        points = np.array(points)\n        if meth == 's':\n            if len(points[0]) == 1:\n                pnts = np.array([[points[0][0]],\n                                 [points[1][0]], [points[2][0]]])\n                pnts = np.append(pnts, points, axis=1)\n            else:\n                pnts = points\n            self.points.append(pnts)\n            self.point_style.append('s')\n        elif meth == 'l':\n            self.points.append(points)\n            self.point_style.append('l')\n        else:\n            self.points.append(points)\n            self.point_style.append('m')", "response": "Adds a list of data points to the bloch sphere."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nadd a list of vectors to Bloch sphere.", "response": "def add_vectors(self, vectors):\n        \"\"\"Add a list of vectors to Bloch sphere.\n\n        Args:\n            vectors (array_like):\n                Array with vectors of unit length or smaller.\n        \"\"\"\n        if isinstance(vectors[0], (list, np.ndarray)):\n            for vec in vectors:\n                self.vectors.append(vec)\n        else:\n            self.vectors.append(vectors)"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nadding a LaTeX annotation to the Bloch sphere.", "response": "def add_annotation(self, state_or_vector, text, **kwargs):\n        \"\"\"Add a text or LaTeX annotation to Bloch sphere,\n        parametrized by a qubit state or a vector.\n\n        Args:\n            state_or_vector (array_like):\n                Position for the annotation.\n                Qobj of a qubit or a vector of 3 elements.\n            text (str):\n                Annotation text.\n                You can use LaTeX, but remember to use raw string\n                e.g. r\"$\\\\langle x \\\\rangle$\"\n                or escape backslashes\n                e.g. \"$\\\\\\\\langle x \\\\\\\\rangle$\".\n            **kwargs:\n                Options as for mplot3d.axes3d.text, including:\n                fontsize, color, horizontalalignment, verticalalignment.\n        Raises:\n            Exception: If input not array_like or tuple.\n        \"\"\"\n        if isinstance(state_or_vector, (list, np.ndarray, tuple)) \\\n                and len(state_or_vector) == 3:\n            vec = state_or_vector\n        else:\n            raise Exception(\"Position needs to be specified by a qubit \" +\n                            \"state or a 3D vector.\")\n        self.annotations.append({'position': vec,\n                                 'text': text,\n                                 'opts': kwargs})"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef render(self, title=''):\n        if self._rendered:\n            self.axes.clear()\n\n        self._rendered = True\n\n        # Figure instance for Bloch sphere plot\n        if not self._ext_fig:\n            self.fig = plt.figure(figsize=self.figsize)\n\n        if not self._ext_axes:\n            self.axes = Axes3D(self.fig, azim=self.view[0], elev=self.view[1])\n\n        if self.background:\n            self.axes.clear()\n            self.axes.set_xlim3d(-1.3, 1.3)\n            self.axes.set_ylim3d(-1.3, 1.3)\n            self.axes.set_zlim3d(-1.3, 1.3)\n        else:\n            self.plot_axes()\n            self.axes.set_axis_off()\n            self.axes.set_xlim3d(-0.7, 0.7)\n            self.axes.set_ylim3d(-0.7, 0.7)\n            self.axes.set_zlim3d(-0.7, 0.7)\n\n        self.axes.grid(False)\n        self.plot_back()\n        self.plot_points()\n        self.plot_vectors()\n        self.plot_front()\n        self.plot_axes_labels()\n        self.plot_annotations()\n        self.axes.set_title(title, fontsize=self.font_size, y=1.08)", "response": "Render the Bloch sphere and its data sets in on given figure and axes."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nplots the front half of the sphere.", "response": "def plot_front(self):\n        \"\"\"front half of sphere\"\"\"\n        u_angle = np.linspace(-np.pi, 0, 25)\n        v_angle = np.linspace(0, np.pi, 25)\n        x_dir = np.outer(np.cos(u_angle), np.sin(v_angle))\n        y_dir = np.outer(np.sin(u_angle), np.sin(v_angle))\n        z_dir = np.outer(np.ones(u_angle.shape[0]), np.cos(v_angle))\n        self.axes.plot_surface(x_dir, y_dir, z_dir, rstride=2, cstride=2,\n                               color=self.sphere_color, linewidth=0,\n                               alpha=self.sphere_alpha)\n        # wireframe\n        self.axes.plot_wireframe(x_dir, y_dir, z_dir, rstride=5, cstride=5,\n                                 color=self.frame_color,\n                                 alpha=self.frame_alpha)\n        # equator\n        self.axes.plot(1.0 * np.cos(u_angle), 1.0 * np.sin(u_angle),\n                       zs=0, zdir='z', lw=self.frame_width,\n                       color=self.frame_color)\n        self.axes.plot(1.0 * np.cos(u_angle), 1.0 * np.sin(u_angle),\n                       zs=0, zdir='x', lw=self.frame_width,\n                       color=self.frame_color)"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef plot_axes(self):\n        span = np.linspace(-1.0, 1.0, 2)\n        self.axes.plot(span, 0 * span, zs=0, zdir='z', label='X',\n                       lw=self.frame_width, color=self.frame_color)\n        self.axes.plot(0 * span, span, zs=0, zdir='z', label='Y',\n                       lw=self.frame_width, color=self.frame_color)\n        self.axes.plot(0 * span, span, zs=0, zdir='y', label='Z',\n                       lw=self.frame_width, color=self.frame_color)", "response": "Plot the current log entry."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef plot_axes_labels(self):\n        opts = {'fontsize': self.font_size,\n                'color': self.font_color,\n                'horizontalalignment': 'center',\n                'verticalalignment': 'center'}\n        self.axes.text(0, -self.xlpos[0], 0, self.xlabel[0], **opts)\n        self.axes.text(0, -self.xlpos[1], 0, self.xlabel[1], **opts)\n\n        self.axes.text(self.ylpos[0], 0, 0, self.ylabel[0], **opts)\n        self.axes.text(self.ylpos[1], 0, 0, self.ylabel[1], **opts)\n\n        self.axes.text(0, 0, self.zlpos[0], self.zlabel[0], **opts)\n        self.axes.text(0, 0, self.zlpos[1], self.zlabel[1], **opts)\n\n        for item in (self.axes.w_xaxis.get_ticklines() +\n                     self.axes.w_xaxis.get_ticklabels()):\n            item.set_visible(False)\n        for item in (self.axes.w_yaxis.get_ticklines() +\n                     self.axes.w_yaxis.get_ticklabels()):\n            item.set_visible(False)\n        for item in (self.axes.w_zaxis.get_ticklines() +\n                     self.axes.w_zaxis.get_ticklabels()):\n            item.set_visible(False)", "response": "plot the axes labels"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nplots the vector of the current object.", "response": "def plot_vectors(self):\n        \"\"\"Plot vector\"\"\"\n        # -X and Y data are switched for plotting purposes\n        for k in range(len(self.vectors)):\n\n            xs3d = self.vectors[k][1] * np.array([0, 1])\n            ys3d = -self.vectors[k][0] * np.array([0, 1])\n            zs3d = self.vectors[k][2] * np.array([0, 1])\n\n            color = self.vector_color[np.mod(k, len(self.vector_color))]\n\n            if self.vector_style == '':\n                # simple line style\n                self.axes.plot(xs3d, ys3d, zs3d,\n                               zs=0, zdir='z', label='Z',\n                               lw=self.vector_width, color=color)\n            else:\n                # decorated style, with arrow heads\n                arr = Arrow3D(xs3d, ys3d, zs3d,\n                              mutation_scale=self.vector_mutation,\n                              lw=self.vector_width,\n                              arrowstyle=self.vector_style,\n                              color=color)\n\n                self.axes.add_artist(arr)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef plot_points(self):\n        # -X and Y data are switched for plotting purposes\n        for k in range(len(self.points)):\n            num = len(self.points[k][0])\n            dist = [np.sqrt(self.points[k][0][j] ** 2 +\n                            self.points[k][1][j] ** 2 +\n                            self.points[k][2][j] ** 2) for j in range(num)]\n            if any(abs(dist - dist[0]) / dist[0] > 1e-12):\n                # combine arrays so that they can be sorted together\n                zipped = list(zip(dist, range(num)))\n                zipped.sort()  # sort rates from lowest to highest\n                dist, indperm = zip(*zipped)\n                indperm = np.array(indperm)\n            else:\n                indperm = np.arange(num)\n            if self.point_style[k] == 's':\n                self.axes.scatter(\n                    np.real(self.points[k][1][indperm]),\n                    - np.real(self.points[k][0][indperm]),\n                    np.real(self.points[k][2][indperm]),\n                    s=self.point_size[np.mod(k, len(self.point_size))],\n                    alpha=1,\n                    edgecolor='none',\n                    zdir='z',\n                    color=self.point_color[np.mod(k, len(self.point_color))],\n                    marker=self.point_marker[np.mod(k,\n                                                    len(self.point_marker))])\n\n            elif self.point_style[k] == 'm':\n                pnt_colors = np.array(self.point_color *\n                                      int(np.ceil(num /\n                                                  float(len(self.point_color)))))\n\n                pnt_colors = pnt_colors[0:num]\n                pnt_colors = list(pnt_colors[indperm])\n                marker = self.point_marker[np.mod(k, len(self.point_marker))]\n                pnt_size = self.point_size[np.mod(k, len(self.point_size))]\n                self.axes.scatter(np.real(self.points[k][1][indperm]),\n                                  -np.real(self.points[k][0][indperm]),\n                                  np.real(self.points[k][2][indperm]),\n                                  s=pnt_size, alpha=1, edgecolor='none',\n                                  zdir='z', color=pnt_colors,\n                                  marker=marker)\n\n            elif self.point_style[k] == 'l':\n                color = self.point_color[np.mod(k, len(self.point_color))]\n                self.axes.plot(np.real(self.points[k][1]),\n                               -np.real(self.points[k][0]),\n                               np.real(self.points[k][2]),\n                               alpha=0.75, zdir='z',\n                               color=color)", "response": "Plot points in the current image."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nplots the annotations of the current object.", "response": "def plot_annotations(self):\n        \"\"\"Plot annotations\"\"\"\n        # -X and Y data are switched for plotting purposes\n        for annotation in self.annotations:\n            vec = annotation['position']\n            opts = {'fontsize': self.font_size,\n                    'color': self.font_color,\n                    'horizontalalignment': 'center',\n                    'verticalalignment': 'center'}\n            opts.update(annotation['opts'])\n            self.axes.text(vec[1], -vec[0], vec[2],\n                           annotation['text'], **opts)"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef show(self, title=''):\n        self.render(title=title)\n        if self.fig:\n            plt.show(self.fig)", "response": "Display Bloch sphere and corresponding data sets."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef save(self, name=None, output='png', dirc=None):\n\n        self.render()\n        if dirc:\n            if not os.path.isdir(os.getcwd() + \"/\" + str(dirc)):\n                os.makedirs(os.getcwd() + \"/\" + str(dirc))\n        if name is None:\n            if dirc:\n                self.fig.savefig(os.getcwd() + \"/\" + str(dirc) + '/bloch_' +\n                                 str(self.savenum) + '.' + output)\n            else:\n                self.fig.savefig(os.getcwd() + '/bloch_' + str(self.savenum) +\n                                 '.' + output)\n        else:\n            self.fig.savefig(name)\n        self.savenum += 1\n        if self.fig:\n            plt.close(self.fig)", "response": "Saves the Bloch sphere to file of type format."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef two_qubit_kak(unitary_matrix, verify_gate_sequence=False):\n    warnings.warn(\"two_qubit_kak function is now accessible under \"\n                  \"qiskit.quantum_info.synthesis\", DeprecationWarning)\n    return synthesis.two_qubit_kak(unitary_matrix)", "response": "Two - qubit Kak synthesis."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef top(self):\n        ret = self.top_format % self.top_connect.center(\n            self.width, self.top_pad)\n        if self.right_fill:\n            ret = ret.ljust(self.right_fill, self.top_pad)\n        if self.left_fill:\n            ret = ret.rjust(self.left_fill, self.top_pad)\n        ret = ret.center(self.layer_width, self.top_bck)\n        return ret", "response": "Construct the top line of the element"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nconstructs the middle line of the element", "response": "def mid(self):\n        \"\"\" Constructs the middle line of the element\"\"\"\n        ret = self.mid_format % self.mid_content.center(\n            self.width, self._mid_padding)\n        if self.right_fill:\n            ret = ret.ljust(self.right_fill, self._mid_padding)\n        if self.left_fill:\n            ret = ret.rjust(self.left_fill, self._mid_padding)\n        ret = ret.center(self.layer_width, self.mid_bck)\n        return ret"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef bot(self):\n        ret = self.bot_format % self.bot_connect.center(\n            self.width, self.bot_pad)\n        if self.right_fill:\n            ret = ret.ljust(self.right_fill, self.bot_pad)\n        if self.left_fill:\n            ret = ret.rjust(self.left_fill, self.bot_pad)\n        ret = ret.center(self.layer_width, self.bot_bck)\n        return ret", "response": "Constructs the bottom line of the element"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nreturn the length of the element including the box around.", "response": "def length(self):\n        \"\"\" Returns the length of the element, including the box around.\"\"\"\n        return max(len(self.top), len(self.mid), len(self.bot))"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef connect(self, wire_char, where, label=None):\n\n        if 'top' in where and self.top_connector:\n            self.top_connect = self.top_connector[wire_char]\n\n        if 'bot' in where and self.bot_connector:\n            self.bot_connect = self.bot_connector[wire_char]\n\n        if label:\n            self.top_format = self.top_format[:-1] + (label if label else \"\")", "response": "Connect boxes and elements using wire_char and setting proper connectors."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\ncenters the label vertically.", "response": "def center_label(self, input_length, order):\n        \"\"\"\n        In multi-bit elements, the label is centered vertically.\n        Args:\n            input_length (int): Rhe amount of wires affected.\n            order (int): Which middle element is this one?\n        \"\"\"\n        location_in_the_box = '*'.center(input_length * 2 - 1).index('*') + 1\n        top_limit = order * 2 + 2\n        bot_limit = top_limit + 2\n        if top_limit <= location_in_the_box < bot_limit:\n            if location_in_the_box == top_limit:\n                self.top_connect = self.label\n            elif location_in_the_box == top_limit + 1:\n                self.mid_content = self.label\n            else:\n                self.bot_connect = self.label"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef fillup_layer(layer, first_clbit):\n        for nones in [i for i, x in enumerate(layer) if x is None]:\n            layer[nones] = EmptyWire('\u2550') if nones >= first_clbit else EmptyWire('\u2500')\n        return layer", "response": "Given a layer with Nones replace the Nones in it with EmptyWire elements."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef fillup_layer(layer_length, arrow_char):\n        breakwire_layer = []\n        for _ in range(layer_length):\n            breakwire_layer.append(BreakWire(arrow_char))\n        return breakwire_layer", "response": "Creates a layer with BreakWire elements."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\ncreates a layer with InputWire elements.", "response": "def fillup_layer(names):  # pylint: disable=arguments-differ\n        \"\"\"\n        Creates a layer with InputWire elements.\n        Args:\n            names (list): List of names for the wires.\n\n        Returns:\n            list: The new layer\n        \"\"\"\n        longest = max([len(name) for name in names])\n        inputs_wires = []\n        for name in names:\n            inputs_wires.append(InputWire(name.rjust(longest)))\n        return inputs_wires"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\ndumping the ascii art in the file.", "response": "def dump(self, filename, encoding=\"utf8\"):\n        \"\"\"\n        Dumps the ascii art in the file.\n        Args:\n            filename (str): File to dump the ascii art.\n            encoding (str): Optional. Default \"utf-8\".\n        \"\"\"\n        with open(filename, mode='w', encoding=encoding) as text_file:\n            text_file.write(self.single_string())"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef lines(self, line_length=None):\n        if line_length is None:\n            line_length = self.line_length\n        if line_length is None:\n            if ('ipykernel' in sys.modules) and ('spyder' not in sys.modules):\n                line_length = 80\n            else:\n                line_length, _ = get_terminal_size()\n\n        noqubits = len(self.qregs)\n\n        layers = self.build_layers()\n\n        if not line_length:\n            line_length = self.line_length\n\n        layer_groups = [[]]\n        rest_of_the_line = line_length\n        for layerno, layer in enumerate(layers):\n            # Replace the Nones with EmptyWire\n            layers[layerno] = EmptyWire.fillup_layer(layer, noqubits)\n\n            TextDrawing.normalize_width(layer)\n\n            if line_length == -1:\n                # Do not use pagination (aka line breaking. aka ignore line_length).\n                layer_groups[-1].append(layer)\n                continue\n\n            # chop the layer to the line_length (pager)\n            layer_length = layers[layerno][0].length\n\n            if layer_length < rest_of_the_line:\n                layer_groups[-1].append(layer)\n                rest_of_the_line -= layer_length\n            else:\n                layer_groups[-1].append(BreakWire.fillup_layer(len(layer), '\u00bb'))\n\n                # New group\n                layer_groups.append([BreakWire.fillup_layer(len(layer), '\u00ab')])\n                rest_of_the_line = line_length - layer_groups[-1][-1][0].length\n\n                layer_groups[-1].append(\n                    InputWire.fillup_layer(self.wire_names(with_initial_value=False)))\n                rest_of_the_line -= layer_groups[-1][-1][0].length\n\n                layer_groups[-1].append(layer)\n                rest_of_the_line -= layer_groups[-1][-1][0].length\n\n        lines = []\n        for layer_group in layer_groups:\n            wires = [i for i in zip(*layer_group)]\n            lines += TextDrawing.draw_wires(wires, self.vertically_compressed)\n\n        return lines", "response": "Generates a list of lines for the current language."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nreturning a list of wire names for each wire.", "response": "def wire_names(self, with_initial_value=True):\n        \"\"\"\n        Returns a list of names for each wire.\n        Args:\n            with_initial_value (bool): Optional (Default: True). If true, adds the initial value to\n                                       the name.\n\n        Returns:\n            List: The list of wire names.\n        \"\"\"\n        qubit_labels = self._get_qubit_labels()\n        clbit_labels = self._get_clbit_labels()\n\n        if with_initial_value:\n            qubit_labels = ['%s: |0>' % qubit for qubit in qubit_labels]\n            clbit_labels = ['%s: 0 ' % clbit for clbit in clbit_labels]\n        else:\n            qubit_labels = ['%s: ' % qubit for qubit in qubit_labels]\n            clbit_labels = ['%s: ' % clbit for clbit in clbit_labels]\n\n        return qubit_labels + clbit_labels"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\ngiving a list of wires creates a list of lines with the text drawing.", "response": "def draw_wires(wires, vertically_compressed=True):\n        \"\"\"\n        Given a list of wires, creates a list of lines with the text drawing.\n        Args:\n            wires (list): A list of wires with instructions.\n            vertically_compressed (bool): Default is `True`. It merges the lines\n                                     so the drawing will take less vertical room.\n        Returns:\n            list: A list of lines with the text drawing.\n        \"\"\"\n        lines = []\n        bot_line = None\n        for wire in wires:\n            # TOP\n            top_line = ''\n            for instruction in wire:\n                top_line += instruction.top\n\n            if bot_line is None:\n                lines.append(top_line)\n            else:\n                if vertically_compressed:\n                    lines.append(TextDrawing.merge_lines(lines.pop(), top_line))\n                else:\n                    lines.append(TextDrawing.merge_lines(lines[-1], top_line, icod=\"bot\"))\n\n            # MID\n            mid_line = ''\n            for instruction in wire:\n                mid_line += instruction.mid\n            lines.append(TextDrawing.merge_lines(lines[-1], mid_line, icod=\"bot\"))\n\n            # BOT\n            bot_line = ''\n            for instruction in wire:\n                bot_line += instruction.bot\n            lines.append(TextDrawing.merge_lines(lines[-1], bot_line, icod=\"bot\"))\n\n        return lines"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef params_for_label(instruction):\n\n        if not hasattr(instruction.op, 'params'):\n            return None\n        if all([isinstance(param, ndarray) for param in instruction.op.params]):\n            return None\n\n        ret = []\n        for param in instruction.op.params:\n            if isinstance(param, (sympy.Number, float)):\n                ret.append('%.5g' % param)\n            else:\n                ret.append('%s' % param)\n        return ret", "response": "Get the params and format them to add them to a label."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef label_for_box(instruction):\n        label = instruction.name.capitalize()\n        params = TextDrawing.params_for_label(instruction)\n        if params:\n            label += \"(%s)\" % ','.join(params)\n        return label", "response": "Creates the label for a box."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef merge_lines(top, bot, icod=\"top\"):\n        ret = \"\"\n        for topc, botc in zip(top, bot):\n            if topc == botc:\n                ret += topc\n            elif topc in '\u253c\u256a' and botc == \" \":\n                ret += \"\u2502\"\n            elif topc == \" \":\n                ret += botc\n            elif topc in '\u252c\u2565' and botc in \" \u2551\u2502\" and icod == \"top\":\n                ret += topc\n            elif topc in '\u252c' and botc == \" \" and icod == \"bot\":\n                ret += '\u2502'\n            elif topc in '\u2565' and botc == \" \" and icod == \"bot\":\n                ret += '\u2551'\n            elif topc in '\u252c\u2502' and botc == \"\u2550\":\n                ret += '\u256a'\n            elif topc in '\u252c\u2502' and botc == \"\u2500\":\n                ret += '\u253c'\n            elif topc in '\u2514\u2518\u2551\u2502\u2591' and botc == \" \" and icod == \"top\":\n                ret += topc\n            elif topc in '\u2500\u2550' and botc == \" \" and icod == \"top\":\n                ret += topc\n            elif topc in '\u2500\u2550' and botc == \" \" and icod == \"bot\":\n                ret += botc\n            elif topc in \"\u2551\u2565\" and botc in \"\u2550\":\n                ret += \"\u256c\"\n            elif topc in \"\u2551\u2565\" and botc in \"\u2500\":\n                ret += \"\u256b\"\n            elif topc in '\u256b\u256c' and botc in \" \":\n                ret += \"\u2551\"\n            elif topc == '\u2514' and botc == \"\u250c\":\n                ret += \"\u251c\"\n            elif topc == '\u2518' and botc == \"\u2510\":\n                ret += \"\u2524\"\n            elif botc in \"\u2510\u250c\" and icod == 'top':\n                ret += \"\u252c\"\n            elif topc in \"\u2518\u2514\" and botc in \"\u2500\" and icod == 'top':\n                ret += \"\u2534\"\n            else:\n                ret += botc\n        return ret", "response": "This function merges two lines of a doubt with the same name."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nnormalize the width of the elements of the specified layer.", "response": "def normalize_width(layer):\n        \"\"\"\n        When the elements of the layer have different widths, sets the width to the max elements.\n        Args:\n            layer (list): A list of elements.\n        \"\"\"\n        instructions = [instruction for instruction in filter(lambda x: x is not None, layer)]\n        longest = max([instruction.length for instruction in instructions])\n        for instruction in instructions:\n            instruction.layer_width = longest"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef _instruction_to_gate(self, instruction, layer):\n\n        current_cons = []\n        connection_label = None\n\n        # add in a gate that operates over multiple qubits\n        def add_connected_gate(instruction, gates, layer, current_cons):\n            for i, gate in enumerate(gates):\n                layer.set_qubit(instruction.qargs[i], gate)\n                actual_index = self.qregs.index(instruction.qargs[i])\n                current_cons.append((actual_index, gate))\n\n        if instruction.name == 'measure':\n            gate = MeasureFrom()\n            layer.set_qubit(instruction.qargs[0], gate)\n            layer.set_clbit(instruction.cargs[0], MeasureTo())\n\n        elif instruction.name in ['barrier', 'snapshot', 'save', 'load',\n                                  'noise']:\n            # barrier\n            if not self.plotbarriers:\n                return layer, current_cons, connection_label\n\n            for qubit in instruction.qargs:\n                layer.set_qubit(qubit, Barrier())\n\n        elif instruction.name == 'swap':\n            # swap\n            gates = [Ex() for _ in range(len(instruction.qargs))]\n            add_connected_gate(instruction, gates, layer, current_cons)\n\n        elif instruction.name == 'cswap':\n            # cswap\n            gates = [Bullet(), Ex(), Ex()]\n            add_connected_gate(instruction, gates, layer, current_cons)\n\n        elif instruction.name == 'reset':\n            layer.set_qubit(instruction.qargs[0], Reset())\n\n        elif instruction.condition is not None:\n            # conditional\n            cllabel = TextDrawing.label_for_conditional(instruction)\n            qulabel = TextDrawing.label_for_box(instruction)\n\n            layer.set_cl_multibox(instruction.condition[0], cllabel, top_connect='\u2534')\n            layer.set_qubit(instruction.qargs[0], BoxOnQuWire(qulabel, bot_connect='\u252c'))\n\n        elif instruction.name in ['cx', 'CX', 'ccx']:\n            # cx/ccx\n            gates = [Bullet() for _ in range(len(instruction.qargs) - 1)]\n            gates.append(BoxOnQuWire('X'))\n            add_connected_gate(instruction, gates, layer, current_cons)\n\n        elif instruction.name == 'cy':\n            # cy\n            gates = [Bullet(), BoxOnQuWire('Y')]\n            add_connected_gate(instruction, gates, layer, current_cons)\n\n        elif instruction.name == 'cz':\n            # cz\n            gates = [Bullet(), Bullet()]\n            add_connected_gate(instruction, gates, layer, current_cons)\n\n        elif instruction.name == 'ch':\n            # ch\n            gates = [Bullet(), BoxOnQuWire('H')]\n            add_connected_gate(instruction, gates, layer, current_cons)\n\n        elif instruction.name == 'cu1':\n            # cu1\n            connection_label = TextDrawing.params_for_label(instruction)[0]\n            gates = [Bullet(), Bullet()]\n            add_connected_gate(instruction, gates, layer, current_cons)\n\n        elif instruction.name == 'rzz':\n            # rzz\n            connection_label = \"zz(%s)\" % TextDrawing.params_for_label(instruction)[0]\n            gates = [Bullet(), Bullet()]\n            add_connected_gate(instruction, gates, layer, current_cons)\n\n        elif instruction.name == 'cu3':\n            # cu3\n            params = TextDrawing.params_for_label(instruction)\n            gates = [Bullet(), BoxOnQuWire(\"U3(%s)\" % ','.join(params))]\n            add_connected_gate(instruction, gates, layer, current_cons)\n\n        elif instruction.name == 'crz':\n            # crz\n            label = \"Rz(%s)\" % TextDrawing.params_for_label(instruction)[0]\n\n            gates = [Bullet(), BoxOnQuWire(label)]\n            add_connected_gate(instruction, gates, layer, current_cons)\n\n        elif len(instruction.qargs) == 1 and not instruction.cargs:\n            # unitary gate\n            layer.set_qubit(instruction.qargs[0],\n                            BoxOnQuWire(TextDrawing.label_for_box(instruction)))\n\n        elif len(instruction.qargs) >= 2 and not instruction.cargs:\n            # multiple qubit gate\n            label = instruction.name\n            params = TextDrawing.params_for_label(instruction)\n            if params:\n                label += \"(%s)\" % ','.join(params)\n            layer.set_qu_multibox(instruction.qargs, label)\n\n        else:\n            raise VisualizationError(\n                \"Text visualizer does not know how to handle this instruction\", instruction)\n\n        # sort into the order they were declared in\n        # this ensures that connected boxes have lines in the right direction\n        current_cons.sort(key=lambda tup: tup[0])\n        current_cons = [g for q, g in current_cons]\n\n        return layer, current_cons, connection_label", "response": "Convert an instruction into a corresponding Gate object and establish\n            any connections it introduces between qubits"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef build_layers(self):\n        wire_names = self.wire_names(with_initial_value=True)\n        if not wire_names:\n            return []\n\n        layers = [InputWire.fillup_layer(wire_names)]\n\n        for instruction_layer in self.instructions:\n            layer = Layer(self.qregs, self.cregs)\n\n            for instruction in instruction_layer:\n                layer, current_connections, connection_label = \\\n                    self._instruction_to_gate(instruction, layer)\n\n                layer.connections.append((connection_label, current_connections))\n                layer.connect_with(\"\u2502\")\n            layers.append(layer.full_layer)\n\n        return layers", "response": "Builds the list of layers for the current language."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nset the qubit to the element in the qubit_layer", "response": "def set_qubit(self, qubit, element):\n        \"\"\"\n        Sets the qubit to the element\n        Args:\n            qubit (qbit): Element of self.qregs.\n            element (DrawElement): Element to set in the qubit\n        \"\"\"\n        self.qubit_layer[self.qregs.index(qubit)] = element"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nsets the clbit to the element in the clbit_layer", "response": "def set_clbit(self, clbit, element):\n        \"\"\"\n        Sets the clbit to the element\n        Args:\n            clbit (cbit): Element of self.cregs.\n            element (DrawElement): Element to set in the clbit\n        \"\"\"\n        self.clbit_layer[self.cregs.index(clbit)] = element"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nsetting the multi - clbit box.", "response": "def set_cl_multibox(self, creg, label, top_connect='\u2534'):\n        \"\"\"\n        Sets the multi clbit box.\n        Args:\n            creg (string): The affected classical register.\n            label (string): The label for the multi clbit box.\n            top_connect (char): The char to connect the box on the top.\n        \"\"\"\n        clbit = [bit for bit in self.cregs if bit[0] == creg]\n        self._set_multibox(\"cl\", clbit, label, top_connect=top_connect)"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef connect_with(self, wire_char):\n\n        if len([qbit for qbit in self.qubit_layer if qbit is not None]) == 1:\n            # Nothing to connect\n            return\n\n        for label, affected_bits in self.connections:\n\n            if not affected_bits:\n                continue\n\n            affected_bits[0].connect(wire_char, ['bot'])\n            for affected_bit in affected_bits[1:-1]:\n                affected_bit.connect(wire_char, ['bot', 'top'])\n\n            affected_bits[-1].connect(wire_char, ['top'], label)\n\n            if label:\n                for affected_bit in affected_bits:\n                    affected_bit.right_fill = len(label) + len(affected_bit.mid)", "response": "Connects the elements in the layer using wire_char."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef latex(self, prec=15, nested_scope=None):\n        if not nested_scope:\n            return \"\\textrm{\" + self.name + \"}\"\n        else:\n            if self.name not in nested_scope[-1]:\n                raise NodeException(\"Expected local parameter name: \",\n                                    \"name=%s, \" % self.name,\n                                    \"line=%s, \" % self.line,\n                                    \"file=%s\" % self.file)\n            else:\n                return nested_scope[-1][self.name].latex(prec,\n                                                         nested_scope[0:-1])", "response": "Return the correspond math mode latex string."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nreturns the correspond symbolic number.", "response": "def sym(self, nested_scope=None):\n        \"\"\"Return the correspond symbolic number.\"\"\"\n        if not nested_scope or self.name not in nested_scope[-1]:\n            raise NodeException(\"Expected local parameter name: \",\n                                \"name=%s, line=%s, file=%s\" % (\n                                    self.name, self.line, self.file))\n        else:\n            return nested_scope[-1][self.name].sym(nested_scope[0:-1])"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nreturning the correspond floating point number.", "response": "def real(self, nested_scope=None):\n        \"\"\"Return the correspond floating point number.\"\"\"\n        if not nested_scope or self.name not in nested_scope[-1]:\n            raise NodeException(\"Expected local parameter name: \",\n                                \"name=%s, line=%s, file=%s\" % (\n                                    self.name, self.line, self.file))\n        else:\n            return nested_scope[-1][self.name].real(nested_scope[0:-1])"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef compile(circuits, backend,\n            config=None, basis_gates=None, coupling_map=None, initial_layout=None,\n            shots=1024, max_credits=10, seed=None, qobj_id=None, seed_mapper=None,\n            pass_manager=None, memory=False):\n    \"\"\"Compile a list of circuits into a qobj.\n\n    Args:\n        circuits (QuantumCircuit or list[QuantumCircuit]): circuits to compile\n        backend (BaseBackend): a backend to compile for\n        config (dict): dictionary of parameters (e.g. noise) used by runner\n        basis_gates (list[str]): list of basis gates names supported by the\n            target. Default: ['u1','u2','u3','cx','id']\n        coupling_map (list): coupling map (perhaps custom) to target in mapping\n        initial_layout (list): initial layout of qubits in mapping\n        shots (int): number of repetitions of each circuit, for sampling\n        max_credits (int): maximum credits to use\n        seed (int): random seed for simulators\n        seed_mapper (int): random seed for swapper mapper\n        qobj_id (int): identifier for the generated qobj\n        pass_manager (PassManager): a pass manger for the transpiler pipeline\n        memory (bool): if True, per-shot measurement bitstrings are returned as well\n\n    Returns:\n        Qobj: the qobj to be run on the backends\n\n    Raises:\n        QiskitError: if the desired options are not supported by backend\n    \"\"\"\n    warnings.warn('qiskit.compile() is deprecated and will be removed in Qiskit Terra 0.9. '\n                  'Please use qiskit.compiler.transpile() to transform circuits '\n                  'and qiskit.compiler.assemble() to produce a runnable qobj.',\n                  DeprecationWarning)\n\n    new_circuits = transpile(circuits,\n                             basis_gates=basis_gates,\n                             coupling_map=coupling_map,\n                             initial_layout=initial_layout,\n                             seed_transpiler=seed_mapper,\n                             backend=backend,\n                             pass_manager=pass_manager)\n\n    qobj = assemble(new_circuits,\n                    qobj_header=None,\n                    shots=shots,\n                    max_credits=max_credits,\n                    seed_simulator=seed,\n                    memory=memory,\n                    qobj_id=qobj_id,\n                    config=config)  # deprecated\n\n    return qobj", "response": "Compile a list of circuits into a single qobj."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef _filter_deprecation_warnings():\n    deprecation_filter = ('always', None, DeprecationWarning,\n                          re.compile(r'^qiskit\\.*', re.UNICODE), 0)\n\n    # Instead of using warnings.simple_filter() directly, the internal\n    # _add_filter() function is used for being able to match against the\n    # module.\n    try:\n        warnings._add_filter(*deprecation_filter, append=False)\n    except AttributeError:\n        # ._add_filter is internal and not available in some Python versions.\n        pass\n\n    # Add a filter for ignoring ChangedInMarshmallow3Warning, as we depend on\n    # marhsmallow 2 explicitly. 2.17.0 introduced new deprecation warnings that\n    # are useful for eventually migrating, but too verbose for our purposes.\n    warnings.simplefilter('ignore', category=ChangedInMarshmallow3Warning)", "response": "Apply filters to the deprecation warnings."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nchecks if internet connection exists to host via specified port.", "response": "def _has_connection(hostname, port):\n    \"\"\"Checks if internet connection exists to host via specified port.\n\n    If any exception is raised while trying to open a socket this will return\n    false.\n\n    Args:\n        hostname (str): Hostname to connect to.\n        port (int): Port to connect to\n\n    Returns:\n        bool: Has connection or not\n\n    \"\"\"\n    try:\n        host = socket.gethostbyname(hostname)\n        socket.create_connection((host, port), 2)\n        return True\n    except Exception:  # pylint: disable=broad-except\n        return False"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef compose(self, other, qargs=None, front=False):\n        if qargs is not None:\n            return PTM(\n                SuperOp(self).compose(other, qargs=qargs, front=front))\n\n        # Convert other to PTM\n        if not isinstance(other, PTM):\n            other = PTM(other)\n        # Check dimensions match up\n        if front and self._input_dim != other._output_dim:\n            raise QiskitError(\n                'input_dim of self must match output_dim of other')\n        if not front and self._output_dim != other._input_dim:\n            raise QiskitError(\n                'input_dim of other must match output_dim of self')\n        if front:\n            # Composition A(B(input))\n            input_dim = other._input_dim\n            output_dim = self._output_dim\n            return PTM(np.dot(self._data, other.data), input_dim, output_dim)\n        # Composition B(A(input))\n        input_dim = self._input_dim\n        output_dim = other._output_dim\n        return PTM(np.dot(other.data, self._data), input_dim, output_dim)", "response": "Returns the composition of self and other."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef power(self, n):\n        if n > 0:\n            return super().power(n)\n        return PTM(SuperOp(self).power(n))", "response": "Returns the matrix power of the superoperator matrix."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef _html_checker(job_var, interval, status, header,\n                  _interval_set=False):\n    \"\"\"Internal function that updates the status\n    of a HTML job monitor.\n\n    Args:\n        job_var (BaseJob): The job to keep track of.\n        interval (int): The status check interval\n        status (widget): HTML ipywidget for output ot screen\n        header (str): String representing HTML code for status.\n        _interval_set (bool): Was interval set by user?\n    \"\"\"\n    job_status = job_var.status()\n    job_status_name = job_status.name\n    job_status_msg = job_status.value\n    status.value = header % (job_status_msg)\n    while job_status_name not in ['DONE', 'CANCELLED']:\n        time.sleep(interval)\n        job_status = job_var.status()\n        job_status_name = job_status.name\n        job_status_msg = job_status.value\n        if job_status_name == 'ERROR':\n            break\n        else:\n            if job_status_name == 'QUEUED':\n                job_status_msg += ' (%s)' % job_var.queue_position()\n                if not _interval_set:\n                    interval = max(job_var.queue_position(), 2)\n            else:\n                if not _interval_set:\n                    interval = 2\n            status.value = header % (job_status_msg)\n\n    status.value = header % (job_status_msg)", "response": "Internal function that updates the status of a HTML job monitor."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef square(times: np.ndarray, amp: complex, period: float, phase: float = 0) -> np.ndarray:\n    x = times/period+phase/np.pi\n    return amp*(2*(2*np.floor(x) - np.floor(2*x)) + 1).astype(np.complex_)", "response": "Continuous square wave.\n\n    Args:\n        times: Times to output wave for.\n        amp: Pulse amplitude. Wave range is [-amp, amp].\n        period: Pulse period, units of dt.\n        phase: Pulse phase."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef triangle(times: np.ndarray, amp: complex, period: float, phase: float = 0) -> np.ndarray:\n    return amp*(-2*np.abs(sawtooth(times, 1, period, (phase-np.pi/2)/2)) + 1).astype(np.complex_)", "response": "Continuous triangle wave.\n\n    Args:\n        times: Times to output wave for.\n        amp: Pulse amplitude. Wave range is [-amp, amp].\n        period: Pulse period, units of dt.\n        phase: Pulse phase."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef gaussian(times: np.ndarray, amp: complex, center: float, sigma: float,\n             zeroed_width: Union[None, float] = None, rescale_amp: bool = False,\n             ret_x: bool = False) -> Union[np.ndarray, Tuple[np.ndarray, np.ndarray]]:\n    r\"\"\"Continuous unnormalized gaussian pulse.\n\n    Integrated area under curve is $\\Omega_g(amp, sigma) = amp \\times np.sqrt(2\\pi \\sigma^2)$\n\n    Args:\n        times: Times to output pulse for.\n        amp: Pulse amplitude at `center`. If `zeroed_width` is set pulse amplitude at center\n            will be $amp-\\Omega_g(center\\pm zeroed_width/2)$ unless `rescale_amp` is set,\n            in which case all samples will be rescaled such that the center\n            amplitude will be `amp`.\n        center: Center (mean) of pulse.\n        sigma: Width (standard deviation) of pulse.\n        zeroed_width: Subtract baseline to gaussian pulses to make sure\n                 $\\Omega_g(center \\pm zeroed_width/2)=0$ is satisfied. This is used to avoid\n                 large discontinuities at the start of a gaussian pulse.\n        rescale_amp: If `zeroed_width` is not `None` and `rescale_amp=True` the pulse will\n                     be rescaled so that $\\Omega_g(center)-\\Omega_g(center\\pm zeroed_width/2)=amp$.\n        ret_x: Return centered and standard deviation normalized pulse location.\n               $x=(times-center)/sigma.\n    \"\"\"\n    times = np.asarray(times, dtype=np.complex_)\n    x = (times-center)/sigma\n    gauss = amp*np.exp(-x**2/2).astype(np.complex_)\n\n    if zeroed_width is not None:\n        gauss = _fix_gaussian_width(gauss, amp=amp, center=center, sigma=sigma,\n                                    zeroed_width=zeroed_width, rescale_amp=rescale_amp)\n\n    if ret_x:\n        return gauss, x\n    return gauss", "response": "r Continues unnormalized gaussian pulse."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef gaussian_square(times: np.ndarray, amp: complex, center: float, width: float,\n                    sigma: float, zeroed_width: Union[None, float] = None) -> np.ndarray:\n    r\"\"\"Continuous gaussian square pulse.\n\n    Args:\n        times: Times to output pulse for.\n        amp: Pulse amplitude.\n        center: Center of the square pulse component.\n        width: Width of the square pulse component.\n        sigma: Width (standard deviation) of gaussian rise/fall portion of the pulse.\n        zeroed_width: Subtract baseline of gaussian square pulse\n                      to enforce $\\OmegaSquare(center \\pm zeroed_width/2)=0$.\n    \"\"\"\n    square_start = center-width/2\n    square_stop = center+width/2\n    if zeroed_width:\n        zeroed_width = min(width, zeroed_width)\n        gauss_zeroed_width = zeroed_width-width\n    else:\n        gauss_zeroed_width = None\n\n    funclist = [functools.partial(gaussian, amp=amp, center=square_start, sigma=sigma,\n                                  zeroed_width=gauss_zeroed_width, rescale_amp=True),\n                functools.partial(gaussian, amp=amp, center=square_stop, sigma=sigma,\n                                  zeroed_width=gauss_zeroed_width, rescale_amp=True),\n                functools.partial(constant, amp=amp)]\n    condlist = [times <= square_start, times >= square_stop]\n    return np.piecewise(times.astype(np.complex_), condlist, funclist)", "response": "r Continues a gaussian square pulse."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef drag(times: np.ndarray, amp: complex, center: float, sigma: float, beta: float,\n         zeroed_width: Union[None, float] = None, rescale_amp: bool = False) -> np.ndarray:\n    r\"\"\"Continuous Y-only correction DRAG pulse for standard nonlinear oscillator (SNO) [1].\n\n    [1] Gambetta, J. M., Motzoi, F., Merkel, S. T. & Wilhelm, F. K.\n        Analytic control methods for high-fidelity unitary operations\n        in a weakly nonlinear oscillator. Phys. Rev. A 83, 012308 (2011).\n\n\n    Args:\n        times: Times to output pulse for.\n        amp: Pulse amplitude at `center`.\n        center: Center (mean) of pulse.\n        sigma: Width (standard deviation) of pulse.\n        beta: Y correction amplitude. For the SNO this is $\\beta=-\\frac{\\lambda_1^2}{4\\Delta_2}$.\n            Where $\\lambds_1$ is the relative coupling strength between the first excited and second\n            excited states and $\\Delta_2$ is the detuning between the resepective excited states.\n        zeroed_width: Subtract baseline to gaussian pulses to make sure\n                 $\\Omega_g(center \\pm zeroed_width/2)=0$ is satisfied. This is used to avoid\n                 large discontinuities at the start of a gaussian pulse.\n        rescale_amp: If `zeroed_width` is not `None` and `rescale_amp=True` the pulse will\n                     be rescaled so that $\\Omega_g(center)-\\Omega_g(center\\pm zeroed_width/2)=amp$.\n\n    \"\"\"\n    gauss_deriv, gauss = gaussian_deriv(times, amp=amp, center=center, sigma=sigma,\n                                        ret_gaussian=True)\n    if zeroed_width is not None:\n        gauss, scale_factor = _fix_gaussian_width(gauss, amp=amp, center=center, sigma=sigma,\n                                                  zeroed_width=zeroed_width,\n                                                  rescale_amp=rescale_amp,\n                                                  ret_scale_factor=True)\n        gauss_deriv *= scale_factor\n\n    return gauss + 1j*beta*gauss_deriv", "response": "Continuous Y - only correction DRAG pulse for a given set of times."}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nreturning a pass manager that maps to the coupling map and optimizes it.", "response": "def default_pass_manager(basis_gates, coupling_map, initial_layout, seed_transpiler):\n    \"\"\"\n    The default pass manager that maps to the coupling map.\n\n    Args:\n        basis_gates (list[str]): list of basis gate names supported by the target.\n        coupling_map (CouplingMap): coupling map to target in mapping.\n        initial_layout (Layout or None): initial layout of virtual qubits on physical qubits\n        seed_transpiler (int or None): random seed for stochastic passes.\n\n    Returns:\n        PassManager: A pass manager to map and optimize.\n    \"\"\"\n    pass_manager = PassManager()\n    pass_manager.property_set['layout'] = initial_layout\n\n    pass_manager.append(Unroller(basis_gates))\n\n    # Use the trivial layout if no layout is found\n    pass_manager.append(TrivialLayout(coupling_map),\n                        condition=lambda property_set: not property_set['layout'])\n\n    # if the circuit and layout already satisfy the coupling_constraints, use that layout\n    # otherwise layout on the most densely connected physical qubit subset\n    pass_manager.append(CheckMap(coupling_map))\n    pass_manager.append(DenseLayout(coupling_map),\n                        condition=lambda property_set: not property_set['is_swap_mapped'])\n\n    # Extend the the dag/layout with ancillas using the full coupling map\n    pass_manager.append(FullAncillaAllocation(coupling_map))\n    pass_manager.append(EnlargeWithAncilla())\n\n    # Circuit must only contain 1- or 2-qubit interactions for swapper to work\n    pass_manager.append(Unroll3qOrMore())\n\n    # Swap mapper\n    pass_manager.append(LegacySwap(coupling_map, trials=20, seed=seed_transpiler))\n\n    # Expand swaps\n    pass_manager.append(Decompose(SwapGate))\n\n    # Change CX directions\n    pass_manager.append(CXDirection(coupling_map))\n\n    # Unroll to the basis\n    pass_manager.append(Unroller(['u1', 'u2', 'u3', 'id', 'cx']))\n\n    # Simplify single qubit gates and CXs\n    simplification_passes = [Optimize1qGates(), CXCancellation(), RemoveResetInZeroState()]\n\n    pass_manager.append(simplification_passes + [Depth(), FixedPoint('depth')],\n                        do_while=lambda property_set: not property_set['depth_fixed_point'])\n\n    return pass_manager"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\nreturning a default pass manager that just unrolls to basis gates.", "response": "def default_pass_manager_simulator(basis_gates):\n    \"\"\"\n    The default pass manager without a coupling map.\n\n    Args:\n        basis_gates (list[str]): list of basis gate names to unroll to.\n\n    Returns:\n        PassManager: A passmanager that just unrolls, without any optimization.\n    \"\"\"\n    pass_manager = PassManager()\n\n    pass_manager.append(Unroller(basis_gates))\n\n    pass_manager.append([RemoveResetInZeroState(), Depth(), FixedPoint('depth')],\n                        do_while=lambda property_set: not property_set['depth_fixed_point'])\n\n    return pass_manager"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef has_register(self, register):\n        has_reg = False\n        if (isinstance(register, QuantumRegister) and\n                register in self.qregs):\n            has_reg = True\n        elif (isinstance(register, ClassicalRegister) and\n              register in self.cregs):\n            has_reg = True\n        return has_reg", "response": "Test if this circuit has the register r."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef mirror(self):\n        reverse_circ = self.copy(name=self.name+'_mirror')\n        reverse_circ.data = []\n        for inst, qargs, cargs in reversed(self.data):\n            reverse_circ.data.append((inst.mirror(), qargs, cargs))\n        return reverse_circ", "response": "Mirroring the circuit by reversing the instructions."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef inverse(self):\n        inverse_circ = self.copy(name=self.name+'_dg')\n        inverse_circ.data = []\n        for inst, qargs, cargs in reversed(self.data):\n            inverse_circ.data.append((inst.inverse(), qargs, cargs))\n        return inverse_circ", "response": "Invert this QuantumCircuit by recursively inverting all gates."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef combine(self, rhs):\n        # Check registers in LHS are compatible with RHS\n        self._check_compatible_regs(rhs)\n\n        # Make new circuit with combined registers\n        combined_qregs = deepcopy(self.qregs)\n        combined_cregs = deepcopy(self.cregs)\n\n        for element in rhs.qregs:\n            if element not in self.qregs:\n                combined_qregs.append(element)\n        for element in rhs.cregs:\n            if element not in self.cregs:\n                combined_cregs.append(element)\n        circuit = QuantumCircuit(*combined_qregs, *combined_cregs)\n        for instruction_context in itertools.chain(self.data, rhs.data):\n            circuit.append(*instruction_context)\n        return circuit", "response": "Combine two QuantumCircuit objects."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nappending rhs to self.", "response": "def extend(self, rhs):\n        \"\"\"\n        Append rhs to self if self contains compatible registers.\n\n        Two circuits are compatible if they contain the same registers\n        or if they contain different registers with unique names. The\n        returned circuit will contain all unique registers between both\n        circuits.\n\n        Modify and return self.\n        \"\"\"\n        # Check registers in LHS are compatible with RHS\n        self._check_compatible_regs(rhs)\n\n        # Add new registers\n        for element in rhs.qregs:\n            if element not in self.qregs:\n                self.qregs.append(element)\n        for element in rhs.cregs:\n            if element not in self.cregs:\n                self.cregs.append(element)\n\n        # Add new gates\n        for instruction_context in rhs.data:\n            self.append(*instruction_context)\n        return self"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef append(self, instruction, qargs=None, cargs=None):\n        qargs = qargs or []\n        cargs = cargs or []\n\n        # Convert input to instruction\n        if not isinstance(instruction, Instruction) and hasattr(instruction, 'to_instruction'):\n            instruction = instruction.to_instruction()\n        if not isinstance(instruction, Instruction):\n            raise QiskitError('object is not an Instruction.')\n\n        # do some compatibility checks\n        self._check_dups(qargs)\n        self._check_qargs(qargs)\n        self._check_cargs(cargs)\n        if instruction.num_qubits != len(qargs) or \\\n                instruction.num_clbits != len(cargs):\n            raise QiskitError(\"instruction %s with %d qubits and %d clbits \"\n                              \"cannot be appended onto %d qubits and %d clbits.\" %\n                              (instruction.name,\n                               instruction.num_qubits, instruction.num_clbits,\n                               len(qargs), len(cargs)))\n\n        # add the instruction onto the given wires\n        instruction_context = instruction, qargs, cargs\n        self.data.append(instruction_context)\n\n        # track variable parameters in instruction\n        for param_index, param in enumerate(instruction.params):\n            if isinstance(param, Parameter):\n                current_symbols = self.parameters\n\n                if param in current_symbols:\n                    self._parameter_table[param].append((instruction, param_index))\n                else:\n                    self._parameter_table[param] = [(instruction, param_index)]\n\n        return instruction", "response": "Append an instruction to the end of the circuit modifying the current state of the instruction."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef _attach(self, instruction, qargs, cargs):\n        self.append(instruction, qargs, cargs)", "response": "Deprecated after 0. 8."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef add_register(self, *regs):\n        if not regs:\n            return\n\n        if any([isinstance(reg, int) for reg in regs]):\n            # QuantumCircuit defined without registers\n            if len(regs) == 1 and isinstance(regs[0], int):\n                # QuantumCircuit with anonymous quantum wires e.g. QuantumCircuit(2)\n                regs = (QuantumRegister(regs[0], 'q'),)\n            elif len(regs) == 2 and all([isinstance(reg, int) for reg in regs]):\n                # QuantumCircuit with anonymous wires e.g. QuantumCircuit(2, 3)\n                regs = (QuantumRegister(regs[0], 'q'), ClassicalRegister(regs[1], 'c'))\n            else:\n                raise QiskitError(\"QuantumCircuit parameters can be Registers or Integers.\"\n                                  \" If Integers, up to 2 arguments. QuantumCircuit was called\"\n                                  \" with %s.\" % (regs,))\n\n        for register in regs:\n            if register in self.qregs or register in self.cregs:\n                raise QiskitError(\"register name \\\"%s\\\" already exists\"\n                                  % register.name)\n            if isinstance(register, QuantumRegister):\n                self.qregs.append(register)\n            elif isinstance(register, ClassicalRegister):\n                self.cregs.append(register)\n            else:\n                raise QiskitError(\"expected a register\")", "response": "Adds a new register to the list of registers."}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef _check_dups(self, qubits):\n        squbits = set(qubits)\n        if len(squbits) != len(qubits):\n            raise QiskitError(\"duplicate qubit arguments\")", "response": "Raise exception if list of qubits contains duplicates."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef _check_qargs(self, qargs):\n        if not all(isinstance(i, tuple) and\n                   isinstance(i[0], QuantumRegister) and\n                   isinstance(i[1], int) for i in qargs):\n            raise QiskitError(\"qarg not (QuantumRegister, int) tuple\")\n        if not all(self.has_register(i[0]) for i in qargs):\n            raise QiskitError(\"register not in this circuit\")\n        for qubit in qargs:\n            qubit[0].check_range(qubit[1])", "response": "Raise exception if a qarg is not in this circuit or bad format."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef _check_cargs(self, cargs):\n        if not all(isinstance(i, tuple) and\n                   isinstance(i[0], ClassicalRegister) and\n                   isinstance(i[1], int) for i in cargs):\n            raise QiskitError(\"carg not (ClassicalRegister, int) tuple\")\n        if not all(self.has_register(i[0]) for i in cargs):\n            raise QiskitError(\"register not in this circuit\")\n        for clbit in cargs:\n            clbit[0].check_range(clbit[1])", "response": "Raise exception if clbit is not in this circuit or bad format."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef decompose(self):\n        from qiskit.transpiler.passes.decompose import Decompose\n        from qiskit.converters.circuit_to_dag import circuit_to_dag\n        from qiskit.converters.dag_to_circuit import dag_to_circuit\n        pass_ = Decompose()\n        decomposed_dag = pass_.run(circuit_to_dag(self))\n        return dag_to_circuit(decomposed_dag)", "response": "Call a decomposition pass on this circuit and return a QuantumCircuit one level decomposed\n       "}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef _check_compatible_regs(self, rhs):\n        list1 = self.qregs + self.cregs\n        list2 = rhs.qregs + rhs.cregs\n        for element1 in list1:\n            for element2 in list2:\n                if element2.name == element1.name:\n                    if element1 != element2:\n                        raise QiskitError(\"circuits are not compatible\")", "response": "Raise exception if the circuits are defined on incompatible registers"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nreturns the OpenQASM string for the current state of the object.", "response": "def qasm(self):\n        \"\"\"Return OpenQASM string.\"\"\"\n        string_temp = self.header + \"\\n\"\n        string_temp += self.extension_lib + \"\\n\"\n        for register in self.qregs:\n            string_temp += register.qasm() + \"\\n\"\n        for register in self.cregs:\n            string_temp += register.qasm() + \"\\n\"\n        for instruction, qargs, cargs in self.data:\n            if instruction.name == 'measure':\n                qubit = qargs[0]\n                clbit = cargs[0]\n                string_temp += \"%s %s[%d] -> %s[%d];\\n\" % (instruction.qasm(),\n                                                           qubit[0].name, qubit[1],\n                                                           clbit[0].name, clbit[1])\n            else:\n                string_temp += \"%s %s;\\n\" % (instruction.qasm(),\n                                             \",\".join([\"%s[%d]\" % (j[0].name, j[1])\n                                                       for j in qargs + cargs]))\n        return string_temp"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\ndrawing the quantum circuit with the specified parameters.", "response": "def draw(self, scale=0.7, filename=None, style=None, output='text',\n             interactive=False, line_length=None, plot_barriers=True,\n             reverse_bits=False, justify=None):\n        \"\"\"Draw the quantum circuit\n\n        Using the output parameter you can specify the format. The choices are:\n        0. text: ASCII art string\n        1. latex: high-quality images, but heavy external software dependencies\n        2. matplotlib: purely in Python with no external dependencies\n\n        Defaults to an overcomplete basis, in order to not alter gates.\n\n        Args:\n            scale (float): scale of image to draw (shrink if < 1)\n            filename (str): file path to save image to\n            style (dict or str): dictionary of style or file name of style\n                file. You can refer to the\n                :ref:`Style Dict Doc <style-dict-doc>` for more information\n                on the contents.\n            output (str): Select the output method to use for drawing the\n                circuit. Valid choices are `text`, `latex`, `latex_source`,\n                `mpl`.\n            interactive (bool): when set true show the circuit in a new window\n                (for `mpl` this depends on the matplotlib backend being used\n                supporting this). Note when used with either the `text` or the\n                `latex_source` output type this has no effect and will be\n                silently ignored.\n            line_length (int): sets the length of the lines generated by `text`\n            reverse_bits (bool): When set to True reverse the bit order inside\n                registers for the output visualization.\n            plot_barriers (bool): Enable/disable drawing barriers in the output\n                circuit. Defaults to True.\n            justify (string): Options are `left`, `right` or `none`, if anything\n                else is supplied it defaults to left justified. It refers to where\n                gates should be placed in the output circuit if there is an option.\n                `none` results in each gate being placed in its own column. Currently\n                only supported by text drawer.\n\n        Returns:\n            PIL.Image or matplotlib.figure or str or TextDrawing:\n                * PIL.Image: (output `latex`) an in-memory representation of the\n                  image of the circuit diagram.\n                * matplotlib.figure: (output `mpl`) a matplotlib figure object\n                  for the circuit diagram.\n                * str: (output `latex_source`). The LaTeX source code.\n                * TextDrawing: (output `text`). A drawing that can be printed as\n                  ascii art\n\n        Raises:\n            VisualizationError: when an invalid output method is selected\n        \"\"\"\n        from qiskit.tools import visualization\n        return visualization.circuit_drawer(self, scale=scale,\n                                            filename=filename, style=style,\n                                            output=output,\n                                            interactive=interactive,\n                                            line_length=line_length,\n                                            plot_barriers=plot_barriers,\n                                            reverse_bits=reverse_bits,\n                                            justify=justify)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef size(self):\n        gate_ops = 0\n        for instr, _, _ in self.data:\n            if instr.name not in ['barrier', 'snapshot']:\n                gate_ops += 1\n        return gate_ops", "response": "Returns the number of gate operations in circuit."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef width(self):\n        return sum(reg.size for reg in self.qregs+self.cregs)", "response": "Return number of qubits plus clbits in circuit."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef count_ops(self):\n        count_ops = {}\n        for instr, _, _ in self.data:\n            if instr.name in count_ops.keys():\n                count_ops[instr.name] += 1\n            else:\n                count_ops[instr.name] = 1\n        return count_ops", "response": "Count each operation kind in the circuit."}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nreturns the number of connected components in the current state of the cbits in the current state of the cbits in the current state of the cbits in the current state of the cbits in the current state of the cbits.", "response": "def num_connected_components(self, unitary_only=False):\n        \"\"\"How many non-entangled subcircuits can the circuit be factored to.\n\n        Args:\n            unitary_only (bool): Compute only unitary part of graph.\n\n        Returns:\n            int: Number of connected components in circuit.\n        \"\"\"\n        # Convert registers to ints (as done in depth).\n        reg_offset = 0\n        reg_map = {}\n\n        if unitary_only:\n            regs = self.qregs\n        else:\n            regs = self.qregs+self.cregs\n\n        for reg in regs:\n            reg_map[reg.name] = reg_offset\n            reg_offset += reg.size\n        # Start with each qubit or cbit being its own subgraph.\n        sub_graphs = [[bit] for bit in range(reg_offset)]\n\n        num_sub_graphs = len(sub_graphs)\n\n        # Here we are traversing the gates and looking to see\n        # which of the sub_graphs the gate joins together.\n        for instr, qargs, cargs in self.data:\n            if unitary_only:\n                args = qargs\n                num_qargs = len(args)\n            else:\n                args = qargs+cargs\n                num_qargs = len(args) + (1 if instr.control else 0)\n\n            if num_qargs >= 2 and instr.name not in ['barrier', 'snapshot']:\n                graphs_touched = []\n                num_touched = 0\n                # Controls necessarily join all the cbits in the\n                # register that they use.\n                if instr.control and not unitary_only:\n                    creg = instr.control[0]\n                    creg_int = reg_map[creg.name]\n                    for coff in range(creg.size):\n                        temp_int = creg_int+coff\n                        for k in range(num_sub_graphs):\n                            if temp_int in sub_graphs[k]:\n                                graphs_touched.append(k)\n                                num_touched += 1\n                                break\n\n                for item in args:\n                    reg_int = reg_map[item[0].name]+item[1]\n                    for k in range(num_sub_graphs):\n                        if reg_int in sub_graphs[k]:\n                            if k not in graphs_touched:\n                                graphs_touched.append(k)\n                                num_touched += 1\n                                break\n\n                # If the gate touches more than one subgraph\n                # join those graphs together and return\n                # reduced number of subgraphs\n                if num_touched > 1:\n                    connections = []\n                    for idx in graphs_touched:\n                        connections.extend(sub_graphs[idx])\n                    _sub_graphs = []\n                    for idx in range(num_sub_graphs):\n                        if idx not in graphs_touched:\n                            _sub_graphs.append(sub_graphs[idx])\n                    _sub_graphs.append(connections)\n                    sub_graphs = _sub_graphs\n                    num_sub_graphs -= (num_touched-1)\n            # Cannot go lower than one so break\n            if num_sub_graphs == 1:\n                break\n        return num_sub_graphs"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nassign parameters to values yielding a new circuit.", "response": "def bind_parameters(self, value_dict):\n        \"\"\"Assign parameters to values yielding a new circuit.\n\n        Args:\n            value_dict (dict): {parameter: value, ...}\n\n        Raises:\n            QiskitError: If value_dict contains parameters not present in the circuit\n\n        Returns:\n            QuantumCircuit: copy of self with assignment substitution.\n        \"\"\"\n        new_circuit = self.copy()\n\n        if value_dict.keys() > self.parameters:\n            raise QiskitError('Cannot bind parameters ({}) not present in the circuit.'.format(\n                [str(p) for p in value_dict.keys() - self.parameters]))\n\n        for parameter, value in value_dict.items():\n            new_circuit._bind_parameter(parameter, value)\n        # clear evaluated expressions\n        for parameter in value_dict:\n            del new_circuit._parameter_table[parameter]\n        return new_circuit"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nassign a parameter value to matching instructions in - place.", "response": "def _bind_parameter(self, parameter, value):\n        \"\"\"Assigns a parameter value to matching instructions in-place.\"\"\"\n        for (instr, param_index) in self._parameter_table[parameter]:\n            instr.params[param_index] = value"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef pulse_drawer(samples, duration, dt=None, interp_method='None',\n                 filename=None, interactive=False,\n                 dpi=150, nop=1000, size=(6, 5)):\n    \"\"\"Plot the interpolated envelope of pulse\n\n    Args:\n        samples (ndarray): Data points of complex pulse envelope.\n        duration (int): Pulse length (number of points).\n        dt (float): Time interval of samples.\n        interp_method (str): Method of interpolation\n            (set `None` for turn off the interpolation).\n        filename (str): Name required to save pulse image.\n        interactive (bool): When set true show the circuit in a new window\n            (this depends on the matplotlib backend being used supporting this).\n        dpi (int): Resolution of saved image.\n        nop (int): Data points for interpolation.\n        size (tuple): Size of figure.\n    Returns:\n        matplotlib.figure: A matplotlib figure object for the pulse envelope.\n    Raises:\n        ImportError: when the output methods requieres non-installed libraries.\n        QiskitError: when invalid interpolation method is specified.\n    \"\"\"\n\n    try:\n        from matplotlib import pyplot as plt\n    except ImportError:\n        raise ImportError('pulse_drawer need matplotlib. '\n                          'Run \"pip install matplotlib\" before.')\n\n    if dt:\n        _dt = dt\n    else:\n        _dt = 1\n\n    re_y = np.real(samples)\n    im_y = np.imag(samples)\n\n    image = plt.figure(figsize=size)\n    ax0 = image.add_subplot(111)\n\n    if interp_method == 'CubicSpline':\n        # spline interpolation, use mid-point of dt\n        time = np.arange(0, duration + 1) * _dt + 0.5 * _dt\n        cs_ry = CubicSpline(time[:-1], re_y)\n        cs_iy = CubicSpline(time[:-1], im_y)\n\n        _time = np.linspace(0, duration * _dt, nop)\n        _re_y = cs_ry(_time)\n        _im_y = cs_iy(_time)\n    elif interp_method == 'None':\n        # pseudo-DAC output\n        time = np.arange(0, duration + 1) * _dt\n\n        _time = np.r_[time[0], np.repeat(time[1:-1], 2), time[-1]]\n        _re_y = np.repeat(re_y, 2)\n        _im_y = np.repeat(im_y, 2)\n    else:\n        raise QiskitError('Invalid interpolation method \"%s\"' % interp_method)\n\n    # plot\n    ax0.fill_between(x=_time, y1=_re_y, y2=np.zeros_like(_time),\n                     facecolor='red', alpha=0.3,\n                     edgecolor='red', linewidth=1.5,\n                     label='real part')\n    ax0.fill_between(x=_time, y1=_im_y, y2=np.zeros_like(_time),\n                     facecolor='blue', alpha=0.3,\n                     edgecolor='blue', linewidth=1.5,\n                     label='imaginary part')\n\n    ax0.set_xlim(0, duration * _dt)\n    ax0.grid(b=True, linestyle='-')\n    ax0.legend(bbox_to_anchor=(0.5, 1.00), loc='lower center',\n               ncol=2, frameon=False, fontsize=14)\n\n    if filename:\n        image.savefig(filename, dpi=dpi, bbox_inches='tight')\n\n    plt.close(image)\n\n    if image and interactive:\n        plt.show(image)\n\n    return image", "response": "Plot the interpolated envelope of a complex pulse."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nsearches for SWAPs which allow for application of largest number of gates.", "response": "def _search_forward_n_swaps(layout, gates, coupling_map,\n                            depth=SEARCH_DEPTH, width=SEARCH_WIDTH):\n    \"\"\"Search for SWAPs which allow for application of largest number of gates.\n\n    Arguments:\n        layout (Layout): Map from virtual qubit index to physical qubit index.\n        gates (list): Gates to be mapped.\n        coupling_map (CouplingMap): CouplingMap of the target backend.\n        depth (int): Number of SWAP layers to search before choosing a result.\n        width (int): Number of SWAPs to consider at each layer.\n    Returns:\n        dict: Describes solution step found.\n            layout (Layout): Virtual to physical qubit map after SWAPs.\n            gates_remaining (list): Gates that could not be mapped.\n            gates_mapped (list): Gates that were mapped, including added SWAPs.\n\n    \"\"\"\n\n    gates_mapped, gates_remaining = _map_free_gates(layout, gates, coupling_map)\n\n    base_step = {'layout': layout,\n                 'swaps_added': 0,\n                 'gates_mapped': gates_mapped,\n                 'gates_remaining': gates_remaining}\n\n    if not gates_remaining or depth == 0:\n        return base_step\n\n    possible_swaps = coupling_map.get_edges()\n\n    def _score_swap(swap):\n        \"\"\"Calculate the relative score for a given SWAP.\"\"\"\n        trial_layout = layout.copy()\n        trial_layout.swap(*swap)\n        return _calc_layout_distance(gates, coupling_map, trial_layout)\n\n    ranked_swaps = sorted(possible_swaps, key=_score_swap)\n\n    best_swap, best_step = None, None\n    for swap in ranked_swaps[:width]:\n        trial_layout = layout.copy()\n        trial_layout.swap(*swap)\n        next_step = _search_forward_n_swaps(trial_layout, gates_remaining,\n                                            coupling_map, depth - 1, width)\n\n        # ranked_swaps already sorted by distance, so distance is the tie-breaker.\n        if best_swap is None or _score_step(next_step) > _score_step(best_step):\n            best_swap, best_step = swap, next_step\n\n    best_swap_gate = _swap_ops_from_edge(best_swap, layout)\n    return {\n        'layout': best_step['layout'],\n        'swaps_added': 1 + best_step['swaps_added'],\n        'gates_remaining': best_step['gates_remaining'],\n        'gates_mapped': gates_mapped + best_swap_gate + best_step['gates_mapped'],\n    }"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef _map_free_gates(layout, gates, coupling_map):\n\n    blocked_qubits = set()\n\n    mapped_gates = []\n    remaining_gates = []\n\n    for gate in gates:\n        # Gates without a partition (barrier, snapshot, save, load, noise) may\n        # still have associated qubits. Look for them in the qargs.\n        if not gate['partition']:\n            qubits = [n for n in gate['graph'].nodes() if n.type == 'op'][0].qargs\n\n            if not qubits:\n                continue\n\n            if blocked_qubits.intersection(qubits):\n                blocked_qubits.update(qubits)\n                remaining_gates.append(gate)\n            else:\n                mapped_gate = _transform_gate_for_layout(gate, layout)\n                mapped_gates.append(mapped_gate)\n            continue\n\n        qubits = gate['partition'][0]\n\n        if blocked_qubits.intersection(qubits):\n            blocked_qubits.update(qubits)\n            remaining_gates.append(gate)\n        elif len(qubits) == 1:\n            mapped_gate = _transform_gate_for_layout(gate, layout)\n            mapped_gates.append(mapped_gate)\n        elif coupling_map.distance(*[layout[q] for q in qubits]) == 1:\n            mapped_gate = _transform_gate_for_layout(gate, layout)\n            mapped_gates.append(mapped_gate)\n        else:\n            blocked_qubits.update(qubits)\n            remaining_gates.append(gate)\n\n    return mapped_gates, remaining_gates", "response": "Map all gates that can be executed on the current layout to the new layout."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\ncalculating the distance between two - qubit pairs in each CNOT in gates based on the layout and the coupling.", "response": "def _calc_layout_distance(gates, coupling_map, layout, max_gates=None):\n    \"\"\"Return the sum of the distances of two-qubit pairs in each CNOT in gates\n    according to the layout and the coupling.\n    \"\"\"\n\n    if max_gates is None:\n        max_gates = 50 + 10 * len(coupling_map.physical_qubits)\n\n    return sum(coupling_map.distance(*[layout[q] for q in gate['partition'][0]])\n               for gate in gates[:max_gates]\n               if gate['partition'] and len(gate['partition'][0]) == 2)"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nreturning the score of a step.", "response": "def _score_step(step):\n\n    \"\"\"Count the mapped two-qubit gates, less the number of added SWAPs.\"\"\"\n    # Each added swap will add 3 ops to gates_mapped, so subtract 3.\n    return len([g for g in step['gates_mapped']\n                if len(g.qargs) == 2]) - 3 * step['swaps_added']"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nreturns a copy of source_dag with metadata but empty.", "response": "def _copy_circuit_metadata(source_dag, coupling_map):\n    \"\"\"Return a copy of source_dag with metadata but empty.\n    Generate only a single qreg in the output DAG, matching the size of the\n    coupling_map.\"\"\"\n\n    target_dag = DAGCircuit()\n    target_dag.name = source_dag.name\n\n    for creg in source_dag.cregs.values():\n        target_dag.add_creg(creg)\n\n    device_qreg = QuantumRegister(len(coupling_map.physical_qubits), 'q')\n    target_dag.add_qreg(device_qreg)\n\n    return target_dag"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef _transform_gate_for_layout(gate, layout):\n\n    mapped_op_node = deepcopy([n for n in gate['graph'].nodes() if n.type == 'op'][0])\n\n    # Workaround until #1816, apply mapped to qargs to both DAGNode and op\n    device_qreg = QuantumRegister(len(layout.get_physical_bits()), 'q')\n    mapped_qargs = [(device_qreg, layout[a]) for a in mapped_op_node.qargs]\n    mapped_op_node.qargs = mapped_op_node.op.qargs = mapped_qargs\n\n    mapped_op_node.pop('name')\n\n    return mapped_op_node", "response": "Return op implementing a virtual gate on given layout."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _swap_ops_from_edge(edge, layout):\n\n    device_qreg = QuantumRegister(len(layout.get_physical_bits()), 'q')\n    qreg_edge = [(device_qreg, i) for i in edge]\n\n    # TODO shouldn't be making other nodes not by the DAG!!\n    return [\n        DAGNode({'op': SwapGate(), 'qargs': qreg_edge, 'cargs': [], 'type': 'op'})\n    ]", "response": "Generate list of ops to implement a SWAP gate along a coupling edge."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef run(self, dag):\n        coupling_map = self._coupling_map\n        ordered_virtual_gates = list(dag.serial_layers())\n\n        if self.initial_layout is None:\n            if self.property_set[\"layout\"]:\n                self.initial_layout = self.property_set[\"layout\"]\n            else:\n                self.initial_layout = Layout.generate_trivial_layout(*dag.qregs.values())\n\n        if len(dag.qubits()) != len(self.initial_layout):\n            raise TranspilerError('The layout does not match the amount of qubits in the DAG')\n\n        if len(self._coupling_map.physical_qubits) != len(self.initial_layout):\n            raise TranspilerError(\n                \"Mappers require to have the layout to be the same size as the coupling map\")\n\n        mapped_gates = []\n        layout = self.initial_layout.copy()\n        gates_remaining = ordered_virtual_gates.copy()\n\n        while gates_remaining:\n            best_step = _search_forward_n_swaps(layout, gates_remaining,\n                                                coupling_map)\n\n            layout = best_step['layout']\n            gates_mapped = best_step['gates_mapped']\n            gates_remaining = best_step['gates_remaining']\n\n            mapped_gates.extend(gates_mapped)\n\n        # Preserve input DAG's name, regs, wire_map, etc. but replace the graph.\n        mapped_dag = _copy_circuit_metadata(dag, coupling_map)\n\n        for node in mapped_gates:\n            mapped_dag.apply_operation_back(op=node.op, qargs=node.qargs, cargs=node.cargs)\n\n        return mapped_dag", "response": "This method runs one pass of the lookahead mapper on the given DAG."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nadds a physical qubit to the coupling graph as a node.", "response": "def add_physical_qubit(self, physical_qubit):\n        \"\"\"Add a physical qubit to the coupling graph as a node.\n\n        physical_qubit (int): An integer representing a physical qubit.\n\n        Raises:\n            CouplingError: if trying to add duplicate qubit\n        \"\"\"\n        if not isinstance(physical_qubit, int):\n            raise CouplingError(\"Physical qubits should be integers.\")\n        if physical_qubit in self.physical_qubits:\n            raise CouplingError(\n                \"The physical qubit %s is already in the coupling graph\" % physical_qubit)\n        self.graph.add_node(physical_qubit)\n        self._dist_matrix = None  # invalidate\n        self._qubit_list = None"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nadd a directed edge to the coupling graph.", "response": "def add_edge(self, src, dst):\n        \"\"\"\n        Add directed edge to coupling graph.\n\n        src (int): source physical qubit\n        dst (int): destination physical qubit\n        \"\"\"\n        if src not in self.physical_qubits:\n            self.add_physical_qubit(src)\n        if dst not in self.physical_qubits:\n            self.add_physical_qubit(dst)\n        self.graph.add_edge(src, dst)\n        self._dist_matrix = None"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef subgraph(self, nodelist):\n        subcoupling = CouplingMap()\n        subcoupling.graph = self.graph.subgraph(nodelist)\n        for node in nodelist:\n            if node not in subcoupling.physical_qubits:\n                subcoupling.add_physical_qubit(node)\n        return subcoupling", "response": "Return a CouplingMap object for a subgraph of self."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef physical_qubits(self):\n        if self._qubit_list is None:\n            self._qubit_list = sorted([pqubit for pqubit in self.graph.nodes])\n        return self._qubit_list", "response": "Returns a sorted list of physical_qubits"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\ntests if the graph is connected.", "response": "def is_connected(self):\n        \"\"\"\n        Test if the graph is connected.\n\n        Return True if connected, False otherwise\n        \"\"\"\n        try:\n            return nx.is_weakly_connected(self.graph)\n        except nx.exception.NetworkXException:\n            return False"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef _compute_distance_matrix(self):\n        if not self.is_connected():\n            raise CouplingError(\"coupling graph not connected\")\n        lengths = nx.all_pairs_shortest_path_length(self.graph.to_undirected(as_view=True))\n        lengths = dict(lengths)\n        size = len(lengths)\n        cmap = np.zeros((size, size))\n        for idx in range(size):\n            cmap[idx, np.fromiter(lengths[idx].keys(), dtype=int)] = np.fromiter(\n                lengths[idx].values(), dtype=int)\n        self._dist_matrix = cmap", "response": "Compute the full distance matrix on pairs of nodes."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nreturn the undirected distance between two physical qubits.", "response": "def distance(self, physical_qubit1, physical_qubit2):\n        \"\"\"Returns the undirected distance between physical_qubit1 and physical_qubit2.\n\n        Args:\n            physical_qubit1 (int): A physical qubit\n            physical_qubit2 (int): Another physical qubit\n\n        Returns:\n            int: The undirected distance\n\n        Raises:\n            CouplingError: if the qubits do not exist in the CouplingMap\n        \"\"\"\n        if physical_qubit1 not in self.physical_qubits:\n            raise CouplingError(\"%s not in coupling graph\" % (physical_qubit1,))\n        if physical_qubit2 not in self.physical_qubits:\n            raise CouplingError(\"%s not in coupling graph\" % (physical_qubit2,))\n        if self._dist_matrix is None:\n            self._compute_distance_matrix()\n        return self._dist_matrix[physical_qubit1, physical_qubit2]"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\ndeprecate - Use qiskit.compiler.transpile for transpiling from circuits to circuits. Transform a dag circuit into another dag circuit (transpile), through consecutive passes on the dag. Args: dag (DAGCircuit): dag circuit to transform via transpilation basis_gates (list[str]): list of basis gate names supported by the target. Default: ['u1','u2','u3','cx','id'] coupling_map (list): A graph of coupling:: [ [control0(int), target0(int)], [control1(int), target1(int)], ] eg. [[0, 2], [1, 2], [1, 3], [3, 4]} initial_layout (Layout or None): A layout object seed_mapper (int): random seed_mapper for the swap mapper pass_manager (PassManager): pass manager instance for the transpilation process If None, a default set of passes are run. Otherwise, the passes defined in it will run. If contains no passes in it, no dag transformations occur. Returns: DAGCircuit: transformed dag", "response": "def transpile_dag(dag, basis_gates=None, coupling_map=None,\n                  initial_layout=None, seed_mapper=None, pass_manager=None):\n    \"\"\"Deprecated - Use qiskit.compiler.transpile for transpiling from\n    circuits to circuits.\n    Transform a dag circuit into another dag circuit\n    (transpile), through consecutive passes on the dag.\n\n    Args:\n        dag (DAGCircuit): dag circuit to transform via transpilation\n        basis_gates (list[str]): list of basis gate names supported by the\n            target. Default: ['u1','u2','u3','cx','id']\n        coupling_map (list): A graph of coupling::\n\n            [\n             [control0(int), target0(int)],\n             [control1(int), target1(int)],\n            ]\n\n            eg. [[0, 2], [1, 2], [1, 3], [3, 4]}\n\n        initial_layout (Layout or None): A layout object\n        seed_mapper (int): random seed_mapper for the swap mapper\n        pass_manager (PassManager): pass manager instance for the transpilation process\n            If None, a default set of passes are run.\n            Otherwise, the passes defined in it will run.\n            If contains no passes in it, no dag transformations occur.\n\n    Returns:\n        DAGCircuit: transformed dag\n    \"\"\"\n    warnings.warn(\"transpile_dag has been deprecated and will be removed in the \"\n                  \"0.9 release. Circuits can be transpiled directly to other \"\n                  \"circuits with the transpile function.\", DeprecationWarning)\n\n    if basis_gates is None:\n        basis_gates = ['u1', 'u2', 'u3', 'cx', 'id']\n\n    if pass_manager is None:\n        # default set of passes\n\n        # if a coupling map is given compile to the map\n        if coupling_map:\n            pass_manager = default_pass_manager(basis_gates,\n                                                CouplingMap(coupling_map),\n                                                initial_layout,\n                                                seed_transpiler=seed_mapper)\n        else:\n            pass_manager = default_pass_manager_simulator(basis_gates)\n\n    # run the passes specified by the pass manager\n    # TODO return the property set too. See #1086\n    name = dag.name\n    circuit = dag_to_circuit(dag)\n    circuit = pass_manager.run(circuit)\n    dag = circuit_to_dag(circuit)\n    dag.name = name\n\n    return dag"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef cu1(self, theta, ctl, tgt):\n    return self.append(Cu1Gate(theta), [ctl, tgt], [])", "response": "Apply cu1 from ctl to tgt with angle theta."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef add(self, gate, qargs, cargs):\n        if not isinstance(gate, Instruction):\n            raise QiskitError(\"attempt to add non-Instruction\" +\n                              \" to InstructionSet\")\n        self.instructions.append(gate)\n        self.qargs.append(qargs)\n        self.cargs.append(cargs)", "response": "Add an instruction and its context."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef q_if(self, *qregs):\n        for gate in self.instructions:\n            gate.q_if(*qregs)\n        return self", "response": "Add controls to all instructions."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\nadding classical control register to all instructions.", "response": "def c_if(self, classical, val):\n        \"\"\"Add classical control register to all instructions.\"\"\"\n        for gate in self.instructions:\n            gate.c_if(classical, val)\n        return self"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nsubscribe to an event so when it s emitted all the callbacks subscribed will be executed.", "response": "def subscribe(self, event, callback):\n        \"\"\"Subscribes to an event, so when it's emitted all the callbacks subscribed,\n        will be executed. We are not allowing double registration.\n\n        Args\n            event (string): The event to subscribed in the form of:\n                            \"terra.<component>.<method>.<action>\"\n            callback (callable): The callback that will be executed when an event is\n                                  emitted.\n        \"\"\"\n        if not callable(callback):\n            raise QiskitError(\"Callback is not a callable!\")\n\n        if event not in self._subscribers:\n            self._subscribers[event] = []\n\n        new_subscription = self._Subscription(event, callback)\n        if new_subscription in self._subscribers[event]:\n            # We are not allowing double subscription\n            return False\n\n        self._subscribers[event].append(new_subscription)\n        return True"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef dispatch(self, event, *args, **kwargs):\n        # No event, no subscribers.\n        if event not in self._subscribers:\n            return\n\n        for subscriber in self._subscribers[event]:\n            subscriber.callback(*args, **kwargs)", "response": "Emits an event if there are any subscribers."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef publish(self, event, *args, **kwargs):\n        return self._broker.dispatch(event, *args, **kwargs)", "response": "Triggers an event and associates some data to it."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\napplying initialize to circuit.", "response": "def initialize(self, params, qubits):\n    \"\"\"Apply initialize to circuit.\"\"\"\n    if isinstance(qubits, QuantumRegister):\n        qubits = qubits[:]\n    else:\n        qubits = _convert_to_bits([qubits], [qbit for qreg in self.qregs for qbit in qreg])[0]\n    return self.append(Initialize(params), qubits)"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _define(self):\n        # call to generate the circuit that takes the desired vector to zero\n        disentangling_circuit = self.gates_to_uncompute()\n\n        # invert the circuit to create the desired vector from zero (assuming\n        # the qubits are in the zero state)\n        initialize_instr = disentangling_circuit.to_instruction().inverse()\n\n        q = QuantumRegister(self.num_qubits, 'q')\n        initialize_circuit = QuantumCircuit(q, name='init_def')\n        for qubit in q:\n            initialize_circuit.append(Reset(), [qubit])\n        initialize_circuit.append(initialize_instr, q[:])\n\n        self.definition = initialize_circuit.data", "response": "Calculate a subcircuit that implements this initialization\n        Implements a recursive initialization algorithm from Synthesis of Quantum Logic Circuits Shende Bullock Markov\n        https://arxiv. org / abs / quant - ph/0406176v5."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef gates_to_uncompute(self):\n        q = QuantumRegister(self.num_qubits)\n        circuit = QuantumCircuit(q, name='disentangler')\n\n        # kick start the peeling loop, and disentangle one-by-one from LSB to MSB\n        remaining_param = self.params\n\n        for i in range(self.num_qubits):\n            # work out which rotations must be done to disentangle the LSB\n            # qubit (we peel away one qubit at a time)\n            (remaining_param,\n             thetas,\n             phis) = Initialize._rotations_to_disentangle(remaining_param)\n\n            # perform the required rotations to decouple the LSB qubit (so that\n            # it can be \"factored\" out, leaving a shorter amplitude vector to peel away)\n            rz_mult = self._multiplex(RZGate, phis)\n            ry_mult = self._multiplex(RYGate, thetas)\n            circuit.append(rz_mult.to_instruction(), q[i:self.num_qubits])\n            circuit.append(ry_mult.to_instruction(), q[i:self.num_qubits])\n        return circuit", "response": "Create a circuit with gates that take the desired vector to zero."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef _rotations_to_disentangle(local_param):\n        remaining_vector = []\n        thetas = []\n        phis = []\n\n        param_len = len(local_param)\n\n        for i in range(param_len // 2):\n            # Ry and Rz rotations to move bloch vector from 0 to \"imaginary\"\n            # qubit\n            # (imagine a qubit state signified by the amplitudes at index 2*i\n            # and 2*(i+1), corresponding to the select qubits of the\n            # multiplexor being in state |i>)\n            (remains,\n             add_theta,\n             add_phi) = Initialize._bloch_angles(local_param[2 * i: 2 * (i + 1)])\n\n            remaining_vector.append(remains)\n\n            # rotations for all imaginary qubits of the full vector\n            # to move from where it is to zero, hence the negative sign\n            thetas.append(-add_theta)\n            phis.append(-add_phi)\n\n        return remaining_vector, thetas, phis", "response": "This method is used to work out the Ry and Rz rotations used to disentangle the LSB qubit."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _bloch_angles(pair_of_complex):\n        [a_complex, b_complex] = pair_of_complex\n        # Force a and b to be complex, as otherwise numpy.angle might fail.\n        a_complex = complex(a_complex)\n        b_complex = complex(b_complex)\n        mag_a = np.absolute(a_complex)\n        final_r = float(np.sqrt(mag_a ** 2 + np.absolute(b_complex) ** 2))\n        if final_r < _EPS:\n            theta = 0\n            phi = 0\n            final_r = 0\n            final_t = 0\n        else:\n            theta = float(2 * np.arccos(mag_a / final_r))\n            a_arg = np.angle(a_complex)\n            b_arg = np.angle(b_complex)\n            final_t = a_arg + b_arg\n            phi = b_arg - a_arg\n\n        return final_r * np.exp(1.J * final_t / 2), theta, phi", "response": "Static method to work out the angle between two sets of elements."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef _multiplex(self, target_gate, list_of_angles):\n        list_len = len(list_of_angles)\n        local_num_qubits = int(math.log2(list_len)) + 1\n\n        q = QuantumRegister(local_num_qubits)\n        circuit = QuantumCircuit(q, name=\"multiplex\" + local_num_qubits.__str__())\n\n        lsb = q[0]\n        msb = q[local_num_qubits - 1]\n\n        # case of no multiplexing: base case for recursion\n        if local_num_qubits == 1:\n            circuit.append(target_gate(list_of_angles[0]), [q[0]])\n            return circuit\n\n        # calc angle weights, assuming recursion (that is the lower-level\n        # requested angles have been correctly implemented by recursion\n        angle_weight = scipy.kron([[0.5, 0.5], [0.5, -0.5]],\n                                  np.identity(2 ** (local_num_qubits - 2)))\n\n        # calc the combo angles\n        list_of_angles = angle_weight.dot(np.array(list_of_angles)).tolist()\n\n        # recursive step on half the angles fulfilling the above assumption\n        multiplex_1 = self._multiplex(target_gate, list_of_angles[0:(list_len // 2)])\n        circuit.append(multiplex_1.to_instruction(), q[0:-1])\n\n        # attach CNOT as follows, thereby flipping the LSB qubit\n        circuit.append(CnotGate(), [msb, lsb])\n\n        # implement extra efficiency from the paper of cancelling adjacent\n        # CNOTs (by leaving out last CNOT and reversing (NOT inverting) the\n        # second lower-level multiplex)\n        multiplex_2 = self._multiplex(target_gate, list_of_angles[(list_len // 2):])\n        if list_len > 1:\n            circuit.append(multiplex_2.to_instruction().mirror(), q[0:-1])\n        else:\n            circuit.append(multiplex_2.to_instruction(), q[0:-1])\n\n        # attach a final CNOT\n        circuit.append(CnotGate(), [msb, lsb])\n\n        return circuit", "response": "Returns a recursive implementation of a multiplexor circuit with a decomposition based on the given target_gate and list of rotation angles."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\npopulating a Layout from a dictionary.", "response": "def from_dict(self, input_dict):\n        \"\"\"\n        Populates a Layout from a dictionary.\n        The dictionary must be a bijective mapping between\n        virtual qubits (tuple) and physical qubits (int).\n\n        Args:\n            input_dict (dict):\n                e.g.:\n                {(QuantumRegister(3, 'qr'), 0): 0,\n                 (QuantumRegister(3, 'qr'), 1): 1,\n                 (QuantumRegister(3, 'qr'), 2): 2}\n\n                Can be written more concisely as follows:\n\n                virtual to physical:\n                    {qr[0]: 0,\n                     qr[1]: 1,\n                     qr[2]: 2}\n\n                physical to virtual:\n                    {0: qr[0],\n                     1: qr[1],\n                     2: qr[2]}\n        \"\"\"\n        # TODO (luciano): Remove this full block after 0.8.\n        #  its here to support {(\"qr\", 0): (\"q\", 0),...}\n        if len(input_dict) >= 1:\n            key = list(input_dict.keys())[0]\n            value = input_dict[key]\n            if (isinstance(key, tuple) and  # pylint: disable=too-many-boolean-expressions\n                    len(key) == 2 and\n                    isinstance(key[0], str) and\n                    isinstance(key[1], int) and\n                    isinstance(value, tuple) and\n                    len(value) == 2 and\n                    isinstance(key[0], str) and\n                    isinstance(key[1], int)):\n                warnings.warn(\"This form of dictionary (i.e. {(\\\"%s\\\",%s):(\\\"%s\\\",%s), ...} is \"\n                              \"going to be deprecated after 0.8.\" % (key + value),\n                              DeprecationWarning)\n                qreg_names = {qubit[0] for qubit in input_dict.keys()}\n                qregs = {}\n                for qreg_name in qreg_names:\n                    qregs[qreg_name] = QuantumRegister(\n                        max([qubit[1] for qubit in input_dict.keys() if qubit[0] == qreg_name]) + 1,\n                        qreg_name)\n                new_input_dict = {}\n                for key, value in input_dict.items():\n                    new_input_dict[value[1]] = (qregs[key[0]], key[1])\n                input_dict = new_input_dict\n\n        for key, value in input_dict.items():\n            virtual, physical = Layout.order_based_on_type(key, value)\n            self._p2v[physical] = virtual\n            if virtual is None:\n                continue\n            self._v2p[virtual] = physical"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef order_based_on_type(value1, value2):\n        if isinstance(value1, int) and Layout.is_virtual(value2):\n            physical = value1\n            virtual = value2\n        elif isinstance(value2, int) and Layout.is_virtual(value1):\n            physical = value2\n            virtual = value1\n        else:\n            raise LayoutError('The map (%s -> %s) has to be a ((Register, integer) -> integer)'\n                              ' or the other way around.' % (type(value1), type(value2)))\n        return virtual, physical", "response": "decides which one is physical based on the type. Returns ( virtual physical )"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef is_virtual(value):\n        return value is None or isinstance(value, tuple) and len(value) == 2 and isinstance(\n            value[0], Register) and isinstance(value[1], int)", "response": "Checks if the value is a valid virtual qubit"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nreturn a copy of a Layout instance.", "response": "def copy(self):\n        \"\"\"Returns a copy of a Layout instance.\"\"\"\n        layout_copy = type(self)()\n\n        layout_copy._p2v = self._p2v.copy()\n        layout_copy._v2p = self._v2p.copy()\n\n        return layout_copy"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\nadding a map element between bit and physical_bit.", "response": "def add(self, virtual_bit, physical_bit=None):\n        \"\"\"\n        Adds a map element between `bit` and `physical_bit`. If `physical_bit` is not\n        defined, `bit` will be mapped to a new physical bit (extending the length of the\n        layout by one.)\n        Args:\n            virtual_bit (tuple): A (qu)bit. For example, (QuantumRegister(3, 'qr'), 2).\n            physical_bit (int): A physical bit. For example, 3.\n        \"\"\"\n        if physical_bit is None:\n            physical_candidate = len(self)\n            while physical_candidate in self._p2v:\n                physical_candidate += 1\n            physical_bit = physical_candidate\n        self[virtual_bit] = physical_bit"}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef swap(self, left, right):\n        if type(left) is not type(right):\n            raise LayoutError('The method swap only works with elements of the same type.')\n        temp = self[left]\n        self[left] = self[right]\n        self[right] = temp", "response": "Swaps the map between left and right."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\ncombines self and another_layout into an edge map.", "response": "def combine_into_edge_map(self, another_layout):\n        \"\"\"Combines self and another_layout into an \"edge map\".\n\n        For example::\n\n              self       another_layout  resulting edge map\n           qr_1 -> 0        0 <- q_2         qr_1 -> q_2\n           qr_2 -> 2        2 <- q_1         qr_2 -> q_1\n           qr_3 -> 3        3 <- q_0         qr_3 -> q_0\n\n        The edge map is used to compose dags via, for example, compose_back.\n\n        Args:\n            another_layout (Layout): The other layout to combine.\n        Returns:\n            dict: A \"edge map\".\n        Raises:\n            LayoutError: another_layout can be bigger than self, but not smaller. Otherwise, raises.\n        \"\"\"\n        edge_map = dict()\n\n        for virtual, physical in self.get_virtual_bits().items():\n            if physical not in another_layout._p2v:\n                raise LayoutError('The wire_map_from_layouts() method does not support when the'\n                                  ' other layout (another_layout) is smaller.')\n            edge_map[virtual] = another_layout[physical]\n\n        return edge_map"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef generate_trivial_layout(*regs):\n        layout = Layout()\n        for reg in regs:\n            layout.add_register(reg)\n        return layout", "response": "Generates a trivial layout with all the registers in regs."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nconvert a list of integers to a Layout object mapping virtual qubits to physical qubits.", "response": "def from_intlist(int_list, *qregs):\n        \"\"\"Converts a list of integers to a Layout\n        mapping virtual qubits (index of the list) to\n        physical qubits (the list values).\n\n        Args:\n            int_list (list): A list of integers.\n            *qregs (QuantumRegisters): The quantum registers to apply\n                the layout to.\n        Returns:\n            Layout: The corresponding Layout object.\n        Raises:\n            LayoutError: Invalid input layout.\n        \"\"\"\n        if not all((isinstance(i, int) for i in int_list)):\n            raise LayoutError('Expected a list of ints')\n        if len(int_list) != len(set(int_list)):\n            raise LayoutError('Duplicate values not permitted; Layout is bijective.')\n        n_qubits = sum(reg.size for reg in qregs)\n        # Check if list is too short to cover all qubits\n        if len(int_list) < n_qubits:\n            err_msg = 'Integer list length must equal number of qubits in circuit.'\n            raise LayoutError(err_msg)\n        out = Layout()\n        main_idx = 0\n        for qreg in qregs:\n            for idx in range(qreg.size):\n                out[(qreg, idx)] = int_list[main_idx]\n                main_idx += 1\n        if main_idx != len(int_list):\n            for int_item in int_list[main_idx:]:\n                out[int_item] = None\n        return out"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef from_tuplelist(tuple_list):\n        out = Layout()\n        for physical, virtual in enumerate(tuple_list):\n            if virtual is None:\n                continue\n            elif Layout.is_virtual(virtual):\n                if virtual in out._v2p:\n                    raise LayoutError('Duplicate values not permitted; Layout is bijective.')\n                out[virtual] = physical\n            else:\n                raise LayoutError(\"The list should contain elements of the form\"\n                                  \" (Register, integer) or None\")\n        return out", "response": "Takes a list containing physical and virtual ones and returns a Layout object."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef ccx(self, ctl1, ctl2, tgt):\n    return self.append(ToffoliGate(), [ctl1, ctl2, tgt], [])", "response": "Apply Toffoli to from ctl1 and ctl2 to tgt."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script for\ndefining the class entry.", "response": "def _define(self):\n        \"\"\"\n        gate ccx a,b,c\n        {\n        h c; cx b,c; tdg c; cx a,c;\n        t c; cx b,c; tdg c; cx a,c;\n        t b; t c; h c; cx a,b;\n        t a; tdg b; cx a,b;}\n        \"\"\"\n        definition = []\n        q = QuantumRegister(3, \"q\")\n        rule = [\n            (HGate(), [q[2]], []),\n            (CnotGate(), [q[1], q[2]], []),\n            (TdgGate(), [q[2]], []),\n            (CnotGate(), [q[0], q[2]], []),\n            (TGate(), [q[2]], []),\n            (CnotGate(), [q[1], q[2]], []),\n            (TdgGate(), [q[2]], []),\n            (CnotGate(), [q[0], q[2]], []),\n            (TGate(), [q[1]], []),\n            (TGate(), [q[2]], []),\n            (HGate(), [q[2]], []),\n            (CnotGate(), [q[0], q[1]], []),\n            (TGate(), [q[0]], []),\n            (TdgGate(), [q[1]], []),\n            (CnotGate(), [q[0], q[1]], [])\n        ]\n        for inst in rule:\n            definition.append(inst)\n        self.definition = definition"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\ncollecting blocks of adjacent gates acting on a pair of \"cx\" qubits. The blocks contain \"op\" nodes in topological sort order such that all gates in a block act on the same pair of qubits and are adjacent in the circuit. the blocks are built by examining predecessors and successors of \"cx\" gates in the circuit. u1, u2, u3, cx, id gates will be included. Return a list of tuples of \"op\" node labels.", "response": "def run(self, dag):\n        \"\"\"collect blocks of adjacent gates acting on a pair of \"cx\" qubits.\n\n        The blocks contain \"op\" nodes in topological sort order\n        such that all gates in a block act on the same pair of\n        qubits and are adjacent in the circuit. the blocks are built\n        by examining predecessors and successors of \"cx\" gates in\n        the circuit. u1, u2, u3, cx, id gates will be included.\n\n        Return a list of tuples of \"op\" node labels.\n        \"\"\"\n        # Initiate the commutation set\n        self.property_set['commutation_set'] = defaultdict(list)\n\n        good_names = [\"cx\", \"u1\", \"u2\", \"u3\", \"id\"]\n        block_list = []\n        nodes = list(dag.topological_nodes())\n        nodes_seen = dict(zip(nodes, [False] * len(nodes)))\n        for nd in dag.topological_op_nodes():\n            group = []\n            # Explore predecessors and successors of cx gates\n            if nd.name == \"cx\" and nd.condition is None and not nodes_seen[nd]:\n                these_qubits = sorted(nd.qargs)\n                # Explore predecessors of the \"cx\" node\n                pred = list(dag.predecessors(nd))\n                explore = True\n                while explore:\n                    pred_next = []\n                    # If there is one predecessor, add it if it's on the right qubits\n                    if len(pred) == 1 and not nodes_seen[pred[0]]:\n                        pnd = pred[0]\n                        if pnd.name in good_names:\n                            if (pnd.name == \"cx\" and sorted(pnd.qargs) == these_qubits) or \\\n                                    pnd.name != \"cx\":\n                                group.append(pnd)\n                                nodes_seen[pnd] = True\n                                pred_next.extend(dag.predecessors(pnd))\n                    # If there are two, then we consider cases\n                    elif len(pred) == 2:\n                        # First, check if there is a relationship\n                        if pred[0] in dag.predecessors(pred[1]):\n                            sorted_pred = [pred[1]]   # was [pred[1], pred[0]]\n                        elif pred[1] in dag.predecessors(pred[0]):\n                            sorted_pred = [pred[0]]   # was [pred[0], pred[1]]\n                        else:\n                            # We need to avoid accidentally adding a cx on these_qubits\n                            # since these must have a dependency through the other predecessor\n                            # in this case\n                            if pred[0].name == \"cx\" and sorted(pred[0].qargs) == these_qubits:\n                                sorted_pred = [pred[1]]\n                            elif pred[1].name == \"cx\" and sorted(pred[1].qargs) == these_qubits:\n                                sorted_pred = [pred[0]]\n                            else:\n                                sorted_pred = pred\n                        if len(sorted_pred) == 2 and sorted_pred[0].name == \"cx\" and \\\n                           sorted_pred[1].name == \"cx\":\n                            break  # stop immediately if we hit a pair of cx\n                        # Examine each predecessor\n                        for pnd in sorted_pred:\n                            if pnd.name not in good_names:\n                                continue\n                            # If a predecessor is a single qubit gate, add it\n                            if pnd.name != \"cx\":\n                                if not nodes_seen[pnd]:\n                                    group.append(pnd)\n                                    nodes_seen[pnd] = True\n                                    pred_next.extend(dag.predecessors(pnd))\n                            # If cx, check qubits\n                            else:\n                                pred_qubits = sorted(pnd.qargs)\n                                if pred_qubits == these_qubits:\n                                    # add if on same qubits\n                                    if not nodes_seen[pnd]:\n                                        group.append(pnd)\n                                        nodes_seen[pnd] = True\n                                        pred_next.extend(dag.predecessors(pnd))\n                                else:\n                                    # remove qubit from consideration if not\n                                    these_qubits = list(set(these_qubits) -\n                                                        set(pred_qubits))\n                    # Update predecessors\n                    # Stop if there aren't any more\n                    pred = list(set(pred_next))\n                    if not pred:\n                        explore = False\n                # Reverse the predecessor list and append the \"cx\" node\n                group.reverse()\n                group.append(nd)\n                nodes_seen[nd] = True\n                # Reset these_qubits\n                these_qubits = sorted(nd.qargs)\n                # Explore successors of the \"cx\" node\n                succ = list(dag.successors(nd))\n                explore = True\n                while explore:\n                    succ_next = []\n                    # If there is one successor, add it if its on the right qubits\n                    if len(succ) == 1 and not nodes_seen[succ[0]]:\n                        snd = succ[0]\n                        if snd.name in good_names:\n                            if (snd.name == \"cx\" and sorted(snd.qargs) == these_qubits) or \\\n                                    snd.name != \"cx\":\n                                group.append(snd)\n                                nodes_seen[snd] = True\n                                succ_next.extend(dag.successors(snd))\n                    # If there are two, then we consider cases\n                    elif len(succ) == 2:\n                        # First, check if there is a relationship\n                        if succ[0] in dag.successors(succ[1]):\n                            sorted_succ = [succ[1]]  # was [succ[1], succ[0]]\n                        elif succ[1] in dag.successors(succ[0]):\n                            sorted_succ = [succ[0]]  # was [succ[0], succ[1]]\n                        else:\n                            # We need to avoid accidentally adding a cx on these_qubits\n                            # since these must have a dependency through the other successor\n                            # in this case\n                            if succ[0].name == \"cx\" and sorted(succ[0].qargs) == these_qubits:\n                                sorted_succ = [succ[1]]\n                            elif succ[1].name == \"cx\" and sorted(succ[1].qargs) == these_qubits:\n                                sorted_succ = [succ[0]]\n                            else:\n                                sorted_succ = succ\n                        if len(sorted_succ) == 2 and \\\n                           sorted_succ[0].name == \"cx\" and \\\n                           sorted_succ[1].name == \"cx\":\n                            break  # stop immediately if we hit a pair of cx\n                        # Examine each successor\n                        for snd in sorted_succ:\n                            if snd.name not in good_names:\n                                continue\n                            # If a successor is a single qubit gate, add it\n                            if snd.name != \"cx\":\n                                if not nodes_seen[snd]:\n                                    group.append(snd)\n                                    nodes_seen[snd] = True\n                                    succ_next.extend(dag.successors(snd))\n                            else:\n                                # If cx, check qubits\n                                succ_qubits = sorted(snd.qargs)\n                                if succ_qubits == these_qubits:\n                                    # add if on same qubits\n                                    if not nodes_seen[snd]:\n                                        group.append(snd)\n                                        nodes_seen[snd] = True\n                                        succ_next.extend(dag.successors(snd))\n                                else:\n                                    # remove qubit from consideration if not\n                                    these_qubits = list(set(these_qubits) -\n                                                        set(succ_qubits))\n                    # Update successors\n                    # Stop if there aren't any more\n                    succ = list(set(succ_next))\n                    if not succ:\n                        explore = False\n\n                block_list.append(tuple(group))\n\n        self.property_set['block_list'] = block_list\n\n        return dag"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef u2(self, phi, lam, q):\n    return self.append(U2Gate(phi, lam), [q], [])", "response": "Apply u2 to q."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nreturn a Numpy. array for the U3 gate.", "response": "def to_matrix(self):\n        \"\"\"Return a Numpy.array for the U3 gate.\"\"\"\n        isqrt2 = 1 / numpy.sqrt(2)\n        phi, lam = self.params\n        phi, lam = float(phi), float(lam)\n        return numpy.array([[isqrt2, -numpy.exp(1j * lam) * isqrt2],\n                            [\n                                numpy.exp(1j * phi) * isqrt2,\n                                numpy.exp(1j * (phi + lam)) * isqrt2\n                            ]],\n                           dtype=complex)"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef insert(self, start_time: int, schedule: ScheduleComponent) -> 'ScheduleComponent':\n        return ops.insert(self, start_time, schedule)", "response": "Insert a schedule into the cache at the specified start time."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef _check_if_fenced(self, name):\n        if name in object.__getattribute__(self, '_attributes_to_fence'):\n            raise TranspilerAccessError(\"The fenced %s has the property %s protected\" %\n                                        (type(object.__getattribute__(self, '_wrapped')), name))", "response": "Checks if the attribute name is in the list of attributes to protect. If so raises a TranspilerAccessError."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nreturns True if completely - positive trace - preserving.", "response": "def is_cptp(self, atol=None, rtol=None):\n        \"\"\"Return True if completely-positive trace-preserving.\"\"\"\n        if atol is None:\n            atol = self._atol\n        if rtol is None:\n            rtol = self._rtol\n        if self._data[1] is not None:\n            return False\n        check = np.dot(np.transpose(np.conj(self._data[0])), self._data[0])\n        return is_identity_matrix(check, rtol=self._rtol, atol=self._atol)"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nreturn the conjugate of the QuantumChannel.", "response": "def conjugate(self):\n        \"\"\"Return the conjugate of the QuantumChannel.\"\"\"\n        # pylint: disable=assignment-from-no-return\n        stine_l = np.conjugate(self._data[0])\n        stine_r = None\n        if self._data[1] is not None:\n            stine_r = np.conjugate(self._data[1])\n        return Stinespring((stine_l, stine_r), self.input_dims(),\n                           self.output_dims())"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef transpose(self):\n        din, dout = self.dim\n        dtr = self._data[0].shape[0] // dout\n        stine = [None, None]\n        for i, mat in enumerate(self._data):\n            if mat is not None:\n                stine[i] = np.reshape(\n                    np.transpose(np.reshape(mat, (dout, dtr, din)), (2, 1, 0)),\n                    (din * dtr, dout))\n        return Stinespring(\n            tuple(stine),\n            input_dims=self.output_dims(),\n            output_dims=self.input_dims())", "response": "Return the transpose of the QuantumChannel."}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef compose(self, other, qargs=None, front=False):\n        if qargs is not None:\n            return Stinespring(\n                SuperOp(self).compose(other, qargs=qargs, front=front))\n\n        # Convert other to Kraus\n        if not isinstance(other, Kraus):\n            other = Kraus(other)\n        # Check dimensions match up\n        if front and self._input_dim != other._output_dim:\n            raise QiskitError(\n                'input_dim of self must match output_dim of other')\n        if not front and self._output_dim != other._input_dim:\n            raise QiskitError(\n                'input_dim of other must match output_dim of self')\n        # Since we cannot directly compose two channels in Stinespring\n        # representation we convert to the Kraus representation\n        return Stinespring(Kraus(self).compose(other, front=front))", "response": "Return the composition of self and other."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef power(self, n):\n        if n > 0:\n            return super().power(n)\n        return Stinespring(SuperOp(self).power(n))", "response": "The matrix power of the superoperator matrix."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef multiply(self, other):\n        if not isinstance(other, Number):\n            raise QiskitError(\"other is not a number\")\n        # If the number is complex or negative we need to convert to\n        # general Stinespring representation so we first convert to\n        # the Choi representation\n        if isinstance(other, complex) or other < 1:\n            # Convert to Choi-matrix\n            return Stinespring(Choi(self).multiply(other))\n        # If the number is real we can update the Kraus operators\n        # directly\n        num = np.sqrt(other)\n        stine_l, stine_r = self._data\n        stine_l = num * self._data[0]\n        stine_r = None\n        if self._data[1] is not None:\n            stine_r = num * self._data[1]\n        return Stinespring((stine_l, stine_r), self.input_dims(),\n                           self.output_dims())", "response": "Returns the QuantumChannel self + other."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef _evolve(self, state, qargs=None):\n        # If subsystem evolution we use the SuperOp representation\n        if qargs is not None:\n            return SuperOp(self)._evolve(state, qargs)\n\n        # Otherwise we compute full evolution directly\n        state = self._format_state(state)\n        if state.shape[0] != self._input_dim:\n            raise QiskitError(\n                \"QuantumChannel input dimension is not equal to state dimension.\"\n            )\n        if state.ndim == 1 and self._data[1] is None and \\\n           self._data[0].shape[0] // self._output_dim == 1:\n            # If the shape of the stinespring operator is equal to the output_dim\n            # evolution of a state vector psi -> stine.psi\n            return np.dot(self._data[0], state)\n        # Otherwise we always return a density matrix\n        state = self._format_state(state, density_matrix=True)\n        stine_l, stine_r = self._data\n        if stine_r is None:\n            stine_r = stine_l\n        din, dout = self.dim\n        dtr = stine_l.shape[0] // dout\n        shape = (dout, dtr, din)\n        return np.einsum('iAB,BC,jAC->ij', np.reshape(stine_l, shape), state,\n                         np.reshape(np.conjugate(stine_r), shape))", "response": "Evolve a state by the QuantumChannel."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef _tensor_product(self, other, reverse=False):\n        # Convert other to Stinespring\n        if not isinstance(other, Stinespring):\n            other = Stinespring(other)\n\n        # Tensor stinespring ops\n        sa_l, sa_r = self._data\n        sb_l, sb_r = other._data\n\n        # Reshuffle tensor dimensions\n        din_a, dout_a = self.dim\n        din_b, dout_b = other.dim\n        dtr_a = sa_l.shape[0] // dout_a\n        dtr_b = sb_l.shape[0] // dout_b\n        if reverse:\n            shape_in = (dout_b, dtr_b, dout_a, dtr_a, din_b * din_a)\n            shape_out = (dout_b * dtr_b * dout_a * dtr_a, din_b * din_a)\n        else:\n            shape_in = (dout_a, dtr_a, dout_b, dtr_b, din_a * din_b)\n            shape_out = (dout_a * dtr_a * dout_b * dtr_b, din_a * din_b)\n\n        # Compute left stinepsring op\n        if reverse:\n            input_dims = self.input_dims() + other.input_dims()\n            output_dims = self.output_dims() + other.output_dims()\n            sab_l = np.kron(sb_l, sa_l)\n        else:\n            input_dims = other.input_dims() + self.input_dims()\n            output_dims = other.output_dims() + self.output_dims()\n            sab_l = np.kron(sa_l, sb_l)\n        # Reravel indicies\n        sab_l = np.reshape(\n            np.transpose(np.reshape(sab_l, shape_in), (0, 2, 1, 3, 4)),\n            shape_out)\n\n        # Compute right stinespring op\n        if sa_r is None and sb_r is None:\n            sab_r = None\n        else:\n            if sa_r is None:\n                sa_r = sa_l\n            elif sb_r is None:\n                sb_r = sb_l\n            if reverse:\n                sab_r = np.kron(sb_r, sa_r)\n            else:\n                sab_r = np.kron(sa_r, sb_r)\n            # Reravel indicies\n            sab_r = np.reshape(\n                np.transpose(np.reshape(sab_r, shape_in), (0, 2, 1, 3, 4)),\n                shape_out)\n        return Stinespring((sab_l, sab_r), input_dims, output_dims)", "response": "Return the tensor product of two quantum channels."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function that can\nrun the BasicSwap pass on the given DAG.", "response": "def run(self, dag):\n        \"\"\"\n        Runs the BasicSwap pass on `dag`.\n        Args:\n            dag (DAGCircuit): DAG to map.\n\n        Returns:\n            DAGCircuit: A mapped DAG.\n\n        Raises:\n            TranspilerError: if the coupling map or the layout are not\n            compatible with the DAG\n        \"\"\"\n        new_dag = DAGCircuit()\n\n        if self.initial_layout is None:\n            if self.property_set[\"layout\"]:\n                self.initial_layout = self.property_set[\"layout\"]\n            else:\n                self.initial_layout = Layout.generate_trivial_layout(*dag.qregs.values())\n\n        if len(dag.qubits()) != len(self.initial_layout):\n            raise TranspilerError('The layout does not match the amount of qubits in the DAG')\n\n        if len(self.coupling_map.physical_qubits) != len(self.initial_layout):\n            raise TranspilerError(\n                \"Mappers require to have the layout to be the same size as the coupling map\")\n\n        current_layout = self.initial_layout.copy()\n\n        for layer in dag.serial_layers():\n            subdag = layer['graph']\n\n            for gate in subdag.twoQ_gates():\n                physical_q0 = current_layout[gate.qargs[0]]\n                physical_q1 = current_layout[gate.qargs[1]]\n                if self.coupling_map.distance(physical_q0, physical_q1) != 1:\n                    # Insert a new layer with the SWAP(s).\n                    swap_layer = DAGCircuit()\n\n                    path = self.coupling_map.shortest_undirected_path(physical_q0, physical_q1)\n                    for swap in range(len(path) - 2):\n                        connected_wire_1 = path[swap]\n                        connected_wire_2 = path[swap + 1]\n\n                        qubit_1 = current_layout[connected_wire_1]\n                        qubit_2 = current_layout[connected_wire_2]\n\n                        # create qregs\n                        for qreg in current_layout.get_registers():\n                            if qreg not in swap_layer.qregs.values():\n                                swap_layer.add_qreg(qreg)\n\n                        # create the swap operation\n                        swap_layer.apply_operation_back(SwapGate(),\n                                                        qargs=[qubit_1, qubit_2],\n                                                        cargs=[])\n\n                    # layer insertion\n                    edge_map = current_layout.combine_into_edge_map(self.initial_layout)\n                    new_dag.compose_back(swap_layer, edge_map)\n\n                    # update current_layout\n                    for swap in range(len(path) - 2):\n                        current_layout.swap(path[swap], path[swap + 1])\n\n            edge_map = current_layout.combine_into_edge_map(self.initial_layout)\n            new_dag.extend_back(subdag, edge_map)\n\n        return new_dag"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nfinds a swap circuit that implements a permutation for this layer.", "response": "def _layer_permutation(layer_partition, initial_layout, layout, qubit_subset,\n                       coupling, trials, qregs, rng):\n    \"\"\"Find a swap circuit that implements a permutation for this layer.\n\n    Args:\n        layer_partition (list): The layer_partition is a list of (qu)bit\n            lists and each qubit is a tuple (qreg, index).\n        initial_layout (Layout): The initial layout passed.\n        layout (Layout): The layout is a Layout object mapping virtual\n            qubits in the input circuit to physical qubits in the coupling\n            graph. It reflects the current positions of the data.\n        qubit_subset (list): The qubit_subset is the set of qubits in\n            the coupling graph that we have chosen to map into, as tuples\n            (Register, index).\n        coupling (CouplingMap): Directed graph representing a coupling map.\n            This coupling map should be one that was provided to the\n            stochastic mapper.\n        trials (int): Number of attempts the randomized algorithm makes.\n        qregs (OrderedDict): Ordered dict of registers from input DAG.\n        rng (RandomState): Random number generator.\n\n    Returns:\n        Tuple: success_flag, best_circuit, best_depth, best_layout, trivial_flag\n\n    Raises:\n        TranspilerError: if anything went wrong.\n     \"\"\"\n    logger.debug(\"layer_permutation: layer_partition = %s\",\n                 pformat(layer_partition))\n    logger.debug(\"layer_permutation: layout = %s\",\n                 pformat(layout.get_virtual_bits()))\n    logger.debug(\"layer_permutation: qubit_subset = %s\",\n                 pformat(qubit_subset))\n    logger.debug(\"layer_permutation: trials = %s\", trials)\n\n    gates = []  # list of lists of tuples [[(register, index), ...], ...]\n    for gate_args in layer_partition:\n        if len(gate_args) > 2:\n            raise TranspilerError(\"Layer contains > 2-qubit gates\")\n        elif len(gate_args) == 2:\n            gates.append(tuple(gate_args))\n    logger.debug(\"layer_permutation: gates = %s\", pformat(gates))\n\n    # Can we already apply the gates? If so, there is no work to do.\n    dist = sum([coupling.distance(layout[g[0]], layout[g[1]])\n                for g in gates])\n    logger.debug(\"layer_permutation: distance = %s\", dist)\n    if dist == len(gates):\n        logger.debug(\"layer_permutation: nothing to do\")\n        circ = DAGCircuit()\n        for register in layout.get_virtual_bits().keys():\n            if register[0] not in circ.qregs.values():\n                circ.add_qreg(register[0])\n        return True, circ, 0, layout, (not bool(gates))\n\n    # Begin loop over trials of randomized algorithm\n    num_qubits = len(layout)\n    best_depth = inf  # initialize best depth\n    best_edges = None  # best edges found\n    best_circuit = None  # initialize best swap circuit\n    best_layout = None  # initialize best final layout\n\n    cdist2 = coupling._dist_matrix**2\n    # Scaling matrix\n    scale = np.zeros((num_qubits, num_qubits))\n\n    int_qubit_subset = regtuple_to_numeric(qubit_subset, qregs)\n    int_gates = gates_to_idx(gates, qregs)\n    int_layout = nlayout_from_layout(layout, qregs, coupling.size())\n\n    trial_circuit = DAGCircuit()  # SWAP circuit for this trial\n    for register in layout.get_virtual_bits().keys():\n        if register[0] not in trial_circuit.qregs.values():\n            trial_circuit.add_qreg(register[0])\n\n    slice_circuit = DAGCircuit()  # circuit for this swap slice\n    for register in layout.get_virtual_bits().keys():\n        if register[0] not in slice_circuit.qregs.values():\n            slice_circuit.add_qreg(register[0])\n    edges = np.asarray(coupling.get_edges(), dtype=np.int32).ravel()\n    cdist = coupling._dist_matrix\n    for trial in range(trials):\n        logger.debug(\"layer_permutation: trial %s\", trial)\n        # This is one Trial --------------------------------------\n        dist, optim_edges, trial_layout, depth_step = swap_trial(num_qubits, int_layout,\n                                                                 int_qubit_subset,\n                                                                 int_gates, cdist2,\n                                                                 cdist, edges, scale,\n                                                                 rng)\n\n        logger.debug(\"layer_permutation: final distance for this trial = %s\", dist)\n        if dist == len(gates) and depth_step < best_depth:\n            logger.debug(\"layer_permutation: got circuit with improved depth %s\",\n                         depth_step)\n            best_edges = optim_edges\n            best_layout = trial_layout\n            best_depth = min(best_depth, depth_step)\n\n        # Break out of trial loop if we found a depth 1 circuit\n        # since we can't improve it further\n        if best_depth == 1:\n            break\n\n    # If we have no best circuit for this layer, all of the\n    # trials have failed\n    if best_layout is None:\n        logger.debug(\"layer_permutation: failed!\")\n        return False, None, None, None, False\n\n    edgs = best_edges.edges()\n    for idx in range(best_edges.size//2):\n        slice_circuit.apply_operation_back(\n            SwapGate(), [initial_layout[edgs[2*idx]], initial_layout[edgs[2*idx+1]]], [])\n    trial_circuit.extend_back(slice_circuit)\n    best_circuit = trial_circuit\n\n    # Otherwise, we return our result for this layer\n    logger.debug(\"layer_permutation: success!\")\n    best_lay = best_layout.to_layout(qregs)\n    return True, best_circuit, best_depth, best_lay, False"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\ntake a list of tuples and converts them into an integer array.", "response": "def regtuple_to_numeric(items, qregs):\n    \"\"\"Takes (QuantumRegister, int) tuples and converts\n    them into an integer array.\n\n    Args:\n        items (list): List of tuples of (QuantumRegister, int)\n                      to convert.\n        qregs (dict): List of )QuantumRegister, int) tuples.\n    Returns:\n        ndarray: Array of integers.\n\n    \"\"\"\n    sizes = [qr.size for qr in qregs.values()]\n    reg_idx = np.cumsum([0]+sizes)\n    regint = {}\n    for ind, qreg in enumerate(qregs.values()):\n        regint[qreg] = ind\n    out = np.zeros(len(items), dtype=np.int32)\n    for idx, val in enumerate(items):\n        out[idx] = reg_idx[regint[val[0]]]+val[1]\n    return out"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nconvert a list of gates into a nested list of integers.", "response": "def gates_to_idx(gates, qregs):\n    \"\"\"Converts gate tuples into a nested list of integers.\n\n    Args:\n        gates (list): List of (QuantumRegister, int) pairs\n                      representing gates.\n        qregs (dict): List of )QuantumRegister, int) tuples.\n\n    Returns:\n        list: Nested list of integers for gates.\n    \"\"\"\n    sizes = [qr.size for qr in qregs.values()]\n    reg_idx = np.cumsum([0]+sizes)\n    regint = {}\n    for ind, qreg in enumerate(qregs.values()):\n        regint[qreg] = ind\n    out = np.zeros(2*len(gates), dtype=np.int32)\n    for idx, gate in enumerate(gates):\n        out[2*idx] = reg_idx[regint[gate[0][0]]]+gate[0][1]\n        out[2*idx+1] = reg_idx[regint[gate[1][0]]]+gate[1][1]\n    return out"}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nrun the StochasticSwap pass on the given DAG.", "response": "def run(self, dag):\n        \"\"\"\n        Run the StochasticSwap pass on `dag`.\n\n        Args:\n            dag (DAGCircuit): DAG to map.\n\n        Returns:\n            DAGCircuit: A mapped DAG.\n\n        Raises:\n            TranspilerError: if the coupling map or the layout are not\n            compatible with the DAG\n        \"\"\"\n\n        if self.initial_layout is None:\n            if self.property_set[\"layout\"]:\n                self.initial_layout = self.property_set[\"layout\"]\n            else:\n                self.initial_layout = Layout.generate_trivial_layout(*dag.qregs.values())\n\n        if len(dag.qubits()) != len(self.initial_layout):\n            raise TranspilerError('The layout does not match the amount of qubits in the DAG')\n\n        if len(self.coupling_map.physical_qubits) != len(self.initial_layout):\n            raise TranspilerError(\n                \"Mappers require to have the layout to be the same size as the coupling map\")\n\n        self.input_layout = self.initial_layout.copy()\n\n        self.qregs = dag.qregs\n        if self.seed is None:\n            self.seed = np.random.randint(0, np.iinfo(np.int32).max)\n        self.rng = np.random.RandomState(self.seed)\n        logger.debug(\"StochasticSwap RandomState seeded with seed=%s\", self.seed)\n\n        new_dag = self._mapper(dag, self.coupling_map, trials=self.trials)\n        # self.property_set[\"layout\"] = self.initial_layout\n        return new_dag"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef _layer_permutation(self, layer_partition, layout, qubit_subset,\n                           coupling, trials):\n        \"\"\"Find a swap circuit that implements a permutation for this layer.\n\n        The goal is to swap qubits such that qubits in the same two-qubit gates\n        are adjacent.\n\n        Based on S. Bravyi's algorithm.\n\n        layer_partition (list): The layer_partition is a list of (qu)bit\n            lists and each qubit is a tuple (qreg, index).\n        layout (Layout): The layout is a Layout object mapping virtual\n            qubits in the input circuit to physical qubits in the coupling\n            graph. It reflects the current positions of the data.\n        qubit_subset (list): The qubit_subset is the set of qubits in\n            the coupling graph that we have chosen to map into, as tuples\n            (Register, index).\n        coupling (CouplingMap): Directed graph representing a coupling map.\n            This coupling map should be one that was provided to the\n            stochastic mapper.\n        trials (int): Number of attempts the randomized algorithm makes.\n\n        Returns:\n            Tuple: success_flag, best_circuit, best_depth, best_layout, trivial_flag\n\n        If success_flag is True, then best_circuit contains a DAGCircuit with\n        the swap circuit, best_depth contains the depth of the swap circuit,\n        and best_layout contains the new positions of the data qubits after the\n        swap circuit has been applied. The trivial_flag is set if the layer\n        has no multi-qubit gates.\n\n        Raises:\n            TranspilerError: if anything went wrong.\n     \"\"\"\n        return _layer_permutation(layer_partition, self.initial_layout,\n                                  layout, qubit_subset,\n                                  coupling, trials,\n                                  self.qregs, self.rng)", "response": "Find a swap circuit that implements a permutation for this layer."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nprovides a DAGCircuit for a new mapped layer. i (int) = layer number first_layer (bool) = True if this is the first layer in the circuit with any multi-qubit gates best_layout (Layout) = layout returned from _layer_permutation best_depth (int) = depth returned from _layer_permutation best_circuit (DAGCircuit) = swap circuit returned from _layer_permutation layer_list (list) = list of DAGCircuit objects for each layer, output of DAGCircuit layers() method Return a DAGCircuit object to append to the output DAGCircuit that the _mapper method is building.", "response": "def _layer_update(self, i, first_layer, best_layout, best_depth,\n                      best_circuit, layer_list):\n        \"\"\"Provide a DAGCircuit for a new mapped layer.\n\n        i (int) = layer number\n        first_layer (bool) = True if this is the first layer in the\n            circuit with any multi-qubit gates\n        best_layout (Layout) = layout returned from _layer_permutation\n        best_depth (int) = depth returned from _layer_permutation\n        best_circuit (DAGCircuit) = swap circuit returned\n            from _layer_permutation\n        layer_list (list) = list of DAGCircuit objects for each layer,\n            output of DAGCircuit layers() method\n\n        Return a DAGCircuit object to append to the output DAGCircuit\n        that the _mapper method is building.\n        \"\"\"\n        layout = best_layout\n        logger.debug(\"layer_update: layout = %s\", pformat(layout))\n        logger.debug(\"layer_update: self.initial_layout = %s\", pformat(self.initial_layout))\n        dagcircuit_output = DAGCircuit()\n        for register in layout.get_virtual_bits().keys():\n            if register[0] not in dagcircuit_output.qregs.values():\n                dagcircuit_output.add_qreg(register[0])\n\n        # If this is the first layer with multi-qubit gates,\n        # output all layers up to this point and ignore any\n        # swap gates. Set the initial layout.\n        if first_layer:\n            logger.debug(\"layer_update: first multi-qubit gate layer\")\n            # Output all layers up to this point\n            for j in range(i + 1):\n                # Make qubit edge map and extend by classical bits\n                edge_map = layout.combine_into_edge_map(self.initial_layout)\n                for bit in dagcircuit_output.clbits():\n                    edge_map[bit] = bit\n                dagcircuit_output.compose_back(layer_list[j][\"graph\"], edge_map)\n        # Otherwise, we output the current layer and the associated swap gates.\n        else:\n            # Output any swaps\n            if best_depth > 0:\n                logger.debug(\"layer_update: there are swaps in this layer, \"\n                             \"depth %d\", best_depth)\n                dagcircuit_output.extend_back(best_circuit)\n            else:\n                logger.debug(\"layer_update: there are no swaps in this layer\")\n            # Make qubit edge map and extend by classical bits\n            edge_map = layout.combine_into_edge_map(self.initial_layout)\n            for bit in dagcircuit_output.clbits():\n                edge_map[bit] = bit\n            # Output this layer\n            dagcircuit_output.compose_back(layer_list[i][\"graph\"], edge_map)\n\n        return dagcircuit_output"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nmap a DAGCircuit onto a CouplingMap using swap gates.", "response": "def _mapper(self, circuit_graph, coupling_graph,\n                trials=20):\n        \"\"\"Map a DAGCircuit onto a CouplingMap using swap gates.\n\n        Use self.initial_layout for the initial layout.\n\n        Args:\n            circuit_graph (DAGCircuit): input DAG circuit\n            coupling_graph (CouplingMap): coupling graph to map onto\n            trials (int): number of trials.\n\n        Returns:\n            DAGCircuit: object containing a circuit equivalent to\n                circuit_graph that respects couplings in coupling_graph\n            Layout: a layout object mapping qubits of circuit_graph into\n                qubits of coupling_graph. The layout may differ from the\n                initial_layout if the first layer of gates cannot be\n                executed on the initial_layout, since in this case\n                it is more efficient to modify the layout instead of swapping\n            Dict: a final-layer qubit permutation\n\n        Raises:\n            TranspilerError: if there was any error during the mapping\n                or with the parameters.\n        \"\"\"\n        # Schedule the input circuit by calling layers()\n        layerlist = list(circuit_graph.layers())\n        logger.debug(\"schedule:\")\n        for i, v in enumerate(layerlist):\n            logger.debug(\"    %d: %s\", i, v[\"partition\"])\n\n        if self.initial_layout is not None:\n            qubit_subset = self.initial_layout.get_virtual_bits().keys()\n        else:\n            # Supply a default layout for this dag\n            self.initial_layout = Layout()\n            physical_qubit = 0\n            for qreg in circuit_graph.qregs.values():\n                for index in range(qreg.size):\n                    self.initial_layout[(qreg, index)] = physical_qubit\n                    physical_qubit += 1\n            qubit_subset = self.initial_layout.get_virtual_bits().keys()\n            # Restrict the coupling map to the image of the layout\n            coupling_graph = coupling_graph.subgraph(\n                self.initial_layout.get_physical_bits().keys())\n            if coupling_graph.size() < len(self.initial_layout):\n                raise TranspilerError(\"Coupling map too small for default layout\")\n            self.input_layout = self.initial_layout.copy()\n\n        # Find swap circuit to preceed to each layer of input circuit\n        layout = self.initial_layout.copy()\n\n        # Construct an empty DAGCircuit with the same set of\n        # qregs and cregs as the input circuit\n        dagcircuit_output = DAGCircuit()\n        dagcircuit_output.name = circuit_graph.name\n        for qreg in circuit_graph.qregs.values():\n            dagcircuit_output.add_qreg(qreg)\n        for creg in circuit_graph.cregs.values():\n            dagcircuit_output.add_creg(creg)\n\n        # Make a trivial wire mapping between the subcircuits\n        # returned by _layer_update and the circuit we build\n        identity_wire_map = {}\n        for qubit in circuit_graph.qubits():\n            identity_wire_map[qubit] = qubit\n        for bit in circuit_graph.clbits():\n            identity_wire_map[bit] = bit\n\n        first_layer = True  # True until first layer is output\n        logger.debug(\"initial_layout = %s\", layout)\n\n        # Iterate over layers\n        for i, layer in enumerate(layerlist):\n\n            # Attempt to find a permutation for this layer\n            success_flag, best_circuit, best_depth, best_layout, trivial_flag \\\n                = self._layer_permutation(layer[\"partition\"], layout,\n                                          qubit_subset, coupling_graph,\n                                          trials)\n            logger.debug(\"mapper: layer %d\", i)\n            logger.debug(\"mapper: success_flag=%s,best_depth=%s,trivial_flag=%s\",\n                         success_flag, str(best_depth), trivial_flag)\n\n            # If this fails, try one gate at a time in this layer\n            if not success_flag:\n                logger.debug(\"mapper: failed, layer %d, \"\n                             \"retrying sequentially\", i)\n                serial_layerlist = list(layer[\"graph\"].serial_layers())\n\n                # Go through each gate in the layer\n                for j, serial_layer in enumerate(serial_layerlist):\n\n                    success_flag, best_circuit, best_depth, best_layout, trivial_flag = \\\n                        self._layer_permutation(\n                            serial_layer[\"partition\"],\n                            layout, qubit_subset,\n                            coupling_graph,\n                            trials)\n                    logger.debug(\"mapper: layer %d, sublayer %d\", i, j)\n                    logger.debug(\"mapper: success_flag=%s,best_depth=%s,\"\n                                 \"trivial_flag=%s\",\n                                 success_flag, str(best_depth), trivial_flag)\n\n                    # Give up if we fail again\n                    if not success_flag:\n                        raise TranspilerError(\"swap mapper failed: \" +\n                                              \"layer %d, sublayer %d\" % (i, j))\n\n                    # If this layer is only single-qubit gates,\n                    # and we have yet to see multi-qubit gates,\n                    # continue to the next inner iteration\n                    if trivial_flag and first_layer:\n                        logger.debug(\"mapper: skip to next sublayer\")\n                        continue\n\n                    if first_layer:\n                        self.initial_layout = layout\n\n                    # Update the record of qubit positions\n                    # for each inner iteration\n                    layout = best_layout\n                    # Update the DAG\n                    dagcircuit_output.extend_back(\n                        self._layer_update(j,\n                                           first_layer,\n                                           best_layout,\n                                           best_depth,\n                                           best_circuit,\n                                           serial_layerlist),\n                        identity_wire_map)\n                    if first_layer:\n                        first_layer = False\n\n            else:\n                # Update the record of qubit positions for each iteration\n                layout = best_layout\n\n                if first_layer:\n                    self.initial_layout = layout\n\n                # Update the DAG\n                dagcircuit_output.extend_back(\n                    self._layer_update(i,\n                                       first_layer,\n                                       best_layout,\n                                       best_depth,\n                                       best_circuit,\n                                       layerlist),\n                    identity_wire_map)\n\n                if first_layer:\n                    first_layer = False\n\n        # This is the final edgemap. We might use it to correctly replace\n        # any measurements that needed to be removed earlier.\n        logger.debug(\"mapper: self.initial_layout = %s\", pformat(self.initial_layout))\n        logger.debug(\"mapper: layout = %s\", pformat(layout))\n        last_edgemap = layout.combine_into_edge_map(self.initial_layout)\n        logger.debug(\"mapper: last_edgemap = %s\", pformat(last_edgemap))\n\n        # If first_layer is still set, the circuit only has single-qubit gates\n        # so we can use the initial layout to output the entire circuit\n        # This code is dead due to changes to first_layer above.\n        if first_layer:\n            logger.debug(\"mapper: first_layer flag still set\")\n            layout = self.initial_layout\n            for i, layer in enumerate(layerlist):\n                edge_map = layout.combine_into_edge_map(self.initial_layout)\n                dagcircuit_output.compose_back(layer[\"graph\"], edge_map)\n\n        return dagcircuit_output"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef pauli_group(number_of_qubits, case='weight'):\n    if number_of_qubits < 5:\n        temp_set = []\n\n        if case == 'weight':\n            tmp = pauli_group(number_of_qubits, case='tensor')\n            # sort on the weight of the Pauli operator\n            return sorted(tmp, key=lambda x: -np.count_nonzero(\n                np.array(x.to_label(), 'c') == b'I'))\n        elif case == 'tensor':\n            # the Pauli set is in tensor order II IX IY IZ XI ...\n            for k in range(4 ** number_of_qubits):\n                z = np.zeros(number_of_qubits, dtype=np.bool)\n                x = np.zeros(number_of_qubits, dtype=np.bool)\n                # looping over all the qubits\n                for j in range(number_of_qubits):\n                    # making the Pauli for each j fill it in from the\n                    # end first\n                    element = (k // (4 ** j)) % 4\n                    if element == 1:\n                        x[j] = True\n                    elif element == 2:\n                        z[j] = True\n                        x[j] = True\n                    elif element == 3:\n                        z[j] = True\n                temp_set.append(Pauli(z, x))\n            return temp_set\n        else:\n            raise QiskitError(\"Only support 'weight' or 'tensor' cases \"\n                              \"but you have {}.\".format(case))\n\n    raise QiskitError(\"Only support number of qubits is less than 5\")", "response": "Returns the Pauli set with 4^n elements in order of the Pauli operator."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef _init_from_bool(self, z, x):\n        if z is None:\n            raise QiskitError(\"z vector must not be None.\")\n        if x is None:\n            raise QiskitError(\"x vector must not be None.\")\n        if len(z) != len(x):\n            raise QiskitError(\"length of z and x vectors must be \"\n                              \"the same. (z: {} vs x: {})\".format(len(z), len(x)))\n\n        z = _make_np_bool(z)\n        x = _make_np_bool(x)\n        self._z = z\n        self._x = x\n\n        return self", "response": "Construct a pauli from boolean array."}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nconvert to Operator object.", "response": "def to_operator(self):\n        \"\"\"Convert to Operator object.\"\"\"\n        # Place import here to avoid cyclic import from circuit visualization\n        from qiskit.quantum_info.operators.operator import Operator\n        return Operator(self.to_matrix())"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef to_instruction(self):\n        from qiskit.circuit import QuantumCircuit, QuantumRegister\n        from qiskit.extensions.standard import IdGate, XGate, YGate, ZGate\n        gates = {'I': IdGate(), 'X': XGate(), 'Y': YGate(), 'Z': ZGate()}\n        label = self.to_label()\n        n_qubits = self.numberofqubits\n        qreg = QuantumRegister(n_qubits)\n        circuit = QuantumCircuit(qreg, name='Pauli:{}'.format(label))\n        for i, pauli in enumerate(reversed(label)):\n            circuit.append(gates[pauli], [qreg[i]])\n        return circuit.to_instruction()", "response": "Convert to Pauli circuit instruction."}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nupdates partial or entire z.", "response": "def update_z(self, z, indices=None):\n        \"\"\"\n        Update partial or entire z.\n\n        Args:\n            z (numpy.ndarray or list): to-be-updated z\n            indices (numpy.ndarray or list or optional): to-be-updated qubit indices\n\n        Returns:\n            Pauli: self\n\n        Raises:\n            QiskitError: when updating whole z, the number of qubits must be the same.\n        \"\"\"\n        z = _make_np_bool(z)\n        if indices is None:\n            if len(self._z) != len(z):\n                raise QiskitError(\"During updating whole z, you can not \"\n                                  \"change the number of qubits.\")\n            self._z = z\n        else:\n            if not isinstance(indices, list) and not isinstance(indices, np.ndarray):\n                indices = [indices]\n            for p, idx in enumerate(indices):\n                self._z[idx] = z[p]\n\n        return self"}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nupdates partial or entire x.", "response": "def update_x(self, x, indices=None):\n        \"\"\"\n        Update partial or entire x.\n\n        Args:\n            x (numpy.ndarray or list): to-be-updated x\n            indices (numpy.ndarray or list or optional): to-be-updated qubit indices\n\n        Returns:\n            Pauli: self\n\n        Raises:\n            QiskitError: when updating whole x, the number of qubits must be the same.\n        \"\"\"\n        x = _make_np_bool(x)\n        if indices is None:\n            if len(self._x) != len(x):\n                raise QiskitError(\"During updating whole x, you can not change \"\n                                  \"the number of qubits.\")\n            self._x = x\n        else:\n            if not isinstance(indices, list) and not isinstance(indices, np.ndarray):\n                indices = [indices]\n            for p, idx in enumerate(indices):\n                self._x[idx] = x[p]\n\n        return self"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef insert_paulis(self, indices=None, paulis=None, pauli_labels=None):\n        if pauli_labels is not None:\n            if paulis is not None:\n                raise QiskitError(\"Please only provide either `paulis` or `pauli_labels`\")\n            if isinstance(pauli_labels, str):\n                pauli_labels = list(pauli_labels)\n            # since pauli label is in reversed order.\n            paulis = Pauli.from_label(pauli_labels[::-1])\n\n        if indices is None:  # append\n            self._z = np.concatenate((self._z, paulis.z))\n            self._x = np.concatenate((self._x, paulis.x))\n        else:\n            if not isinstance(indices, list):\n                indices = [indices]\n            self._z = np.insert(self._z, indices, paulis.z)\n            self._x = np.insert(self._x, indices, paulis.x)\n\n        return self", "response": "Insert or append paulis to the targeted indices."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef append_paulis(self, paulis=None, pauli_labels=None):\n        return self.insert_paulis(None, paulis=paulis, pauli_labels=pauli_labels)", "response": "Append paulis at the end of the list."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef delete_qubits(self, indices):\n        if not isinstance(indices, list):\n            indices = [indices]\n\n        self._z = np.delete(self._z, indices)\n        self._x = np.delete(self._x, indices)\n\n        return self", "response": "Delete pauli at the indices."}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nreturning a random Pauli on number of qubits.", "response": "def random(cls, num_qubits, seed=None):\n        \"\"\"Return a random Pauli on number of qubits.\n\n        Args:\n            num_qubits (int): the number of qubits\n            seed (int): Optional. To set a random seed.\n        Returns:\n            Pauli: the random pauli\n        \"\"\"\n        if seed is not None:\n            np.random.seed(seed)\n        z = np.random.randint(2, size=num_qubits).astype(np.bool)\n        x = np.random.randint(2, size=num_qubits).astype(np.bool)\n        return cls(z, x)"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\ngenerates a single qubit pauli at index with length num_qubits.", "response": "def pauli_single(cls, num_qubits, index, pauli_label):\n        \"\"\"\n        Generate single qubit pauli at index with pauli_label with length num_qubits.\n\n        Args:\n            num_qubits (int): the length of pauli\n            index (int): the qubit index to insert the single qubii\n            pauli_label (str): pauli\n\n        Returns:\n            Pauli: single qubit pauli\n        \"\"\"\n        tmp = Pauli.from_label(pauli_label)\n        z = np.zeros(num_qubits, dtype=np.bool)\n        x = np.zeros(num_qubits, dtype=np.bool)\n\n        z[index] = tmp.z[0]\n        x[index] = tmp.x[0]\n\n        return cls(z, x)"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function for\napplying an arbitrary 1 - qubit unitary matrix.", "response": "def _add_unitary_single(self, gate, qubit):\n        \"\"\"Apply an arbitrary 1-qubit unitary matrix.\n\n        Args:\n            gate (matrix_like): a single qubit gate matrix\n            qubit (int): the qubit to apply gate to\n        \"\"\"\n        # Compute einsum index string for 1-qubit matrix multiplication\n        indexes = einsum_vecmul_index([qubit], self._number_of_qubits)\n        # Convert to complex rank-2 tensor\n        gate_tensor = np.array(gate, dtype=complex)\n        # Apply matrix multiplication\n        self._statevector = np.einsum(indexes, gate_tensor,\n                                      self._statevector,\n                                      dtype=complex,\n                                      casting='no')"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\napplying a two - qubit unitary matrix.", "response": "def _add_unitary_two(self, gate, qubit0, qubit1):\n        \"\"\"Apply a two-qubit unitary matrix.\n\n        Args:\n            gate (matrix_like): a the two-qubit gate matrix\n            qubit0 (int): gate qubit-0\n            qubit1 (int): gate qubit-1\n        \"\"\"\n        # Compute einsum index string for 1-qubit matrix multiplication\n        indexes = einsum_vecmul_index([qubit0, qubit1], self._number_of_qubits)\n        # Convert to complex rank-4 tensor\n        gate_tensor = np.reshape(np.array(gate, dtype=complex), 4 * [2])\n        # Apply matrix multiplication\n        self._statevector = np.einsum(indexes, gate_tensor,\n                                      self._statevector,\n                                      dtype=complex,\n                                      casting='no')"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nsimulate the outcome of measurement of a qubit.", "response": "def _get_measure_outcome(self, qubit):\n        \"\"\"Simulate the outcome of measurement of a qubit.\n\n        Args:\n            qubit (int): the qubit to measure\n\n        Return:\n            tuple: pair (outcome, probability) where outcome is '0' or '1' and\n            probability is the probability of the returned outcome.\n        \"\"\"\n        # Axis for numpy.sum to compute probabilities\n        axis = list(range(self._number_of_qubits))\n        axis.remove(self._number_of_qubits - 1 - qubit)\n        probabilities = np.sum(np.abs(self._statevector) ** 2, axis=tuple(axis))\n        # Compute einsum index string for 1-qubit matrix multiplication\n        random_number = self._local_random.rand()\n        if random_number < probabilities[0]:\n            return '0', probabilities[0]\n        # Else outcome was '1'\n        return '1', probabilities[1]"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef _add_sample_measure(self, measure_params, num_samples):\n        # Get unique qubits that are actually measured\n        measured_qubits = list({qubit for qubit, cmembit in measure_params})\n        num_measured = len(measured_qubits)\n        # Axis for numpy.sum to compute probabilities\n        axis = list(range(self._number_of_qubits))\n        for qubit in reversed(measured_qubits):\n            # Remove from largest qubit to smallest so list position is correct\n            # with respect to position from end of the list\n            axis.remove(self._number_of_qubits - 1 - qubit)\n        probabilities = np.reshape(np.sum(np.abs(self._statevector) ** 2,\n                                          axis=tuple(axis)),\n                                   2 ** num_measured)\n        # Generate samples on measured qubits\n        samples = self._local_random.choice(range(2 ** num_measured),\n                                            num_samples, p=probabilities)\n        # Convert to bit-strings\n        memory = []\n        for sample in samples:\n            classical_memory = self._classical_memory\n            for count, (qubit, cmembit) in enumerate(sorted(measure_params)):\n                qubit_outcome = int((sample & (1 << count)) >> count)\n                membit = 1 << cmembit\n                classical_memory = (classical_memory & (~membit)) | (qubit_outcome << cmembit)\n            value = bin(classical_memory)[2:]\n            memory.append(hex(int(value, 2)))\n        return memory", "response": "Generate memory samples from the current statevector."}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef _add_qasm_measure(self, qubit, cmembit, cregbit=None):\n        # get measure outcome\n        outcome, probability = self._get_measure_outcome(qubit)\n        # update classical state\n        membit = 1 << cmembit\n        self._classical_memory = (self._classical_memory & (~membit)) | (int(outcome) << cmembit)\n\n        if cregbit is not None:\n            regbit = 1 << cregbit\n            self._classical_register = \\\n                (self._classical_register & (~regbit)) | (int(outcome) << cregbit)\n\n        # update quantum state\n        if outcome == '0':\n            update_diag = [[1 / np.sqrt(probability), 0], [0, 0]]\n        else:\n            update_diag = [[0, 0], [0, 1 / np.sqrt(probability)]]\n        # update classical state\n        self._add_unitary_single(update_diag, qubit)", "response": "Apply a measure instruction to a qubit."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\napply a reset instruction to a qubit.", "response": "def _add_qasm_reset(self, qubit):\n        \"\"\"Apply a reset instruction to a qubit.\n\n        Args:\n            qubit (int): the qubit being rest\n\n        This is done by doing a simulating a measurement\n        outcome and projecting onto the outcome state while\n        renormalizing.\n        \"\"\"\n        # get measure outcome\n        outcome, probability = self._get_measure_outcome(qubit)\n        # update quantum state\n        if outcome == '0':\n            update = [[1 / np.sqrt(probability), 0], [0, 0]]\n            self._add_unitary_single(update, qubit)\n        else:\n            update = [[0, 1 / np.sqrt(probability)], [0, 0]]\n            self._add_unitary_single(update, qubit)"}
{"SOURCE": "codesearchnet", "instruction": "Implement a Python 3 function for\nvalidating an initial statevector", "response": "def _validate_initial_statevector(self):\n        \"\"\"Validate an initial statevector\"\"\"\n        # If initial statevector isn't set we don't need to validate\n        if self._initial_statevector is None:\n            return\n        # Check statevector is correct length for number of qubits\n        length = len(self._initial_statevector)\n        required_dim = 2 ** self._number_of_qubits\n        if length != required_dim:\n            raise BasicAerError('initial statevector is incorrect length: ' +\n                                '{} != {}'.format(length, required_dim))"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nset the backend options for all experiments in a qobj", "response": "def _set_options(self, qobj_config=None, backend_options=None):\n        \"\"\"Set the backend options for all experiments in a qobj\"\"\"\n        # Reset default options\n        self._initial_statevector = self.DEFAULT_OPTIONS[\"initial_statevector\"]\n        self._chop_threshold = self.DEFAULT_OPTIONS[\"chop_threshold\"]\n        if backend_options is None:\n            backend_options = {}\n\n        # Check for custom initial statevector in backend_options first,\n        # then config second\n        if 'initial_statevector' in backend_options:\n            self._initial_statevector = np.array(backend_options['initial_statevector'],\n                                                 dtype=complex)\n        elif hasattr(qobj_config, 'initial_statevector'):\n            self._initial_statevector = np.array(qobj_config.initial_statevector,\n                                                 dtype=complex)\n        if self._initial_statevector is not None:\n            # Check the initial statevector is normalized\n            norm = np.linalg.norm(self._initial_statevector)\n            if round(norm, 12) != 1:\n                raise BasicAerError('initial statevector is not normalized: ' +\n                                    'norm {} != 1'.format(norm))\n        # Check for custom chop threshold\n        # Replace with custom options\n        if 'chop_threshold' in backend_options:\n            self._chop_threshold = backend_options['chop_threshold']\n        elif hasattr(qobj_config, 'chop_threshold'):\n            self._chop_threshold = qobj_config.chop_threshold"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nsets the initial statevector for simulation", "response": "def _initialize_statevector(self):\n        \"\"\"Set the initial statevector for simulation\"\"\"\n        if self._initial_statevector is None:\n            # Set to default state of all qubits in |0>\n            self._statevector = np.zeros(2 ** self._number_of_qubits,\n                                         dtype=complex)\n            self._statevector[0] = 1\n        else:\n            self._statevector = self._initial_statevector.copy()\n        # Reshape to rank-N tensor\n        self._statevector = np.reshape(self._statevector,\n                                       self._number_of_qubits * [2])"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nreturn the current statevector in JSON Result spec format", "response": "def _get_statevector(self):\n        \"\"\"Return the current statevector in JSON Result spec format\"\"\"\n        vec = np.reshape(self._statevector, 2 ** self._number_of_qubits)\n        # Expand complex numbers\n        vec = np.stack([vec.real, vec.imag], axis=1)\n        # Truncate small values\n        vec[abs(vec) < self._chop_threshold] = 0.0\n        return vec"}
{"SOURCE": "codesearchnet", "instruction": "How would you implement a function in Python 3 that\nchecks if measure sampling is allowed for an experiment. Returns True if measure sampling is allowed otherwise False.", "response": "def _validate_measure_sampling(self, experiment):\n        \"\"\"Determine if measure sampling is allowed for an experiment\n\n        Args:\n            experiment (QobjExperiment): a qobj experiment.\n        \"\"\"\n        # If shots=1 we should disable measure sampling.\n        # This is also required for statevector simulator to return the\n        # correct final statevector without silently dropping final measurements.\n        if self._shots <= 1:\n            self._sample_measure = False\n            return\n\n        # Check for config flag\n        if hasattr(experiment.config, 'allows_measure_sampling'):\n            self._sample_measure = experiment.config.allows_measure_sampling\n        # If flag isn't found do a simple test to see if a circuit contains\n        # no reset instructions, and no gates instructions after\n        # the first measure.\n        else:\n            measure_flag = False\n            for instruction in experiment.instructions:\n                # If circuit contains reset operations we cannot sample\n                if instruction.name == \"reset\":\n                    self._sample_measure = False\n                    return\n                # If circuit contains a measure option then we can\n                # sample only if all following operations are measures\n                if measure_flag:\n                    # If we find a non-measure instruction\n                    # we cannot do measure sampling\n                    if instruction.name not in [\"measure\", \"barrier\", \"id\", \"u0\"]:\n                        self._sample_measure = False\n                        return\n                elif instruction.name == \"measure\":\n                    measure_flag = True\n            # If we made it to the end of the circuit without returning\n            # measure sampling is allowed\n            self._sample_measure = True"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nruns the entire experiment.", "response": "def run(self, qobj, backend_options=None):\n        \"\"\"Run qobj asynchronously.\n\n        Args:\n            qobj (Qobj): payload of the experiment\n            backend_options (dict): backend options\n\n        Returns:\n            BasicAerJob: derived from BaseJob\n\n        Additional Information:\n            backend_options: Is a dict of options for the backend. It may contain\n                * \"initial_statevector\": vector_like\n\n            The \"initial_statevector\" option specifies a custom initial\n            initial statevector for the simulator to be used instead of the all\n            zero state. This size of this vector must be correct for the number\n            of qubits in all experiments in the qobj.\n\n            Example::\n\n                backend_options = {\n                    \"initial_statevector\": np.array([1, 0, 0, 1j]) / np.sqrt(2),\n                }\n        \"\"\"\n        self._set_options(qobj_config=qobj.config,\n                          backend_options=backend_options)\n        job_id = str(uuid.uuid4())\n        job = BasicAerJob(self, job_id, self._run_job, qobj)\n        job.submit()\n        return job"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\nrun experiments in qobj", "response": "def _run_job(self, job_id, qobj):\n        \"\"\"Run experiments in qobj\n\n        Args:\n            job_id (str): unique id for the job.\n            qobj (Qobj): job description\n\n        Returns:\n            Result: Result object\n        \"\"\"\n        self._validate(qobj)\n        result_list = []\n        self._shots = qobj.config.shots\n        self._memory = getattr(qobj.config, 'memory', False)\n        self._qobj_config = qobj.config\n        start = time.time()\n        for experiment in qobj.experiments:\n            result_list.append(self.run_experiment(experiment))\n        end = time.time()\n        result = {'backend_name': self.name(),\n                  'backend_version': self._configuration.backend_version,\n                  'qobj_id': qobj.qobj_id,\n                  'job_id': job_id,\n                  'results': result_list,\n                  'status': 'COMPLETED',\n                  'success': True,\n                  'time_taken': (end - start),\n                  'header': qobj.header.as_dict()}\n\n        return Result.from_dict(result)"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nruns an experiment and return a single experiment result.", "response": "def run_experiment(self, experiment):\n        \"\"\"Run an experiment (circuit) and return a single experiment result.\n\n        Args:\n            experiment (QobjExperiment): experiment from qobj experiments list\n\n        Returns:\n             dict: A result dictionary which looks something like::\n\n                {\n                \"name\": name of this experiment (obtained from qobj.experiment header)\n                \"seed\": random seed used for simulation\n                \"shots\": number of shots used in the simulation\n                \"data\":\n                    {\n                    \"counts\": {'0x9: 5, ...},\n                    \"memory\": ['0x9', '0xF', '0x1D', ..., '0x9']\n                    },\n                \"status\": status string for the simulation\n                \"success\": boolean\n                \"time_taken\": simulation time of this single experiment\n                }\n        Raises:\n            BasicAerError: if an error occurred.\n        \"\"\"\n        start = time.time()\n        self._number_of_qubits = experiment.config.n_qubits\n        self._number_of_cmembits = experiment.config.memory_slots\n        self._statevector = 0\n        self._classical_memory = 0\n        self._classical_register = 0\n        self._sample_measure = False\n        # Validate the dimension of initial statevector if set\n        self._validate_initial_statevector()\n        # Get the seed looking in circuit, qobj, and then random.\n        if hasattr(experiment.config, 'seed'):\n            seed = experiment.config.seed\n        elif hasattr(self._qobj_config, 'seed'):\n            seed = self._qobj_config.seed\n        else:\n            # For compatibility on Windows force dyte to be int32\n            # and set the maximum value to be (2 ** 31) - 1\n            seed = np.random.randint(2147483647, dtype='int32')\n\n        self._local_random.seed(seed=seed)\n        # Check if measure sampling is supported for current circuit\n        self._validate_measure_sampling(experiment)\n\n        # List of final counts for all shots\n        memory = []\n        # Check if we can sample measurements, if so we only perform 1 shot\n        # and sample all outcomes from the final state vector\n        if self._sample_measure:\n            shots = 1\n            # Store (qubit, cmembit) pairs for all measure ops in circuit to\n            # be sampled\n            measure_sample_ops = []\n        else:\n            shots = self._shots\n        for _ in range(shots):\n            self._initialize_statevector()\n            # Initialize classical memory to all 0\n            self._classical_memory = 0\n            self._classical_register = 0\n            for operation in experiment.instructions:\n                conditional = getattr(operation, 'conditional', None)\n                if isinstance(conditional, int):\n                    conditional_bit_set = (self._classical_register >> conditional) & 1\n                    if not conditional_bit_set:\n                        continue\n                elif conditional is not None:\n                    mask = int(operation.conditional.mask, 16)\n                    if mask > 0:\n                        value = self._classical_memory & mask\n                        while (mask & 0x1) == 0:\n                            mask >>= 1\n                            value >>= 1\n                        if value != int(operation.conditional.val, 16):\n                            continue\n\n                # Check if single  gate\n                if operation.name in ('U', 'u1', 'u2', 'u3'):\n                    params = getattr(operation, 'params', None)\n                    qubit = operation.qubits[0]\n                    gate = single_gate_matrix(operation.name, params)\n                    self._add_unitary_single(gate, qubit)\n                # Check if CX gate\n                elif operation.name in ('id', 'u0'):\n                    pass\n                elif operation.name in ('CX', 'cx'):\n                    qubit0 = operation.qubits[0]\n                    qubit1 = operation.qubits[1]\n                    gate = cx_gate_matrix()\n                    self._add_unitary_two(gate, qubit0, qubit1)\n                # Check if reset\n                elif operation.name == 'reset':\n                    qubit = operation.qubits[0]\n                    self._add_qasm_reset(qubit)\n                # Check if barrier\n                elif operation.name == 'barrier':\n                    pass\n                # Check if measure\n                elif operation.name == 'measure':\n                    qubit = operation.qubits[0]\n                    cmembit = operation.memory[0]\n                    cregbit = operation.register[0] if hasattr(operation, 'register') else None\n\n                    if self._sample_measure:\n                        # If sampling measurements record the qubit and cmembit\n                        # for this measurement for later sampling\n                        measure_sample_ops.append((qubit, cmembit))\n                    else:\n                        # If not sampling perform measurement as normal\n                        self._add_qasm_measure(qubit, cmembit, cregbit)\n                elif operation.name == 'bfunc':\n                    mask = int(operation.mask, 16)\n                    relation = operation.relation\n                    val = int(operation.val, 16)\n\n                    cregbit = operation.register\n                    cmembit = operation.memory if hasattr(operation, 'memory') else None\n\n                    compared = (self._classical_register & mask) - val\n\n                    if relation == '==':\n                        outcome = (compared == 0)\n                    elif relation == '!=':\n                        outcome = (compared != 0)\n                    elif relation == '<':\n                        outcome = (compared < 0)\n                    elif relation == '<=':\n                        outcome = (compared <= 0)\n                    elif relation == '>':\n                        outcome = (compared > 0)\n                    elif relation == '>=':\n                        outcome = (compared >= 0)\n                    else:\n                        raise BasicAerError('Invalid boolean function relation.')\n\n                    # Store outcome in register and optionally memory slot\n                    regbit = 1 << cregbit\n                    self._classical_register = \\\n                        (self._classical_register & (~regbit)) | (int(outcome) << cregbit)\n                    if cmembit is not None:\n                        membit = 1 << cmembit\n                        self._classical_memory = \\\n                            (self._classical_memory & (~membit)) | (int(outcome) << cmembit)\n                else:\n                    backend = self.name()\n                    err_msg = '{0} encountered unrecognized operation \"{1}\"'\n                    raise BasicAerError(err_msg.format(backend, operation.name))\n\n            # Add final creg data to memory list\n            if self._number_of_cmembits > 0:\n                if self._sample_measure:\n                    # If sampling we generate all shot samples from the final statevector\n                    memory = self._add_sample_measure(measure_sample_ops, self._shots)\n                else:\n                    # Turn classical_memory (int) into bit string and pad zero for unused cmembits\n                    outcome = bin(self._classical_memory)[2:]\n                    memory.append(hex(int(outcome, 2)))\n\n        # Add data\n        data = {'counts': dict(Counter(memory))}\n        # Optionally add memory list\n        if self._memory:\n            data['memory'] = memory\n        # Optionally add final statevector\n        if self.SHOW_FINAL_STATE:\n            data['statevector'] = self._get_statevector()\n            # Remove empty counts and memory for statevector simulator\n            if not data['counts']:\n                data.pop('counts')\n            if 'memory' in data and not data['memory']:\n                data.pop('memory')\n        end = time.time()\n        return {'name': experiment.header.name,\n                'seed': seed,\n                'shots': self._shots,\n                'data': data,\n                'status': 'DONE',\n                'success': True,\n                'time_taken': (end - start),\n                'header': experiment.header.as_dict()}"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 function for\napplying an arbitrary 1 - qubit unitary matrix.", "response": "def _add_unitary_single(self, gate, qubit):\n        \"\"\"Apply an arbitrary 1-qubit unitary matrix.\n\n        Args:\n            gate (matrix_like): a single qubit gate matrix\n            qubit (int): the qubit to apply gate to\n        \"\"\"\n        # Convert to complex rank-2 tensor\n        gate_tensor = np.array(gate, dtype=complex)\n        # Compute einsum index string for 1-qubit matrix multiplication\n        indexes = einsum_matmul_index([qubit], self._number_of_qubits)\n        # Apply matrix multiplication\n        self._unitary = np.einsum(indexes, gate_tensor, self._unitary,\n                                  dtype=complex, casting='no')"}
{"SOURCE": "codesearchnet", "instruction": "Create a Python 3 function to\napply a two - qubit unitary matrix.", "response": "def _add_unitary_two(self, gate, qubit0, qubit1):\n        \"\"\"Apply a two-qubit unitary matrix.\n\n        Args:\n            gate (matrix_like): a the two-qubit gate matrix\n            qubit0 (int): gate qubit-0\n            qubit1 (int): gate qubit-1\n        \"\"\"\n\n        # Convert to complex rank-4 tensor\n        gate_tensor = np.reshape(np.array(gate, dtype=complex), 4 * [2])\n\n        # Compute einsum index string for 2-qubit matrix multiplication\n        indexes = einsum_matmul_index([qubit0, qubit1], self._number_of_qubits)\n\n        # Apply matrix multiplication\n        self._unitary = np.einsum(indexes, gate_tensor, self._unitary,\n                                  dtype=complex, casting='no')"}
{"SOURCE": "codesearchnet", "instruction": "How would you explain what the following Python 3 function does\ndef _validate_initial_unitary(self):\n        # If initial unitary isn't set we don't need to validate\n        if self._initial_unitary is None:\n            return\n        # Check unitary is correct length for number of qubits\n        shape = np.shape(self._initial_unitary)\n        required_shape = (2 ** self._number_of_qubits,\n                          2 ** self._number_of_qubits)\n        if shape != required_shape:\n            raise BasicAerError('initial unitary is incorrect shape: ' +\n                                '{} != 2 ** {}'.format(shape, required_shape))", "response": "Validate an initial unitary matrix"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate the documentation for the following Python 3 function\ndef _set_options(self, qobj_config=None, backend_options=None):\n        # Reset default options\n        self._initial_unitary = self.DEFAULT_OPTIONS[\"initial_unitary\"]\n        self._chop_threshold = self.DEFAULT_OPTIONS[\"chop_threshold\"]\n        if backend_options is None:\n            backend_options = {}\n\n        # Check for custom initial statevector in backend_options first,\n        # then config second\n        if 'initial_unitary' in backend_options:\n            self._initial_unitary = np.array(backend_options['initial_unitary'],\n                                             dtype=complex)\n        elif hasattr(qobj_config, 'initial_unitary'):\n            self._initial_unitary = np.array(qobj_config.initial_unitary,\n                                             dtype=complex)\n        if self._initial_unitary is not None:\n            # Check the initial unitary is actually unitary\n            shape = np.shape(self._initial_unitary)\n            if len(shape) != 2 or shape[0] != shape[1]:\n                raise BasicAerError(\"initial unitary is not a square matrix\")\n            iden = np.eye(len(self._initial_unitary))\n            u_dagger_u = np.dot(self._initial_unitary.T.conj(),\n                                self._initial_unitary)\n            norm = np.linalg.norm(u_dagger_u - iden)\n            if round(norm, 10) != 0:\n                raise BasicAerError(\"initial unitary is not unitary\")\n            # Check the initial statevector is normalized\n\n        # Check for custom chop threshold\n        # Replace with custom options\n        if 'chop_threshold' in backend_options:\n            self._chop_threshold = backend_options['chop_threshold']\n        elif hasattr(qobj_config, 'chop_threshold'):\n            self._chop_threshold = qobj_config.chop_threshold", "response": "Set the backend options for all experiments in a qobj"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef _initialize_unitary(self):\n        self._validate_initial_unitary()\n        if self._initial_unitary is None:\n            # Set to identity matrix\n            self._unitary = np.eye(2 ** self._number_of_qubits,\n                                   dtype=complex)\n        else:\n            self._unitary = self._initial_unitary.copy()\n        # Reshape to rank-N tensor\n        self._unitary = np.reshape(self._unitary,\n                                   self._number_of_qubits * [2, 2])", "response": "Initialize the unitary for the current object"}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\nreturn the current unitary in JSON Result spec format", "response": "def _get_unitary(self):\n        \"\"\"Return the current unitary in JSON Result spec format\"\"\"\n        unitary = np.reshape(self._unitary, 2 * [2 ** self._number_of_qubits])\n        # Expand complex numbers\n        unitary = np.stack((unitary.real, unitary.imag), axis=-1)\n        # Truncate small values\n        unitary[abs(unitary) < self._chop_threshold] = 0.0\n        return unitary"}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nruns experiments in qobj.", "response": "def _run_job(self, job_id, qobj):\n        \"\"\"Run experiments in qobj.\n\n        Args:\n            job_id (str): unique id for the job.\n            qobj (Qobj): job description\n\n        Returns:\n            Result: Result object\n        \"\"\"\n        self._validate(qobj)\n        result_list = []\n        start = time.time()\n        for experiment in qobj.experiments:\n            result_list.append(self.run_experiment(experiment))\n        end = time.time()\n        result = {'backend_name': self.name(),\n                  'backend_version': self._configuration.backend_version,\n                  'qobj_id': qobj.qobj_id,\n                  'job_id': job_id,\n                  'results': result_list,\n                  'status': 'COMPLETED',\n                  'success': True,\n                  'time_taken': (end - start),\n                  'header': qobj.header.as_dict()}\n\n        return Result.from_dict(result)"}
{"SOURCE": "codesearchnet", "instruction": "Write a Python 3 script to\nrun an experiment and return a single experiment result.", "response": "def run_experiment(self, experiment):\n        \"\"\"Run an experiment (circuit) and return a single experiment result.\n\n        Args:\n            experiment (QobjExperiment): experiment from qobj experiments list\n\n        Returns:\n            dict: A result dictionary which looks something like::\n\n                {\n                \"name\": name of this experiment (obtained from qobj.experiment header)\n                \"seed\": random seed used for simulation\n                \"shots\": number of shots used in the simulation\n                \"data\":\n                    {\n                    \"unitary\": [[[0.0, 0.0], [1.0, 0.0]],\n                                [[1.0, 0.0], [0.0, 0.0]]]\n                    },\n                \"status\": status string for the simulation\n                \"success\": boolean\n                \"time taken\": simulation time of this single experiment\n                }\n\n        Raises:\n            BasicAerError: if the number of qubits in the circuit is greater than 24.\n            Note that the practical qubit limit is much lower than 24.\n        \"\"\"\n        start = time.time()\n        self._number_of_qubits = experiment.header.n_qubits\n\n        # Validate the dimension of initial unitary if set\n        self._validate_initial_unitary()\n        self._initialize_unitary()\n\n        for operation in experiment.instructions:\n            # Check if single  gate\n            if operation.name in ('U', 'u1', 'u2', 'u3'):\n                params = getattr(operation, 'params', None)\n                qubit = operation.qubits[0]\n                gate = single_gate_matrix(operation.name, params)\n                self._add_unitary_single(gate, qubit)\n            elif operation.name in ('id', 'u0'):\n                pass\n            # Check if CX gate\n            elif operation.name in ('CX', 'cx'):\n                qubit0 = operation.qubits[0]\n                qubit1 = operation.qubits[1]\n                gate = cx_gate_matrix()\n                self._add_unitary_two(gate, qubit0, qubit1)\n            # Check if barrier\n            elif operation.name == 'barrier':\n                pass\n            else:\n                backend = self.name()\n                err_msg = '{0} encountered unrecognized operation \"{1}\"'\n                raise BasicAerError(err_msg.format(backend, operation.name))\n        # Add final state to data\n        data = {'unitary': self._get_unitary()}\n        end = time.time()\n        return {'name': experiment.header.name,\n                'shots': 1,\n                'data': data,\n                'status': 'DONE',\n                'success': True,\n                'time_taken': (end - start),\n                'header': experiment.header.as_dict()}"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef _validate(self, qobj):\n        n_qubits = qobj.config.n_qubits\n        max_qubits = self.configuration().n_qubits\n        if n_qubits > max_qubits:\n            raise BasicAerError('Number of qubits {} '.format(n_qubits) +\n                                'is greater than maximum ({}) '.format(max_qubits) +\n                                'for \"{}\".'.format(self.name()))\n        if hasattr(qobj.config, 'shots') and qobj.config.shots != 1:\n            logger.info('\"%s\" only supports 1 shot. Setting shots=1.',\n                        self.name())\n            qobj.config.shots = 1\n        for experiment in qobj.experiments:\n            name = experiment.header.name\n            if getattr(experiment.config, 'shots', 1) != 1:\n                logger.info('\"%s\" only supports 1 shot. '\n                            'Setting shots=1 for circuit \"%s\".',\n                            self.name(), name)\n                experiment.config.shots = 1\n            for operation in experiment.instructions:\n                if operation.name in ['measure', 'reset']:\n                    raise BasicAerError('Unsupported \"%s\" instruction \"%s\" ' +\n                                        'in circuit \"%s\" ', self.name(),\n                                        operation.name, name)", "response": "Semantic validations of the qobj which cannot be done via schemas."}
{"SOURCE": "codesearchnet", "instruction": "How would you code a function in Python 3 to\ndetermine if an object is a bit", "response": "def _is_bit(obj):\n    \"\"\"Determine if obj is a bit\"\"\"\n    # If there is a bit type this could be replaced by isinstance.\n    if isinstance(obj, tuple) and len(obj) == 2:\n        if isinstance(obj[0], Register) and isinstance(obj[1], int) and obj[1] < len(obj[0]):\n            return True\n    return False"}
{"SOURCE": "codesearchnet", "instruction": "Here you have a function in Python 3, explain what it does\ndef _convert_to_bits(a_list, bits):\n    new_list = []\n    for item in a_list:\n        if isinstance(item, (int, slice)):\n            # eg. circuit.h(2)\n            # eg. circuit.h(slice(0, 2))\n            try:\n                new_list.append(bits[item])\n            except IndexError:\n                raise QiskitError(\"The integer param is out of range\")\n        elif isinstance(item, list):\n            # eg. circuit.h([0, 2])\n            new_list.append(_convert_to_bits(item, bits))\n        elif isinstance(item, range):\n            # eg. circuit.h(range(0, 2))\n            new_list.append(_convert_to_bits([index for index in item], bits))\n        else:\n            new_list.append(item)\n    return new_list", "response": "Recursively converts integers tuples and ranges in a_list to bits."}
{"SOURCE": "codesearchnet", "instruction": "Implement a function in Python 3 to\nconvert gate arguments to [ qu|cl bits ] from integers slices ranges etc.", "response": "def _to_bits(nqbits, ncbits=0, func=None):\n    \"\"\"Convert gate arguments to [qu|cl]bits from integers, slices, ranges, etc.\n    For example circuit.h(0) -> circuit.h(QuantumRegister(2)[0]) \"\"\"\n    if func is None:\n        return functools.partial(_to_bits, nqbits, ncbits)\n\n    @functools.wraps(func)\n    def wrapper(self, *args):\n        qbits = self.qubits()\n        cbits = self.clbits()\n\n        nparams = len(args) - nqbits - ncbits\n        params = args[:nparams]\n        qb_args = args[nparams:nparams + nqbits]\n        cl_args = args[nparams + nqbits:]\n\n        args = list(params) + _convert_to_bits(qb_args, qbits) + _convert_to_bits(cl_args, cbits)\n\n        return func(self, *args)\n\n    return wrapper"}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef to_matrix(self):\n        lam = self.params[0]\n        lam = float(lam)\n        return numpy.array([[1, 0], [0, numpy.exp(1j * lam)]], dtype=complex)", "response": "Return a Numpy. array for the U3 gate."}
{"SOURCE": "codesearchnet", "instruction": "Can you tell what is the following Python 3 function doing\ndef run(self, dag):\n        num_dag_qubits = sum([qreg.size for qreg in dag.qregs.values()])\n        if num_dag_qubits > self.coupling_map.size():\n            raise TranspilerError('Number of qubits greater than device.')\n        self.property_set['layout'] = Layout.generate_trivial_layout(*dag.qregs.values())", "response": "Run the layout algorithm."}
{"SOURCE": "codesearchnet", "instruction": "Can you implement a function in Python 3 that\nchecks if self has overlap with interval.", "response": "def has_overlap(self, interval: 'Interval') -> bool:\n        \"\"\"Check if self has overlap with `interval`.\n\n        Args:\n            interval: interval to be examined\n\n        Returns:\n            bool: True if self has overlap with `interval` otherwise False\n        \"\"\"\n        if self.begin < interval.end and interval.begin < self.end:\n            return True\n        return False"}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef shift(self, time: int) -> 'Interval':\n        return Interval(self._begin + time, self._end + time)", "response": "Return a new interval shifted by time from self."}
{"SOURCE": "codesearchnet", "instruction": "Can you write a function in Python 3 where it\nreturns a new Timeslot shifted by time.", "response": "def shift(self, time: int) -> 'Timeslot':\n        \"\"\"Return a new Timeslot shifted by `time`.\n\n        Args:\n            time: time to be shifted\n        \"\"\"\n        return Timeslot(self.interval.shift(time), self.channel)"}
{"SOURCE": "codesearchnet", "instruction": "Can you create a Python 3 function that\nreturns the earliest start time in this collection.", "response": "def ch_start_time(self, *channels: List[Channel]) -> int:\n        \"\"\"Return earliest start time in this collection.\n\n        Args:\n            *channels: Channels over which to obtain start_time.\n        \"\"\"\n        intervals = list(itertools.chain(*(self._table[chan] for chan in channels\n                                           if chan in self._table)))\n        if intervals:\n            return min((interval.begin for interval in intervals))\n        return 0"}
{"SOURCE": "codesearchnet", "instruction": "Make a summary of the following Python 3 code\ndef ch_stop_time(self, *channels: List[Channel]) -> int:\n        intervals = list(itertools.chain(*(self._table[chan] for chan in channels\n                                           if chan in self._table)))\n        if intervals:\n            return max((interval.end for interval in intervals))\n        return 0", "response": "Return the maximum time of timeslots over all channels."}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef is_mergeable_with(self, timeslots: 'TimeslotCollection') -> bool:\n        for slot in timeslots.timeslots:\n            for interval in self._table[slot.channel]:\n                if slot.interval.has_overlap(interval):\n                    return False\n        return True", "response": "Return if self is mergeable with timeslots."}
{"SOURCE": "codesearchnet", "instruction": "Can you generate a brief explanation for the following Python 3 code\ndef merged(self, timeslots: 'TimeslotCollection') -> 'TimeslotCollection':\n        slots = [Timeslot(slot.interval, slot.channel) for slot in self.timeslots]\n        slots.extend([Timeslot(slot.interval, slot.channel) for slot in timeslots.timeslots])\n        return TimeslotCollection(*slots)", "response": "Returns a new TimeslotCollection merged with a specified timeslots"}
{"SOURCE": "codesearchnet", "instruction": "Explain what the following Python 3 code does\ndef shift(self, time: int) -> 'TimeslotCollection':\n        slots = [Timeslot(slot.interval.shift(time), slot.channel) for slot in self.timeslots]\n        return TimeslotCollection(*slots)", "response": "Return a new TimeslotCollection shifted by time."}
{"SOURCE": "codesearchnet", "instruction": "Given the following Python 3 function, write the documentation\ndef real(self, nested_scope=None):\n        op = self.children[0].name\n        expr = self.children[1]\n        dispatch = {\n            'sin': sympy.sin,\n            'cos': sympy.cos,\n            'tan': sympy.tan,\n            'asin': sympy.asin,\n            'acos': sympy.acos,\n            'atan': sympy.atan,\n            'exp': sympy.exp,\n            'ln': sympy.log,\n            'sqrt': sympy.sqrt\n        }\n        if op in dispatch:\n            arg = expr.real(nested_scope)\n            return dispatch[op](arg)\n        else:\n            raise NodeException(\"internal error: undefined external\")", "response": "Return the correspond floating point number."}