EX05 - Dictionary Functions

In this exercise you will get some practice with some dictionary functions.

Assignment Outline

  • invert – 20 Points Autograded
    • Unit Tests – 5 Points Autograded
  • count – 20 Points Autograded
    • Unit Tests – 5 Points Autograded
  • favorite_colors – 25 Points Autograded
    • Unit Tests – 5 Points Autograded
  • bin_len – 25 Points Autograded
    • Unit Tests – 5 Points Autograded
  • Typing – 10 Points Autograded

Note: Even if your functions are not 100% correct or finished, you can get full credit for the unit tests if you set up a function skeleton and write your tests assuming correct functionality.

0. Setup

Create a new directory in exercises named ex05.

Inside the exercises/ex05 directory, create a file named dictionary.py. Add a docstring and establish an __author__ variable to be assigned a string with the digits of your PID. This is where you will implement your function skeletons and implementations below.

Unit Tests

Also inside the exercises/ex05 directory, create a file named dictionary_test.py. Add a docstring and establish an __author__ in this file as well.

For each function from below (invert, count, favorite_color, bin_len), you are to define at least 3x unit test functions. Remember that a unit test function starts with test_.

The 3 unit tests should consist of:

  • One edge case
  • Two use cases

Include descriptive function names and docstrings, so that it captures what is being tested.

The command to run your tests is python -m pytest exercises/ex05 or you can run them using the beaker tab in VSCode if it is working (do note the VSCode testing feature tends to be a bit flaky).

Just like in ex04, you will need to import your functions in your tests file.

If your screen is large enough, you are encouraged to open these files side-by-side in VSCode by dragging the tab of one to the right side of VSCode so that it changes to a split pane view. Closing your file explorer can help give you additional horizontal space.

1. invert

This is the first function you will write in dictionary.py. The other two functions will also be defined in this file.

Given a dictionary of [str, str], invert should return a dict[str, str] that inverts the keys and the values. The keys of the input list becomes the values of the output list and vice versa.

Remember that keys in a dictionary are unique. If you encounter more than one of the same key when trying to invert your dictionary, i.e., raise a KeyError. Example usage:

>>> invert({'a': 'z', 'b' : 'y', 'c': 'x'})
{'z': 'a', 'y': 'b', 'x': 'c'}
>>> invert({'apple': 'cat'})
{'cat': 'apple'}
>>> invert({'kris': 'jordan', 'michael': 'jordan'})
KeyError

The syntax for raising a KeyError is: `raise KeyError(“error message of your choice here!”)

Testing for KeyError:

You may optionally add a test for the invert function to check if the KeyError is being raised. Testing errors requires different syntax than testing normal outputs. If an error is raised then the function never actually returned anything, so we are unable to assert a return value. Instead try the following:

  1. Make sure to import pytest
  2. Use the following code as an example:
    with pytest.raises(KeyError):
        my_dictionary = {'kris': 'jordan', 'michael': 'jordan'}
        invert(my_dictionary)

2. count

Given a list[str], this function will produce a dict[str, int] where each key is a unique value in the given list and each value associated is the count of the number of times that value appeared in the input list.

  • Function name: count
  • Parameter: list[str] - list of values to count the frequencies of
  • Return Type: dict[str, int] - a dictionary of the counts of each of the items in the input list

Implementation strategy:

  1. Establish an empty dictionary to store your built-up result in
  2. Loop through each item in the input list
    1. Check to see if that item has already been established as a key in your dictionary. Try the following boolean conditional: if <item> in <dict>: – replacing <item> with the variable name of the current value and <dict> with the name of your result dictionary.
    2. If the item is found in the dict, that means there is already a key/value pair where the item is a key. Increase the value associated with that key by 1 (counting it!)
    3. If the item is not found in the dict, that means this is the first time you are encountering the value and should assign an initial count of 1 to that key in the result dictionary.
  3. Return the resulting dictionary.

3. favorite_color

Create a function in your dictionary.py file called favorite_color. It has the following specifications:

  1. It has one parameter, of type dict[str, str] of names and favorite colors.
  2. It returns a str which is the color that appears most frequently.
    1. If there is a tie for most popular color, return the first color encountered.

You should consider whether you can make use of your correctly implemented count function in the implementation of this function and challenge yourself to try to.

You can check the correctness of your favorite_color function by writing and running tests for it first.

4. bin_len

Write a function named bin_len that “bins” a list of strings into a dictionary where the key is an int length of a given string and the associated values are a set of strings of the key’s length found in the original list.

For example:

assert bin_len(["the", "quick", "fox"]) == {3: {"the", "fox"}, 5: {"quick"}}
assert bin_len(["the", "the", "fox"]) == {3: {"the", "fox"}}

You should write your skeleton function implementation and runnable tests before continuing on. It is a challenge left to you to define the correct parameter type and return type based on the problem description.

4. Submit to Gradescope for Grading

Login to Gradescope and select the assignment named “EX05 - Dictionary Functions.”. You’ll see an area to upload a zip file. To produce a zip file for autograding, return back to Visual Studio Code.

If you do not see a Terminal at the bottom of your screen, open the Command Palette and search for “View: Toggle Integrated Terminal”.

Type the following command (all on a single line):

python -m tools.submission exercises/ex05

In the file explorer pane, look to find the zip file named “yy.mm.dd-hh.mm-exercises-ex05.zip”. The “yy”, “mm”, “dd”, and so on, are timestamps with the current month, day, hour, minute. If you right click on this file and select “Reveal in File Explorer” on Windows or “Reveal in Finder” on Mac, the zip file’s location on your computer will open. Upload this file to Gradescope to submit your work for this exercise.

Autograding will take a few moments to complete. If there are issues reported, you are encouraged to try and resolve them and resubmit. If for any reason you aren’t receiving full credit and aren’t sure what to try next, come give us a visit in office hours!

Contributor(s): Kaki Ryan