A few of us recently submitted a paper to ALASI’2017 that examined a “case study” of a teacher (me) engaging in a bit of DIY learning analytics. The case was used to drawing a few tentative conclusions and questions around the institutional implementation of learning analytics. The main conclusion is that teacher DIY learning analytics is largely ignored at the institutional level and that there appears to be a need and value to support it. The question is how (and then if supported, what happens)?
This post is the start of an exploration of some technologies that combined may offer some of the affordances necessary to supporting teacher DIY learning analytics. The collection of technologies and the approach owes a significant amount of inspiration to Tony Hirst, especially in this post in which he writes
What I care about are some of the features that Docker has, and how I can use those features to make my own life easier, … supporting personal, DIY, BYOA (“bring your own app”) IT that works at an individual level in the form of end-user applications, or personal digital workbenches
The plan/hope here is that Docker combined with some other technologies can provide a platform to enable a useful combination of do-it-with (DIW) and do-it-yourself (DIY) paths for the institutional implementation of learning analytics. The follow is mostly documenting ad hoc exploration of the technologies.
In the end, I’ve been able to get working a Jupyter notebook working as a JSON API and started explorer docker containers. Laid the ground work for the next step which will be to explore how and if some of this can be combined to integrate some of the work Hazel is doing with some of the Indicators work from earlier in the year.
Learning more – Juypter notebook JSON api
The README.md from github repo mentions serving HTTP requests from “annotated notebook cells”. Suggesting that the method of annotation will be important. The IBM example code that each API call is handled by a particular block starting with an appropriately formatted comment i.e.
single-line comments containing a HTTP verb … followed by a parameterised URL path
Have a simple example working.
Deploying – user experience
The IBM bit then goes about using Docker to to deploy this API. But before I do that. Lets get some experience at the user en with Tony’s example.
- Install VirtualBox
Question: Is this something a standard user can do?
- Install vagrant
- command line to install a vagrant plugin
Question: Too much? But can probably be worked around.
- Download the repo as a zip file.
Had to figure out to go back to the repo “home” to get the download option (long time between drinks doing this).
- Run the vagrant file
Ok, it’s downloading the file from the vagrant server (from the ouseful area on Vagrant).
It’s a 1.66Gb file. That size could potentially be an issue, suggesting the need for a local copy. Especially given the slow download.
An hour or two later and it is up and running. There’s a GUI linux box running on my Mac.
Don’t know a great deal about the application that is the focus, but it appears to work. It’s a 3D application, so the screen refresh isn’t all that fast. But as a personal server for DIY teacher analytics, it should work fine, at least in terms of speed.
Running it a second time includes a check to see if it’s up to date and then up it pops.
The box appears to have Perl, Python and Juypter installed.
Deploying – developing a docker/container/images
This raises the question of the best option for creating and sharing a docker/container/insert appropriate term – I’ll go with images – that has Jupyter notebooks and the kernel_gateway tool running. At this stage, this purpose seems best served by a headless virtual machine with browser-based communication the method for interacting with Jupyter notebooks.
Tony appears to do exactly this (using OpenRefine) using Kitematic in this post. Later in the post the options appear to include
- Sharing images publicly via the Dockerhub registry
- Use a private Dockerhub registry (one with the free plan)
- On a local computer
- Run your own image registry
- And, I assume use an alternative.
Tony sees using the command line a draw back for running your own. Perhaps not the biggest problem in my case. But what is the best approach?
Dockerhub and its ilk do appear to provide extra help (e.g. official repositories you can build upon).
One set of alternatives appear largely focused on supporting central IT, not the end user. Echoing a concern expressed by Tony.
Intro from another alternative suggests that docker is becoming more generic. Time to look and read further afield.
Intro to containers
- Containers abstract the OS etc to make it simple to deploy
- Containers usually measured in 10s of megabytes
- Big distinction made between containers and virtual machines, perhaps boils down to “containers virtualise the OS; virtual machines the hardware”
Though interesting, the one tried above required the downloading of a virtual machine first. Update: That appears to be because I’m running Mac OS X. If I were on a Linux box, I probably wouldn’t have needed that.
- The following seem to resonate most with the needs of teacher DIY learning analytics
- Using containers can decrease the time needed for development, testing, and deployment of applications and services.
- Testing and bug tracking also become less complicated since you there is no difference between running your application locally, on a test server, or in production.
- Container-based virtualization are a great option for microservices, DevOps, and continuous deployment.
- Docker is based on Linux and open source, is the big player.
- Spends some attention on container orchestration – appears to be focused on enterprise IT.
Following offers a creative intro to Kubernetes
Starts with the case for containers (Docker), but then moves onto orchestration and the need for Kubernetes. Puts containers into a pod, perhaps with more than one if tightly coupled. Goes onto to explain the other features provided by Kubernetes.
And intro to Docker
Rolling my own
Possible technology options
- Docker toolbox
Though that appears to be deprecated
- Docker for Mac (and a version for Windows)
Download it; test it…all works
Do the following and I have a web server running in Docker that I can access from my Mac OS browser.
AA17-00936:docker david$ docker run -d -p 80:80 --name webserver nginx Unable to find image 'nginx:latest' locally latest: Pulling from library/nginx afeb2bfd31c0: Pull complete 7ff5d10493db: Pull complete d2562f1ae1d0: Pull complete Digest: sha256:af32e714a9cc3157157374e68c818b05ebe9e0737aac06b55a09da374209a8f9 Status: Downloaded newer image for nginx:latest f1f6925acc31f80faf726358f8de5712458ff3649d2c0626bf3bb37f11d1b070 AA17-00936:docker david$
Dig into tutorials and have a play
Docker share a git repo for tutorials and labs. Which are quite good and useful.
Getting set up with some advice above.
Running your first container includes some simple commands. e.g. to show details of installed images. Showing that they can be quite small.
Question: To have folk install Docker, or do the VM route as above?
AA17-00936:docker david$ docker images REPOSITORY TAG IMAGE ID CREATED SIZE ubuntu latest 2d696327ab2e 11 days ago 122MB nginx latest da5939581ac8 2 weeks ago 108MB alpine latest 76da55c8019d 2 weeks ago 3.97MB hello-world latest 05a3bd381fc2 2 weeks ago 1.84kB
Web apps with docker, which also starts looking at the process of rolling your own.
This is where discussion of different types of images commence
- Base (e.g. an OS) and child images which add functionality to a base image
- Official images – sactioned by docker
- user images
Process can be summarised as
- Create the app (example is using a Python web framework – Flask)
- Add in a Dockerfile – text file of commands for the Docker daemon when creating an image
- Build the image
Does require an account on the Docker cloud
And there it goes getting all the pre-reqs etc. Quite quick.
And successful running.
Docker Swarm running multiple copies, including on the cloud. Given the use case I’m interested in is people running their own…not a priority.
It does provide a look at Docker Compose files and a more complex application – multiple containers and two networks. Given my focus on using Jupyter Notebooks and perhaps the kernel gateway, this may be simplified a bit.
Seems we’re at the stage of actually trying to do something real.
Create a Docker image – TDIY
Jupyter Notebook, kernel gateway and a simple collection of notebooks – perhaps with greasemonkey script
Misc. related stuff
Bit on microservices (microservice architectural style) pointing out the focus on
principles of loose coupling and high cohesion of services
and in turn a number of characteristics
- Applications are made up of small independent services
Is TDIY LA about allowing teachers to create applications by combining these services?
- Services are independently modifiable and (re)deployable
But by whom?
- Decentalised data management: each service can have its own database
What about each user?
Goes on to list a range of advantages, but the disadvantages include
- inefficiency – remote calls, network latency, potential duplication etc.
But going local might help address some of this.
- Developing a user case could need the cooperation of multiple teams
This is the biggest barrier to implementation within an instituiton. But raises the spectre of shadow systems, kludges etc.
- complications in debugging, communication
Microservices and containers covers some of the alternatives.
Seems docker is the place — it’s bought Kitematic and apparently not loved it – a risk for basing the DIY approach on it.
Another part of the story is that you can build your own images and either share them publicly via the Dockerhub registry, keep them locally on your own computer, post them to a private Dockerhub repository (you get a single private repository as part of the Dockerhub free plan, or can pay for more…), or run your own image registry.
Dockerhub is probably the option I want to use here because of the focus on being open, of being cross institutional etc.