Let’s face it, data scientists work with software engineers and at the end of the day, they have to deploy software. An effective data scientists should be able to work within a software development team, so (s)he must be able to use the tools of the trade. Our previous post introduced the very basics and now we make a further step.
(more…)Author: crowintelligenceteam
Using AI to Predict Risk in Public Procurement
Our report on using AI to predict the riskiness of public procurement calls based on their text is available on tenders.guru
(more…)Software engineering for data scientists – Part 1, Development Tools
I have bad news; being able to write programs is only a tiny fraction of software development and data science. You have to know a lot of things and this can become very frustrating. You will be bombarded with acronyms and silly-named tools, like IDE, git, version control, CI, etc, etc. At some point, you have to start use the tools of the trade along with coding and here we give you some advice on what to use to become a pro.
(more…)A simple Markov chain text generator in Python
What does it mean to build a language model? How it is related to the Beat Generation’s cut-up technique and can be used for text generation? We’ll shed light on these questions using Python!
(more…)Order emerging from randomness or the joy of random Boolean networks and Python
Boolean networks are widely used in computational biology to model gene regulatory networks, and they can be very useful in time-series analysis too. Here, we will use the concept of random Boolean networks to generate interesting images which show some sort of repetitive patterns. Let complexity meet generative art and Python!
(more…)Getting Started with Colab Using TensorFlow is freely available on Manning LP
We developed a free course for Manning liveProject. Getting Started with Google Colab Using TensorFlow has got a very descriptive title, so it teaches you the very basics of using Colab with TensorFlow.
(more…)Getting Started with Colab Using PyTorch is freely available on Manning LP
We developed a free course for Manning liveProject. Getting Started with Google Colab Using PyTorch has got a very descriptive title, so it teaches you the very basics of using Colab with PyTorch.
(more…)Getting Started with Jupyter is freely available on Manning liveProject
We’ve been working hard on our newest liveProject, Getting Started with Jupyter Notebook during the last few months and now it is freely available!
(more…)Content Analysis with Similarity Graphs – the talk is available on YouTube
We presented our thoughts on content analysis with similarity graphs at the Graph Data Science Conference. The full conference is available on Twitch, and Orsolya’s talk is now available on our YouTube channel. You can watch it below!
(more…)A Diary on Information Theory – a lightweight yet rigorous intro into the field
Information Theory (IT) was born in 1948 when Claude Shannon published its a Mathematical Theory of Communication. This seminal work uses linguistic examples, so Information Theory is related to language since its inception. There were many attempts to explain IT to the general audience, but Rényi’s Diary is outstanding with its unique style, clear prose and rigor. If you read only one book on IT, read this one!
(more…)