Tribulations of a data researcher

I am Victor Rodriguez and I am a data research scientist. As of October 2012, I have been obliged to blog. But I appreciate this because we are always eager to talk about ourselves (even if it's only about our work) and we usually lack a good excuse. I have to blog to get paid, and this is an excellent excuse to write.

Nobody is obliged to read, though. However, you might want to read this blog if...

.: 

Bits and RDF Triples

Information theory describes messages as the information passed from a source to a receiver.
The Linked Data model is about connecting related data using the Web.
The information unit in the first model is the bit, the connection unit in the second is the RDF Triple linking two resources.

.: 

Bits and RDF Triples: Access control and link control.

In a previous post, I described a parallelism between the information theory paradigm and the Linked Data paradigm.

Information Theory development led to the study of other problems, like the channel capacity in a communication, the information encoding and data compression techniques or the access control. Access control decides whether a receiver should obtain certain information if certain conditions hold. What about RDF links?

.: 

Distance metrics between Tweets

A numeric distance can be calculated between two Tweets to represents how near they are in meaning, or at least a vague approximation.

The first approach considers the 1-gram and 2-gram and their relative frequencies, as offered in the www.ngrams.info webpage for the American English language. Try the algorithm here!0

.: 

Sample Twitter datasets

Twitter is not happy to see their datasets online. There were some excellent datasets, like this one with 470 million tweets, but they had to go offline. As for now, these pages here in salonica are for private use of colleagues and will not be made public.
We have considered the following two datasets:

.: 

The Work of Art in the Age of Mechanical Reproduction

For the last 6 years I have been researching on how to license content in order to foster the e-commerce of virtual goods.
It was a blunder. Content and data availability has been steadily growing and its value steadily decaying. There is no commerce of abundant goods as they are no longer economic goods. Our world is getting poorer in material resources and richer in information.

Tags:

.: 

Luditas digitales

Comentario ludita

Este año se cumplen 200 años desde que estallaran los disturbios luditas.
Este movimiento entendía la máquina como enemiga del trabaja y como tal había de ser destrozada.

.: 

Estudio Matemático de la Quiniela

El "Estudio Matemático de la Quiniela" es un libro que describe un método de juego óptimo para apostar en la quiniela. Está basado en la rentabilidad matemática de las columnas, detallando cómo calcularla y con qué confianza cabe esperar el retorno de inversión en qué plazo.

.: 

Mindstorms Mobile Webcam

As soon as I acquired a Lego Mindstorms set, I built my first project not-in-legoinstructions.

Mindstorms Mobile Webcam

A computer controlled webcam which can be controlled from a computer via Bluetooth.

This is a normal webcam, attached to a computer, but which can be moved by two engines in the two degrees of freedom, covering all the possible range. It is a very simple model.

Tags:

.: 

Método de juego a la quiniela

"As of April 2007, I published the software I used together with a short description of the method.
The software had been written during 2005, though...

Esta página presenta un método para jugar en la quiniela española
de fútbol con criterios matemáticos. Ésta podría
ser una descripción de las reglas.

Tags:

.: 

Pages

Subscribe to www.cosasbuenas.es RSS