I am one of those who often need to flip between Excel or R depending on the needs of the client. Calling Excel in R is not that trivial so here are some notes on how to do that. So let’s say that you have an Excel VBA program that you want to take over after R has done its job.
You use the system2 command in R like so
> system2(“/Applications/Microsoft\ Excel.app/Contents/MacOS/Microsoft\ Excel”, c(“/Users/Extranosky/Temp/VehicleRepair.xlsm”))
The second argument should not have spaces in its file or directory name. I have not had success in passing a file name to be opened by Excel in this platform that had spaces in its directory.
This command will call up Excel open up the file as an argument and hopefully you got an open event in that VBA so things can be done seamlessly and automatically.
In this post I will show an example of how mathematics can be very mysterious. I showed this to what might be classified as “first year in maths” students and they came out of class perplexed, as if their mind had to hit the reset button and reboot.
Theorem: Let be the set of natural numbers and the set of integers. We have and .
This means that though is a subset of , the size of is equal to the size of . In technical language the cardinality of the natural numbers is equal to the cardinality of the integers, What duh?
a.) , this is trivial because the natural numbers are just the positive integers in . So every element of is found in .
b.) In order to show that the size of the two sets are equal we need to establish a bijective function from one set to the other. That is a function which is both surjective and injective. Another way of saying this is to say that we need a function that is onto and at the same time one-to-one from one set to the other. Obtaining such a function proves that the size of both sets are the same.
We will just get the one suggested by wikipaedia: We let
with when and only when and ,
else when and only when .
1.) is one-to-one, i.e. injective. Let , then we have two cases, either or . The first case we have , . On the other hand, if the second is the case then again this .
2.) is onto, i.e. surjective. Let then is positive and either even or odd. If even and positive, then in general for some integer ,(the property of even numbers). Since is an integer, then it is an integer in and so that and choose . On the other hand, if is positive and odd, then for some integer (property of odd numbers) and has to be positive since is positive, i.e., and so we have . So in both cases we have seen that for every , we have found a matching .
How can a subset have the same size as it's superset? If this does not boggle your mind, perhaps you missed the point. The reason for this is that we have here two infinite sets, and this mystery only happens when infinity is involved. Now some mathematicians are not happy with this that is why they do not believe in infinite sets. It seems infinity is just a concept that has no matching physical reality and we can be indifferent with it. I suggest the concept of infinity is a metaphysical concept. So can one can reason that it is a concept that exists just in our mind and is not "real". Just like unicorns or fairies? I do not think so. There is no reason for us to believe in unicorns and in fact not all civilisations believe in the mythical horse. However the concept of infinity is different. It is because the concept per se is a necessity. The mind requires it when presented with the nature of numbers. For is it not true that the set of natural numbers is infinite? We can conceive it and by force of nature admit it. It is a necessary truth so in that sense infinite sets are real and transcends material physicality.
Have you ever encountered mathematicians who do not believe in “real numbers”? Well there are some, mainly those who come form a computer science ideology. I am starting to understand why they do not think real numbers are real or useful as a concept. Firstly what do we mean by a real number? It comes from looking at the number line as a continuum. It is treating the number line consisting of infinite number of points. For example the numbers between 0 and 1 – there are infinite “real numbers” there.
Take an example of a so called real number . It is written as 3.141592653589793… Now notice the ellipses in the number. They are there to say that the decimals after the last 3 as printed is infinitely long. So people think that a real number can be represented by those dot dot dot and so the real numbers have unending decimal series. In actuality the symbol is the limit of that series of decimals once considered.
Now I can appreciate why A/Prof. N. Wildberger insists the need for something to be written down, and we will explain the reason why later. If you think for a moment, we are not capable of writing a real number down. Those dot dot dots are a semantic idea to signify to us that the digits following goes into an infinite series. That is not really writing a number down. Why do we need to be able to write something down with finality?
Well it is because we can put the process of writing into an algorithm. We can put it into a function. So imagine again . The fact that we can not write the number down with completion means we can not put that generation of the numbers into an algorithm that will stop. It can not and we won’t let it stop precisely because the number of decimals in the tail end is infinite. So an algorithm that goes into an infinite loop, making it useless. Since we can not locate precisely where is in the continuum line we can not even have a function to compute it. In a sense the digits following are not decidable.
From a computer science point of view, the algorithm must terminate and if it does not, then the function is undefined at that point. The problem stems from the idea of infinity of points present in the number line. Yet in practice we can not really even locate real numbers in the number line. You can have the function stop at the 100th position of but that is not itself. This is the best we can do but that number is not exactly rather it is “something like or close to “.
So real numbers are unreal, man.
This year I began doing private maths tutoring and I have been learning a lot about the deficiencies in mathematics education that are encountered by our high school students. I am very skeptical about this “new maths” approach. For one thing, the students are not taught to use pen and paper to write out their reasoning and calculation. For another, they make the student rely heavily on intuition. Sometimes intuition helps but other times, intuition can mislead.
Let me illustrate this problem, not original to me.
Assume we have 3 cards with two faces. One card is colored black on both sides, the other is colored white on either side, and the last has black on one side and white on the other. Let us drop the cards in a hat and then choose a card, and then when we get a card, we choose a side to see at random too. Question: If the side we see is black, what is the probability the other side is black also? Did you answer 1/2? Your intuition has misled you. You probably thought, by this data we can dismiss the possibility of the card with both white colors (the second card definition) and just deal with the first and last card. This is not the true situation.
Here is our analysis. The sample space describes the possible color combination of our cards, e.g. BB means one side is black and the other side is black also, etc. Let “the side we see is black”, “the other side is black”.
So the situation is asking what is ?
actually has 3 ways of getting a black out of 6 ways of getting a face. Then also is tantamount to getting the first card in our description which is 1 out of 3.
So the moral of the story is that intuition can not be a substitute for formalism. Formalism actually yields a more accurate result. Our intuition is trumped by the formal analysis, which is a better way of approaching the problem.
It is a common question asked in data science or data analysis forums if one should use Python or R one’s data work. So far, I myself have managed not to learn Python. I have managed to ward off the urge to do so. Now I have learned plenty of programming languages and have actual work experience in the following: C,C++,Java,Perl, PHP, Tcl, VBA. If I look further back, I should mention COBOL, Fortran, Assembler, Algol and BPL – ancient Burroughs programming language based on Algol hence, BPL. In fact, I should name Scheme/Lisp and Ocaml (see older posts) as one of the languages I can code and program. Currently, I am playing around Clojure . I can really learn Python if I wanted to. However, I don’t.
Oh please, not another language to learn!
Why? Because for statistical type of work R is enough, yes, I can even use R for data cleansing and munging, where Python could probably help. However for that type of task, R has so many functions I can avail of without touching Python. Anyway, which one is close to statistics? Python or R? It is R and if I want to do any general purpose computation I can do it all in R because of those functions. Lastly, the nice thing is that R takes some of those functional programming insights into its philosophy, it took its inspiration from Scheme.
You can get a copy of this textbook here.
Your interest and comments will be most welcomed.
I am working to release it at Amazon, CreatSpace and in Kindle.
A few days ago, I went to a seminar conducted by one of my former professors on the Internet of Things and I learned how we now have plenty of sensors which can publish data into the Internet. Name it what you will, it can be traffic cameras, or weather stations etc, they can all tap into the Net.
During the seminar, my mind wondered off and I started imagining the movie Terminator. The reality of Skynet may no longer be confined to the movie franchise. Then today I stumbled on this article of Elon Musk.
Read it and let me know what you think.