Adding two columns using Deedle in C# - c#

Given the following CSV file
A,B
2,3
5,7
9,11
I'd like to add the two columns, resulting in
A,B,C
2,3,5
5,7,12
9,11,20
using C# and Deedle.
using Deedle;
using System.IO;
using System.Linq;
namespace NS
{
class AddTwoColumns
{
static void main(string[] args)
{
var root = "path/to";
var df = Frame.ReadCsv(Path.Combine(root, "data.csv"));
var a = df.GetColumn<int>("A");
var b = df.GetColumn<int>("B");
var c = df.Select(x => x.a + x.b);
df.AddColumn("C", c);
df.Print();
}
}
}
Neither the
reference
nor the tutorial
(series,
frame)
is particularly illuminating.
What is the correct df.Select() for this simple operation?

a and b are just Deedle.Series which you can perform numerical operations on. So, you can do this just by adding both series:
// simply add the series
var c = a + b;
df.AddColumn("C", c);
df.Print();
// output
A B C
0 -> 2 3 5
1 -> 5 7 12
2 -> 9 11 20
The Statistics and calculations section (of the page you linked to) briefly mentions arithmetic operations. It also features a note on missing data which you might need to consider:
Point-wise and scalar operators automatically propagate missing data.
When calculating s1 + s2 and one of the series does not contain data
for a key k, then the resulting series will not contain data for k.

I know this question is particularly addressed for C#, but I hope this F# approach can help somehow:
Frame.ReadCsv(#"C:\Users\flavi\Downloads\sample.txt")
|> fun frame->
Frame.addCol "C"
(Frame.mapRowValues (fun row ->
row.GetAs<int>("A") + row.GetAs<int>("B")
)frame) frame

Related

How to do a row based process using Deedle (Frame in and Frame out)

I am trying to use Deedle to do row based process on a DataFrame. But i just can't get my mind tuned into the Deedle way.
Say for a Frame like
Indicator1 Indicator2
1 100 200
2 300 500
3 -200 1000
Say there are some rules needs to be applied to each indicator:
if Indicator value is less than 500 and larger than 0, multiply it by 1.1
if Indicator value is less than 0, make it NaN
I have been trying to use Frame.mapRow .... functions.
I know I can use the
fun v -> let indVal = v.GetAs<Int>("Indicator1");
let newIndVal = match indVal with
|...... logic
|...... some other logic
let indVal2 = v.GetAs<Int>("Indicator2");
let newIndVal2 = match indVal2 with
|...... logic
|...... some other logic
with the Frame.mapRow ....
But I am stuck at how to make the newIndVal and newIndVal2 back into the a row and eventually back into a new data frame.
What I am trying to achieve is a frame in and frame out. Also I only know to process the column one by one (after retrieving them by index or name). If the logic to be applied are generic, is there a way NOT to apply the logic one column by one column?
A imperative (and really simple) way to do this with C or C# 2d array is
loop through the row dimension
loop through the column dimension
apply the rule as the side effect to the array[row,col]
How to achieve this in Deedle?
UPDATE:
Leaf Garland's suggestion works great if the calculation doesn't need to reference other columns from the same row. For my case, I need to look at the data row by row, hence I would like to use Frame.mapRows. I should have been clear on the simplified requirements:
Say for a Frame like
Indicator1 Indicator2
1 100 200
2 <Missing> 500
3 -200 1000
4 100 <Missing>
5 <Missing> 500
6 -200 100
For example
if indicator1 is less than 300, new Indicator2 value is Indicator2 + 5% * Indicator1
I need to use
mapRows fun k v -> let var1 = v.get("Indicator1")
let var2 = v.get("Indicator2")
run through the conditions and produce new var1 and var2
produce a objectSeries
|> Frame.ofRows
The pesudo code above sounds simple but i just can figure out how to reproduce a proper objectSeries to recreate the Frame.
I also noticed something i can't explain with mapRows function [SO question]: Deedle Frame.mapRows how to properly use it and how to construct objectseries properly
Update
Since the original question was posted, I have since used Deedle in C#. To my surprise the row based calculation is very easy in C# and the way C# Frame.rows function handle missing values are very different than F# mapRows function. The following is a very basic example i used to try and true the logic. it might be useful to anyone who is searching for similar application:
Things to pay attention to are:
1. The rows function didn't remove the row while both columns' value are missing
2. The mean function is smart enough to calculate mean based on available data point.
using System.Text;
using System.Threading.Tasks;
using Deedle;
namespace TestDeedleRowProcessWithMissingValues
{
class Program
{
static void Main(string[] args)
{
var s1 = new SeriesBuilder<DateTime, double>(){
{DateTime.Today.Date.AddDays(-5),10.0},
{DateTime.Today.Date.AddDays(-4),9.0},
{DateTime.Today.Date.AddDays(-3),8.0},
{DateTime.Today.Date.AddDays(-2),double.NaN},
{DateTime.Today.Date.AddDays(-1),6.0},
{DateTime.Today.Date.AddDays(-0),5.0}
}.Series;
var s2 = new SeriesBuilder<DateTime, double>(){
{DateTime.Today.Date.AddDays(-5),10.0},
{DateTime.Today.Date.AddDays(-4),double.NaN},
{DateTime.Today.Date.AddDays(-3),8.0},
{DateTime.Today.Date.AddDays(-2),double.NaN},
{DateTime.Today.Date.AddDays(-1),6.0}
}.Series;
var f = Frame.FromColumns(new KeyValuePair<string, Series<DateTime, double>>[] {
KeyValue.Create("s1",s1),
KeyValue.Create("s2",s2)
});
s1.Print();
f.Print();
f.Rows.Select(kvp => kvp.Value).Print();
// 29/05/2015 12:00:00 AM -> series [ s1 => 10; s2 => 10]
// 30/05/2015 12:00:00 AM -> series [ s1 => 9; s2 => <missing>]
// 31/05/2015 12:00:00 AM -> series [ s1 => 8; s2 => 8]
// 1/06/2015 12:00:00 AM -> series [ s1 => <missing>; s2 => <missing>]
// 2/06/2015 12:00:00 AM -> series [ s1 => 6; s2 => 6]
// 3/06/2015 12:00:00 AM -> series [ s1 => 5; s2 => <missing>]
f.Rows.Select(kvp => kvp.Value.As<double>().Mean()).Print();
// 29/05/2015 12:00:00 AM -> 10
// 30/05/2015 12:00:00 AM -> 9
// 31/05/2015 12:00:00 AM -> 8
// 1/06/2015 12:00:00 AM -> <missing>
// 2/06/2015 12:00:00 AM -> 6
// 3/06/2015 12:00:00 AM -> 5
//Console.ReadLine();
}
}
}
You could map over all values in your frame using Frame.mapValues. Provide it a function that takes your data type and returns the updated value.
let indicator1 = [100.0;300.0;-200.0] |> Series.ofValues
let indicator2 = [200.0;500.0;1000.0] |> Series.ofValues
let frame = Frame.ofColumns ["indicator1" => indicator1; "indicator2" => indicator2]
// val frame : Frame<int,string> =
//
// indicator1 indicator2
// 0 -> 100 200
// 1 -> 300 500
// 2 -> -200 1000
let update v =
match v with
|v when v<500.0 && v>0.0 -> v * 1.1
|v when v<0.0 -> nan
|v -> v
let newFrame = frame |> Frame.mapValues update
// val newFrame : Frame<int,string> =
//
// indicator1 indicator2
// 0 -> 110 220
// 1 -> 330 500
// 2 -> <missing> 1000

New mechanism of ordering

I am trying to invent bycicle which should be already invented :)
so, I have sorted dictionary:
1:val1,
2:val3,
3:val3
now, I need insert new item between 1 - 2 key without moving the key(2 and 3). There is one way- insert key with 1.5 value but it is under question how many item can I insert between 1-2 and which algorithm is suitable to do so.
It is possible to insert N items between them......
the Data type or the format of the key dos not metter, just it crucial that key dos not change
You can use alphabet as keys. New key will be concatenated string of previous and next key.
For example:
If you have A and B, new key will be AB:
A
AB (A + B)
B
to add new key between A and AB, you will have AAB:
A
AAB (A + AB)
AB (A +B)
B
and so on...
You can use BigDecimal in Java to get new_key=(key1+key2)/2.
import java.math.BigDecimal;
public class CodeSheet {
public static void main(String[] args) {
BigDecimal a = new BigDecimal(1);
BigDecimal two = new BigDecimal(2);
for(int i = 0; i <= 6000;i ++)
{
a = a.divide(two);
if (i%1000 == 0)
System.out.println(a.toString());
}
}
}
output:
0.5
4.666318092516094394950447723619085848085457231858540123108571698979834554878878172272201635489405511797974949651621213121077437606770161974207604086019653781172053330691625751369975379929509157555502453981325565591202562573979668954025891355627075519053491894272132405597349071143304796110088314552213992280847244435737332640031641842263237146309149310826013965976447468035589253318343705327199027653590681602999224130209770506066148149347510972573049521073043341806223964760174134323088289634580237100329681945208689479110591825390227783142221369626937585639273983907781732018574388408834499276960346327197120043559868373508749313133453736483812679019646881169169905234639372793026268482208251953125E-302
4.354904908108608337788097747389436147929551871352694308321746614749144426703133706892369375399893940532782043588727560318102151185994168162541395412261518430550755321155148659223854561694954711924029281870986173595315518548585511706690334120707350364118146386757419736828045574038868295421914635929079059744555673086284578352082725382878484893441482333124196570054633787834328204149323035102391048682919453726835836055542393496585133335537308533180937973567503195969258360476033334393391821525284565813777886412721596101172364149931162673561491299813436308231347105575294992080347939004544602896748346183684569136737241491985864479016388335577898767470263560575437108148985779824881400524519876725232838928477267134127170926478227142121467310560266975687300142692457732481335134494432171071782408627253792130227205541778953790826318813538185488987468045762324286746160892737356344542977960827017103483581102720485435212671525873919071359975592227058010595009966338494815791830176096729490346274067433317969857326763618026109468018115811724815116133999445116665877671509729075069315638648723850028991409064960833766332205861977229046767218582849546541487471215035845260620260257995532242830003020146307439405502318054114708229860672592763223579453418947532879305088491927160146050190013971612697710103760111069333206902362883772710769598910416379683242622746115746412698399581131525337696075439453125E-603




It can be store at least 6000 records...(with small possibility, always minimum key/2)
the implementation of Java's BigDecimal with some flaws, you can implement a own one if necessary.

Import F# block of code into C# application [duplicate]

This question already has answers here:
Run F# code in C# - (Like C# in C# using Roslyn)
(3 answers)
Closed 9 years ago.
I was looking for solution to import F# block of code into C# application (in order to gain calculation speed of some operations I have). Therefore I install FSharpx.Core to my C# solution in MS Visual Studio Express 2013.
And that's it...don't know how to continue :-)
Can someone help me with some simple code - e.g. when I click a button in the form, to subtract variable v1 and v2.
Form1.cs
...
private void buttonCalcVars_Click(object sender, EventArgs e)
{
int sum = CallFSharpFunction(1,2);
}
// This should be some F# block of code
private int CallFSharpFunction(int a, int b)
{
let v1 = a;
let v2 = b;
// do some calculations
return result;
}
...
Hope this is possible,
Thank you!
Before using F # one wonders, can you use an F # function in applications written in c #.
Become familiar with functional language F # you can, for example, at this address: http://msdn.microsoft.com/ru-ru/magazine/cc164244.aspx
But here, in my opinion, F # functional language is good for writing various mathematical functions (excuse my tautology), but why take the bread from the object-oriented languages, why put unintelligible code to work with WinForms or WebForms controls pages in F #? Because I immediately wondered how to invoke a function from F # assemblies. Just want to say that because there are difficulties in functional languages with tipizaciâmi when writing code, in the case of using F # functions from C # assemblies these difficulties only increase. Get down to business.
Create a project that includes, for example, the C # console application and an F # Assembly.
F # in the Assembly we need one file MyFunctions. fs. Here we describe some of the features that we believe it is easier for us to write at a functional language. For example, even if it is the translation function array bitmap images from RGB to YCbCr color space (this is just an example). Entry in F # can be roughly this:
open System
let RGB_to_YCbCr (r : double,g : double,b : double) =
let y = 0.299 * r + 0.587 * g + 0.114 * b in
let Cb = (-0.1687) * r - 0.3313 * g + 0.5 * b + 128.0 in
let Cr = 0.5 * r - 0.4187 * g - 0.0813 * b + 128.0 in
(y,Cb,Cr);
let RGB_to_YCbCr_v (v : _ array) =
RGB_to_YCbCr (v.[0], v.[1], v.[2]);
let Process (image : _ array) =
let lenght = Array.length image in
let imageYCbCr = Array.create lenght (0.0, 0.0, 0.0) in
for index = 0 to lenght - 1 do
imageYCbCr.[index] <- RGB_to_YCbCr_v (image.[index])
done
imageYCbCr
After Assembly, we can see that access to features not just have weird views the namespace, and how to use them is not clear. See the location of the functions in the Assembly we can using Reflector.
In order to describe the namespace and the class you need to add the following line immediately after #light:
module FSharp.Sample.MyFunctions
That said, the fact that all the functions written below will contain the class MyFunctions fsharp.Core namespace.
After the project again, we will see that in the Assembly have a clear the fsharp.Core namespace Sample that has the class MyFunctions static methods, which we have described above.
Read more in our console application, we set the Reference to the Assembly the fsharp.core Sample is the name of my Assembly to F # and the fsharp.core -in order to use types (classes) of the F # type Triple. And write the following code:
using System;
using FSharp.Sample;
namespace CSharp.Sample.Console
{
class Program
{
static void Main()
{
double[][] image = new double[1000][];
Random rand = new Random();
for (int i = 0; i < 1000; i ++ )
{
image[i] = new double[3];
image[i][0] = rand.Next();
image[i][1] = rand.Next();
image[i][2] = rand.Next();
}
foreach (var doubles in MyFunctions.Process(image))
{
System.Console.WriteLine(doubles);
}
}
}
}
Where initially we specify to use the fsharp.Core namespace. In the code we generate an array of data and pass it to the function MyFunction. Process, which converts it to the chosen algorithm. We are returned as an array of data types "in Microsoft fsharp.core.. Tuple'3".

how to join two Lists in linq

i have two List A,B which consists integer values ,list A contains 40 to 1 list B contains 40 to 79 i like to both values except 40 and store it in another list using Linq.The resultant list like this {80,80...}. how can i do this? Is it possible to do this?
It sounds like you're trying to "join" these in a pairwise fashion by index: the first element from each list, then the second element etc. That suggests you want Zip, which was introduced in .NET 4:
var zipped = list1.Zip(list2, (x1, x2) => x1 + x2);
If you're using .NET 3.5, you can use a separate implementation of the same method, such as the one in MoreLINQ.
EDIT: Alternatively, Eric Lippert posted some source code for Zip a while ago, too.
Check out the IEnumerable<T>.Join() method.
using System;
using System.Linq;
class Program
{
static void Main()
{
// Two source arrays.
var array1 = new int[] { 1, 2, 3, 4, 5 };
var array2 = new int[] { 6, 7, 8, 9, 10 };
// Add elements at each position together.
var zip = array1.Zip(array2, (a, b) => (a + b));
// Look at results.
foreach (var value in zip)
{
Console.WriteLine(value);
}
}
}
--- Output of the program ---
7
9
11
13
15
Try Joining them together
http://weblogs.asp.net/rajbk/archive/2010/03/12/joins-in-linq-to-sql.aspx
http://msdn.microsoft.com/en-us/library/bb397676.aspx

Spelling Suggestor in ASP.NET

I need to build a spelling suggestor in ASP.NET... The below are my requirement.
Case 1: My list of words are not just englist words but will also includes some codes like AACD, ESSA, BIMER etc... I may provide such (New) words from Database.
Case 2: I also need a similar spelling suggestor for Non-English Language, Even here, I can provide a list of words from a Database.
Now, Any suggestions as to how I implement the same is welcome.
Further, I found the following Python Code, from a website, which states it returns the most probable suggestion (in english ofcourse). If someone can translate it into C# that would be really helpful.
import re, collections
def words(text): return re.findall('[a-z]+', text.lower())
def train(features):
model = collections.defaultdict(lambda: 1)
for f in features:
model[f] += 1
return model
NWORDS = train(words(file('big.txt').read()))
alphabet = 'abcdefghijklmnopqrstuvwxyz'
def edits1(word):
s = [(word[:i], word[i:]) for i in range(len(word) + 1)]
deletes = [a + b[1:] for a, b in s if b]
transposes = [a + b[1] + b[0] + b[2:] for a, b in s if len(b)>1]
replaces = [a + c + b[1:] for a, b in s for c in alphabet if b]
inserts = [a + c + b for a, b in s for c in alphabet]
return set(deletes + transposes + replaces + inserts)
def known_edits2(word):
return set(e2 for e1 in edits1(word) for e2 in edits1(e1) if e2 in NWORDS)
def known(words): return set(w for w in words if w in NWORDS)
def correct(word):
candidates = known([word]) or known(edits1(word)) or known_edits2(word) or [word]
return max(candidates, key=NWORDS.get)
Thanks
- Raja
Another alternative is NHunspel
NHunspell is a free open source spell
checker for the .NET Framework. C# and
Visual Basic sample code is available
for spell checking, hyphenation and
sysnonym lookup via thesaurus.
using (Hunspell hunspell = new Hunspell("en_us.aff", "en_us.dic"))
{
bool correct = hunspell.Spell("Recommendation");
var suggestions = hunspell.Suggest("Recommendatio");
foreach (string suggestion in suggestions)
{
Console.WriteLine("Suggestion is: " + suggestion );
}
}
The commercial product I work on uses NETSpell Spell Checker, it has a dictionary tool that allows you to add custom dictionaries and words.
Free .NET spell checker based around a WPF text box that can be used client or server side can be seen here. This can be passed a list of words to ignore (your custom dictionary)
Full disclosure...written by yours truly with some help from stack overflow of course :)

Categories

Resources