FAQ Database Discussion Community


unix word count tool counting characters across multiple lines

unix,word-count,wc
How would I apply the word count tool to a task of this nature: Д е с я т ь д н е й I want to know how many characters appear but I want to ignore the white space in between the characters. How can I specify that in...

HADOOP - Problems copying text files into HDFS

hadoop,mapreduce,hdfs,word-count
I am implementing Hadoop single-node-cluster following the prominent Michael Noll Tutorial. The cluster is working, checking with jps shows that all components are running after execution of start-all.sh. I face a problem reproducing the wordcount-example using some downloaded texts. I downloaded the files in /tmp/gutenberg and checked if they are...

PHP Word Count Function

php,arrays,string,word-count
I am attempting to write my first custom function. I understand that there are other functions out there that do the same thing that this does, but this one is mine. I have the function wrote, but I do not understand char_list as it pertains to functions and cannot figure...

Running my own version of WordCount.java in Hadoop-2.6.0

java,hadoop,jar,mapreduce,word-count
I am trying to create my own version of wordcount and execute it. For that, I am trying to create the wordcount.jar by executing the following command (as described here http://cs.smith.edu/dftwiki/index.php/Hadoop_Tutorial_1_--_Running_WordCount for previous releases than Hadoop-2.*): javac -classpath /usr/local/hadoop-2.6.0/share/hadoop/common/*:/usr/local/hadoop-2.6.0/share/hadoop/mapreduce/* -d wordcount_classes/ WordCount.java jar -cvf wordcount.jar -C wordcount_classes/ . The problem...

Error while submiting topology to storm clustre

java,storm,word-count,nimbus,topology
I am running a storm topology .This is the basic wordcount topology.I am using text file as the source and storm for processing the data.While submitting the i am facing these issues.I am very new to storm.Please suggest me the changes i need to do in the following code.Thanks in...

How to get a “fieldcount” (like wordcount) on CouchDB/Cloudant?

javascript,mapreduce,couchdb,word-count,cloudant
Trying to get a count of fields, just like the classic word count example. I thought this would be trivial... but I got this useless result... {"rows":[ {"key":null,"value":212785214} ]} How can I get what I wanted... an inventory of all fields used in my documents, with a count of how...

Word count program with two input files and single output file

java,hadoop,mapreduce,word-count
I am new to Hadoop. I have done word count program with single input file and single output file. Now I want to take 2 files as input and write that output to a single file. I tried like this: FileInputFormat.setInputPaths(conf, new Path(args[0]), new Path(args[1])); FileOutputFormat.setOutputPath(conf, new Path(args[2])); This is...

Hadoop examples beside the word count

hadoop,word-count
I like to learn Hadoop applications in the real world scenarios. Currently most of the example only cover the word count problem, and no any example on industrial use case. Are there other Hadoop examples, or Hadoop tutorials out there, that solve other problem beside the word count problem?...

Apply wordcount on individual lines in a mapreduce job

java,hadoop,mapreduce,word-count
I have an input file like LOW LOW HIGH LOW LOW LOW HIGH MOD LOW LOW HIGH LOW HIGH HIGH HIGH LOW LOW LOW LOW LOW . . . . . . . . . . for which i would like to have the result as follows: Genuine Moderate Not_genuine...

JQuery loop on multiple items of the same CSS class

jquery,loops,each,keyup,word-count
I have multiple fields of the same class. The word count shows up correctly, but changes the value of every word count when one field has received input. Additionally, I'd expect the field to show a word count of 100 if no input exists, but it shows 0. Suggestions on...

Caused by: java.lang.ClassNotFoundException: org.apache.hadoop.fs.CanSetDropBehind issue in ecllipse

maven,hadoop,apache-spark,word-count
I have the below spark word count program : package com.sample.spark; import java.util.Arrays; import java.util.List; import java.util.Map; import org.apache.spark.SparkConf; import org.apache.spark.api.java.*; import org.apache.spark.api.java.function.FlatMapFunction; import org.apache.spark.api.java.function.Function; import org.apache.spark.api.java.function.Function2; import org.apache.spark.api.java.function.PairFlatMapFunction; import org.apache.spark.api.java.function.PairFunction; import...

Hadoop - word count per node

java,hadoop,mapreduce,word-count
I am implementing a customized version of WordCount.java in Hadoop where I am interested in outputting the word counts per node. For example, given text: FindMe FindMe ..... .... .... .. more big text ... FindMe FindMe FindMe FindMe node01: 2 FindMe node02: 3 Here is a snippet from my...

reduce function in hadoop doesn't work

java,hadoop,mapreduce,word-count
I learning hadoop. I wrote simple program in Java. Program have to counts words (and creates file with words and number of times each word appears), but program only creates a file with all words, and number "1" near every word. It's look like : rmd 1 rmd 1 rmd...

Favorite tool for word/phrase counting

full-text-search,text-mining,data-analysis,word-count,text-analysis
I am looking for a tool that performs counting of words and, more importantly, phrases, in large amounts of open-ended text responses. I need the ability to exclude certain words (a, the, and, etc.) as well. I am aware of a few tools that do this: - http://www.mywritertools.com/default.asp - http://www.hermetic.ch/wfca/wfca.htm...

Count recurrent words in two files

java,word-count
I have a code, which can count word occurences in a file. I would like to use this with 2 files and display recurrent(which both files contains) words in a separated table. What is your idea, how is it possible to use it with 2 files? while ((inputLine = bufferedReader.readLine())...