ubuntu,cuda,mpi , Building CUDA-aware openMPI on Ubuntu 12.04 cannot find cuda.h

Building CUDA-aware openMPI on Ubuntu 12.04 cannot find cuda.h


Tag: ubuntu,cuda,mpi

I am building openMPI 1.8.5 on Ubuntu 12.04 with CUDA 6.5 installed and tested with default samples. I intend to run it on a single node with following configuration:

Dell Precision T7400
Dual Xeon X5450
Nvidia GT730/Tesla C1060

The configure command issued was

    $ ./configure --prefix=/usr --with-cuda=/usr/local/cuda

In the generated config.log, it is clear the the configure script was not able to find cuda.h and cuda_runtime_api.h in /usr/loca/cuda/include, which do exist.

For cuda.h:

    configure:73774: checking cuda.h usability
    configure:73774: gcc -std=gnu99 -c -O3 -DNDEBUG    conftest.c >&5
    conftest.c:645:18: fatal error: cuda.h: No such file or directory
    compilation terminated.
    configure:73774: $? = 1
    configure: failed program was:
    | /* confdefs.h */

For cuda_runtime_api.h:

    configure:73857: checking cuda_runtime_api.h presence
    configure:73857: gcc -E   conftest.c
    conftest.c:612:30: fatal error: cuda_runtime_api.h: No such file or directory
    compilation terminated.
    configure:73857: $? = 1
    configure: failed program was:
    | /* confdefs.h */

I tried to change the path to version-specific directory, i.e. /usr/loca/cuda-6.5/cuda but same error was thrown.

I tried to proceed to install, and the ompi_info gave


Does anybody had similar experience that can help me out? Many thanks!


OK, I think I fixed the problem. The conftest.c seems to be looking for cuda.h in /usr/include, instead of the supposed /usr/local/cuda/include. The problem is solved once I created a soft link of cuda.h and cuda_runtime_api.h.


Can't install Composer on Ubuntu behind proxy

I'm trying to install Composer, in order to use Laravel, but I'm behind the company proxy. The proxy is already configured in the system, so wget --proxy-user=<my_user_name> --proxy-password=<my_password> https://getcomposer.org/installer works (curl doesn't!), and I get the 270kB "installer" file. Next, I'm trying to run php installer as the manual says,...

In ubuntu unable to write file in specified directory using java

While trying to write file in specified directory i am getting exception. Java code :- public void jsonToYaml(JSONObject json, String studioName) throws JSONException, org.codehaus.jettison.json.JSONException, IOException { Yaml.dump(Yaml.dump(JsonToMap.jsonToMap(json)), new File("config.yml")); BufferedReader br = new BufferedReader(new FileReader("config.yml")); String line; studioName = studioName.toLowerCase(); File writeFile = new File("sudo /var/iprotecs/idns2.0","" + studioName + ".yaml");...

Write Access for user on all repos on Gitolite

I'm trying to add access to read, write and create new repos from my local to a gitolite server. I have the following config on my gitolite server, but it doesn't want to let me push to a new repo: repo @all RW+ = git repo gitolite-admin RW+ = git...

Error in reading Ubuntu 14.04 mouse event file (/dev/input/event3) with java programmig

I want to handle mouse event in Linux terminal via java programming. I wrote two program via c++ and java that they do same process. when i use c++ programming to open and read file ("/dev/input/event3"-mouse event file), there is no problem while running executable file. (Ubuntu 14.04 terminal and...

CUDA cuBlasGetmatrix / cublasSetMatrix fails | Explanation of arguments

I've attempted to copy the matrix [1 2 3 4 ; 5 6 7 8 ; 9 10 11 12 ] stored in column-major format as x, by first copying it to a matrix in an NVIDIA GPU d_x using cublasSetMatrix, and then copying d_x to y using cublasGetMatrix(). #include<stdio.h>...

'an illegal memory access' when trying to write to a 2D array allocated using cudaMalloc3D

I am trying to allocate and copy memory of a flattened 2D array on to the device using cudaMalloc3D to test the performance of cudaMalloc3D. But when I try to write to the array from the kernel it throws 'an illegal memory access was encountered' exception. The program runs fine...

how to generalize square matrix multiplication to handle arbitrary dimensions

I have written this program and I am having some trouble understanding how to use multiple blocks by using dim3 variable in the kernel call line. This code works fine when I am doing 1000*1000 matrix multiplication, but not getting correct answer for lower dimensions like 100*100 , 200*200. #include...

How do you build the example CUDA Thrust device sort?

I am trying to build and run the Thrust example code in Visual Studio 2010 with the latest version (7.0) of CUDA and the THURST install that comes with it. I cannot get the example code to build and run. By eliminating parts of the code, I found the problem...

issues with installing newer cabal version for haskell vim now

I would like to install this vim plugin: https://github.com/begriffs/haskell-vim-now When trying to run the suggested installation script: curl -o - https://raw.githubusercontent.com/begriffs/haskell-vim-now/master/install.sh | bash I get: --- Cabal version 1.18 or later is required. Aborting. I then try to install a newer version of cabal: [email protected]:~/Downloads/cabal-install-$ ./bootstrap.sh The response I get:...

Kill command in UNIX

If I execute the Following command all the process will be killed and the account is logged out. Command: kill -9 -1 It is similar to the logout command. In Unix, no such process having the pid as "-1". So, what is the reason for this result?...

JFrame wrong location with Ubuntu (Unity ?)

It seems that there is a bug with Ubuntu (maybe only unity). The decoration of the JFrame is taken into account for getLocation() and getSize(), but not for setLocation() and setSize(). This leads to weird behaviour. For instance, if you use pack() after the frame is displayed and the dimensions...

Tesla k20m interoperability with Direct3D 11

I would like to know if I can work with Nvidia Tesla K20 and Direct3D 11? I'd like to render an image using Direct3D, Then process the rendered image with CUDA, [ I know how to work out the CUDA interoperability]. Tesla k20 doesn't have a display adapter (physically remote...

Error installing gem twitter on Ubuntu 15.04

Am trying to install twitter gem on Ubuntu 15.04 and this error keeps popping up gem install twitter Building native extensions. This could take a while... ERROR: Error installing twitter: ERROR: Failed to build gem native extension. /usr/bin/ruby2.1 extconf.rb mkmf.rb can't find header files for ruby at /usr/lib/ruby/include/ruby.h extconf failed,...

Would using Vagrant be overkill? [on hold]

I'm a developer-hobbyist running Windows 8.1 on a Yoga 2 Pro. I mostly do Python/Django work but I think I'm gonna pick up Ruby soon. The thing is, Windows always seems to be the limiting factor for any project I want to pick up. Last time I tried to install...

Why does Hyper-Q selectively overlap async HtoD and DtoH transfer on my cc5.2 hardware?

There's an old Parallel ForAll blog post that demonstrates using streams and async memcpys to generate overlap between kernels and memcpys, and between HtoD and DtoH memcpys. So I ran the full Async sample given on my GTX Titan X, and here's the result: http://i.stack.imgur.com/rT676.png As you can see, when...

How do I respond to a prompt for password in a shell script?

I'm writing a shell script to set a VNC password using vncpasswd. The only way to use vncpasswd is in interactive mode (enter password, return, confirm password, return). How can I respond to the prompts in my shell script so I can set the password automatically? (i.e. non-interactive). Thanks! Chris....

Understanding Memory Replays and In-Flight Requests

I'm trying to understand how a matrix transpose can be faster reading naively from columns vs. rows. (example is from Professional CUDA C Programming) The matrix is in memory by row, i.e. (0,1),(0,2),(0,3)...(1,1),(1,2) __global__ void transposeNaiveCol(float *out, float *in, const int nx, const int ny) { unsigned int ix =...

How to load data in global memory into shared memory SAFELY in CUDA?

My kernel: __global__ void myKernel(float * devData, float * devVec, float * devStrFac, int Natom, int vecNo) { extern __shared__ float sdata[]; int idx = blockIdx.x * blockDim.x + threadIdx.x; float qx=devVec[3*idx]; float qy=devVec[3*idx+1]; float qz=devVec[3*idx+2]; __syncthreads();//sync_1 float c=0.0,s=0.0; for (int iatom=0; iatom<Natom; iatom += blockDim.x) { float rtx =...

Boost unit test dynamic linking on Ubuntu

I am trying to build a unit test using Boost's unit test framework. I would like to dynamically link test suite libraries with the auto generated test module that Boost provides. Here is the basic construction I've been using: test_main.cpp: #define BOOST_TEST_DYN_LINK #define BOOST_TEST_MAIN #include <boost/test/unit_test.hpp> lib_case.cpp: #define BOOST_TEST_DYN_LINK #include...

Is it possible to dump the core but not exit the process?

I want to be able to generate a core dump but not exit the process afterwards. I don't need it to continue execution, just not die. This is a C++ Ubuntu process. I believe I'm dumping the core in a pretty standard way: I catch the offending signal via setting...

Can not increase max_open_files for Mysql max-connections in Ubuntu 15

I am running this version of Mysql Ver 14.14 Distrib 5.6.24, for debian-linux-gnu (x86_64) On this version of Ubuntu Distributor ID: Ubuntu Description: Ubuntu 15.04 Release: 15.04 Codename: vivid This is the config I set for Mysql: key_buffer_size = 16M max_allowed_packet = 16M thread_stack = 192K thread_cache_size = 8 innodb_buffer_pool_size=20G...

Ubuntu 14.04 - An error occurred while installing pg (0.18.2), and Bundler cannot continue

This issue doesn't let me go ahead and I don't know whether it's possible for me to deploy my Rails App ever on Heroku. When I try bundle install by having gem 'pg' in my Gemfile it gives following error. An error occurred while installing pg (0.18.2), and Bundler cannot...

How do I make all files executable that have the file extension .cgi?

My server downloads its files from git hub to make sure they are up to date and make it easier for me to edit the files. I have set up a cron job that will update the files every few minutes. However I am having a problem as the CGI...

Update a D3D9 texture from CUDA

I’m working on a prototype that integrates WPF, Direct3D9 (using Microsoft’s D3DImage WPF class), and CUDA (I need to be able to generate a texture for the D3DImage on the GPU). The problem is, CUDA doesn’t update my texture. No error codes are returned, the texture just stays unchanged. Even...

Protobuf cannot be linked on ubuntu

I try to use protobuf but somehow the linking fails (here just snippet): Linking CXX executable app CMakeFiles/app.dir/msg.pb.cc.o: In function `evoswarm::protobuf_AssignDesc_a_5fto_5fb_2eproto()': msg.pb.cc:(.text+0x133): undefined reference to `google::protobuf::internal::GeneratedMessageReflection::NewGeneratedMessageReflection(google::protobuf::Descriptor const*, google::protobuf::Message const*, int const*, int, int, int, int, int, int)' msg.pb.cc:(.text+0x190): undefined reference to...

Using a data pointer with CUDA (and integrated memory)

I am using a board with integrated gpu and cpu memory. I am also using an external matrix library (Blitz++). I would like to be able to grab the pointer to my data from the matrix object and pass it into a cuda kernel. After doing some digging, it sounds...

Zip all subdirectories using python

I am attempting to create a script that zips all of the subdirectories of a folder, then deletes the folders which have now been zipped import shutil import os loc = "foldertzipfilesin" path = "/whereparentis/" + loc + "/" dirs = os.listdir( path ) for file in dirs: name =...

Unable to connect to mariadb database server with qt 4.8.5 and Ubuntu 12.04

I use the following code to connect to a MySQL server database. QSqlDatabase db_Server = QSqlDatabase::database("Test"); //find mysql driver db_Server = QSqlDatabase::addDatabase("QMYSQL","Test"); db_Server.setHostName("188.**.***.***"); db_Server.setPort(3306); db_Server.setDatabaseName("Test"); db_Server.setUserName("Test"); db_Server.setPassword("*********"); bool ret = db_Server.open(); if(ret) qDebug() << "Database open" else qDebug() << db_Server.lastError().text(); Lately they changed the server to mariadb and I assumed...

Ubuntu down load a package and all of its dependencies without installing them

I need to download a package and all of its dependencies without installing any of them. I'm looking for a command like apt-get -R --download-only install package-name Or any solution that would produce the same result. Based on my research I could not find a solution that produces this and...

pyreport LaTeX formulae not working

I'm trying to create a HTML report using pyreport and it works up to the single point, that the LaTeX formulae are not generated. Here is the input file I use for testing: #$ This is \LaTeX : $c = 2\cdot(a+b)$ Than I run pyreport -l -t html --verbose file.py,...

Can an unsigned long long int be used to store the output from clock64()?

I need to update a global array storing clock64() from different threads atomically. All of the atomic functions in CUDA support only unsigned for long long int sizes. But the return type of clock64() is signed. Is it safe to store the output from clock64() in an unsigned?

C++ Ubuntu select() if serial interface has data on asynchronous read

I´m writing an asynchronous serial data reader class for Ubuntu using C++ and termios and I´m facing difficulties checking is there is data available. Here is my code: #include <iostream> #include <string> #include <sstream> #include <vector> #include <stdio.h> #include <fcntl.h> #include <unistd.h> #include <termios.h> class MySerialClass { public: MySerialClass(std::string port);...

cudaMalloc vs cudaMalloc3D performance for a 2D array

I want to know the impact on performance when using cudaMalloc or cudaMalloc3D when allocating, copying and accessing memory for a 2D array. I have code that I tried to test the run time on where on one I use cudaMalloc and on the other cudaMalloc3D. I have included the...

What is version of cuda for nvidia 304.125

I am using ubuntu 14.04. I want to install CUDA. But I don't know which version is good for my laptop. I trace my drive that is $cat /proc/driver/nvidia/version NVRM version: NVIDIA UNIX x86_64 Kernel Module 304.125 Mon Dec 1 19:58:28 PST 2014 GCC version: gcc version 4.8.2 (Ubuntu 4.8.2-19ubuntu1)...

PHP Autoloader doesn't work on Ubuntu production server

I have to environments i'm working developing my API (PHP based): Local development: Mac OS Yosemite - running PHP 5.5.20 Production server: Ubuntu server - running PHP 5.5.9 My code uses composer for auto loading as followed: { "require": { "facebook/php-sdk": "@stable", "everyman/neo4jphp": "dev-master", "predis/predis": "1.0.1", "aws/aws-sdk-php": "2.*" }, "autoload":...

Creating a docker swarm cluster in Vagrant

I'm trying to create a swarm cluster of diffferent ubuntu VMs running in Vagrant. These have docker enabled via the vagrant file that boots them. Of the three VM's I started the swarm cluster on one machine in the following way docker pull swarm docker run --rm swarm create This...

How to redirect standard output to a file — what's wrong with this code?

I'm using a C program on my raspberry pi2 with a 433mhz receiver to read codes that are transmitted. This program sniffing 433mhz codes. To run it, I use the following command: sudo ./RFSniffer and if a code is found, the program displays in the console something like : Received...

How does one install MarkLogic 8 on Ubuntu 14.04?

What are the steps to install MarkLogic 8 on Ubuntu 14.04?

Error execute studio.sh unrecognized vm option 'MaxPermSize=350m' on Ubuntu 14.04

Error to Execute/Install studio.sh on Ubuntu: [email protected]:~$ cd android-studio/bin [email protected]:~/android-studio/bin$ ./studio.sh Unrecognized VM option 'MaxPermSize=350m' Error: Could not create the Java Virtual Machine. Error: A fatal exception has occurred. Program will exit. And with some knowledge after searching on search engines, I open the studio.vmoptions file in edit mode and...

connect to mysql database which is in ubuntu server

I am using below code to connect MySQL database in PHP. try { shell_exec("ssh -f -L 3307: [email protected]_ip sleep 60 >> logfile"); $this->_conn = $this->dbh = new PDO('mysql:host=;dbname=my_db', DB_USER, DB_PASS); $this->dbh->setAttribute(PDO::ATTR_ERRMODE, PDO::ERRMODE_EXCEPTION); } catch (PDOException $e) { die("Couldn't connect to database. Please try again!"); } I want to direct connect...

how to ssh run a tail and then send data to a mysql database

This code SSH's and then runs a tail command on a remote hots. I would now like to pass that tailed data into a mysql database using a local script called insertPerfmon.sh. How do I pass data generated in a ssh session into the local shell script insertPerfmon.sh. The local...

Python3 input() error: can't initialize sys standard streams

I'm running Python 3.4.3 on Ubuntu 15.04 and just encountered a very strange problem when trying to use the input() function. To isolate the problem Iv'e created a file called test.py contaning: print(input()) When running it, I receive this error: $ python3 test.py Fatal Python error: Py_Initialize: can't initialize sys...

Cannot set PHP include_path

I have uploaded all my files in var/www/html and in one of my php files I have this line : require_once('libraries/stripe/init.php'); the structure of my folders are list this: www -html/ -libraries -> Stripe -> init.php -register.php I keep getting this error message: Warning: require_once(libraries/Stripe/init.php): failed to open stream: No...

Pylucene 4.9.0 Ubuntu 14.04 Installation ImportError

I've been trying to install Pylucene on my Mac for a little over a week, and have given up on that in favor of trying to install it with Ubuntu through a virtual machine. I thought the installation process had gone well, so I fired up Python in the terminal...

Is prefix scan CUDA sample code in gpugems3 correct?

I've written a piece of code to call the kernel in gpugem3 but the results that I got is a bunch of negative numbers instead of prefix scan. I'm wondering if my kernel call is wrong or there is something wrong with the gpugem3 code? here is my code: #include...

Pretty URLs aren't working after upgrade to Mediawiki 1.24.2

So I am moving a Mediawiki site of mine to a new server. The version 1.20.3 worked fine on the old server running Ubuntu 12.04. However, when I copied everything over to my new server running Ubuntu 14.04 it didn't. So after messing with it for a while I decided...

Bash: Loop through file and read substring as argument, execute multiple instances

How it is now I currently have a script running under windows that frequently invokes recursive file trees from a list of servers. I use an AutoIt (job manager) script to execute 30 parallel instances of lftp (still windows), doing this: lftp -e "find .; exit" <serveraddr> The file used...

Access binaries inside docker

I am using Meteor and Meteur Up package to push a bundle to server. It uses docker. The problem is that I cannot access graphicsmagick or imagemagick from inside a docker to use it in my app. However it is installed on the server and I can access it when...

How can I pass a struct to a kernel in JCuda

I have already looked at this http://www.javacodegeeks.com/2011/10/gpgpu-with-jcuda-good-bad-and-ugly.html which says I must modify my kernel to take only single dimensional arrays. However I refuse to believe that it is impossible to create a struct and copy it to device memory in JCuda. I would imagine the usual implementation would be to...