This is an old revision of the document!
Corning Community College
CSCS1320 C/C++ Programming
~~TOC~~
To practice manipulating binary data in a C program (for fun and glory).
In addition to the new skills required on previous projects, to successfully accomplish/perform this project, the listed resources/experiences need to be consulted/achieved:
The allure of using (and learning) a programming language is to be able to effectively use it to solve problems, which in and of themselves are simulations of some process we can do in “the real world”.
In this case, we will be writing a program which will implement the mental math techniques for subtracting certain numbers from left to right– seemingly against the grain from everything we've been taught.
With the UNIX people exploring binary data, and using hex editors, and you being tasked with writing a hex editor, it only makes sense to steer some of our activities towards the manipulation of binary data as well- one cannot effectively write a hex editor if they have no idea how to work with binary data.
This project aims to ameliorate that.
Binary data merely refers to data as the computer stores it. The computer is a binary device, so its raw data (as it exists on various forms of storage and media) is often referred to as binary data, to reflect the 1s and 0s being represented.
The data we have become familiar with is textual data. We read from and write to files with the express purpose of storing text in them. And with the use of various text processing tools, we can easily manipulate these text files.
But: did you know that all text data is also binary data?
The trick to remember is that its opposite is not always tre: not all binary data is text.
The computer works in units of bytes, which these days means groups of 8-bits. C has the ability to arbitrarily read and write individual bytes of data, and we will want to make use of that to aid us in our current task.
On lab46, in /var/public/cprog/spring2015/cbf0/ is a file called out.file, a 124284 byte binary file that is essentially a scrambled JPEG file (originally called cbf0.jpg).
Your task is to write a C program to unscramble out.file and return it to its image-viewable cbf0.jpg state.
For those who are in UNIX, you are not allowed to use any of the UNIX tools to accomplish this task: you must write a C program that does all the work.
The out.file data file has been scrambled as follows:
Your task is to write a C program that does the following:
You can test the success of your program by pointing a web browser at the following URL (being mindful to substitute in your lab46 username where “username” is specified):
Note that the “~” IS very much required.
If you are successful, the image should render itself in the browser, and you should be able to recognize it (vs. it being unreadable and unrecognizable). The image is intended to be meme-like in original, and hopefully will invoke a mild sense of humor (that of course is optional).
As indicated, this task shares many attributes with the mbe1 project; in fact, the mental math process itself may be slightly simpler. That affords us the opportunity to introduce and learn about further programming optimizations, without the concurrent burden of new concepts.
Specifically, we will look at modularizing aspects of our solution, using functions, to make for a cleaner, more organized codebase.
We've been using functions all along (everytime you use fprintf() or fscanf(), for instance), but the value is not just in using pre-existing ones, but also in making our own to use.
Like variables, functions need to be declared.
We can declare them at various scopes (file/global, block/local)… if you wish for the function to be accessible by all functions within a program, you will want to declare it with a global scope.
If a particular function is only to be used by a specific function, and no others, you can opt to declare it local scope (ie within the function that will be calling it).
A function is basically a module or subroutine. It is a mini-program, focusing on the performing of a particular process.
Like a program, it takes input, does processing, and provides output.
Unlike a program, its input may not come from the keyboard, but instead from particular variables, and may not send output to the screen, but instead channel output in a way that it can be stored into a variable.
This distinctions aside, a function can in many ways be viewed as a micro- or sub-program/routine. We use functions to assist us in making our code more readable/organized/navigable.
Keeping everything in ONE file, ONE big function in that one file, is rather monolithic. In time, with sufficiently large programs, such an arrangement would become a tad unwieldy. So functions help to keep our focus short yet attentive.
To create a function we must first declare (or prototype) it. This needs to happen BEFORE said function is ever used (just as with variables- you must declare a variable before it is first used, otherwise the compiler yells).
A function, in many ways, is like a programmable variable (or is a variable with programming attached).
As such, it has a return value of a type (the function's output), a name, and parameters (input).
We see this with main()… here are two variations of a main() function declaration (technically also the start of the definition as well, in the case of main()):
int main()
In this example, we see the declaration of main() where it has a return value of int, meaning, upon completion, main() will return a value corresponding with an int data type (also in main()'s case, being the first function run, we tend to return a status code to the operating system– 0 for success, non-zero for some sort of error or deferred success).
main(), in this case, takes no parameters (just an empty set of parenthesis)… due to this, we refer to this function as a parameterless function. A function without parameters. Without input.
Now: this is technically a different form of input and output than you are used to. Input doesn't ALWAYS have to come from the keyboard, nor does output ALWAYS have to go to the screen. Input instead is desired informating being acquired for the process at hand, and output is the byproduct of performing the operation. Sometimes this means keyboard input and screen output- but not always.
Additionally, with or without parameters, we can always perform additional input (and output) within a given function, through the use of various input and output methods (like fprintf()/fscanf()).
int main(int argc, char **argv)
In this case, our main() function actually takes parameters- two, in fact:
This function takes two parameters, two pieces of input, available to us in the form of variables, by those names, of those types. We make use of them as we need to in accomplishing the program at hand.
So, when we wish to create functions of our own, we need:
For example, let us make a sum() function. Here would be a likely prototype (we'd place it above main()):
int sum(int *, int);
A function prototype (vs. its definition) will have a terminating semi-colon, as you see above.
In our case, our sum() function has the following:
Our sum() function will take an integer array (denoted by the int pointer above), and a size (the second, regular int).
Now, parameter order very much matters. In our case, an “int *” came first, followed by an “int”… we need to be mindful of this order to successfully call and use the function.
While a function prototype is technically optional (you can put the definition in place of the prototype– we just often use prototypes to further allow organization), we MUST have a function definition. This is nothing short of the code that dictates what operations the function in question will perform.
Our sum() function will be defined (below the main() function) as follows:
int sum(int *array, int size) { int result = 0; int i = 0; for (i = 0; i < size; i++) result = result + array[i]; return(result); }
Once we've declared (prototyped) and defined our function, now all we have to do is use it! When you make use of a function, we refer to it as calling. We call the function, by name, providing and required parameters, and capturing any return value as we see fit.
Here would be an example of calling the above-mentioned sum() function:
int scores[4]; int tally = 0; scores[0] = 88; scores[1] = 47; scores[2] = 96; scores[3] = 73; tally = sum(scores, 4);
Note, that it is rather important to match the type and order of parameters. Due to the nature of the array (especially the form of array declaration) used, certain pointer-related details are being hidden from us, giving somewhat of a false impression. Further discussion about pointers will begin to shed light on that.
It is your task to write a program that obtains a long integer value from the user, and processes that single value into separate array elements (one digit per array element). Determining the number of digits, you are to perform this “all from nine, last from ten” subtraction method on the number using array transactions, displaying a visual representation of the problem being solved to STDOUT.
Your program should:
I might suggest the following function prototypes:
unsigned char *longint2array(unsigned long int); void printarray(unsigned char *, unsigned char); unsigned char *allfromnine(unsigned char *);
Some questions to contemplate:
An example of your program in action:
lab46:~/src/cprog/afn0$ ./afn0 Enter value: 31415926535897 Digits detected: 14 100000000000000 - 31415926535897 --------------- 68584073464102 lab46:~/src/cprog/afn0$
To successfully complete this project, the following criteria must be met:
To submit this program to me using the submit tool, run the following command at your lab46 prompt:
$ submit cprog afn0 afn0.c Submitting cprog project "afn0": -> afn0.c(OK) SUCCESSFULLY SUBMITTED
You should get some sort of confirmation indicating successful submission if all went according to plan. If not, check for typos and or locational mismatches.